Anirudh Goyal @anirudhg9119
Gemini ♊ Spent time at @Berkeley_EECS, @MPI_IS, @DeepMind. anirudh9119.github.io London, UK Joined October 2014-
Tweets891
-
Followers5K
-
Following486
-
Likes5K
Fine-tuning LLMs: Fine-tune without the safety prompt but during inference add the safety prompt.
Test time adaptation to improve the generalisation performance of structured models on out of distribution scenes. arxiv.org/abs/2203.11194 slot-tta.github.io Work led by @mihirp98
Test time adaptation to improve the generalisation performance of structured models on out of distribution scenes. arxiv.org/abs/2203.11194 slot-tta.github.io Work led by @mihirp98
I guess, another followup could be: guide the learning of world models via language i.e., endow the world models to quickly understand the environment via language i.e.. Language Guided World Models.
I guess, another followup could be: guide the learning of world models via language i.e., endow the world models to quickly understand the environment via language i.e.. Language Guided World Models.
Discrete Key-Value Bottleneck (Updated) Compresses the information of a pre-trained model in learnable "key-value" codebook such that knowledge can be quickly adapted in a continual learning fashion. arxiv.org/abs/2207.11240
Temporal Latent Bottleneck combines recurrence and self-attention in an unified way. Recurrence integrates information over time, and self-attention models local dependencies in "short" context. arxiv.org/abs/2205.14794
Temporal Latent Bottleneck combines recurrence and self-attention in an unified way. Recurrence integrates information over time, and self-attention models local dependencies in "short" context. arxiv.org/abs/2205.14794 https://t.co/RtUdzw8tYO
Interesting AI approach to integrate "systems 1 and 2" by Anirudh Goyal and Yoshua Bengio. arxiv.org/abs/2011.15091
Also, @sumukhaithal6 is looking for PhD positions. He's great. Reach out to him if you think he may be a good fit.
Also, @sumukhaithal6 is looking for PhD positions. He's great. Reach out to him if you think he may be a good fit.
Kyunghyun Cho @kchonyc
61K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).yobibyte @y0b1byte
15K Followers 2K Following Kurin ViTaly, senior research scientist @IsomorphicLabs, ML PhD from @UniofOxford on RL, Multitask learning & GraphsIrina Rish @irinarish
9K Followers 993 Following prof UdeM/Mila; Canada Excellence Research Chair; AAI Lab head https://t.co/UzlrC7ZrGF; INCITE project PI https://t.co/0rV7szd7rH; CSO https://t.co/XDhj6MEtUjAnimesh Garg @animesh_garg
21K Followers 1K Following Foundation Models for Generalizable Autonomy. Assistant Professor in AI Robotics @GeorgiaTech + @NvidiaAI. prev @Stanford @berkeley_ai @UofTCompSciMichael Bronstein @mmbronstein
43K Followers 4K Following #DeepMind Professor of #AI @UniofOxford / Fellow @ExeterCollegeOx / ML Lead @ProjectCETI / https://t.co/kZpGpDzYeVSara Hooker @sarahookr
39K Followers 7K Following I lead @CohereForAI. Formerly Research @Google Brain @GoogleDeepmind. ML Efficiency at scale, LLMs, @trustworthy_ml. Changing spaces where breakthroughs happen.Felix Hill @FelixHill84
9K Followers 777 Following Research Scientist, Deepmind I try to think hard about everything I tweet, esp on 90s football and 80s music None of my opinions are really someone else'sShane Gu @shaneguML
28K Followers 1K Following Research Scientist & Manager @GoogleDeepMind Tokyo/MTV. ex: @GoogleAI Brain, @OpenAI. (JP: @shanegJP)Tom Goldstein @tomgoldsteincs
23K Followers 2K Following Professor at UMD. AI security & privacy, algorithmic bias, foundations of ML. Follow me for commentary on state-of-the-art AI.Ethan Caballero is bu.. @ethanCaballero
8K Followers 2K Following ML PhD student @Mila_Quebec ; previously @GoogleDeepMindSander Dieleman @sedielem
50K Followers 2K Following Research Scientist at Google DeepMind. I tweet about deep learning (research + software), music, generative models (personal account).Edward Grefenstette @egrefen
36K Followers 774 Following FR/US/GB AI/ML Person, Director of Research at @GoogleDeepMind, Honorary Professor at @UCL_DARK, @ELLISforEurope Fellow. All posts are personal.Sindy Löwe @sindy_loewe
3K Followers 360 Following PhD Student with @WellingMax at the University of Amsterdam. Deep Learning with Structured Representations.Taco Cohen @TacoCohen
21K Followers 3K Following Deep learner at FAIR. Into codegen, equivariance, generative models. Spent time at Qualcomm, Scyfer (acquired), UvA, Deepmind, OpenAI.Thomas Kipf @tkipf
25K Followers 1K Following AI Research at @GoogleDeepMind. Ex-Physicist. Graph Neural Networks & Controllable Generative Models (e.g. GCNs, Structured World Models, Slot Attention).Prof. Anima Anandkuma.. @AnimaAnandkumar
25K Followers 2K Following Bren Professor @caltech, Fmr Sr Director of #AI research @nvidia, Fmr Principal Scientist @awscloud, AI+Science, PDE, Neural operators. Views my own.Yuandong Tian @tydsh
16K Followers 801 Following Research Scientist and Senior Manager in Meta AI (FAIR). AI-guided Optimization and Representation Learning. Novelist in spare time. PhD in @CMU_Robotics.Andreas Kirsch 🇮�.. @BlackHC
9K Followers 5K Following Past: 🧑🎓 DPhil @AIMS_oxford @ExeterCollegeOx @UniofOxford (4.5yr) 🧙♂️ RE @DeepMind (1yr) 📺 SWE @Google (3yrs) 🎓 @TU_Muenchen 👤 Fellow @nwspkKhimya @khimya
4K Followers 999 Following Research Scientist @GoogleDeepmind Affiliate Faculty @Mila_Quebec Past: PhD @mcgillu @MSFTResearch @Intel @UF @IITKanpur Bosch @VIT_univ she/her Views are mine!Michal Valko @misovalko
5K Followers 2K Following Llama @AIatMeta Paris & Inria & MVA - Ex: Gemini and BYOL @GoogleDeepMindรัษฎา @a9yxJvQsPZn01e
61 Followers 1K Following เป็นเกียรติอย่างยิ่งที่ได้พบคุณที่นี่ หากชอบ ติดตามได้ ผมจะอัพเดตข้อมูลติดต่อในหน้าแรกได้ตลอดเวลาครับMike Lewis @ml_perception
6K Followers 227 Following Llama3 pre-training lead. Partially to blame for things like the Cicero Diplomacy bot, BART, RoBERTa, kNN-LM, top-k sampling & Deal Or No Deal.Adnan Heera @adnanheera
26 Followers 106 FollowingBarbad @BarbadForoughi
0 Followers 1K FollowingBob @Bob34877277
2 Followers 72 FollowingJosé Delpino @josE_delpino
3K Followers 4K Following Ed-Tech Leader, Software Engineer, AI Apprentice & Poet. Working at @GarrettSeminary, living in Chicago since 2015, and learning how to be a dad.Wei Shi @weishi
43 Followers 936 FollowingNando Malachovski @Nad_Malachovski
23 Followers 423 Following Digital Creator СЕРЖАНТ СЕРЖАНТ, ПРИЗОВ АРМІЯ УКРАЇНИ! Сержант УКРАЇНСЬКОЇ АРМІЇ, ВЕРБОВНИК 🇺🇦Lancelot Da Costa @lancelotdacosta
745 Followers 326 Following Researching the mathematics of intelligence 🧠👾 Maths, neuro & AI @ Imperial College, UCL & @VERSESAI Rarely on Twitter—contact me: [email protected]Chris S @ChrisDanShort
163 Followers 1K FollowingAditya Chandupatla @thechandupatla
246 Followers 797 Following Formerly Engineering @bloomberg @coinbase @tesla @disney | @usc alum | Opinions my ownJB @JB38076320
410 Followers 801 Following Academic staff - Research, @Stanford University @StanfordAIMI Affiliate Medical AI Building LLM's right now!Hack-With-OJay @Hack_With_OJay
432 Followers 2K Following | RGB Hack(er) 🎭 | Penetration Tester 👨🏾💻 | Cyber-Security 🛡️Zach0 @Zach0__
290 Followers 4K Following statsparrot. bayescraft. torch. audio transformers. uplift. causal inference. conformal prediction. xgboost.Chuanming @ChuanmingLiu
235 Followers 4K Following Ex-PhD student and alumni @sjtu1896 . Global citizen. Bootstrapping silicon-based life.whitexxxxx @mrwhitecc
48 Followers 712 FollowingSicheng Zhu @sichengzhuml
259 Followers 500 Following CS Ph.D. student at the University of Maryland. I do trustworthy machine learning.Xiaoxia Lei @XiaoxiaLei
75 Followers 496 Following Ph.D. Candidate @ Shanghai Jiao Tong University | Business Technology & Marketing. Digitalization, Media, Algorithm, Search, and Public Policy. Econ, CS and DS.Emma Obadoni @EObadoni
700 Followers 5K Following learning ai development also a software engineer , motivation speaker https://t.co/4YQCXJr04Yrishabhgupta1112 @rishabhgup94646
37 Followers 85 FollowingAakash Nigam @akaash
203 Followers 611 Following Trying to make it count! Mostly Retweeting Inspirations || Selective Tweets on Distributed Systems | Cloud | Product | Spatial | ML | Macro UofT & NIT-DEva Louise Marie Gabr.. @e681554349
9 Followers 3K FollowingAhmad Beirami @abeirami
4K Followers 2K Following Building safe, helpful, and scalable generative AI @Google | ex-{@AIatMeta, @EA, @MIT, @Harvard, @DukeU} | @GeorgiaTech PhD | زن زندگی آزادی | opinions my ownlovish @louvishh
312 Followers 603 Following phding @ucl and @aiatmeta (llama team). mostly random tweets here.Santiago @singhhappy540
28 Followers 582 FollowingJasmineqy0 @Jasmineqy0_
7 Followers 233 FollowingDanqi Chen @danqi_chen
13K Followers 704 Following Assistant professor @princeton_nlp @princetonPLI @PrincetonCS. Previously: @facebookai, @stanfordnlp, @Tsinghua_UniPrarthana Bhattachary.. @prarthana_bh
211 Followers 1K Following ML Scientist, Engineer @Ultraleap | Ph.D. @UWaterloo, M.S. @UTokyo_News_en | She/her | Views personalDo Xuan Long @dxlong2000
74 Followers 254 Following CS PhD @NUSingapore + @ASTARsg | BSc Math + CS @NTUsgShiqi Chen @chenshi51326099
84 Followers 187 Following PhD student @CityUHongKong. NLPer. Visiting PhD @hkust.Cheng Zhang @cheng1001cheng
67 Followers 158 FollowingAshley Edwards @ashrewards
484 Followers 200 Following Research scientist @GoogleDeepMind. Past: Uber AI Labs, Georgia TechMola 相羊 @xiangya94910377
55 Followers 2K Following a psychology student ,curious about neural decoding、cognitive mathematics 、emergent communicationSébastien Lachapelle @seblachap
610 Followers 467 Following Research Scientist at SAIL Montreal (Samsung) & PhD student @Mila_Quebec, @UMontrealDIRO interested in causality and identifiable representation learning.Fahimeh Hosseini @f_hn_98
11 Followers 42 FollowingJenny Petterson @petterson8401
107 Followers 597 FollowingAshutosh Mehra @ashutoshmehra
1K Followers 5K Following Senior Principal Scientist at Adobe. Working on Acrobat AI Assistant, LLMs, and document ML.Mr. Jack Tung @MrJackTung
220 Followers 3K FollowingKyunghyun Cho @kchonyc
61K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).Lucas Beyer (bl16) @giffmana
56K Followers 444 Following Researcher (Google DeepMind/Brain in Zürich, ex-RWTH Aachen), Gamer, Hacker, Belgian. Mostly gave up trying mastodon as [email protected]Animesh Garg @animesh_garg
21K Followers 1K Following Foundation Models for Generalizable Autonomy. Assistant Professor in AI Robotics @GeorgiaTech + @NvidiaAI. prev @Stanford @berkeley_ai @UofTCompSciMichael Bronstein @mmbronstein
43K Followers 4K Following #DeepMind Professor of #AI @UniofOxford / Fellow @ExeterCollegeOx / ML Lead @ProjectCETI / https://t.co/kZpGpDzYeVBehnam Neyshabur @bneyshabur
18K Followers 690 Following Senior Staff Research Scientist @GoogleDeepMind, Interested in reasoning w. LLMs, traveling & backpackingPercy Liang @percyliang
49K Followers 408 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | PianistFelix Hill @FelixHill84
9K Followers 777 Following Research Scientist, Deepmind I try to think hard about everything I tweet, esp on 90s football and 80s music None of my opinions are really someone else'sShane Gu @shaneguML
28K Followers 1K Following Research Scientist & Manager @GoogleDeepMind Tokyo/MTV. ex: @GoogleAI Brain, @OpenAI. (JP: @shanegJP)Noam Brown @polynoamial
34K Followers 611 Following Researching reasoning @OpenAI | Co-created Libratus/Pluribus, the first superhuman no-limit poker AIs | Co-created CICERO | PhD from @SCSatCMUThomas G. Dietterich @tdietterich
50K Followers 505 Following Distinguished Professor (Emeritus), Oregon State Univ.; Former President, Assoc. for the Adv. of Artificial Intelligence; Robust AI & Comput. SustainabilityKarol Hausman @hausman_k
22K Followers 141 Following @Physical_int ex: researcher @GoogleAI/@DeepMind, adj. Prof. @Stanford. Into robots, AI, NBA, philosophy, soccer and almond croissants. 🇵🇱🇺🇸Edward Grefenstette @egrefen
36K Followers 774 Following FR/US/GB AI/ML Person, Director of Research at @GoogleDeepMind, Honorary Professor at @UCL_DARK, @ELLISforEurope Fellow. All posts are personal.Christopher Manning @chrmanning
126K Followers 115 Following Director, @StanfordAILab. Assoc. Director, @StanfordHAI. Founder, @stanfordnlp. Prof. CS & Linguistics, @Stanford. IP @aixventureshq. 🇦🇺 Do #NLProc & #AI. 👋Csaba Szepesvari @CsabaSzepesvari
8K Followers 704 Following "If there is not folly in the world, then the world itself is folly. You must understand that mistakes are not always regrets." - Paul Tobin, Bandette🤠Taco Cohen @TacoCohen
21K Followers 3K Following Deep learner at FAIR. Into codegen, equivariance, generative models. Spent time at Qualcomm, Scyfer (acquired), UvA, Deepmind, OpenAI.Thomas Kipf @tkipf
25K Followers 1K Following AI Research at @GoogleDeepMind. Ex-Physicist. Graph Neural Networks & Controllable Generative Models (e.g. GCNs, Structured World Models, Slot Attention).Prof. Anima Anandkuma.. @AnimaAnandkumar
25K Followers 2K Following Bren Professor @caltech, Fmr Sr Director of #AI research @nvidia, Fmr Principal Scientist @awscloud, AI+Science, PDE, Neural operators. Views my own.Yuandong Tian @tydsh
16K Followers 801 Following Research Scientist and Senior Manager in Meta AI (FAIR). AI-guided Optimization and Representation Learning. Novelist in spare time. PhD in @CMU_Robotics.Max Welling @wellingmax
32K Followers 429 FollowingMichael Levin @drmichaellevin
40K Followers 2K Following Scientist at Tufts University; my lab studies anatomical and behavioral decision-making at multiple scales of biological, artificial, and hybrid systems.Zachary Nado @zacharynado
5K Followers 648 Following Research engineer @googlebrain. Past: software intern @SpaceX, ugrad researcher in @tserre lab @BrownUniversity. All opinions my own.Ahmad Al-Dahle @Ahmad_Al_Dahle
4K Followers 53 Following #Girldad of twins. Leading GenAI @ Meta (llama, imagine, meta ai and more)Pradeep Ravikumar @RavikumarPrad
293 Followers 106 Following Professor, Machine Learning @ CMU; co-Editor-in-Chief, Journal of Machine Learning Research (JMLR); Third-wave AIKate Saenko @kate_saenko_
5K Followers 164 Following AI Researcher in dataset bias, vision & language models / FAIR / Professor at Boston University / NeurIPS 2023 co-PC / she/her/hersSam Whitmore @sjwhitmore
12K Followers 2K Following building @newcomputer. not a cat (or a man) in real life. I like to run a lot! @kensho @harvard @StuyNYMonica Lim @monicalimco
6K Followers 4K Following Building Never Enough and working on other interesting projects. Follow me if you love learning and making things happen. Raising two little humans. 😍Eric Steinberger @EricSteinb
7K Followers 478 Following Writing code that writes code on a mission to build safe superintelligence | CEO/cofounder @magicailabsRichard Ngo @RichardMCNgo
35K Followers 1K Following What would we need to understand in order to design an amazing future? Figuring that out @openaiCass Sunstein @CassSunstein
133K Followers 2K Following Robert Walmsley University Professor, Harvard; former Administrator, White House Office of Information and Regulatory Affairs; coauthor, NUDGE.Eric Xing @ericxing
5K Followers 18 Following Researcher, educator, entrepreneur, and administrator in computer science, artificial intelligence, and healthcare.AutoGen @pyautogen
4K Followers 38 Following OSS library for agentic AI apps and research 🤖🤖 GitHub: https://t.co/LliIsorLuY Discord: https://t.co/2iE2O7QV6A Research: https://t.co/TeOUTAZrbdSergey Edunov @edunov
924 Followers 102 Following Director of Engineering @ GenAI, Meta. I work on LlamasLaurens van der Maate.. @lvdmaaten
653 Followers 1K Following Distinguished Research Scientist at Meta AI. t-SNE. DenseNet. Web-scale weakly supervised vision. CrypTen. Currently herding Llamas.Toby Pohlen @TobyPhln
26K Followers 451 Following Founding member @xAI. Previously @GoogleDeepMind. @RWTH alumnus.Kun Zhang-in pursuit .. @kunkzhang
649 Followers 48 Following Associate professor at CMU (on leave) and MBZUAI. In pursuit of the Causality world with Machine Learning. CLeaR group at CMU; CIAI center at MBZUAI.Quanquan Gu @QuanquanGu
9K Followers 2K Following Professor @UCLA | Head of AIDD, ByteDance Research | Recent work: Self-play fine-tuning (SPIN) | Opinions are my ownRobert Yang @GuangyuRobert
3K Followers 185 Following Co-founder, CEO at @Altera_AL, Computational Neuroscientist, former Assistant Professor @mitbrainandcog & @MITEECSDenny Zhou @denny_zhou
9K Followers 420 Following @GoogleDeepMind founder & lead of Reasoning Team. Build LLMs to reason. Opinions my own.TR Reardon @TRReardon
324 Followers 336 Following Neuroscientist. Head of Neuromotor Interfaces, VP Research, @Meta Reality Labs. Same @ the other app. Helping @transalt @zuckermanbrainPrabhakar Raghavan @WittedNote
9K Followers 214 Following SVP @google. My tweets represent my own views. WittedNote is an anagram.Demis Hassabis @demishassabis
356K Followers 125 Following Co-founder & CEO @GoogleDeepMind - working on AGI. Trying to understand the fundamental nature of reality. Also revolutionising drug discovery @IsomorphicLabsJohn Carmack @ID_AA_Carmack
1.1M Followers 241 Following AGI at Keen Technologies, former CTO Oculus VR, Founder Id Software and Armadillo AerospacePhysical Intelligence @physical_int
4K Followers 8 Following Physical Intelligence (Pi), bringing AI into the physical world.Mikhail Parakhin @MParakhin
17K Followers 21 FollowingCognition @cognition_labs
123K Followers 19 Following Makers of Devin, the first AI software engineer. We are an applied AI lab focused on reasoning, and code is just the beginning. Join us: https://t.co/tpfZwEwGiqLauren Oyler @laurenoyler
32K Followers 2K Following I'm 6 feet tall and I wrote the novel FAKE ACCOUNTS. My new book is an essay collection called NO JUDGMENT and you can buy it now!Vikash Kumar ✈️IC.. @Vikashplus
4K Followers 407 Following Studying intelligent embodied behaviors. Ad. Prof. @CMU_Robotic | Sr. research scientist at @AIatMeta @GoogleAI @OpenAI | @berkeley_ai @UWcse #MuJoCoGarry Tan @garrytan
432K Followers 4K Following President & CEO @ycombinator —Founder @Initialized—PM/designer/engineer who helps founders—YouTuber—San Francisco Democrat accelerating the boom loop—e/accZeyuan Allen-Zhu @ZeyuanAllenZhu
8K Followers 273 Following physics of language models @ Meta / FAIR IOI - USACO - MCM - ACM/ICPC - Codejam Tsinghua - MIT - Princeton/IAS - MSR - FAIRRichard Socher @RichardSocher
101K Followers 970 Following CEO @youSearchEngine Investing at @aixventuresHQ Before: Stanford Adj Prof in AI/NLP, Chief Scientist at Salesforce, MetaMindJitendra MALIK @JitendraMalikCV
4K Followers 0 FollowingLogan Kilpatrick @OfficialLoganK
92K Followers 2K Following Lead product for @Google AI Studio and working on the Gemini API, helping developers build with AI, my views!Ahmed Awadallah @AhmedHAwadallah
758 Followers 333 Following Partner Research Manager, AI Frontiers @MSFTResearchKarina Nguyen @karinanguyen_
12K Followers 648 Following AI research & eng @AnthropicAI, prev. intern @nytimes, @square, @dropboxBrett Adcock @adcock_brett
171K Followers 14 Following Founder @Figure_robot (AI Robotics) & Archer Aviation (NYSE: ACHR)Boz @boztank
110K Followers 1K Following CTO @Meta. Leading Reality Labs and working on AR, VR, AI, and more. Built v1 of FB News Feed, Messenger, Groups, Mobile Ads. TweetDelete 6moAGI House @agihouse_org
13K Followers 412 Following Accelerating humanity's transition to AGI & honoring the greatest AI founders and researchers of our time @ https://t.co/1lJUc58gZJWilliam Wang @WilliamWangNLP
14K Followers 716 Following UCSB NLP Lab + ML Center. https://t.co/6TOnqbk6YT https://t.co/KJYhnav3Et Mellichamp Chair Prof. at UCSB CS. PhD @ CMU SCS. Areas: #NLProc, Machine Learning, AI.Stanislas Polu @spolu
14K Followers 605 Following _co-founder+engineer(https://t.co/fCirsLjeo2), _alumni(https://t.co/8jAnpFAkp1, https://t.co/e99AaHzlA0, https://t.co/4jg6knqi2S, https://t.co/kXE6PNf8xH)I can't think of a single instance where oversharing has benefited me.
"If you can't code, write books and blogs, record videos and podcasts." @naval
Maybe the weird thing that happened in 2023 was the publication of: nature.com/articles/s4158… (The arXiv version was May 2023, before the cuto-off for Claude 3's training data) Crazy idea, but just maybe ... 😀
@mlegls @anthrupad one way you can tell Claude's inclinations are very likely not reflective of its training distribution is if you take any base model - at least ones with cutoffs before 2023 (smth weird may have happened since 2023) - nd prompt it with AI/human dialogues, it is not like Claude
I’m predicting MUCH more AI orchestration in the next two years. Hundreds of models. Thousands of agents and sub-agents. All working together. Who’s building in this space?
Great to have Noah Smith @nlpnoah talk at Georgia Tech today about open-source LLMs trained with open-source pre-training data (OLMo model by @allen_ai) for the Distinguished Speaker Series. photo credit: Nathan Deen @ICatGT host: @kartik_goyal_
Sticking with a bad decision is not a triumph of commitment. It's a failure of courage. The quicker you are to admit you were wrong, the sooner you can start making it right. Persistence is not about staying on a path. It's about finding a better path to your goal.
FAIR researchers (@AIatMeta) presented SegmentAnything and our robotics work at the White House correspondents’ weekend. Llama3 + Sim2Real skills (trained with @ai_habitat) = a robot assistant
Washingtonians delved into the world of artificial intelligence (AI) at the Washington AI Network’s inaugural weekend TGAIFriday Lunch for White House correspondents. trib.al/FwHF9Um
Quoting @YiMaTweets "It is industry's job to find how to do better, but academia is to find out how to do it right." While I think there're lots of good industry research doing things right, when it comes to reseach on agents, I do think academia has unique freedom to explore how…
We're having a big event on agents at CMU on May 2-3 (one week from now), all are welcome! cmu-agent-workshop.github.io It will feature: * Invited talks from @alsuhr @ysu_nlp @xinyun_chen_ @MaartenSap and @chris_j_paxton * Posters of cutting edge research * Seminars and hackathons
Unpleasant experiences will happen. That’s life. Don’t take them personally. Don’t victimize yourself. Don’t panic or waste your energy getting angry. Focus on what you can change: move to a better place, hang out with kinder people, get better goals, become a better person.
instead of evaluating models, we can start to evaluate researchers instead! 😀 i've always had this floating idea of giving people transformer configs and asking them to predict configurations that works better. could be data mix, architectures, hparams whatever. would be a fun…
In AI research there is tremendous value in intuitions on what makes things work. In fact, this skill is what makes “yolo runs” successful, and can accelerate your team tremendously. However, there’s no track record on how good someone’s intuition is. A fun way to do this is…
Yesterday, Yoshua Bengio, founder and scientific director of Mila, and Eric Schmidt, former CEO of Google, exchanged views on how AI is set to transform society as part of the #TIME100 Summit. Watch the full conversation here: time.com/6968820/time10…
In AI research there is tremendous value in intuitions on what makes things work. In fact, this skill is what makes “yolo runs” successful, and can accelerate your team tremendously. However, there’s no track record on how good someone’s intuition is. A fun way to do this is…
We had a small party to celebrate Llama-3 yesterday in Paris! The entire LLM OSS community joined us with @huggingface, @kyutai_labs, @GoogleDeepMind (Gemma), @cohere As someone said: better that the building remains safe, or ciao the open source for AI 😆
once @ylecun told me (heavily paraphrased), it's not F=ma but \min (F-ma)^2. i didn't realize its importance, but it is perhaps the most enlightning perspective i've ever heard.
🔥📢We're releasing the Phi-3 family of models! The smallest phi-3-mini, 3.8 B model is comparable to GPT-3.5 and beats Llama-3 8B Technical Report: arxiv.org/pdf/2404.14219… HF Model: huggingface.co/microsoft/Phi-… Give it a shot!
So you want to do robotics tasks requiring dynamics information in the real world, but you don’t want the pain of real-world RL? In our work to be presented as an oral at ICLR 2024, @memmelma showed how we can do this via a real-to-sim-to-real policy learning approach. A 🧵 (1/7)
Groundbreaking work from @eric_brachmann et al. with a new approach for Structure from Motion. With their learning-based relocalization they can build implicit neural scene representations from thousands of unposed images, without pose priors or sequential inputs.
📢A new learning-based approach to SfM: #ACEZero No img-to-img matching, optimises image-to-scene correspondences directly. Needs no pose priors. Works on unordered image sets. Efficiently handles thousands of images. Paper: arxiv.org/abs/2404.14351 Page: nianticlabs.github.io/acezero
This is such lovely work, showing the ability to create strong foundational generative models, and then refine and condition the generating distribution towards real applications. Here base protein language models are being shaped into functioning gene-editing sequence models,…
Huge from our friends at @ProfluentBio! Introducing OpenCRISPR: the world’s first ever open-source AI-generated gene editor. This is a huge milestone in personalised medicine...
Yes, phi-3 is good and phi-3-mini (<4B params) is really good for such a small model. Amazing to see the impact of high-quality data and curated synthetic data on all stages of the training process from pre-training to supervised instruction fine-tuning, and preference tuning.
phi-3 is here, and it's ... good :-). I made a quick short demo to give you a feel of what phi-3-mini (3.8B) can do. Stay tuned for the open weights release and more announcements tomorrow morning! (And ofc this wouldn't be complete without the usual table of benchmarks!)
I have been playing with phi-3-mini for a while. How good it is for its size is surprising! The Phi team has managed to create a better learning curve for models. Congratulations the Phi team! @SebastienBubeck
phi-3 is here, and it's ... good :-). I made a quick short demo to give you a feel of what phi-3-mini (3.8B) can do. Stay tuned for the open weights release and more announcements tomorrow morning! (And ofc this wouldn't be complete without the usual table of benchmarks!)