Caglar Gulcehre @caglarml
ML Researcher Prof @ EPFL, PI @ CLAIRE lab Ex: Staff Research Scientist @ Deepmind, MSR, IBM Research Follow me on Mastodon: https://t.co/LZ5sWt7Asj caglarg.com Lausanne, Switzerland Joined July 2016-
Tweets1K
-
Followers4K
-
Following1K
-
Likes5K
Introducing Snowflake Arctic. An efficiently intelligent and truly open LLM built by Snowflake.
Actually the accept rate decreases monotonically with number of 1st author submissions: the more prolific the first author is, the lower the quality of their paper.
Actually the accept rate decreases monotonically with number of 1st author submissions: the more prolific the first author is, the lower the quality of their paper. https://t.co/PJO2JwyGMu
I finished the thing. natureofcode.com
Llama3 was trained on 15 trillion tokens of public data. But where can you find such datasets and recipes?? Here comes the first release of 🍷Fineweb. A high quality large scale filtered web dataset out-performing all current datasets of its scale. We trained 200+ ablation…
Llama3 was trained on 15 trillion tokens of public data. But where can you find such datasets and recipes?? Here comes the first release of 🍷Fineweb. A high quality large scale filtered web dataset out-performing all current datasets of its scale. We trained 200+ ablation…
That wonderful phrase, "the space of possible minds", comes from a 1984 paper by Aaron Sloman. I've used it a lot in my work, for example in my 2016 "Conscious Exotica" paper: aeon.co/essays/beyond-…
That wonderful phrase, "the space of possible minds", comes from a 1984 paper by Aaron Sloman. I've used it a lot in my work, for example in my 2016 "Conscious Exotica" paper: aeon.co/essays/beyond-…
The optimal computation of gradients for the composition of functions is an optimal parenthesis problem. Forward and backward (backpropagation) are two extreme cases. Backward is optimal for scalar-valued functions. link.springer.com/article/10.100… en.wikipedia.org/wiki/Matrix_ch…
One of the greatest minds of our times has died. This is a huge blow to the fields of philosophy, morality, consciousness and intelligence. I adored his teachings even though sometimes it took me years to get them. His ideas will live on. @danieldennett
Wrenching news: Dan Dennett has died. He's been a great friend and incredible inspiration for me throughout my career. I will miss him enormously. dailynous.com/2024/04/19/dan…
We have a new preprint out - your language model is not a reward, it’s a Q function! 1. The likelihood of the preferred answer must go down - it’s a policy divergence 2. MCTS guided decoding on language is equivalent to likelihood search on DPO 3. DPO learns credit assignment
I often get this question: Is LLM all you need for robot planning? I'd go: "obviously not, because you need to consider physical constraints, dynamics, ... ", which then turn into a non-stop rant. Now I'll just point them to this paper 😎
I often get this question: Is LLM all you need for robot planning? I'd go: "obviously not, because you need to consider physical constraints, dynamics, ... ", which then turn into a non-stop rant. Now I'll just point them to this paper 😎
The upcoming Llama-3-400B+ will mark the watershed moment that the community gains open-weight access to a GPT-4-class model. It will change the calculus for many research efforts and grassroot startups. I pulled the numbers on Claude 3 Opus, GPT-4-2024-04-09, and Gemini.…
Introducing Meta Llama 3: the most capable openly available LLM to date. Today we’re releasing 8B & 70B models that deliver on new capabilities such as improved reasoning and set a new state-of-the-art for models of their sizes. Today's release includes the first two Llama 3…
Today we released a new version of OLMo 7B, which has significantly improved performance on MMLU. We also discuss a lot of how we got the improvements, big shoutout to the team! Check out that performance-efficiency tradeoff 🤩 this new model is on the Pareto frontier!
Today we released a new version of OLMo 7B, which has significantly improved performance on MMLU. We also discuss a lot of how we got the improvements, big shoutout to the team! Check out that performance-efficiency tradeoff 🤩 this new model is on the Pareto frontier!
On Monday @aroraakhilcs and I gave a workshop on adapting LLMs (prompting, constraining, tool use, tuning with QLoRA). While preparing we found tons of great resources online. We want to follow these footsteps and share our material as well: go.epfl.ch/llm-workshop
🚀 How can meta-learning, self-attention & JAX power the next generation of Evolutionary Optimizers 🦎? Excited to share my @DeepMind internship project and our #ICLR2023 paper ‘Discovering Evolution Strategies via Meta-Black-Box Optimization’ 🎉 📜: openreview.net/forum?id=mFDU0…
Talking to many junior faculty members and students in AI lately. Many seem to be somewhat lost with all the seemingly fast progresses made by the industry. My suggestion to them is: It is industry's job to find how to do better, but academia is to find out how to do it right.
NEW: my column this week is about the coming vibe shift, from Boomers vs Millennials to huge wealth inequality *between* Millennials. Current discourse centres on how the average Millennial is worse-off than the average Boomer was, but the richest millennials are loaded 💸🚀
I am also concerned about this too. I am already quite unsatisfied with the current quality of reviews, and I am hoping that at least we will not ask high school students to review main conference papers as well. Or are we already at the dip so that it can't get worse 🤷♂️
I am also concerned about this too. I am already quite unsatisfied with the current quality of reviews, and I am hoping that at least we will not ask high school students to review main conference papers as well. Or are we already at the dip so that it can't get worse 🤷♂️
Google DeepMind @GoogleDeepMind
943K Followers 275 Following We’re a team of scientists, engineers, ethicists and more, committed to solving intelligence, to advance science and benefit humanity.Jim Fan @DrJimFan
229K Followers 3K Following @NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.Kyunghyun Cho @kchonyc
61K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).yobibyte @y0b1byte
15K Followers 2K Following Kurin ViTaly, senior research scientist @IsomorphicLabs, ML PhD from @UniofOxford on RL, Multitask learning & GraphsEdward Grefenstette @egrefen
36K Followers 774 Following FR/US/GB AI/ML Person, Director of Research at @GoogleDeepMind, Honorary Professor at @UCL_DARK, @ELLISforEurope Fellow. All posts are personal.Eugene Vinitsky @EugeneVinitsky
13K Followers 2K Following Anti-cynic. Artificial narrow intelligence. Autonomous vehicles, multi-agent learning, and transportation. RS at Apple, Asst. Prof at @nyutandon. He/him.Delip Rao e/σ @deliprao
46K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈Dan Roy @roydanroy
45K Followers 2K Following ML / AI researcher, emphasis on theory. Research Director and Canada CIFAR AI Chair, @VectorInst Professor, @UofT (Statistics/CS)Eric Jang @ericjang11
69K Followers 3K Following physical AGI at 1X. Author of "AI is Good for You" https://t.co/eFg4WXhg0pSander Dieleman @sedielem
50K Followers 2K Following Research Scientist at Google DeepMind. I tweet about deep learning (research + software), music, generative models (personal account).Shane Gu @shaneguML
28K Followers 1K Following Research Scientist & Manager @GoogleDeepMind Tokyo/MTV. ex: @GoogleAI Brain, @OpenAI. (JP: @shanegJP)Nando de Freitas 🏳.. @NandoDF
97K Followers 658 Following I research intelligence to understand it and to harness it wisely. Path: Wits, Cambridge, Berkeley, UBC, Oxford, DarkBlueLabs, Google DeepMindPablo Samuel Castro @pcastr
10K Followers 814 Following Señor swesearcher @ Google DeepMind. Adjunct prof @ U de Montreal & Mila. Musician. From 🇪🇨 living in 🇨🇦.David Pfau @pfau
22K Followers 1K Following Knowledge manifests itself in radiant dreams that shimmer like the wild sun Views are my own pfau at sigmoid dot social on 🦣 https://t.co/xqtVHHVI17 on 🦋F. Güney @ftm_guney
7K Followers 1K Following research on computer vision, teaching, and movies. asst. prof. @KuisAICenter @kocuniversity tweets in TR, ENFelix Hill @FelixHill84
9K Followers 777 Following Research Scientist, Deepmind I try to think hard about everything I tweet, esp on 90s football and 80s music None of my opinions are really someone else'sPetar Veličković @PetarV_93
30K Followers 555 Following Staff Research Scientist @GoogleDeepMind | Affiliated Lecturer @Cambridge_Uni | Associate @clarehall_cam | GDL Scholar @ELLISforEurope. Monoids. 🇷🇸🇲🇪🇧🇦Natasha Jaques @natashajaques
25K Followers 1K Following Senior Research Scientist at @GoogleAI and Assistant Professor @uwcse. Social Reinforcement Learning in multi-agent and human-AI interactions. PhD from @MIT.Csaba Szepesvari @CsabaSzepesvari
8K Followers 704 Following "If there is not folly in the world, then the world itself is folly. You must understand that mistakes are not always regrets." - Paul Tobin, Bandette🤠Aaditya ; @Aaditya26082004
527 Followers 7K Following CS'26 • Machine Learning • Open-Source • Web Dev. • Algorithms • Jai Shree Krishna 🦚🪈YolandoMainguy @MainguyYol78815
8 Followers 457 FollowingXuhui Zhang @XuhuiZhangXHZ
4 Followers 236 FollowingYoshinari Fujinuma @akkikiki
971 Followers 1K Following Applied Scientist@AWS AI Labs; CS PhD @CUBoulder; Tweets are my own; Substack: https://t.co/Mq5oR2vaGN Lived: 🇹🇭🇯🇵🇫🇷🇺🇸 Tweets: JA/ENArvid Frydenlund @ArvidFrydenlund
44 Followers 318 Following Ph.D. student in Machine Learning at University of Torontocoffee & AI @realcoffeeAI
44 Followers 600 FollowingClaudio Gallicchio @claudiogallicc1
822 Followers 920 Following Assistant Professor of ML at the University of Pisa (Italy). Deep Randomized Neural Networks, Reservoir Computing, Stable Architectures, Deep Learning 4 GraphsLaronda Schlecter @schlec_laro
80 Followers 5K FollowingChantel Kerkman @KerkmChant
51 Followers 5K FollowingArif Ahmad @arif_ahmad_py
260 Followers 7K Following All things AI, Computer Science and Circuits! Prev. @GoogleAIKilian Haefeli @khshind
232 Followers 341 Following Exploring crevasses of Deep Learning at ETH Zurich & UofT | Previously: @Aleph__Alpha, @Logitech, and exfounder at AiricaDaanish @danishabbir
623 Followers 5K Following elk again. before: startup founder, ml eng (e.g. @nvidia), ee + english (@stanford)abderrahim zine @abderrahimzine6
23 Followers 616 FollowingHarper-rose Thrun @ThrunRose79962
71 Followers 5K FollowingTracy Mesias @MesiasTra
40 Followers 5K FollowingPranav Kulkarni @ProfessorBrat
54 Followers 394 Following Exceptionally Stupid Tensor-trusting Individual with generalization issuesOğuzhan Ercan @vuhuoguzhan
40 Followers 453 FollowingWeyaxi @Weyaxi
2K Followers 2K Followingxonobo @Xonobo
9 Followers 450 FollowingHasan Yazar @hasanyazar_
71 Followers 369 Following @itu1773 | control 2/4 | machine & deep learningTomás Fernandes @tomasff02
90 Followers 753 Following computer scientist @ university of warwick. graph & multi-modal ML, distributed systems enjoyerChen Bo Calvin Zhang @calvincbzhang
82 Followers 337 Following Research in ML/RL @MIT | MSc Data Science @ETH Zurich | BSc CS + Mathematics @OfficialUoMManoel @manoelribeiro
3K Followers 1K Following CS PhD student @ EPFL — On the job market for 2023-2024! 🐘: @[email protected] Keywords: Computational Social Science, Platforms, Communities, ModerationLinh Thân @LinhThan1512
2 Followers 60 Followingserkar aydinci @serkaraydinci
40 Followers 116 FollowingComp Sci @compscigrad
8 Followers 170 FollowingAmineh Zadbood @amiinehz
186 Followers 1K Following Adjunct Faculty and Research Associate at @followstevens; machine learning, large language models; optimization; agent-based modelingCocoSun @Hunter2Sun
71 Followers 1K FollowingRon K Jeffries @ronkj.. @RonKJeffries
3K Followers 5K Following A curious guy. Becoming a better human? QUOTE: Tell me about despair, yours, and I will tell you mine. Meanwhile the world goes on. --Mary Oliver, Wild GeeseOsman Batur İnce @ospanbatyr
95 Followers 638 Following Grad student at @kuisaicenter into NLP, currently multimodal learning and compositionality. BSc from @CS_Bilkent.İskender Spotifyoğl.. @itsagoodtweet
8 Followers 879 Following ay layk di wörld, di west wörld 🏳️0🌈a002 @t70582
11 Followers 90 FollowingMirco Musolesi @mircomusolesi
3K Followers 930 Following Professor of Computer Science. Machine Intelligence Lab, Autonomous Systems Group, @UCLCS, @UCL. AI/ML/RL, computing, computers, and books.Armand Joulin @armandjoulin
4K Followers 344 Following principal researcher, @googledeepmind. ex director of emea at fair @metaai. mostly work on open projects: fasttext, dino, llama, gemma.Yann LeCun @ylecun
711K Followers 718 Following Professor at NYU. Chief AI Scientist at Meta. Researcher in AI, Machine Learning, Robotics, etc. ACM Turing Award Laureate.AK @_akhaliq
310K Followers 3K Following AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo follow on Hugging Face: https://t.co/q2Qoey80GxGoogle DeepMind @GoogleDeepMind
943K Followers 275 Following We’re a team of scientists, engineers, ethicists and more, committed to solving intelligence, to advance science and benefit humanity.Jim Fan @DrJimFan
229K Followers 3K Following @NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.Kyunghyun Cho @kchonyc
61K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).Soumith Chintala @soumithchintala
186K Followers 877 Following Cofounded and lead @PyTorch at Meta. Also dabble in robotics at NYU. AI is delicious when it is accessible and open-source.Sergey Levine @svlevine
79K Followers 122 Following Associate Professor at UC Berkeley Co-founder, Physical Intelligenceyobibyte @y0b1byte
15K Followers 2K Following Kurin ViTaly, senior research scientist @IsomorphicLabs, ML PhD from @UniofOxford on RL, Multitask learning & GraphsJürgen Schmidhuber @SchmidhuberAI
107K Followers 0 Following Invented principles of meta-learning (1987), GANs (1990), Transformers (1991), very deep learning (1991), etc. Our AI is used many billions of times every day.Edward Grefenstette @egrefen
36K Followers 774 Following FR/US/GB AI/ML Person, Director of Research at @GoogleDeepMind, Honorary Professor at @UCL_DARK, @ELLISforEurope Fellow. All posts are personal.PyTorch @PyTorch
379K Followers 77 Following Tensors and neural networks in Python with strong hardware acceleration. PyTorch is an open source project at the Linux Foundation. #PyTorchFoundationEugene Vinitsky @EugeneVinitsky
13K Followers 2K Following Anti-cynic. Artificial narrow intelligence. Autonomous vehicles, multi-agent learning, and transportation. RS at Apple, Asst. Prof at @nyutandon. He/him.Delip Rao e/σ @deliprao
46K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈Dan Roy @roydanroy
45K Followers 2K Following ML / AI researcher, emphasis on theory. Research Director and Canada CIFAR AI Chair, @VectorInst Professor, @UofT (Statistics/CS)Eric Jang @ericjang11
69K Followers 3K Following physical AGI at 1X. Author of "AI is Good for You" https://t.co/eFg4WXhg0pSander Dieleman @sedielem
50K Followers 2K Following Research Scientist at Google DeepMind. I tweet about deep learning (research + software), music, generative models (personal account).Danijar Hafner @danijarh
14K Followers 868 Following Building AI that makes autonomous decisions using world models, artificial curiosity, and temporal abstraction @DeepMindShane Gu @shaneguML
28K Followers 1K Following Research Scientist & Manager @GoogleDeepMind Tokyo/MTV. ex: @GoogleAI Brain, @OpenAI. (JP: @shanegJP)Nando de Freitas 🏳.. @NandoDF
97K Followers 658 Following I research intelligence to understand it and to harness it wisely. Path: Wits, Cambridge, Berkeley, UBC, Oxford, DarkBlueLabs, Google DeepMindkyutai @kyutai_labs
6K Followers 6 FollowingKevin Patrick Murphy @sirbayes
42K Followers 334 Following Research Scientist at Google Brain / Deepmind. Interested in Bayesian Machine Learning.Daniel Shiffman @shiffman
69K Followers 2K Following "This wacky flailing arm inflatable tube man." he/himTaelin @VictorTaelin
17K Followers 902 Following Founder of @HigherOrderComp Building the massively parallel future of computing Reaching AGI to cure all diseases and suffering is all that mattersBojan Tunguz @tunguz
187K Followers 8K Following Machine Learning ex Nvidia. Kaggle Quadruple Grandmaster. Data Scientist. Physicist. Catholic. Husband. Father. Stanford Alum. e/xgb. XGBoost.eth. AMDG.murat 🍥 @mayfer
15K Followers 5K Following programmer / designer • governance / generative art / neural nets / music / physics / mathMachine Learning Stre.. @MLStreetTalk
19K Followers 382 Following AI YouTube & Audio Podcast (MLST). Run by Dr. Tim Scarfe @ecsquendor and featuring co-host @DoctorDuggar https://t.co/bVe6XB85YDChen Bo Calvin Zhang @calvincbzhang
82 Followers 337 Following Research in ML/RL @MIT | MSc Data Science @ETH Zurich | BSc CS + Mathematics @OfficialUoMManoel @manoelribeiro
3K Followers 1K Following CS PhD student @ EPFL — On the job market for 2023-2024! 🐘: @[email protected] Keywords: Computational Social Science, Platforms, Communities, ModerationTim Ferriss @tferriss
2.0M Followers 3K Following Author of 5 #1 NYT/WSJ bestsellers, early-stage investor (https://t.co/cpVCd1q9Hk), Tim Ferriss Show podcast (1B+ downloads), founder of https://t.co/9bQjti0XgEOlivier Bachem @OlivierBachem
3K Followers 305 Following Senior Staff Research Scientist at @GoogleDeepMind where I lead the team that built the RLHF technology used in Bard, PaLM 2, Gemini, and other Google products.Science girl @gunsnrosesgirl3
2.2M Followers 6K Following science in context, art history and some puzzles to solveAlexander Mathis @TrackingPlumes
4K Followers 817 Following Assistant Professor in Comp Neuro and ML at @epfl_en @CampusBiotech | #deeplabcut co-developer | past: @LMU_Muenchen, @uni_tue and @HarvardLogan Kilpatrick @OfficialLoganK
92K Followers 2K Following Lead product for @Google AI Studio and working on the Gemini API, helping developers build with AI, my views!Omar Sanseviero @osanseviero
31K Followers 2K Following Chief Llama Officer @huggingface 🦙 Founder @AI_Learners. Xoogler (SWE @Google Assistant, 20% PM TF Graphics). 100% Hacker Llama🇵🇪🇲🇽Horace He @cHHillee
23K Followers 449 Following Working at the intersection of ML and Systems @ PyTorch "My learning style is Horace twitter threads" - @typedfemaleOzan Erdem @ozanerdem
628 Followers 1K Following Principal Engineer @CerebrasSystems. PhD in AI, Satisfiability and Constraint Satisfaction Problems. Tweets don't necessarily represent my employer's opinions.Adam Karvonen @a_karvonen
1K Followers 296 Following Interested in ML and software. I prefer email to DM.Ryan Lowe @ryan_t_lowe
5K Followers 358 Following what is the place from which we are creating? ❤️✨🤠❤️Rafael Rafailov @rm_rafailov
3K Followers 637 Following Ph.D. Student at @StanfordAILab. I work on Foundation Models and Decision Making. Previously @GoogleDeepMind @UCBerkeleyNous Research @NousResearch
18K Followers 29 Following The AI Accelerator Company. https://t.co/vrD0aDJetoNancy Pelosi Stock Tr.. @PelosiTracker_
560K Followers 223 Following Highlighting Politicians' trades so we can invest alongside Goal: get them banned from trading Powered by @joinautopilot_David Hall @dlwh
2K Followers 1K Following Research Engineering Lead at @StanfordCRFM . Previously co-founder at Semantic Machines ⟶ MSFT. Lead developer of Levanter, Breeze. he/him @[email protected]Chris Lattner @clattner_llvm
79K Followers 182 Following Building beautiful things like Mojo🔥 and MAX @Modular, lifting the world of production AI/ML software into a new phase of innovation. We’re hiring! 🚀🧠Fernando Moreno-Pino,.. @fermorenp
259 Followers 806 Following Postdoc in Machine Learning for Quant Finance at @UniofOxford - @OxManInst. PhD in Probabilistic Machine Learning and Deep Learning from @uc3m.Amanda Askell @AmandaAskell
26K Followers 653 Following Philosopher & ethicist teaching models to be good @AnthropicAI. Personal account. All opinions come from my training data.Dave W Plummer @davepl1968
46K Followers 59 Following Hi! I'm Dave Plummer. You might remember me from such Windows components as Task Manager, Windows Pinball, Calc, ZIPFolders, Product Activation, etc. Cheers!Dan Go @FitFounder
662K Followers 429 Following High Performance Coach To Entrepreneurs | Helping 1 million transform their lives by 2027. Tweets on health optimization. Sign up for a free strategy call 👇🏽David @DavidSHolz
54K Followers 5K Following founder @midjourney, prev founder leap motion, nasa, max planckSamet Oymak @SametOymac
840 Followers 224 Following EECS Prof @UMich, Research on ML+RL+LLM theory & algosMartin Josifoski @MartinJosifoski
334 Followers 199 Following PhD candidate at EPFL. Spent some time at Microsoft Research, ETH and MetaAI.Alex Hägele @haeggee
386 Followers 475 Following PhD Student in Machine Learning @ICepfl. MSc/BSc from @ETH_en. Previously: Student Researcher @Apple MLR. @[email protected]Aakanksha Chowdhery @achowdhery
7K Followers 3K Following LLMs @ Google DeepMind :: PaLM, Gemini // Previously @MSFTResearch, @Stanford, @Princeton // views my own and subject to changeKaiqing Zhang @KaiqingZhang
760 Followers 399 Following Assistant Professor @UofMaryland; Previously {@MIT, @SimonsInstitute, @ECEILLINOIS, @Tsinghua_Uni}; Control + Game Theory + Reinforcement LearningBrian Anderson @braindersnn
547 Followers 2K Following Neuromorphics @Intel | MLPerf, TPU perf, Chrome GPU @Google | Android GPU, ASIC design @Nvidia | Robots @MediaLab | HT patch clamp @Tecella | RunnerJohn Schulman @johnschulman2
39K Followers 609 Following Cofounder @openai, lead post-training for ChatGPT and the API. Interested in reinforcement learning, alignment, birds, jazz musicAlex Nichol @unixpickle
8K Followers 388 Following Code, AI, and 3D printing. Opinions are my own, not my computer's...for now. Husband of @thesamnichol. Co-creator of DALL-E 2. Researcher @openai.Sangwoo Mo @sangwoomo
547 Followers 699 Following Postdoc @UMich. Past: PhD @kaist_ai, Intern @AIatMeta, @NVIDIAAI. Work on foundation models for vision, language, and robotics.Yingtao Tian @alanyttian
3K Followers 5K Following ↑ profile picture is dreamed by Anime GAN / cooking computational creativity and other ML sauce at google tokyo / before: stony brook u ← fudan uGábor Melis (@melisg.. @GaborMelis
1K Followers 202 Following Research Scientist at Google Deepmind #lisp #machinelearningJakob Uszkoreit @kyosu
4K Followers 275 FollowingThis is what out-of-distribution generalization looks like!
This man is from Mongolia. He can't speak English in a conversational sense, but he can sing it
@AnsongNi @hugo_larochelle @GoogleDeepMind Did you compare your method to something like RestEM or STaR (or any other self-training method) ? sorry if this is already in the paper
Introducing “FlowMap”, the first self-supervised, differentiable structure-from-motion method that is competitive with conventional SfM like Colmap! cameronosmith.github.io/flowmap/ IMO this solves a major missing piece for internet-scale training of 3D Deep Learning methods. 1/n
Introducing Snowflake Arctic. An efficiently intelligent and truly open LLM built by Snowflake.
I'm as excited as you are about your {lab, company, school}'s research, but perhaps rather than just hype, you can display a bit of scientific humility and tell me also what the challenges, gaps are.
Now out in the International Journal of Computer Vision! link.springer.com/article/10.100…
🐻 WildCLIP: Scene and animal attribute retrieval from camera trap data with domain-adapted vision-language models biorxiv.org/content/10.110… from @amathislab 🎊
I think the hardest thing for me the last few years has been seeing so many talented scientists who obviously belong in the academy turn into tech company middle managers or startup founders.
Academia is deeply broken, so I don't blame them. But I just know in a better world they could be sharing their hard-won knowledge with the next generation instead of forgetting it all to go chase the money.
Google has a Kafkaesque payment system. Somehow, somewhere my Google Play country was set to the UK. I can no longer change it without buying an Android phone (which I have never owned). Many emails later @Google says it can't be changed unless I buy one of their phones.
This was an awesome project - we teach models to follow constitutional principles with self-supervision (no labels). We also show that a weak model can generate principles for a stronger one, which self-aligns (SUPERALIGNENT!) and can beat the instruction-tuned (RLHF-ed) model!
Constitutional AI showed LMs can learn to follow constitutions by labeling their own outputs. But why can't we just tell a base model the principles of desired behavior and rely on it to act appropriately? Introducing SAMI: Self-Supervised Alignment with Mutual Information!
You don't realize how much you use huggingface.co until it goes down 😅
Super excited to share that I successfully defended my PhD thesis "Understanding Generalization and Robustness in Modern Deep Learning" today 👨🎓 A huge thanks to the thesis examiners @SebastienBubeck, @zicokolter, and @KrzakalaF, jury president Rachid Guerraoui, and, of course,…
I missed announcement when Meta retired the old hydraulic Zuck and introduced the more human-like electric Zuck.
🤖 Inspiring the Next Generation of Roboticists! 🎓 Our lab had an incredible opportunity to demo our robot learning systems to local K-12 students for the National Robotics Week program @GTrobotics . A big shout-out to @saxenavaibhav11 @simar_kareer @pranay_mathur17 for hosting…
Me: remove that sentence, it doesn't make any sense. Student: The sentence...that you wrote? 🤦
Is Ideogram using SD? No. We have @hojonathanho who came up with denoising diffusion and @wchan212 and @Chitwan_Saharia who led text to image and text to video at Google. We built everything from scratch, and we have a track record in foundational AI research that powers this…
They'd be more likely to finish their work at a reasonable hour if even one of them had a monitor.
@aaron_lou @chenlin_meng @StefanoErmon This appears related to some of the developments in this paper with @JoeJBenton @ValentinDeBort1 @GeorgeDeligian9 arxiv.org/abs/2211.03595