Ruiqi Gao @RuiqiGao
Research scientist @Google DeepMind. Generative modeling, representation learning. San Francisco Joined June 2019-
Tweets179
-
Followers5K
-
Following512
-
Likes652
We are looking for reviewers for the 2nd ICML SPIGM workshop (spigmworkshop2024.github.io). Welcome to register here! 👇
We are looking for reviewers for the 2nd ICML SPIGM workshop (spigmworkshop2024.github.io). Welcome to register here! 👇
We are glad to announce that the "Structured Probabilistic Inference & Generative Modeling" workshop will be held again on @icmlconf 2024, Vienna! Check the current schedule and updates on our website: spigmworkshop2024.github.io
It's always such an enjoyable experience on reading Sander's posts. 😃
It's always such an enjoyable experience on reading Sander's posts. 😃
It's still quite challenging to maintain the exact content for few step distillation of diffusion models. Very neat work to make this happen, and more (advanced deterministic sampling techniques etc.). Congrats to the team! 🥳
It's still quite challenging to maintain the exact content for few step distillation of diffusion models. Very neat work to make this happen, and more (advanced deterministic sampling techniques etc.). Congrats to the team! 🥳
Fast sampling with 'Multistep Consistency Models': We get 1.6 FID on Imagenet64 in 4 steps and scale text-to-image models, generating 256x256 images with 16 steps. Guess which row is distilled? With @emiel_hoogeboom @TimSalimans Arxiv: arxiv.org/abs/2403.06807
It has been a while but if you haven't checked it out, could be a fun reading if you're excited about flow matching by SD3 and want to understand more about the connection between that and diffusion models 😃
It has been a while but if you haven't checked it out, could be a fun reading if you're excited about flow matching by SD3 and want to understand more about the connection between that and diffusion models 😃
text-to-3d scenes that are automatically decomposed into the objects they contain, using only an image diffusion model & no other supervision: dave.ml/layoutlearning work w/ @poolio @BenMildenhall Alyosha Efros and @holynski_
Exciting!
Join us at Hall B1 in #NeurIPS2023 for an awesome diffusion panel with @robrombach @sedielem @ArashVahdat @RuiqiGao
If you want to chat about flow matching, stochastic interpolants, diffusion models and their links with OT (Schrödinger Bridge 🌉 hehe) catch me at poster #619 (Thurs 14 10:45-12:45) where I will be presenting "Diffusion Schrödinger Bridge Matching" (arxiv.org/abs/2303.16852).
Want more control over images generated by your diffusion model? Check out self-guidance at poster #605 this morning @ #NeurIPS2023 from Dave Epstein et al.! With no labels or fine-tuning, you can move + resize objects, 0-shot DreamBooth, and more: dave.ml/selfguidance/
Peanut, the diffusion cat was featured in a nice talk by @RuiqiGao at #NeurIPS2023
Diffusion circle at @NeurIPSConf: let's meet at 2:30pm on Thursday (tomorrow!) outside Hall E (Gate 10B) and then find a place to sit and have a chat. We'll do it old school and just sit on the floor somewhere. I'll share location updates live! Tell your friends! #NeurIPS2023 📢
A great turnout at the LDM tutorial, and a hard act to follow. If you are hungry for more; please come to our workshop on diffusion models this Friday in Hall B1: diffusionworkshop.github.io Submit questions to our fantastic panel of experts here docs.google.com/forms/d/e/1FAI…
A great turnout at the LDM tutorial, and a hard act to follow. If you are hungry for more; please come to our workshop on diffusion models this Friday in Hall B1: diffusionworkshop.github.io Submit questions to our fantastic panel of experts here docs.google.com/forms/d/e/1FAI… https://t.co/mgPblKphCG
Excited to announce our new work on using synthetic data for improving mathematical problem solving and code generation in LLMs! arxiv: arxiv.org/abs/2312.06585 A small amount of fine-tuning can lead to large gains (>6% on Hendrycks MATH with Palm-2)
Nuvo: Neural UV Mapping! It's super difficult to UV map/texture atlas geometry produced by 3D reconstruction and generation pipelines. Nuvo works on all kinds of "unruly" 3D representations (NeRF, DreamFusion, etc.) and enables easy appearance editing! pratulsrinivasan.github.io/nuvo 1/3
Looking forward to seeing you there and discuss diffusion models!
If you're at @NeurIPSConf, come check this out our demo at the @GoogleDeepMind booth on Wednesday at noon, we've got some cool stuff to share! 🎶 #NeurIPS2023
Very nice to see so much work leveraging learned priors for 3d reconstruction and generation! Tutorial: Latent Diffusion Models: Is the Generative Al Revolution Happening in Latent Space? #NeurIPS2023
The people want video LDMs (latent diffusion models). // @NeurIPSConf
Jim Fan @DrJimFan
229K Followers 3K Following @NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.Peyman Milanfar @docmilanfar
67K Followers 261 Following Distinguished Scientist at Google Research. Computational Imaging, Machine Learning, and Vision. Tweets = personal opinions. May change or disappear over time.Kevin Patrick Murphy @sirbayes
42K Followers 333 Following Research Scientist at Google Brain / Deepmind. Interested in Bayesian Machine Learning.Sander Dieleman @sedielem
50K Followers 2K Following Research Scientist at Google DeepMind. I tweet about deep learning (research + software), music, generative models (personal account).Rosanne Liu @savvyRL
33K Followers 965 Following Cofounded & running @ml_collective. Host of Deep Learning Classics & Trends. Research at Google DeepMind. DEI/DIA Chair of ICLR & NeurIPS. Writing https://t.co/IbycyGfnDRShane Gu @shaneguML
28K Followers 1K Following Research Scientist & Manager @GoogleDeepMind Tokyo/MTV. ex: @GoogleAI Brain, @OpenAI. (JP: @shanegJP)Eric Jang @ericjang11
69K Followers 3K Following physical AGI at 1X. Author of "AI is Good for You" https://t.co/eFg4WXhg0pDurk Kingma @dpkingma
35K Followers 346 Following Deep learning, mostly generative models. Prev. Google Brain/DeepMind, founding team @OpenAI. Inventor of the VAE, Adam optimizer, among other things. ML PhD.Taco Cohen @TacoCohen
21K Followers 3K Following Deep learner at FAIR. Into codegen, equivariance, generative models. Spent time at Qualcomm, Scyfer (acquired), UvA, Deepmind, OpenAI.Yuandong Tian @tydsh
16K Followers 801 Following Research Scientist and Senior Manager in Meta AI (FAIR). AI-guided Optimization and Representation Learning. Novelist in spare time. PhD in @CMU_Robotics.Arash Vahdat (hiring) @ArashVahdat
8K Followers 805 Following Principal scientist and research manager @nvidia research, leading forward-looking fundamental generative AI research efforts, views are my own.Ben Poole @poolio
17K Followers 1K Following research scientist at google brain. phd in neural nonsense from stanford.Jiaming Song @baaadas
5K Followers 992 Following Chief Scientist @LumaLabsAI. Working on visual generative AI. Were @NVIDIA @Stanford @OpenAI @MetaAISam Power @sp_monte_carlo
17K Followers 7K Following Lecturer in Maths & Stats at Bristol. Interested in probabilistic + numerical computation, statistical modelling + inference. (he / him)Wenhu Chen @WenhuChen
11K Followers 520 Following AI researcher @UWaterloo @GoogleAI @VectorInst. Interested in natural language processing, diffusion models. I direct TIGER-Lab at UWaterloo.Sara Hooker @sarahookr
39K Followers 7K Following I lead @CohereForAI. Formerly Research @Google Brain @GoogleDeepmind. ML Efficiency at scale, LLMs, @trustworthy_ml. Changing spaces where breakthroughs happen.Ethan Caballero is bu.. @ethanCaballero
8K Followers 2K Following ML PhD student @Mila_Quebec ; previously @GoogleDeepMindAndreas Kirsch 🇮�.. @BlackHC
9K Followers 5K Following Past: 🧑🎓 DPhil @AIMS_oxford @ExeterCollegeOx @UniofOxford (4.5yr) 🧙♂️ RE @DeepMind (1yr) 📺 SWE @Google (3yrs) 🎓 @TU_Muenchen 👤 Fellow @nwspkXin Eric Wang @xwang_lk
7K Followers 1K Following Multimodal and Embodied AI Researcher / Professor @UCSC. Director of https://t.co/Y4swOBag21. AI for Humanity in the long run. he/himNaveen Kumar @naveen1815
80 Followers 725 FollowingWilliam Li @Williamiumli
15 Followers 139 Following Incoming Ph.D. student @UCSanDiego, M.S.E. in CS @JohnsHopkins, B.S. in CS at SCUTqq @qq_great
0 Followers 116 FollowingYuanbo Yang @YuanboYang60742
9 Followers 174 FollowingMaryam Honari @HonariMaryam
124 Followers 646 Following Poking Reinforcement Learning & Language models,@microsoft /ABK #MLAgent ex-@unity3d ex-RA @uvicyuyu @yuyu011005
3 Followers 158 Following🐐 Qi Wang(Levi) @LeviWQ1
21 Followers 163 Following Neuroscience PhD.Student @MPICybernetics @uni_tue #MRI #ML #UnsupervisedLearning #GenerativeModelsDiego Mesquita @wkly_infrmtive
299 Followers 663 Following Machine learning, deep learning, and Bayesian methods. Assistant prof @FGVBrazil. Previously @AaltoUniversity. (Bad) Jokes are my own.Aaditya ; @Aaditya26082004
525 Followers 7K Following CS'26 • Machine Learning • Open-Source • Web Dev. • Algorithms • Jai Shree Krishna 🦚🪈Tobias Schröder @tobias_schrdr
16 Followers 50 Following Machine Learning Researcher Currently training Energy-Based Models @ Imperial College LondonJohn Wong @ChiHoWONG19
49 Followers 140 FollowingArif Ahmad @ArifAhm92263086
238 Followers 7K Following All things AI, Computer Science and Circuits! Prev. @GoogleAIShamim Ibne Shahid @Shamim2297
19 Followers 345 FollowingAI Papers Podcast @aipaperspodcast
871 Followers 2K Following A digestible daily update on the latest AI Research Papers. Brought to you by @pocketpodappAria @Aria3441
1 Followers 57 FollowingDiandian Yilin @DiandianYilin
27 Followers 365 Following PhD student in QuantMarketing @CUHKofficial |@UWMadison @pku1898 alum | fashion, sports, computer vision, causal inference 💻🐶🐴Radoslav Krivak @rdkbio
334 Followers 5K Following Structural Bioinformatics / AI for Drug Discovery / Geometric DL (@IOCBPrague, prev. PhD @cusbg)Yiheng Li @yhli123
2 Followers 401 FollowingMa Sheen @MaSheenUprising
7 Followers 970 Following “The programme will take me a little while to run.” Fook glanced impatiently at his watch.Mehreen Malik @MehreenNMalik
1K Followers 5K FollowingAdam Falls @AdamFalls172137
58 Followers 971 FollowingDhruvesh Patel @_dhruveshp
89 Followers 487 Following An @iitmadras graduate, Ph.D. student @umasscsMadeOfParticles @MadeOfParticles
7 Followers 248 Following View everything through the lens of its smallest components: every detail, every moment, composed of particles. #ParticlePerspectiveZhiyong Wang @Zhiyong16403503
380 Followers 2K Following Visiting Ph.D. student at Cornell University. Ph.D. candidate at CUHK. Working on bandits and reinforcement learning theory.Alex Trevithick @alextrevith
246 Followers 177 Following PhD Student @UCSanDiego. @NSF GRFP. Previously @NVIDIAAI @VcaiMpi. Currently SR @GoogleAI. 3D Vision, Machine Learning, Generative Models.Save mhenyu @tanaka_ndove
62 Followers 292 Followingshawn @xiaoxinz
6 Followers 19 FollowingXubin Ren @xubinrencs
583 Followers 1K Following Ph.D. student of @hkudatascience and @HKUniversity Data Intelligence Lab, fortunately advised by @huang_chao4969. Trying to be a good data science researcher.Mohammed Alaa Elkomy @m_a_komy
348 Followers 557 FollowingYoko @stuffyokodraws
5K Followers 1K Following Cartoonist, Engineer, PM, Partner @a16z investing in infra & AI Prev Product lead @HashiCorp, Founding Eng/PM @Transposit. Eng @AppDynamics. Opinions = own.≽(•ᴗ•)≼ @unrankedmage
0 Followers 2K FollowingWilson Lee @uisiong
5 Followers 21 Followingming wen @mingwen1284013
0 Followers 27 FollowingHaque Ishfaq @HaqueIshfaq
1K Followers 821 Following PhD student at @mcgillu/ @MILAMontreal. Bandits and Reinforcement Learning. BS, MS @Stanford 🇧🇩🇺🇸🇨🇦Zeqian Bao @BaoZeqian18347
5 Followers 300 FollowingGaurav Shrivastava @datacrunch3r
87 Followers 85 Following CS PhD at University of Maryland | Research Intern @Google | @Meta | @tiktok_us | @NUSingapore | @bitspilaniindiaDamaru M @damaru_m
213 Followers 3K FollowingRon @JontanJon
2 Followers 300 FollowingAK @_akhaliq
309K Followers 3K Following AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo follow on Hugging Face: https://t.co/q2Qoey80GxJim Fan @DrJimFan
229K Followers 3K Following @NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.Yi Ma @YiMaTweets
71K Followers 123 Following Chair Professor in AI, Director of IDS, Head of CS, HKU; Professor of EECS, Berkeley; Author of Book: High-Dim Data Analysis, https://t.co/gwaqMJp8av.Peyman Milanfar @docmilanfar
67K Followers 261 Following Distinguished Scientist at Google Research. Computational Imaging, Machine Learning, and Vision. Tweets = personal opinions. May change or disappear over time.Kosta Derpanis @CSProfKGD
48K Followers 198 Following #CS Associate Prof @YorkUniversity, #ComputerVision Scientist Samsung #AI, @VectorInst Faculty Affiliate, TPAMI AE, #CVPR2024/#ECCV2024 Publicity Co-chairKevin Patrick Murphy @sirbayes
42K Followers 333 Following Research Scientist at Google Brain / Deepmind. Interested in Bayesian Machine Learning.Sander Dieleman @sedielem
50K Followers 2K Following Research Scientist at Google DeepMind. I tweet about deep learning (research + software), music, generative models (personal account).Lucas Beyer (bl16) @giffmana
56K Followers 442 Following Researcher (Google DeepMind/Brain in Zürich, ex-RWTH Aachen), Gamer, Hacker, Belgian. Mostly gave up trying mastodon as [email protected]Matthias Niessner @MattNiessner
31K Followers 162 Following Professor for Visual Computing & Artificial Intelligence @TU_Muenchen Co-Founder @synthesiaIONeurIPS Conference @NeurIPSConf
111K Followers 35 Following New Orleans, Dec 10-16, 23. https://t.co/ga8aOw615g Tweets to this account are not monitored. Please send feedback to [email protected].Rosanne Liu @savvyRL
33K Followers 965 Following Cofounded & running @ml_collective. Host of Deep Learning Classics & Trends. Research at Google DeepMind. DEI/DIA Chair of ICLR & NeurIPS. Writing https://t.co/IbycyGfnDRShane Gu @shaneguML
28K Followers 1K Following Research Scientist & Manager @GoogleDeepMind Tokyo/MTV. ex: @GoogleAI Brain, @OpenAI. (JP: @shanegJP)Eric Jang @ericjang11
69K Followers 3K Following physical AGI at 1X. Author of "AI is Good for You" https://t.co/eFg4WXhg0pYi Tay @YiTayML
29K Followers 97 Following chief scientist / cofounder @RekaAILabs 🫠 past: research scientist @google brain 🤯 currently learning to be a dad 🍼Kyunghyun Cho @kchonyc
61K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).Percy Liang @percyliang
49K Followers 408 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | PianistSergey Levine @svlevine
79K Followers 122 Following Associate Professor at UC Berkeley Co-founder, Physical IntelligenceDan Roy @roydanroy
45K Followers 2K Following ML / AI researcher, emphasis on theory. Research Director and Canada CIFAR AI Chair, @VectorInst Professor, @UofT (Statistics/CS)Cognition @cognition_labs
123K Followers 19 Following Makers of Devin, the first AI software engineer. We are an applied AI lab focused on reasoning, and code is just the beginning. Join us: https://t.co/tpfZwEwGiqNeal Wu @WuNeal
15K Followers 390 Following Building @cognition_labs. Previously @tryramp, @GoogleBrain, @Harvard, competitive programming (featured in @Wired). Created https://t.co/pihw5AGvbV.Alex Trevithick @alextrevith
246 Followers 177 Following PhD Student @UCSanDiego. @NSF GRFP. Previously @NVIDIAAI @VcaiMpi. Currently SR @GoogleAI. 3D Vision, Machine Learning, Generative Models.シェイン・グウ @shanegJP
53K Followers 318 Following Gemini 1.5 @GoogleDeepMind 東京・SF。 元@GoogleAI Brain、元 @OpenAI。 英語: @shaneguML。全て個人意見です。Yoko @stuffyokodraws
5K Followers 1K Following Cartoonist, Engineer, PM, Partner @a16z investing in infra & AI Prev Product lead @HashiCorp, Founding Eng/PM @Transposit. Eng @AppDynamics. Opinions = own.Bilawal Sidhu @bilawalsidhu
46K Followers 4K Following Blending reality & imagination. Ex-@Google Maps & AR/VR. VFX & 3D creator with 1.4M+ subs & 360M+ views. Scout @a16z. Host @TEDTalks AI Show. ੴ.Beidi Chen @BeidiChen
6K Followers 348 Following Asst. Prof @CarnegieMellon, Visiting Researcher @Meta, Postdoc @Stanford, Ph.D. @RiceUniversity, Large-Scale ML, a fan of Dota2.Dinghuai Zhang 张鼎.. @zdhnarsil
2K Followers 1K Following PhD student at @Mila_Quebec. Ex intern at FAIR Labs @MetaAI. Previous math undergraduate at @PKU1898.Hugging Face @huggingface
343K Followers 189 Following The AI community building the future. https://t.co/VkRPD0VKaZ #BlackLivesMatter #stopasianhateEric Schmidt @ericschmidt
2.2M Followers 224 Following Former Executive Chairman & CEO and tweets from Schmidt FoundationUiPath @UiPath
104K Followers 5K Following We envision a world with a 🤖 for every person. Dedicated to accelerating human achievement via an #AI-powered end-to-end #automation platform.Karl Tuyls @karl_tuyls
2K Followers 333 Following Ex: team lead @ DeepMind,@GoogleDeepMind - still working on AGI in a Multi-Agent world. CS professor (Liverpool/Leuven) and LFC fan.Yuke Zhu @yukez
15K Followers 464 Following Assistant Professor @UTCompSci | Co-Leading GEAR @NVIDIAAI | CS PhD @Stanford | Building generalist robot autonomy in the wild | Opinions are my ownFigure @Figure_robot
71K Followers 1 Following Figure is an AI Robotics company building the world's first commercially viable autonomous humanoid robot.Jonathan Heek @JonathanHeek
234 Followers 5 FollowingChin-Yi Cheng @chinyich
417 Followers 259 Following ai x design x interaction at google research | formerly autodesk ai labScott Reed @scott_e_reed
16K Followers 386 Following Research Scientist at NVIDIA working on generalist embodied agent researchAnyi Rao @raoanyi
1K Followers 814 Following Postdoc @Stanford & Ph.D. @ MMLab & Prev. @Meta @RealityLabs @UofT @VectorInst Works include #ControlNet #AnimateDiff @cveu_workshopJiaxuan You @youjiaxuan
1K Followers 138 Following Incoming Assistant Prof @UIUC CS Senior Research Scientist @NVIDIA PhD @Stanford CSLucas Theis @lucastheis
3K Followers 843 Following Research Scientist at @GoogleDeepMind. Previously @twitter, Magic Pony, @bethgelab. Compression, information theory, and probabilistic machine learning.Oliver Wang @oliver_wang2
907 Followers 124 FollowingMikey @MikeyShulman
532 Followers 8 Following Aspiring mediocre athlete. Co-founder of https://t.co/DEMHM3kA1T.Delip Rao e/σ @deliprao
46K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈Mark Tenenholtz @marktenenholtz
114K Followers 544 Following Head of AI @PredeloHQ. XGBoost peddler, transformer purveyor.David Ruhe @djjruhe
1K Followers 394 Following Student Researcher @GoogleDeepMind ; PhD Candidate Machine Learning @AmlabUva, @ai4science_lab @UvA_Amsterdam. Previously @MSFTResearch, @FlatironInstKaran Goel @krandiash
3K Followers 881 Following Founder @cartesia_ai, Machine Learning PhD at @StanfordAILab, CMU / IIT-Delhi alum.Christian Szegedy @ChrSzegedy
32K Followers 2K Following #deeplearning, #ai research scientist. Opinions are mine.Jungkook @BTS_Junngkook
137K Followers 5 Following RolePlayer - BTS's Golden Maknae, Jungkook Jeon. Official @bts_twtHongyang R. Zhang @HongyangZhang
670 Followers 451 Following Asst Prof of #computerscience @Northeastern. PhD @Stanford. Postdoc @Penn.Volodymyr Kuleshov �.. @volokuleshov
8K Followers 997 Following AI Researcher. Prof @Cornell & @Cornell_Tech. Co-Founder @afreshai. PhD @Stanford.Sasha Rush @srush_nlp
52K Followers 464 Following Professor, Programmer in NYC. Cornell Tech, Hugging Face 🤗 https://t.co/cZl0wTfqGzFrançois Fleuret @francoisfleuret
31K Followers 455 Following Prof. @Unige_en, Adjunct Prof. @EPFL_en, Research Fellow @idiap_ch, co-founder @nc_shape. AI and machine learning since 1994. I like reality.Cartesia @cartesia_ai
1K Followers 8 Following Cartesia is training next-gen foundation models with subquadratic deep learning architectures. Sign up for early access at https://t.co/c5og0yF1PzYisong Yue @yisongyue
19K Followers 3K Following Machine Learning @Caltech (@YueLabCaltech). AI for invention at @AsariAILabs. Autonomous Driving at https://t.co/riZHAmvcAr. Senior Program Chair @iclr_conf.Nando de Freitas 🏳.. @NandoDF
97K Followers 658 Following I research intelligence to understand what we are, and to harness it wisely. I lead a wonderfully creative AI team at @GoogleDeepMind who inspire me everyday.AGI House @agihouse_org
13K Followers 412 Following Accelerating humanity's transition to AGI & honoring the greatest AI founders and researchers of our time @ https://t.co/1lJUc58gZJTSAIL Group @TsinghuaSAIL
26 Followers 46 Following The Statistical Artificial Intelligence & Learning (SAIL) Group at @Tsinghua_Uni • #Tianshou #ZhuSuan #DPMSolver #UniDiffuser #ProlificDreamerEric Xing @ericxing
5K Followers 18 Following Researcher, educator, entrepreneur, and administrator in computer science, artificial intelligence, and healthcare.📢📢Most diffusion (and flow matching) models use handcrafted schedules for their denoising steps during sampling. We show how to optimize them in a principled manner for high-quality generation! @amsabour added quickstart guide & collab to get you started quickly (links below)!
📢📢 Align Your Steps: Optimizing Sampling Schedules in Diffusion Models research.nvidia.com/labs/toronto-a… TL;DR: We introduce a method for obtaining improved sampling schedules for diffusion models, resulting in better samples at the same computation cost. (1/5)
HF's dedication to true open source is such a blessing 🙏
Llama3 was trained on 15 trillion tokens of public data. But where can you find such datasets and recipes?? Here comes the first release of 🍷Fineweb. A high quality large scale filtered web dataset out-performing all current datasets of its scale. We trained 200+ ablation…
Llama3 was trained on 15 trillion tokens of public data. But where can you find such datasets and recipes?? Here comes the first release of 🍷Fineweb. A high quality large scale filtered web dataset out-performing all current datasets of its scale. We trained 200+ ablation…
We have just released 🍷 FineWeb: 15 trillion tokens of high quality web data. We filtered and deduplicated all CommonCrawl between 2013 and 2024. Models trained on FineWeb outperform RefinedWeb, C4, DolmaV1.6, The Pile and SlimPajama!
Introducing Meta Llama 3: the most capable openly available LLM to date. Today we’re releasing 8B & 70B models that deliver on new capabilities such as improved reasoning and set a new state-of-the-art for models of their sizes. Today's release includes the first two Llama 3…
It took my brain a while to parse what's going on in this video. We are so obsessed with "human-level" robotics that we forget it is just an artificial ceiling. Why don't we make a new species superhuman from day one? Boston Dynamics has once again reinvented itself. Gradually,…
Another Mamba-Attention hybrid that looks very strong! These two layers are complementary: Mamba is great at compressing information, and a few attention layers are enough to retrieve from the context for in-context learning.
Zyphra is pleased to announce Zamba-7B: - 7B Mamba/Attention hybrid - Competitive with Mistral-7B and Gemma-7B on only 1T fully open training tokens - Outperforms Llama-2 7B and OLMo-7B - All checkpoints across training to be released (Apache 2.0) - Achieved by 7 people, on 128…
Excited to finally release our open course on deep generative models! This material has been taught at Stanford/Cornell/UCLA since 2019. It includes 🎥 20 hours of video lectures ✨ 17 sets of slides 📖 Lecture notes Youtube: youtube.com/playlist?list=… Site:…
Announcing NeurIPS Preschool Track This year, we invite preschoolers to submit machine learning research papers.
Just finished my ECCV reviews. 2 of the 6 papers in my pile were 100% unedited and incoherent LLM output. If you let ChatGPT write your paper, and one of your reviews is a "strong reject" and a diatribe about why what you did is immoral: hi, that was me, we are not friends.
Check out RealmDreamer (realmdreamer.github.io)--our new 3D scene generation method! No multiview data required :) One of my favorites is this: "Fantasy lighthouse in the Arctic, surrounded by a world of ice and snow, shining with a mystical light under the aurora borealis."
RealmDreamer Text-Driven 3D Scene Generation with Inpainting and Depth Diffusion We introduce RealmDreamer, a technique for generation of general forward-facing 3D scenes from text descriptions. Our technique optimizes a 3D Gaussian Splatting representation to match
Got a story to tell? You’ve got 72 hours. Together with @elevenlabsio, we’re launching our very first 72-hour FilmFAST. Join us April 12-14 for three days of prompts, voices, sound effects, and eye candy. Sign up to compete via the link in comments. We’ll announce the theme…
Updated list of diffusion model tutorial sources, from my lecture on diffusion.
I wrote a tutorial on diffusion models for undergrad and grad students. I tried my best to give intuitive explanations for complicated equations. Your feedback is much appreciated Thanks to those who suggested various reading materials to me arxiv.org/abs/2403.18103
A Mamba Primer (w/ Yair Schiff youtube.com/watch?v=dVH1dR… ) Mamba is a nice jumping off point to summarize foundational ideas in sequence modeling, parallel algorithms, continuous-time representations, and GPU aware algorithms. We try to put these together in the context of LMs.
How @_sholtodouglas got scouted by Google DeepMind: “Every night from 10 PM till 2 AM, I would do my own research. @jekbradbury saw some of my questions online and was like, ‘I thought I knew all the people in the world who were asking these questions. Who on Earth are you?’”
A good example is @_sholtodouglas at @GoogleDeepMind. He's quiet on Twitter, doesn't have any flashy first-author publications, and has only been in the field for ~1.5 years, but people in AI know he was one of the most important people behind Gemini's success
This blog post is an amazing exposition and analysis of consistency models, and how they relate to diffusion models, leading to several suggested improvements to the training procedure that look very promising. Definitely worth a read!
🚀Our latest blog post unveils the power of Consistency Models and introduces Easy Consistency Tuning (ECT), a new way to fine-tune pretrained diffusion models to consistency models. SoTA fast generative models using 1/32 training cost! 🔽 Get ready to speed up your generative…
We are glad to announce that the "Structured Probabilistic Inference & Generative Modeling" workshop will be held again on @icmlconf 2024, Vienna! Check the current schedule and updates on our website: spigmworkshop2024.github.io
Ideogram on the @Nasdaq billboard! 🔥
Information theory ftw 🙌 The most important thing in AI is not GPUs but coordination & governance. We must decentralize AI before it is too late 👊 lfg
It was great talking with @EMostaque pioneer of open source AI & founder @StabilityAI and hear him talk about his vision for open / decentralized AI powering human flourishing and how it fits with our vision for coordination accelerationism at @eigenlayer.
Jensen's energy and passion are contagious! I don't know how he can always lift everyone around him up.