Behrooz Ghorbani @_ghorbani
Researcher at @OpenAI, studying large language models. Formerly @GoogleBrain and @stanford_ee. Opinions expressed are solely my own. web.stanford.edu/~ghorbani/ San Francisco, CA Joined December 2017-
Tweets122
-
Followers297
-
Following455
-
Likes648
My group at Berkeley Stats and EECS has a postdoc opening in the theoretical (e.g., scaling laws, watermark) and empirical aspects (e.g., efficiency, safety, alignment) of LLMs or diffusion models. Send me an email with your CV if interested!
work on what is most important
Introducing Sora, our text-to-video model. Sora can create videos of up to 60 seconds featuring highly detailed scenes, complex camera motion, and multiple characters with vibrant emotions. openai.com/sora Prompt: “Beautiful, snowy…
🌐Exciting News in Machine Translation! 🚀MetricX-23, our SOTA evaluation metric, is now OPEN-SOURCE in PyTorch/Transformers! 🎉There are three model sizes available, all trained on 1m+ human judgments of MT quality! 🔗Code github.com/google-researc… 🔗Paper www2.statmt.org/wmt23/pdf/2023…
I remember how quick the media was to clown mistral in june for raising pre-product, calling it peak AI hype I hope they’re eating their words now
A Meta-Evaluation paper recognized with a best paper award! Accurate evaluation is crucial for progress, and this holds true for metric research as well. Congratulations to @_danieldeutsch, well deserved!. I highly recommend reading his paper, even if you're not specifically…
A Meta-Evaluation paper recognized with a best paper award! Accurate evaluation is crucial for progress, and this holds true for metric research as well. Congratulations to @_danieldeutsch, well deserved!. I highly recommend reading his paper, even if you're not specifically…
Congratulations to @_danieldeutsch, George Foster, and @markuseful, co-authors of the paper “Ties Matter: Meta-Evaluating Modern Metrics with Pairwise Accuracy and Tie Calibration”, for winning the #EMNLP2023 Outstanding Paper Award in Machine Translation!
# On the "hallucination problem" I always struggle a bit with I'm asked about the "hallucination problem" in LLMs. Because, in some sense, hallucination is all LLMs do. They are dream machines. We direct their dreams with prompts. The prompts start the dream, and based on the…
Hi @emilymbender, I'm one of the lead authors of MMMU. I can certify that 1) Google didn't fund this work, and 2) Google didn't have early access. They really like the benchmark after our release and worked very hard to get the results. It doesn't take that long to eval on a…
Hi @emilymbender, I'm one of the lead authors of MMMU. I can certify that 1) Google didn't fund this work, and 2) Google didn't have early access. They really like the benchmark after our release and worked very hard to get the results. It doesn't take that long to eval on a…
Excited to announce 🔥🤨DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models 🤨 🔥 Appearing at #NeurIPS2023 as Datasets and Benchmarks **Oral** Paper: openreview.net/forum?id=kaHpo… Led by @uiuc_aisecure @wbx_life @ChulinXie @danielz2333 1/N
I am excited and honored to have just been named as an independent director of @OpenAI. I look forward to working with board colleagues and the OpenAI team to advance OpenAI’s extraordinarily important mission. First steps, as outlined by Bret and Sam in their messages, include…
I am excited and honored to have just been named as an independent director of @OpenAI. I look forward to working with board colleagues and the OpenAI team to advance OpenAI’s extraordinarily important mission. First steps, as outlined by Bret and Sam in their messages, include…
Happy birthday ChatGPT! One year ago we released what we intended to be a “low-key research preview” expecting that the real moment of excitement would be GPT-4 launch. Became all-hands-on-deck scaling effort—GPU efficiency, db, even auth. Thank you everyone for your passion!
Slides on Optimization of Random Cost Functions (29th Solvay Conference of Physics). Tried to summarize what did we learn in the last 40 years about the fundamental complexity of random optimization, using Fu-Anderson 1986 as a starting point: web.stanford.edu/~montanar/OTHE……
Spent today (and will spend rest of the week) doing 1:1s with the team. Have never seen a team more energized and focused. Really a privilege to be working with such amazing people.
We're building several efforts at OpenAI: Preparedness, reliable AI deployment research, and AI security research. Up for chatting with us about these at NeurIPS? Fill out this form (by Dec 1): forms.gle/ogN3rqu3t7ywg7…
🚀Introducing new (synthetic) RLHF Dataset Nectar and new open model Starling-LM-7B-alpha🚀 🌟 Model & Dataset Highlights: 📊 Scores 8.09 in MT Bench: Surpassing all existing models except OpenAI's GPT-4 and GPT-4 Turbo. 📚 183K Chat Prompts + 7 responses in Nectar: With 3.8M…
not contributing to your 401k “because of AGI timelines” is the tech bro equivalent of saying you’re not having kids because of climate change. you’re just making excuses for what you were going to do anyways
Zhimei Ren @RenZhimei
815 Followers 269 Following Assistant professor @Wharton stats. Former postdoc @UChicago & PhD from @Stanford.Jason Lee @jasondeanlee
10K Followers 3K Following Associate Professor at Princeton and Research Scientist at Google DeepMind. ML/AI Researcher working on foundations of LLMs and deep learningSara Hooker @sarahookr
39K Followers 7K Following I lead @CohereForAI. Formerly Research @Google Brain @GoogleDeepmind. ML Efficiency at scale, LLMs, @trustworthy_ml. Changing spaces where breakthroughs happen.Ofir Press @OfirPress
10K Followers 3K Following I build tough benchmarks for LMs and then I get the LMs to solve them. Postdoc @Princeton. PhD from @nlpnoah @UW. Ex-visiting researcher @MetaAI & @MosaicML.Weijie Su @weijie444
4K Followers 442 Following Associate Professor @Wharton & CS Penn. coDir @Penn Research #MachineLearning. PhD @Stanford. #Privacy #DeepLearning #Statistics #GameTheory #Optimization._Edmund @Edmund874303
3 Followers 656 FollowingYushunZhang @ericzhang0410
45 Followers 205 Following Phd student at The Chinese University of Hong Kong, shenzhen, China, This twitter account is to record the good papers & talks that I learned.McGeale @GealeMc453
0 Followers 159 Followingvireshkumbar_718 @718Vireshk81993
3 Followers 217 Following Nice to meet you. My hobbies are reading, food and sports. I like cats😘 I like to meet new friends while traveling🎉🎉🎉Tismeto @tismeto22912
1 Followers 191 FollowingReampeski @reampeski28631
4 Followers 504 Followingus_Piper_ @us_piper71829
6 Followers 1K FollowingCairo_6 @Cairo61075756
1 Followers 731 Followingus_Lyla_ @usLyla179533
4 Followers 999 FollowingZoey_Rodrigu @RodriguZoe28827
4 Followers 1K FollowingEva Louise Marie Gabr.. @e681554349
7 Followers 3K FollowingMartin @MartinTechAcc
26 Followers 139 Following Backend Software engineer | Typescript And Graphql lover | Learning AI/Python and some frontend with ReactMani @msulemanas57411
256 Followers 6K Following Current :-Senior Analytic Consultant @wellsfargo. Previously :-Founder of WIFC (Without Internet free Call). I go by Muhammad._l_uxury @luxury1258272
3 Followers 1K FollowingFreddie Bickford Smit.. @fbickfordsmith
345 Followers 713 Following ML PhD at Oxford with @tom_rainforth @adamefoster. Previously at UCL with @profdata @bdroads @egrefen.Nina Beguš @ninabegus
3K Followers 2K Following Researcher @UCBerkeley Founder @Interpret_AI #ArtificialHumanitiesKhaled Saab @KhaledSaab11
396 Followers 274 Following Working on AI for healthcare @GoogleAI, prev: PhD @StanfordAILabTom Kocmi @KocmiTom
604 Followers 178 Following Senior researcher at Microsoft Translator (he/him) | AI Evaluation (LLMs, MT, Multilingulity)Song Mei @Song__Mei
1K Followers 547 Following Assistant Professor at UC Berkeley, Department of Statistics and EECS. Researcher working on foundations of generative AI.purpleSlate @purpleslate_
102 Followers 324 Following Building meaningful conversations💬 Improved engagement levels🤝 Happier Customers🤗Machine Learning FLX @machinelearnflx
164K Followers 30K Following Everything about #MachineLearning #NLP #DeepLearning #AI #GenAI #Bigdata #Analytics #DataMining, #DataScience #Courses #Learning #ArtificialintelligenceCustomGPT.ai @CustomGPT
995 Followers 1K Following https://t.co/HdkElz5ohR lets you easily build your own agent with your own data!Julia Kreutzer @KreutzerJulia
3K Followers 1K Following 🤖💬NLP researcher @CohereForAI. Mom of 3👶🏻👶🏻👼, cellist🎶, baker🥯, outdoor enthusiast 🏞️. Views my own.Spotify Medizin @DataBook91
0 Followers 440 FollowingJuliette Gibbons @julieGB0707
43 Followers 545 Following The purpose of art is washing the dust of daily life off our souls. We can share thought and be positive. ❤️ Feel free for DM ❤️ ♀️👧Berivan Isik @BerivanISIK
3K Followers 2K Following PhD @StanfordAILab. Scalable & trustworthy ML, transfer learning, language models, federated learning, privacy | prev: @Google @AWSCloud @VectorInstTuringPost @TheTuringPost
62K Followers 16K Following Newsletter exploring AI & ML - Weekly trends - LLM/FM insights - Unicorn spotlights - Global dynamics - History Led by @kseniase_ Elevate your AI game 👇🏼Roy @TheBest_Roy
6 Followers 178 FollowingYi Lin Sung @yilin_sung
532 Followers 730 Following CS PhD student @unccs @uncnlp | Previously intern @MetaAI @MSFTResearch | Multi-modal DL, Efficient fine-tuning.Aaditya ; @Aaditya26082004
521 Followers 7K Following CS'26 • Machine Learning • Open-Source • Web Dev. • Algorithms • Jai Shree Krishna 🦚🪈Teemu Summanen @teemusum
195 Followers 3K Following Interested in AI, security, healthcare, and Flutter & Dart.👨🏼💻At X for reading diverse views by professionals and hobbyists.🔬📚🫶Vineet Kukreti @googlervineet
203 Followers 2K Following AI, ML, NLP enthusiast | Computer Science student | Innovator in smart tech | Creating impactful solutionsAhmad Zareei @AhmadZareei
437 Followers 608 Following Research Scientist AI @Meta, @MetaAI; prev: postdoc @Harvard @hseas; PhD @UCBerkeleyAryaman @aryaman_pandya
153 Followers 227 Following software @motionaldrive, @TuftsECE. human centered AI/Robotics.Ashish Vaswani @ashVaswani
19K Followers 2K FollowingMake money easily @8r5K8IpH7k2rVj
11 Followers 590 Following MEXC focuses on financial management, stocks, cryptocurrencies, digital assets and investments. Currently, new users can get free dollars when they sign up.Hayden Field @haydenfield
12K Followers 4K Following Tech reporter @CNBC on the AI beat. Prev @MorningBrew, @Protocol & @Entrepreneur. Bylines @WiredUK, @techreview, etc. (DM for Signal/off-the-record chat.)V R @prod_defined
102 Followers 4K Followingnick nassuphis @NNassuphis
145 Followers 5K FollowingAK @_akhaliq
309K Followers 3K Following AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo follow on Hugging Face: https://t.co/q2Qoey80GxBehnam Neyshabur @bneyshabur
18K Followers 690 Following Senior Staff Research Scientist @GoogleDeepMind, Interested in reasoning w. LLMs, traveling & backpackingAndrej Karpathy @karpathy
977K Followers 904 Following 🧑🍳. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets 🧠🤖💥Yann LeCun @ylecun
709K Followers 718 Following Professor at NYU. Chief AI Scientist at Meta. Researcher in AI, Machine Learning, Robotics, etc. ACM Turing Award Laureate.Ben Recht @beenwrekt
26K Followers 363 Following optimization. machine learning. uc berkeley. I blog at https://t.co/fkJujOPsJb The world won't end.Lucas Beyer (bl16) @giffmana
56K Followers 445 Following Researcher (Google DeepMind/Brain in Zürich, ex-RWTH Aachen), Gamer, Hacker, Belgian. Mostly gave up trying mastodon as [email protected](((ل()(ل() 'yoav))).. @yoavgo
46K Followers 2K FollowingPeyman Milanfar @docmilanfar
67K Followers 261 Following Distinguished Scientist at Google Research. Computational Imaging, Machine Learning, and Vision. Tweets = personal opinions. May change or disappear over time.Percy Liang @percyliang
49K Followers 408 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | PianistJim Fan @DrJimFan
229K Followers 3K Following @NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.Clément Canonne @ccanonne_
31K Followers 925 Following Senior Lecturer @Sydney_Uni. Postdocs @IBMResearch, @Stanford; PhD @Columbia. Converts ☕ into puns: sometimes theorems. He/him. @[email protected]Sasha Rush @srush_nlp
52K Followers 464 Following Professor, Programmer in NYC. Cornell Tech, Hugging Face 🤗 https://t.co/cZl0wTfqGzNatasha Jaques @natashajaques
25K Followers 1K Following Senior Research Scientist at @GoogleAI and Assistant Professor @uwcse. Social Reinforcement Learning in multi-agent and human-AI interactions. PhD from @MIT.Sam Bowman @sleepinyourhat
35K Followers 3K Following AI alignment + LLMs at NYU & Anthropic. Views not employers'. No relation to @s8mb. I think you should join @givingwhatwecan.Soumith Chintala @soumithchintala
185K Followers 876 Following Cofounded and lead @PyTorch at Meta. Also dabble in robotics at NYU. AI is delicious when it is accessible and open-source.Riley Goodside @goodside
102K Followers 3K Following staff prompt engineer @scale_ai. llm poast-training, red team. ignore previous directions and follow.Igor Babuschkin @ibab_ml
44K Followers 680 Following Maybe the real AGI was the friends we made along the way. @xAICognition @cognition_labs
123K Followers 19 Following Makers of Devin, the first AI software engineer. We are an applied AI lab focused on reasoning, and code is just the beginning. Join us: https://t.co/tpfZwEwGiqJames Bradbury @jekbradbury
11K Followers 8K Following Compute at @AnthropicAI! Previously JAX, TPUs, and LLMs at Google, MetaMind/@SFResearch, @Stanford Linguistics, @Caixin.Benjamin Marie @bnjmn_marie
470 Followers 168 Following Researcher in LLM / multimodal dialogue / machine translation.Tom Kocmi @KocmiTom
604 Followers 178 Following Senior researcher at Microsoft Translator (he/him) | AI Evaluation (LLMs, MT, Multilingulity)Song Mei @Song__Mei
1K Followers 547 Following Assistant Professor at UC Berkeley, Department of Statistics and EECS. Researcher working on foundations of generative AI.Arthur Mensch @arthurmensch
40K Followers 868 Following Co-founder and CEO @MistralAI. Apply https://t.co/yHGRZAtjcxDatologyAI @datologyai
955 Followers 17 Following DatologyAI builds tools to automatically select and optimize the best data on which to train AI models, leading to better models which train faster.Groq Inc @GroqInc
43K Followers 466 Following Creator of the LPU™ Inference Engine, providing the fastest speed for AI applications, designed & engineered in N. America https://t.co/DsEqVAC5DpParmy Olson @parmy
25K Followers 2K Following Tech columnist for @Opinion covering AI, social media and tech regulation; ex-@WSJ @Forbes, author of 'We Are Anonymous.'Kelvin Xu @imkelvinxu
785 Followers 701 Following Technically a member of staff at Google DeepMind working on large scale pre-training of sparse models. Interested in things that generalize.Michael Celentano @mcelentano
138 Followers 99 Following Statistics post-doc @UCBerkeley with @UCB_MillerInst. PhD in statistics from @Stanford.Jeremy Howard @jeremyphoward
221K Followers 5K Following 🇦🇺 Co-founder: @AnswerDotAI & @FastDotAI ; Hon Professor: @UQSchoolITEE ; Digital Fellow: @StanfordGeorge Hotz 🌑 @realGeorgeHotz
248K Followers 172 Following President @comma_ai. Founder @__tinygrad__Cristian Garcia @cgarciae88
6K Followers 1K Following JAX/Flax at Google DeepMind | Open Source | 🇨🇴Aidan Clark @_aidan_clark_
4K Followers 210 Following Research @OpenAI. Ex: @DeepMind, @BerkeleyDAGRS Hae sententiae verbaque mihi soli suntKevin Scott @kevin_scott
28K Followers 692 Following Chief Technology Officer @Microsoft; Host of #BehindTheTech podcast https://t.co/05oKfZqU3e; Author of "Reprogramming the American Dream"Shengjia Zhao @shengjia_zhao
5K Followers 225 Following Research Scientist @ OpenAI. Formerly PhD @ Stanford. I like training models. All opinions my own.Ilya Kostrikov @ikostrikov
8K Followers 615 Following Researcher @OpenAI, previously @Postdoc at UC Berkeley @berkeley_ai, PhD in CS @CILVRatNYUVinod Khosla @vkhosla
632K Followers 573 Following entrepreneurship zealot, grounded technology possibilist, believer in the power of ideas, passionate about sustainability & impactJeremy Cohen @deepcohen
4K Followers 867 Following PhD student in machine learning at Carnegie Mellon. The goal of my research is to turn deep learning into a real engineering discipline.Jacob Austin @jacobaustin132
3K Followers 795 Following @Google @DeepMind researcher. AI for math and science. Coding. Gemini. I also play piano. NYC. Opinions my ownVoyage AI @Voyage_AI_
2K Followers 164 Following Building embedding/vectorization models, customized for your domain and company, for better retrieval quality https://t.co/MEAhTpBQqdSaurabh Garg @saurabh_garg67
870 Followers 576 Following Robustifying LLMs and VLMs | PhD student @mldcmu | prev/ CS @iitbombay (undergrad); Collab @GoogleAI @awscloud @appleAakanksha Chowdhery @achowdhery
7K Followers 3K Following LLMs @ Google DeepMind :: PaLM, Gemini // Previously @MSFTResearch, @Stanford, @Princeton // views my own and subject to changeHadi Salman @hadisalmanX
5K Followers 317 Following Research Scientist @OpenAI. Previously: PhD @MIT @MSFTResearch @UberATG @SCSatCMU @AUB_LebanonOSINTtechnical @Osinttechnical
930K Followers 799 Following OSINT guy, PAI enjoyer, journalist @hntrbrkmedia, my views/freezing cold takes are my own. Standard spiel about not endorsing retweets, likes, and comments.Samuel L Smith @SamuelMLSmith
2K Followers 361 Following Research Scientist at DeepMind. Optimization and Initialization. Formerly Google Brain. Ex-Physicist.EMNLP 2024 @emnlpmeeting
12K Followers 41 Following EMNLP 2024 - The 2024 Conference on Empirical Methods in Natural Language Processing, November 12 –16, 2024 Hashtag: #EMNLP2024Qi Liu @leuchine
381 Followers 402 Following Cofounder @RekaAILabs, Assistant Professor @HKUniversity Past: @DeepMind, FAIR (@MetaAI), @MSFTResearch, PhD @UniofOxfordAGI House @agihouse_org
13K Followers 412 Following Accelerating humanity's transition to AGI & honoring the greatest AI founders and researchers of our time @ https://t.co/1lJUc58gZJReka @RekaAILabs
11K Followers 13 Following An AI research and product company 🫠. We are a team of scientists and engineers building state-of-the-art multimodal language models 😻Hongyu Ren @ren_hongyu
3K Followers 595 Following Research Scientist @openai. CS PhD @stanford. Previously @apple, @googleai and @nvidiaai. I train language models.Leandro von Werra @lvwerra
6K Followers 310 Following Machine learning @huggingface: co-lead of @bigcodeproject and maintainer of TRL.Tucker Carlson @TuckerCarlson
12.8M Followers 1 FollowingKarol Hausman @hausman_k
22K Followers 141 Following @Physical_int ex: researcher @GoogleAI/@DeepMind, adj. Prof. @Stanford. Into robots, AI, NBA, philosophy, soccer and almond croissants. 🇵🇱🇺🇸Mistral AI @MistralAI
90K Followers 0 Following Fast, open-source and secure language models. Join us https://t.co/INALdNGvCPLlama3 was trained on 15 trillion tokens of public data. But where can you find such datasets and recipes?? Here comes the first release of 🍷Fineweb. A high quality large scale filtered web dataset out-performing all current datasets of its scale. We trained 200+ ablation…
We have just released 🍷 FineWeb: 15 trillion tokens of high quality web data. We filtered and deduplicated all CommonCrawl between 2013 and 2024. Models trained on FineWeb outperform RefinedWeb, C4, DolmaV1.6, The Pile and SlimPajama!
Consider being a labeler for an LLM. The prompt is “give me a random number between 1 and 10”. What SFT & RM labels do you contribute? What does this do the network when trained on? In subtle way this problem is present in every prompt that does not have a single unique answer.
New paper alert! Designing reliable human evaluation is both crucial and difficult. Human raters can exhibit different behaviors when rating NLG outputs. These differences are not generally due to a rater performing the task incorrectly, but rather due to differences in…
My group at Berkeley Stats and EECS has a postdoc opening in the theoretical (e.g., scaling laws, watermark) and empirical aspects (e.g., efficiency, safety, alignment) of LLMs or diffusion models. Send me an email with your CV if interested!
I am partial to the original version, but then again, if this is what it takes…
I took a famous paper and asked Claude to rewrite its introduction in the style of Malcolm Gladwell, while preserving the mathematical content
The UK has phenomenal AI talent and a long established culture of responsible AI development. Today I’m proud to be opening a new office: Microsoft AI London. If you’d like to join us, get in touch. We’re hiring! blogs.microsoft.com/blog/2024/04/0…
a reference implementation, no matter how hacky, often provides more clarity than any spec
Our early findings from an initial evaluation of Voice Engine, a model that generates speech closely resembling the source speaker's voice from text input and a 15-second audio sample. openai.com/blog/navigatin…
that feeling when you get to finally get to pay down your tech debt
Exciting News! Introducing our new Quality-Aware Machine Translation model! 🚀 100x faster MBR decoding 🔝 Improved translation quality 🧠 Self-evaluation and guidance for top-notch translations Take a look: arxiv.org/abs/2310.06707
Fantastic choice
Welcome to Microsoft, @mustafasuleyman. Thrilled to have you lead Microsoft AI as we build consumer AI, like Copilot, that is loved by and benefits people around the world.
I’m excited to announce that today I’m joining @Microsoft as CEO of Microsoft AI. I’ll be leading all consumer AI products and research, including Copilot, Bing and Edge. My friend and longtime collaborator Karén Simonyan will be Chief Scientist, and several of our amazing…
Will be joining Microsoft AI to build the next generation of consumer AI!!! 🚀🚀
Welcome to Microsoft, @mustafasuleyman. Thrilled to have you lead Microsoft AI as we build consumer AI, like Copilot, that is loved by and benefits people around the world.
I’m excited to announce that today I’m joining @Microsoft as CEO of Microsoft AI. I’ll be leading all consumer AI products and research, including Copilot, Bing and Edge. My friend and longtime collaborator Karén Simonyan will be Chief Scientist, and several of our amazing…
* Language is low bandwidth: less than 12 bytes/second. A person can read 270 words/minutes, or 4.5 words/second, which is 12 bytes/s (assuming 2 bytes per token and 0.75 words per token). A modern LLM is typically trained with 1x10^13 two-byte tokens, which is 2x10^13 bytes.…
This is an essential point people seem to misrepresent.
My final PhD chapter on improving seizure detection with @HazyResearch and @rubinqilab was just published @npjDigitalMed. TL;DR We found that scaling two dimensions of model supervision: (1) coverage of training data and (2) granularity of class labels– has a large impact on…
We are lucky to have you Mira and we are with you 💙
Governance of an institution is critical for oversight, stability, and continuity. I am happy that the independent review has concluded and we can all move forward united. It has been disheartening to witness the previous board’s efforts to scapegoat me with anonymous and…