Bill Yuchen Lin 🤖 @billyuchenlin
Research @allen_ai. I evaluate (multi-modal) LLMs, build agents, and study the science of LLMs. Previously: @GoogleAI & @MetaAI FAIR @nlp_usc yuchenlin.xyz Seattle, WA, USA Joined April 2016-
Tweets875
-
Followers6K
-
Following2K
-
Likes7K
PhDone!!!! 👨🎓 08/2019-04/2024 What a journey 🥳🚞 I especially feel lucky to share this once-in-a-life-time moment with people I love ❤️ . And seeing my passion-driven research efforts being acknowledged by researchers I deeply admire 🌞!! Special thanks to my awesome committee…
@DrJimFan Multimodal LLM arena from @allen_ai —> WildVision-Arena: huggingface.co/spaces/WildVis…
Llama3 was trained on 15 trillion tokens of public data. But where can you find such datasets and recipes?? Here comes the first release of 🍷Fineweb. A high quality large scale filtered web dataset out-performing all current datasets of its scale. We trained 200+ ablation…
Llama3 was trained on 15 trillion tokens of public data. But where can you find such datasets and recipes?? Here comes the first release of 🍷Fineweb. A high quality large scale filtered web dataset out-performing all current datasets of its scale. We trained 200+ ablation…
Llama3 @AIatMeta looks awesome! github.com/meta-llama/lla… Can't wait to test them on our WildBench and URIAL-Bench! 🤩
After receiving community feedback, we added @GoogleDeepMind Gemini 1.5 Pro's results. 👇 Gemini 1.5 Pro's vision ability was significantly improved compared to 1.0 Pro and matched GPT-4's performance on our VisualWebBench! 🏆 Its action prediction (e.g., predicting what would…
After receiving community feedback, we added @GoogleDeepMind Gemini 1.5 Pro's results. 👇 Gemini 1.5 Pro's vision ability was significantly improved compared to 1.0 Pro and matched GPT-4's performance on our VisualWebBench! 🏆 Its action prediction (e.g., predicting what would… https://t.co/kQnZzztfEh
🚀Introducing VisualWebBench: A Comprehensive Benchmark for Multimodal Web Page Understanding and Grounding. visualwebbench.github.io 🤔What's this all about? Why this benchmark? > Back in Nov 2023, when we released MMMU (mmmu-benchmark.github.io), a comprehensive multimodal…
🚀Introducing VisualWebBench: A Comprehensive Benchmark for Multimodal Web Page Understanding and Grounding. visualwebbench.github.io 🤔What's this all about? Why this benchmark? > Back in Nov 2023, when we released MMMU (mmmu-benchmark.github.io), a comprehensive multimodal… https://t.co/QxtwUsJNcY
There are several LLM benchmarks for web agents, but agents are not the only web application of LLMs. What about more fine-grained web-page understanding? Our new benchmark VisualWebBench evaluates LLMs on abilities such as OCR, QA, identifying DOM elements, etc.
There are several LLM benchmarks for web agents, but agents are not the only web application of LLMs. What about more fine-grained web-page understanding? Our new benchmark VisualWebBench evaluates LLMs on abilities such as OCR, QA, identifying DOM elements, etc.
Updates of ⚔️𝕎𝕚𝕝𝕕𝕍𝕚𝕤𝕚𝕠𝕟-𝔸𝕣𝕖𝕟𝕒: We added more models such as @AnthropicAI's Claude3 and @RekaAILabs! Also, many new features for improving user experience and collecting better evaluation data. E.g., we support selecting models for sampling and inputting reasons…
@AnthropicAI Awesome finding and insights on jailbreaking LLMs! I think that a useful baseline defense method for mitigating many-shot jailbreaking could be our SafeDecoding (linked below). Have you tried that? Btw, if one wants to make it easier, replacing safety fine-tuning with…
NEW : 𝐀𝐠𝐞𝐧𝐭🪄𝐋𝐮𝐦𝐨𝐬 is amazing at complex tasks Lumos is Language Agents with Unified Data Formats, Modular Design, & OS LLMs Lumos unifies a suite of complex interactive tasks, achieves competitive performance with GPT-4/3.5, OS agents Task➡️Modular Approach➡️Results
DBRX-Base from @databricks also achieves the top position in the URIAL Bench, which tests Base LLMs on the MT-bench with URIAL prompts (3-shot instruction-following examples). Check out the full results here on @huggingface 🤗: huggingface.co/spaces/allenai… Related Xs: 1️⃣ [URIAL…
🆕 Check out the recent update of 𝕎𝕚𝕝𝕕𝔹𝕖𝕟𝕔𝕙! We have included a few more models including DBRX-Instruct @databricks and StarlingLM-beta (7B) @NexusflowX which are both super powerful! DBRX-Instruct is indeed the best open LLM; Starling-LM 7B outperforms a lot of even…
🆕 Check out the recent update of 𝕎𝕚𝕝𝕕𝔹𝕖𝕟𝕔𝕙! We have included a few more models including DBRX-Instruct @databricks and StarlingLM-beta (7B) @NexusflowX which are both super powerful! DBRX-Instruct is indeed the best open LLM; Starling-LM 7B outperforms a lot of even… https://t.co/imWcH5BGtq
🪄 𝔸𝕘𝕖𝕟𝕥 𝕃𝕦𝕞𝕠𝕤 is one of the first unified and modular frameworks for training open-source LLM-based agents. New features: 🤖️Multimodal Reasoning with 𝕃𝕦𝕞𝕠𝕤 🐘 13B-scale 𝕃𝕦𝕞𝕠𝕤 models 🤗 𝕃𝕦𝕞𝕠𝕤 data-explorer demo @ai2_mosaic @uclanlp 📝:…
PPO's training curves look like this. Note that several 1B models' KL exploded. From an optimization point of view, there is nothing wrong with them because the RLHF reward kept going up, the these 1B models corresponds to the "reward hacking" / over optimized models. To…
Jim Fan @DrJimFan
229K Followers 3K Following @NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.Akari Asai @AkariAsai
11K Followers 650 Following Ph.D. student @uwcse & @uwnlp. NLP. IBM Ph.D. fellow (2022-2023). Meta student researcher (2023-) . ☕️ 🐕 🏃♀️🧗♀️🍳William Wang @WilliamWangNLP
14K Followers 716 Following UCSB NLP Lab + ML Center. https://t.co/6TOnqbk6YT https://t.co/KJYhnav3Et Mellichamp Chair Prof. at UCSB CS. PhD @ CMU SCS. Areas: #NLProc, Machine Learning, AI.Wenhu Chen @WenhuChen
11K Followers 520 Following AI researcher @UWaterloo @GoogleAI @VectorInst. Interested in natural language processing, diffusion models. I direct TIGER-Lab at UWaterloo.Yu Su @ysu_nlp
6K Followers 857 Following Dist. Assist. Prof.@OhioState, Director @osunlp, 20% Researcher@Microsoft. I like to think about intelligence, artificial or biologicalKayo Yin @kayo_yin
8K Followers 556 Following PhD student @berkeley_ai @berkeleynlp working on interpretability and signed languages. Former @msftresearch @deepmind @carnegiemellon @polytechnique. 🇫🇷🇯🇵Luca Soldaini 🎀 @soldni
6K Followers 1K Following I like tokens! Lead for OLMo data team at @allen_ai (makin Dolma 🍇), open source science fan, @QueerInAI organizer 🤖☕️🍕they/themYao Fu @Francis_YAO_
13K Followers 2K Following PhD @EdinburghNLP on LLMs and Machine Reasoning. Ex. @Columbia @PKU1898 @MITIBMLab @allen_ai AGI has yet to come, so keep runningWeijia Shi @WeijiaShi2
5K Followers 965 Following PhD student @uwcse @uwnlp | Visiting Researcher @MetaAI | Undergrad @CS_UCLA | https://t.co/eLBQmgkvymXin Eric Wang @xwang_lk
7K Followers 1K Following Multimodal and Embodied AI Researcher / Professor @UCSC. Director of https://t.co/Y4swOBag21. AI for Humanity in the long run. he/himDelip Rao e/σ @deliprao
46K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈Allen Institute for A.. @allen_ai
53K Followers 361 Following AI for the Common Good. › Join us: https://t.co/DqTs1G4bGO › Get our newsletter: https://t.co/tvb1VpySfLDanish Pruthi @danish037
7K Followers 628 Following Faculty at Indian Institute of Science, Bangalore. PhD from @LTIatCMU.Michael Saxon @m2saxon
2K Followers 1K Following CS PhD cand @ucsbNLP 🌊🌴 @NSF GRFP 🧐analyzing semantics in generative lang/img AI models🤖 Big tech ex-intern. BS/MS @ASU 🌵🏜 🔜 @AMD opensrc GenAI RS internOfir Press @OfirPress
10K Followers 3K Following I build tough benchmarks for LMs and then I get the LMs to solve them. Postdoc @Princeton. PhD from @nlpnoah @UW. Ex-visiting researcher @MetaAI & @MosaicML.Greg Durrett @gregd_nlp
6K Followers 751 Following CS professor at UT Austin. I do NLP most of the time. he/himSean (Xiang) Ren @xiangrenNLP
6K Followers 561 Following Building @SaharaLabsAI | @USCViterbi Early Career Chair, Professor @nlp_usc | @MIT TR 35 , @ForbesUnder30 | Prev: @allen_ai, @Snapchat, @Stanford, @UofIllinoisWenzhao Qiu @WenzhaoQiu
0 Followers 138 FollowingYin-Hong Cao @caoyinhong
74 Followers 906 Following Postdoc in Jiayang Li Lab, Institute of Genetics and Developmental Biology, CAS. Focus on the Multi-omics of the rice & dandelions🌾🌱🧬Quarkstar @Quarkstar9
17 Followers 107 FollowingAnurag Mishra @anuragm75160136
111 Followers 801 Following Building Scalable AI Applications | Senior Data Scientist @ EY | CSE Btech @ NIT MN | Linkedin: https://t.co/pCmSV6FmOehuzaifa jawad @huzaifajaw25291
2 Followers 71 FollowingDr.R @MichaelRan15
1K Followers 444 Following A Believer, Builder and Investor in Generative AI. Co-Founder of GPTDAO @gptdaoglobal and @1genaiLaura Liu @lauraqq
16 Followers 384 FollowingSimon Batzner @simonbatzner
4K Followers 690 Following RS at Google DeepMind. Prev: Harvard, MIT, NASA, Google Brain.Thomas Balestri @ThomasBalestri4
44 Followers 374 Following Applied Scientist at AWS. Formerly high-energy physics PhD for ATLAS@CERN. Opinions are my own.Kyle @Kyle_Vectara
3 Followers 77 Following Generative AI, Retrieval Augmented Generative, LLMs, Machine Learning, Neural Networks, Data Science, Semantic Search, Conversational AI, VectorDB, Researchabderrahim zine @abderrahimzine6
25 Followers 593 FollowingYichen Zhu @asdfihu145275
0 Followers 143 FollowingTodd Kueny — e/acc @techgazetteco
3K Followers 5K Following Empowering worlds where AI enriches lives, solves complex problems, and inspires continuous learning.Ervin Lang @ervinlang
48 Followers 1K Followingashish mishra @aegisAshish
138 Followers 349 Following Now Assisting Profs @CSE_IITH | Earlier Postdocing @Purdue | PhD, Computer Science @IIScCSA. Interested in Programming Languages, Politics, and Philosophy.John McDonald @rjohnmcdonald
47 Followers 291 FollowingYikai Zhang @ykzhang721
6 Followers 35 Following Ph.D. candidate at Fudan University. Research Intern at @BytedanceTalk AI LabRan Cheng @RanCheng10
189 Followers 1K Following Head of AI at Eureka Robotics & Midea Group MCA, formerly a research engineer at Huawei Noah's Ark Lab, Canada.今夜无眠 @Airwalker2020
5 Followers 105 Followingharpreet @DataScienceHarp
7K Followers 1K Following 🤖 Generative AI Hacker | 👨🏽💻 AI Engineer | 👷🏽♀️ Developer Advocate | Building🏗️-Shipping🚢-Sharing🚀Mike Channon Ⓜ️ @XDA_Forum_Admin
6K Followers 5K Following Forum Admin at https://t.co/mFiBmgsI4b, Director at https://t.co/iH1LoXoajpMohammad Alaggan, Ph... @m_aggan
1K Followers 3K Following Sr. Software Development Engineer at @AWSCloud. Opinions are my own.Alex Vaughan (agvaugh.. @agvaughan
468 Followers 3K Following All opinions my own, and I'm as disappointed as you are. Currently: @Meta Prev: @CajalNeuro, @Pymetrics, CSHL, Stanford Also: @[email protected]xenjoyer007 @xenjoyer007
1 Followers 132 FollowingAbdallah Arioua @AbdallahAriooua
137 Followers 895 Following Chief Data and AI Officer, PhD in AI. Opinions are mine.Blaine Combs @BlaineComb58074
9 Followers 171 FollowingMuizz @muizzkhan77
33 Followers 1K FollowingMatt Ahmann @mattahmann
317 Followers 2K Following Finance @usouthflorida | MSF Candidate @vanderbiltu | Space🚀, Tech🖥️, AI 🤖, Biotech 🧬, Sustainable Energy ☀️, Finance 💹, College Sports Fan 🏀🏈🏟️Hualong @ValonLee
16 Followers 59 FollowingJHU CLSP @jhuclsp
5K Followers 662 Following Center for Language and Speech Processing at @JohnsHopkins #NLProc #MachineLearning #AI https://t.co/6IXR5OSiDY @[email protected]christian cch @chris_cch_
188 Followers 3K FollowingAbdulrahman Tabaza @embed_dim
3 Followers 708 Following enjoyer of various vector spaces, encoders and modalitiesXiwen Wei @XiwenWei_
14 Followers 60 FollowingEvangeline @Evangeljy
1 Followers 87 FollowingJindong Gu @Jindong73504766
246 Followers 752 Following Senior Research Fellow in University of Oxford @OxfordTVG Faculty Researcher @Google #ResponsibleAI #AISafety #GenAI Homepage: https://t.co/YOSVO3jb6hMingkai Deng @mdeng34
322 Followers 276 Following PhD student @LTIatCMU | MSML @mldcmu | BA Math-Stats + CS @Columbia | CV, RL, NLP | He/HisKyle Leahy @kyletleahy
37 Followers 203 FollowingAlo @Hal90910
0 Followers 2K FollowingAI Papers Podcast @aipaperspodcast
874 Followers 2K Following A digestible daily update on the latest AI Research Papers. Brought to you by @pocketpodappLianmin Zheng @lm_zheng
4K Followers 437 Following CS Ph.D. @ UC Berkeley. Creator of Alpa, Vicuna, and Chatbot Arena. @lmsysorgClementine J Yang @G3yk7EL35zOiq2x
7 Followers 34 FollowingPercy Liang @percyliang
49K Followers 408 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | PianistJim Fan @DrJimFan
229K Followers 3K Following @NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.AI at Meta @AIatMeta
531K Followers 255 Following Together with the AI community, we are pushing the boundaries of what’s possible through open science to create a more connected world.Akari Asai @AkariAsai
11K Followers 650 Following Ph.D. student @uwcse & @uwnlp. NLP. IBM Ph.D. fellow (2022-2023). Meta student researcher (2023-) . ☕️ 🐕 🏃♀️🧗♀️🍳William Wang @WilliamWangNLP
14K Followers 716 Following UCSB NLP Lab + ML Center. https://t.co/6TOnqbk6YT https://t.co/KJYhnav3Et Mellichamp Chair Prof. at UCSB CS. PhD @ CMU SCS. Areas: #NLProc, Machine Learning, AI.Graham Neubig @gneubig
31K Followers 585 Following Associate professor at CMU, studying natural language processing and machine learning.Yi Tay @YiTayML
29K Followers 97 Following chief scientist / cofounder @RekaAILabs 🫠 past: research scientist @google brain 🤯 currently learning to be a dad 🍼Sam Bowman @sleepinyourhat
35K Followers 3K Following AI alignment + LLMs at NYU & Anthropic. Views not employers'. No relation to @s8mb. I think you should join @givingwhatwecan.Jacob Andreas @jacobandreas
13K Followers 958 Following Teaching computers to read. Assoc. prof @MITEECS / @MIT_CSAIL (he/him). https://t.co/5kCnXHjtlY https://t.co/2A3qF5vdJwWenhu Chen @WenhuChen
11K Followers 520 Following AI researcher @UWaterloo @GoogleAI @VectorInst. Interested in natural language processing, diffusion models. I direct TIGER-Lab at UWaterloo.Sasha Rush @srush_nlp
52K Followers 464 Following Professor, Programmer in NYC. Cornell Tech, Hugging Face 🤗 https://t.co/cZl0wTfqGzKyunghyun Cho @kchonyc
61K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).Yu Su @ysu_nlp
6K Followers 857 Following Dist. Assist. Prof.@OhioState, Director @osunlp, 20% Researcher@Microsoft. I like to think about intelligence, artificial or biologicalYoav Artzi @yoavartzi
13K Followers 163 Following Research/prof @cs_cornell + @cornell_tech🚡 / https://t.co/9YnWry7yHs / https://t.co/3VmRSyYm2d / asso. faculty director @arxiv / building https://t.co/f9QkzO5kaCChristopher Manning @chrmanning
126K Followers 115 Following Director, @StanfordAILab. Assoc. Director, @StanfordHAI. Founder, @stanfordnlp. Prof. CS & Linguistics, @Stanford. IP @aixventureshq. 🇦🇺 Do #NLProc & #AI. 👋Ana Marasović @anmarasovic
4K Followers 604 Following Asst prof @UUtah · Ex @allen_ai @uwnlp postdoc @HD_NLP PhD · she/her 🇭🇷Kayo Yin @kayo_yin
8K Followers 556 Following PhD student @berkeley_ai @berkeleynlp working on interpretability and signed languages. Former @msftresearch @deepmind @carnegiemellon @polytechnique. 🇫🇷🇯🇵Rafael Rafailov @rm_rafailov
3K Followers 637 Following Ph.D. Student at @StanfordAILab. I work on Foundation Models and Decision Making. Previously @GoogleDeepMind @UCBerkeleyJindong Gu @Jindong73504766
246 Followers 752 Following Senior Research Fellow in University of Oxford @OxfordTVG Faculty Researcher @Google #ResponsibleAI #AISafety #GenAI Homepage: https://t.co/YOSVO3jb6hYijia Shao @EchoShao8899
2K Followers 280 Following CS Ph.D. student @StanfordNLP. Previous: undergraduate @PKU1898.Vitaliy Chiley @vitaliychiley
2K Followers 606 Following Head of NLP Pretraining @Databricks / @MosaicML | Former @CerebrasSystems | What do we want? FLOPS! When do we want it? TOKENS!Jonathan Frankle @jefrankle
16K Followers 685 Following Chief Scientist, Neural Networks @Databricks via MosaicML. PhD @MIT_CSAIL. BS/MS @PrincetonCS. DC area native. Making AI efficient for everyone at @DbrxMosaicAIDatabricks @databricks
70K Followers 1K Following Databricks is the data and AI company, helping data teams solve the world’s toughest problems.Cody Blakeney @code_star
3K Followers 824 Following Head of Data Research @MosaicML / @databricks | Formerly Visiting Researcher @ Facebook | Ph.D | #TXSTFOOTBALL fan | https://t.co/4G6Jf3at5wNancy Pelosi Stock Tr.. @PelosiTracker_
559K Followers 223 Following Highlighting Politicians' trades so we can invest alongside Goal: get them banned from trading Powered by @joinautopilot_Dimitris Papailiopoul.. @DimitrisPapail
11K Followers 965 Following prof @ wisconsin; thinking about transformers; learning in context; babas of Inez LilyWing Lian (caseus) @winglian
9K Followers 2K Following @axolotl_ai dev. OpenAccess AI Collective founder. Alignment Labs. AI/ML tinkerer. Building tools for everyone.Remi Cadene @RemiCadene
8K Followers 587 Following Robotics at Hugging Face Ex-Tesla Autopilot Optimus Postdoc Brown, PhD SorbonneAmanda Askell @AmandaAskell
26K Followers 653 Following Philosopher & ethicist teaching models to be good @AnthropicAI. Personal account. All opinions come from my training data.Min-Hung (Steve) Chen @CMHungSteven
2K Followers 1K Following Senior Research Scientist @NVIDIAAI @NVIDIA | Ex-@Microsoft Azure AI, @MediaTek AI | Ph.D. @GeorgiaTech | Multimodal AI/CV/DL/ML | https://t.co/dKaEzVoTfZHal Daumé III @haldaume3
27K Followers 355 Following Human-centered AI #HCAI, NLP & ML. Director @trails_ai. Prof @umdCS, member of @CLIPumd @HCIL_umd, researcher @MSFTresearch. Fun: 🧗🧑🍳🧘⛷️🏕️. he/him.WIRED @WIRED
10.0M Followers 451 Following Where tomorrow is realized || Sign up for our newsletters: https://t.co/webmuFK9lNFenqing Jiang @fengqing_jiang
60 Followers 128 Following PhD Student@UW, working on trustworthy ml, especially LLM recently. Open for collaboration.LLM Security @llm_sec
8K Followers 297 Following Research, papers, jobs, and news on large language model security. Got something relevant? DM / tag @llm_secEthan Mollick @emollick
210K Followers 551 Following Professor @Wharton studying AI, innovation & startups. Democratizing education using tech Book: https://t.co/CSmipbJ2jV Substack: https://t.co/UIBhxu4bgqSida Wang @sidawxyz
449 Followers 295 FollowingAakanksha Chowdhery @achowdhery
7K Followers 3K Following LLMs @ Google DeepMind :: PaLM, Gemini // Previously @MSFTResearch, @Stanford, @Princeton // views my own and subject to changeZhangchen Xu @zhangchen_xu
96 Followers 112 Following UW PhD Student|Distributed Systems & Federated Learning & LLM Security | Looking for Summer Internships 🥲Costa Huang @vwxyzjn
3K Followers 1K Following RLHF @huggingface 🤗; main dev of @cleanrl_lib; CS PhD @DrexelUniv; Ex @CuraiHQ @weights_biases @NVIDIAAI @riotgames.Axolotl @axolotl_ai
806 Followers 17 Following Axolotl is the premier open source LLM fine tuning framework. find us on discord https://t.co/wlcE2wlJa9Mahdi Soltanolkotabi @mahdisoltanol
291 Followers 316 Following Cycling Academic; work on optimization, probability/statistics, theory of deep learning, AI for science/healthcare; director of center on AIF4S @USCyi 🦛 @agihippo
3K Followers 81 Following secondary account, hardcore fans only. friend of @agikoala the great researcher, main account: @yitayml warning: hot takes.Davis Blalock @davisblalock
12K Followers 165 Following Research scientist + first hire @MosaicML. @MIT PhD. I write + retweet threads about machine learning papers. Paper summaries newsletter: https://t.co/xX7NIpsIVZXuhui Zhou @nlpxuhui
683 Followers 428 Following PhD student @LTIatCMU. Previously, @GeorgiaTech, @UWNLP, and @Apple. Social Intelligence in language +X. He/Him.🐳Ruqi Zhang @ruqi_zhang
540 Followers 270 Following Assistant Professor @PurdueCS | PhD @Cornell | Probabilistic machine learning, Bayesian deep learning, Sampling, MCMCTianlong Chen @TianlongChen4
533 Followers 17 Following Incoming Asst. Professor at UNC Chapel Hill (@unccs, @unc). Postdoc, CSAIL@MIT (@MIT_CSAIL) & BMI@Harvard (@Harvard). Ph.D., ECE@UT Austin (@UTAustin). #AI #MLTed Xiao @xiao_ted
11K Followers 681 Following I teach robots to be smarter @GoogleDeepMind. Tweets about robot learning, scaling, and large models. Opinions my own.Groq Inc @GroqInc
44K Followers 467 Following Creator of the LPU™ Inference Engine, providing the fastest speed for AI applications, designed & engineered in N. America https://t.co/DsEqVAC5DpYuchen Eleanor Jiang @eleanorjiang630
3K Followers 338 Following co-founder&CEO of AIWaves Inc., formerly CS PhD@ETH Zürich, ex-research intern@Microsoft Research Asia, #NLProc #LLMsAnand Bhattad @anand_bhattad
2K Followers 293 Following Research Assistant Professor @TTIC_Connect | Exploring Knowledge in Generative Models | PhD from @illinoisCS | UG @surathkal_nitkTianyu Pang @TianyuPang1
414 Followers 148 Following 🇸🇬Research Scientist at Sea AI Lab @SeaGroup; 👨🏻🎓PhD/BS from @Tsinghua_Uni and ex-@MSFTResearch; 🛡️Trustworthy AI and Generative Models.Meng Jiang @Meng_CS
1K Followers 486 Following Associate Professor with Tenure at Notre Dame CSE | Data Mining | Natural Language ProcessingReka @RekaAILabs
11K Followers 13 Following An AI research and product company 🫠. We are a team of scientists and engineers building state-of-the-art multimodal language models 😻Qi Liu @leuchine
382 Followers 402 Following Cofounder @RekaAILabs, Assistant Professor @HKUniversity Past: @DeepMind, FAIR (@MetaAI), @MSFTResearch, PhD @UniofOxfordDaniel van Strien @vanstriendaniel
3K Followers 1K Following Machine Learning Librarian @huggingface 🤗 | Championing Open Science & ML | Sharing the latest ML datasets 🌟 | Tips for mastering the HF HubMatthew Peters @mattthemathman
2K Followers 572 Following Cofounder @SpiffyAI. Research Scientist at AI2 (@allenai_org).Yufei Wang @YufeiWang25
322 Followers 183 Following PhD in Robotics. Robot Learning. Robotics Institute, CMU.Leo Simon @leo5imon
12K Followers 206 Following lore builder @ https://t.co/8sBsk2Hdoy | venture @telahvcHappening now! 👇👇
🚨 Excited to announce the 'UNC Symposium on AI and Society'! 🙂 cs.unc.edu/event/symposiu… Excellent line-up of speakers (Apr25+26) across diverse disciplines/departments incl. computer science, philosophy, cognitive science, psychology, ethics, sociology, data science,…
PhDone!!!! 👨🎓 08/2019-04/2024 What a journey 🥳🚞 I especially feel lucky to share this once-in-a-life-time moment with people I love ❤️ . And seeing my passion-driven research efforts being acknowledged by researchers I deeply admire 🌞!! Special thanks to my awesome committee…
Today we're launching PRISM, a new resource to diversify the voices contributing to alignment. We asked 1500 people around the world for their stated preferences over LLM behaviours, then we observed their contextual preferences in 8000 convos with 21 LLMs arxiv.org/abs/2404.16019
The best part about making slides on LLM alignment is that I now get to combine my two passions in life: math and memes 😅 (this one is a classic from @tomgoldsteincs)
On May 2-3, we're going to have a big event in Pittsburgh about LLM Agents. We have invited talks from great speakers inside and outside CMU, student research presentations and posters, tutorials and discussions! Come join us at CMU campus, and register at cmu-agent-workshop.github.io
🚨New Paper🚨 We propose 1⃣CultureBank🌎 dataset sourced from TikTok & Reddit 2⃣An extensible pipeline to build cultural knowledge bases 3⃣Evaluation of LLMs’ cultural awareness 4⃣Insights into culturally-aware LLMs Project: culturebank.github.io Data: shorturl.at/hrtwP
Some personal updates: I joined OpenAI a few months ago, working on all things robustness/safety/privacy. Also, we are working to publish more of our safety work. See my first project here below, where we make initial progress on prompt injections and other attacks!
Introducing the Instruction Hierarchy, our latest safety research to advance robustness for prompt injections and other ways of tricking LLMs into executing unsafe actions. More details: arxiv.org/abs/2404.13208
@lmsysorg I applaud to this but tbh, it's hard to trust a benchmark where the winner is the judge.
Datasets might be more impactful than models at this point and this may be the GPT4 of datasets. Courtesy of the amazing Guilherme who trained Falcon & the @huggingface team!
We have just released 🍷 FineWeb: 15 trillion tokens of high quality web data. We filtered and deduplicated all CommonCrawl between 2013 and 2024. Models trained on FineWeb outperform RefinedWeb, C4, DolmaV1.6, The Pile and SlimPajama!
Llama-3 is closing the gap with GPT-4, but multimodal models gotta catch up. Vision capabilities of open models like LlaVA are far, far behind GPT-4V. Video models are even worse. They hallucinate all the time and fail to give detailed descriptions of complex scenes and actions.…
Very excited about the release of arena hard, the main benchmark we looked at when selecting the checkpoints for Starling model. It focuses on a subset of very hard prompts from chatbot arena.
Introducing Arena-Hard – a pipeline to build our next generation benchmarks with live Arena data. Highlights: - Significantly better separability than MT-bench (22.6% -> 87.4%) - Highest agreement to Chatbot Arena ranking (89.1%) - Fast & cheap to run ($25) - Frequent update…
We have just released 🍷 FineWeb: 15 trillion tokens of high quality web data. We filtered and deduplicated all CommonCrawl between 2013 and 2024. Models trained on FineWeb outperform RefinedWeb, C4, DolmaV1.6, The Pile and SlimPajama!
Llama3 was trained on 15 trillion tokens of public data. But where can you find such datasets and recipes?? Here comes the first release of 🍷Fineweb. A high quality large scale filtered web dataset out-performing all current datasets of its scale. We trained 200+ ablation…
We have just released 🍷 FineWeb: 15 trillion tokens of high quality web data. We filtered and deduplicated all CommonCrawl between 2013 and 2024. Models trained on FineWeb outperform RefinedWeb, C4, DolmaV1.6, The Pile and SlimPajama!
@bindureddy Why do you tweet the most ridiculous things sometimes? You realize that Llama 3 will be ancient technology in a year *because* of training models with 10x more compute, right?
@bindureddy @LoulyAdam DJ does not represent overall market, and it’s not even market cap weighted. It’s a meaningless index.
Handling data at scale always presents edge cases... In preparing WildChat-1M, besides Moderation issues↓, we found a curse word repeated thousands of times w/o spaces, causing the Presidio analyzer in PII removal to hang. Stay tuned for the upcoming release of WildChat-1M!
Update on Moderation API issue: length errors seem to link to non-Latin characters. E.g., Moderation can handle 1M Latin characters but fails for a few K non-Latin characters on WildChat (Korean, Chinese, etc) Code for reproducing the err & a workaround: community.openai.com/t/moderation-r…