CLS @ChengleiSi
vibing @stanfordnlp | real AGI is the friends we made along the way noviscl.github.io Palo Alto, California Joined August 2018-
Tweets2K
-
Followers2K
-
Following3K
-
Likes16K
Congrats to everyone who just finished the PhD application cycle! 🎉🥳 If you found our repository of statements at cs-sop.org helpful, please consider sharing yours so that we could help more students in the future!
Anyone want to hang out at ICLR next week and chat about empirical training dynamics, AI for science, and LM interpretability?
the shortest and most surreal 30 minutes. favorite (shareable) thing he said, paraphrased: someone asked what college students should do as we enter a world approaching AGI, how to prepare and grapple with this future, what work will be left for us to do.
the shortest and most surreal 30 minutes. favorite (shareable) thing he said, paraphrased: someone asked what college students should do as we enter a world approaching AGI, how to prepare and grapple with this future, what work will be left for us to do.
@emilyzsh props to @ChengleiSi for asking the question that this answered 🫡
There is a really nice community of researchers developing transformer alternatives. Want to highlight these impressive folks. Simran Arora (@simran_s_arora), Chunting Zhou (@violet_zct), Dan Fu (@realDanFu), and Songlin Yang (@SonglinYang4)
amazing resources for culturally aware LLMs by the amazing @shi_weiyan
amazing resources for culturally aware LLMs by the amazing @shi_weiyan
Exciting update -- Llama-3 full result is out, now reaching top-5 on the Arena leaderboard🔥 We've got stable enough CIs with over 12K votes. No question now Llama-3 70B is the new king of open model. Its powerful 8B variant has also surpassed many larger-size models. What an…
Can LMs correctly distinguish🔎 confusing entity mentions in multiple documents? We study how current LMs perform QA task when provided ambiguous questions and a document set📚 that requires challenging entity disambiguation. Work done at @UTCompSci✨ w/ @xiye_nlp, @eunsolc
Want to train an aligned LM in a new language 🌏 but don’t have preference data for training the reward model (RM)? 💡 Just use a RM for another language: it often works well, sometimes even BETTER than if you had a RM in your target language! 🤯 arxiv.org/abs/2404.12318
We have just released 🍷 FineWeb: 15 trillion tokens of high quality web data. We filtered and deduplicated all CommonCrawl between 2013 and 2024. Models trained on FineWeb outperform RefinedWeb, C4, DolmaV1.6, The Pile and SlimPajama!
Hello. We receive so many questions about agents in DSPy. Did you know you that ~15 lines of DSPy can turn an agent that scores 30% to a prompt-optimized, multi-agent aggregation system that scores 60% EM on a HotPotQA sample? I released a notebook showing how to do this 🧵⤵️
Nikola Jokić goes to therapy to deal with being called Gru in this new teaser for ‘DESPICABLE ME 4’
@WenhuChen @Teknium1 I think the motivation of LIMA is not to quantify the number of SFT examples that is needed but to highlight (1) how important high quality SFT data is and (2) the superficial alignment hypothesis where pretrained LLM stores all the knowledge and can be easily tuned into an…
Curious about socially-intelligent AI? Check out our paper on underlying technical challenges, open questions, and opportunities to advance social intelligence in AI agents: Work w/ @lpmorency, @pliang279 📰Paper: arxiv.org/abs/2404.11023 💻Repo: github.com/l-mathur/socia… 🧵1/9
More technical details on the new Meta Llama 3 models announced today. 🦙🧵
Google presents Reuse Your Rewards Reward Model Transfer for Zero-Shot Cross-Lingual Alignment Aligning language models (LMs) based on human-annotated preference data is a crucial step in obtaining practical and performant LM-based systems. However, multilingual human
“Can we get a new text analysis tool?” “No—we have Topic Model at home” Topic Model at home: outputs vague keywords; needs constant parameter fiddling🫠 Is there a better way? We introduce LLooM, a concept induction tool to explore text data in terms of interpretable concepts🧵
Coding is the frontier of AI. Excited to push the two frontiers of AI coding: 1. SWE(-bench/agent) 2. Olympiad programming (this tweet) Introduce USACO benchmark: * inference methods (RAG/reflect) help a bit: 9->20% * human feedback helps a lot: 0->86%! princeton-nlp.github.io/USACOBench/
AK @_akhaliq
310K Followers 3K Following AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo follow on Hugging Face: https://t.co/q2Qoey80GxAkari Asai @AkariAsai
11K Followers 650 Following Ph.D. student @uwcse & @uwnlp. NLP. IBM Ph.D. fellow (2022-2023). Meta student researcher (2023-) . ☕️ 🐕 🏃♀️🧗♀️🍳Bill Yuchen Lin 🤖 @billyuchenlin
6K Followers 2K Following Research @allen_ai. I evaluate (multi-modal) LLMs, build agents, and study the science of LLMs. Previously: @GoogleAI & @MetaAI FAIR @nlp_uscJim Fan @DrJimFan
229K Followers 3K Following @NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.Yao Fu @Francis_YAO_
14K Followers 2K Following PhD @EdinburghNLP on LLMs and Machine Reasoning. Ex. @Columbia @PKU1898 @MITIBMLab @allen_ai AGI has yet to come, so keep runningKyunghyun Cho @kchonyc
61K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).Ofir Press @OfirPress
10K Followers 3K Following I build tough benchmarks for LMs and then I get the LMs to solve them. Postdoc @Princeton. PhD from @nlpnoah @UW. Ex-visiting researcher @MetaAI & @MosaicML.Weijia Shi @WeijiaShi2
5K Followers 968 Following PhD student @uwcse @uwnlp | Visiting Researcher @MetaAI | Undergrad @CS_UCLA | https://t.co/eLBQmgkvymLuca Soldaini 🎀 @soldni
6K Followers 1K Following I like tokens! Lead for OLMo data team at @allen_ai (Dolma 🍇), OSS is fun, @QueerInAI organizer 🤖☕️🍕they/them (views mine, not my employer’s)Xi Ye @xiye_nlp
2K Followers 304 Following CS PhD student @UTAustin. I study NLP, particularly explanations. I sometimes make memes.Yu Su @ysu_nlp
6K Followers 857 Following Dist. Assist. Prof.@OhioState, Director @osunlp, 20% Researcher@Microsoft. I like to think about intelligence, artificial or biologicalNaomi Saphra @nsaphra
7K Followers 1K Following Waiting on a robot body. ML/NLP. All opinions are universal and held by both employers and family. Same username on every lifeboat off this sinking ship.Sam Bowman @sleepinyourhat
35K Followers 3K Following AI alignment + LLMs at NYU & Anthropic. Views not employers'. No relation to @s8mb. I think you should join @givingwhatwecan.Michael Saxon @m2saxon
2K Followers 1K Following CS PhD cand @ucsbNLP 🌊🌴 @NSF GRFP 🧐analyzing semantics in generative lang/img AI models🤖 Big tech ex-intern. BS/MS @ASU 🌵🏜 🔜 @AMD opensrc GenAI RS internTuhin Chakrabarty @TuhinChakr
2K Followers 621 Following Newly minted Ph.D. from @ColumbiaCompSci studying creativity. Ex affiliations: @GoogleDeepmind @SFResearch @allen_aiJordan Boyd-Graber @boydgraber
4K Followers 2K Following Trivia Nerd, NLPer, Dad, Colorado native in Maryland exile Working on QA, negotiating/cooperating bots, ML explanations Exemplar for absent-minded professorNathan Schneider @complingy
4K Followers 1K Following Computational Linguist and Professional Nerd at Georgetown University he/him pronouns, ALL the prepositions @[email protected] @complingy.bsky.socialEkin Akyürek @akyurekekin
2K Followers 726 Following graduate student in computer science @MITEECS/@MIT_CSAILYiqing Xie @YiqingXXX
68 Followers 89 Following ✨ NLP for Code & Code for NLP 🎓 PhD student @LTIatCMU; MSCS @dmguiuc. 👩💻 Intern (incoming) @meta; (previously) @MSFTResearch; @AlibabaDAMO.精神病狗婊子杂.. @frkglp
0 Followers 3K Following 神病狗婊子杂种邓小平,刘少奇就是整个世界的敌人,它那套歪把戏不除,世界战乱不断。Cgkl精神病狗婊子杂种习近平被凌迟处死。Cgk凌迟处死精神病狗婊子杂种中共狗屁家族邓小平,习近平,陈云,刘少奇,陈一新,张又侠,何卫东,刘振立,苗华,董军。锸s你跟踪本人的精神病狗婊子杂种全部中共空军、警察、台湾间谍Agency @AgencyMDR
1K Followers 825 Following Employee Targeted Digital Risk // Personalized Managed Cybersecurity // Security and Compliance for Hyper Growth Companies // YC W22Quealleyth @QuealleythNmgs
0 Followers 76 FollowingNicole Meister @nicole__meister
67 Followers 109 Following phd student @stanford, previously @princeton @VisualAILab (she/her)Theytud @Theytud170841
1 Followers 173 FollowingAva-grace Sotello @AvaSotello58557
67 Followers 5K FollowingZhijing Jin @ZhijingJin
3K Followers 1K Following Final-year PhD @MPI_IS & @ETH_en w/ @bschoelkopf. Research on (1) @CausalNLP and (2) NLP4SocialGood @NLP4SG. Mentor and mentee @ACLMentorship.Zanë ([email protected].. @ZanaBucinca
2K Followers 401 Following PhD Candidate @Harvard, human-AI interaction; KosovoEric @exalted
2K Followers 4K Following Infosec & AI | Learning Poker GTO, Badminton, Web3. Emerald-hearted JG, love hadoukens. #sagamobileInhwa Song @_inhwa_song
254 Followers 516 FollowingMinsik Oh @minsik_nlp
671 Followers 1K Following AI Researcher, ex-AWS AI. Incoming @stanford MSCS. #NLProc #AIKim-Mai Cutler @kimmaicutler
58K Followers 25K Following Partner at @initialized. Previously @techcrunch. When life hands me lemons, I make tarte au citron.Ryan Boyle @_RyanBoyle_
1K Followers 5K Following Tech Enthusiast 👨🏼💻 Aspiring ML Engineer. Frequent Traveler 🌎 Based in Philly & LA, Soon → SF 🌉Gaurav Ragtah @gragtah
9K Followers 3K Following 👨💻 https://t.co/KC5a1dIPP1 quickly find open-source AI code. built things at Google, Yelp, Klout, SlideShare. Columbia/Colgate CS. ✍️ tech, linguistics, cultureSurabhi Gupta @this_is_surabhi
542 Followers 2K Following prev robotics intern at @getpeppermint | alterok @_buildspace | 20 | electronics engineering undergrad | robotics enthusiastSarah Arminta Bentley @Sarah_A_Bentley
3K Followers 2K Following Trader. PKM. AI. Mom. @tana_inc AmbassadorNazneen Rajani @nazneenrajani
4K Followers 2K Following Something new 🧪 | Previously: @huggingface 🤗, @SFResearch, PhD @utcompsciMcSethay @sethay75183
26 Followers 263 Following In the dull and boring world, there is also occasional luck. No cross, no crown.Sitao Cheng (Seeking .. @TonyCheng990417
24 Followers 159 Following Seeking PhD positions 25Spring/25Fall! Interested in NLP, LLM-agents, knowledge graph reasoning. M.S. @NanjingUnivers1. Current Intern @MSFTResearch.Babuaravind Gururaj @aravindguru33
220 Followers 1K Following ZK Intern @ironmill_xyz | Information systems graduate @Northeastern | VP Finance @NortheasternGSG | 4x web3 hackathon winnerThe Great Indoors @thegreatindoorx
48 Followers 604 Following Explore the bleeding edge of technology with me! AI is moving faster than most people think, keeping up with it is more important than ever.Harman Singh @Harman26Singh
598 Followers 2K Following Pre-doctoral Researcher, Languages @GoogleAI • Prev: AI Resident @MetaAI, Undergrad @iitdelhi, INK Lab @CSatUSC, @IBMResearch. language, vision, reasoninghhkb.logi @hhkblogi
22 Followers 881 FollowingCrazyFoxMovies @CrazyFoxMovies
2K Followers 418 Following Twitter Account of CrazyFoxMovies 500k+ subs on YouTube. Tweets are related to Minecraft & Memes! Discord: https://t.co/D2LcXnCi9rmikaela @mikscust1
21 Followers 1K Following藤原大輔 | KUSABI.. @dicefujiwara
363 Followers 439 Following @kusabifundインターン | AI周りのスタートアップの情報を配信してます | 19y @MinervaUni CS 専攻 | スポーツ大好き| 🇻🇳→🇳🇿→🇺🇸→🇰🇷→🇩🇪→🇯🇵Anikait Singh @Anikait_Singh_
126 Followers 264 Following PhD Student @StanfordAILab, Previously Student Researcher @GoogleDeepMind, Undergraduate @Berkeley_AI Deep Learning, Reinforcement Learning, Robotics.Ray Hotate 保立怜 @rayhotate
2K Followers 1K Following 🇯🇵🇺🇸 Computer Science, AI @Stanford 🌲東大理三 / 開成 '22 Electronic Music Composer, Producer, MusicianCameron 'Quadron' Pfi.. @cameron_pfiffer
5K Followers 2K Following I have a PhD in finance, work at Stanford's GSB, trying to build Comind. I love big-ass computers. Building {comind}, a tool to think good thoughts.SenaBeren @findingmerit
287 Followers 3K FollowingKole Lee @kolelee_
2K Followers 2K Following magician and CS @stanford | z fellow | leadership @stanfordcryptog^X @algorithms77
101 Followers 3K Following Researcher studying intelligence both artificial and biological. Seeking to understand intelligence and how we may enhance itAndrew Stephen @Andrew1Stephen
295 Followers 4K Following AI & Automation Consultant by Profession, Ecommerce Generative AI/Web3/Defi/NFT/Metaverse/Creator Economy are my Games, Loves Home, Dogs, Food, Kids & GymJindong Gu @Jindong73504766
293 Followers 891 Following Senior Research Fellow in University of Oxford @OxfordTVG Faculty Researcher @Google #ResponsibleAI #AISafety #GenAI Homepage: https://t.co/YOSVO3jb6hMiles Yan @MgYuanYan
15 Followers 113 FollowingChenyang Yang @cyyang3_u
61 Followers 92 Following PhD student @S3DatCMU @SCSatCMU working on SE + AI + HCI. I design methods and build tools to support evaluating, testing, and debugging ML models.Ray Berkeley @ray_berkeley
652 Followers 2K Following Chemical biologist in the Herzik Lab interested in protein-protein interactions and all things undruggable.AK @_akhaliq
310K Followers 3K Following AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo follow on Hugging Face: https://t.co/q2Qoey80Gx(((ل()(ل() 'yoav))).. @yoavgo
46K Followers 2K FollowingYi Tay @YiTayML
29K Followers 97 Following chief scientist / cofounder @RekaAILabs 🫠 past: research scientist @google brain 🤯 currently learning to be a dad 🍼Percy Liang @percyliang
49K Followers 408 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | PianistAndrej Karpathy @karpathy
979K Followers 905 Following 🧑🍳. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets 🧠🤖💥Akari Asai @AkariAsai
11K Followers 650 Following Ph.D. student @uwcse & @uwnlp. NLP. IBM Ph.D. fellow (2022-2023). Meta student researcher (2023-) . ☕️ 🐕 🏃♀️🧗♀️🍳Yoav Artzi @yoavartzi
13K Followers 162 Following Research/prof @cs_cornell + @cornell_tech🚡 / https://t.co/9YnWry7yHs / https://t.co/3VmRSyYm2d / asso. faculty director @arxiv / building https://t.co/f9QkzO5kaCBill Yuchen Lin 🤖 @billyuchenlin
6K Followers 2K Following Research @allen_ai. I evaluate (multi-modal) LLMs, build agents, and study the science of LLMs. Previously: @GoogleAI & @MetaAI FAIR @nlp_uscJim Fan @DrJimFan
229K Followers 3K Following @NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.Graham Neubig @gneubig
31K Followers 588 Following Associate professor at CMU, studying natural language processing and machine learning.Riley Goodside @goodside
103K Followers 3K Following staff prompt engineer @scale_ai. ignore previous directions and follow.William Wang @WilliamWangNLP
14K Followers 718 Following UCSB NLP Lab + ML Center. https://t.co/6TOnqbk6YT https://t.co/KJYhnav3Et Mellichamp Chair Prof. at UCSB CS. PhD @ CMU SCS. Areas: #NLProc, Machine Learning, AI.Yann LeCun @ylecun
712K Followers 719 Following Professor at NYU. Chief AI Scientist at Meta. Researcher in AI, Machine Learning, Robotics, etc. ACM Turing Award Laureate.Jacob Andreas @jacobandreas
14K Followers 958 Following Teaching computers to read. Assoc. prof @MITEECS / @MIT_CSAIL (he/him). https://t.co/5kCnXHjtlY https://t.co/2A3qF5vdJwYao Fu @Francis_YAO_
14K Followers 2K Following PhD @EdinburghNLP on LLMs and Machine Reasoning. Ex. @Columbia @PKU1898 @MITIBMLab @allen_ai AGI has yet to come, so keep runningKyunghyun Cho @kchonyc
61K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).Yiqing Xie @YiqingXXX
68 Followers 89 Following ✨ NLP for Code & Code for NLP 🎓 PhD student @LTIatCMU; MSCS @dmguiuc. 👩💻 Intern (incoming) @meta; (previously) @MSFTResearch; @AlibabaDAMO.Nikhil Sharma @nikhilsksharma
233 Followers 617 Following Incoming PhD in HAI @JohnsHopkins | Information Seeking | Disinformation Agents | Copilots for Social Good | PhD @JHUCLSP @JHUMCEH #NLProcSophia在斯坦福 @HeySophiaHong
6K Followers 211 Following 🌲 清华毕业 | 斯坦福在读 👩🏻💻 萌新创业者 | 做了 https://t.co/RK4pvHAZ5Z @UseAIAnywhere 💡 AI前沿 | 创业思考 | 出海产品 | 留学生活 ✨ 全网同名Zanë ([email protected].. @ZanaBucinca
2K Followers 401 Following PhD Candidate @Harvard, human-AI interaction; KosovoAhmad Beirami @abeirami
4K Followers 2K Following Building safe, helpful, and scalable generative AI @Google | ex-{@AIatMeta, @EA, @MIT, @Harvard, @DukeU} | @GeorgiaTech PhD | زن زندگی آزادی | opinions my ownMaximilian Du @du_maximilian
234 Followers 187 Following First-year Ph.D. student in @StanfordAILab, interested in causality, robustness, and self-directed play in AI, humans, & animals! (Oh, and I'm a writer too)Anikait Singh @Anikait_Singh_
126 Followers 264 Following PhD Student @StanfordAILab, Previously Student Researcher @GoogleDeepMind, Undergraduate @Berkeley_AI Deep Learning, Reinforcement Learning, Robotics.M.J. Crockett @mollycrockett
15K Followers 2K Following Professor @PsychPrinceton & University Center for Human Values | Cognitive scientist curious about (anti)normativity, technology & the self | They/She 🏳️🌈Yitao Liu @taoooo917
140 Followers 443 Following Looking for 2024 summer research internships | Senior @HKUniversity | Intern @HKUNLP & @PrincetonNLP | NLP researchXindi Wu @cindy_x_wu
935 Followers 807 Following Data-centric multimodal ml PhD student @PrincetonCS, prev @RealityLabs @roboVisionCMU @CMU_Robotics @SnapchatSara Du @saraduit
34K Followers 891 Following ceo & cofounder @alloyautomation | prev @harvard @ycombinatorWei-Lin Chiang @infwinston
3K Followers 852 Following CS PhD student at UC Berkeley. co-lead of Chatbot Arena @lmsysorgZheng Yuan @GanjinZero
661 Followers 509 Following NLP Researcher. The author of RRHF, RFT and MATH-Qwen. Focus on Medical & Reasoning & Alignment in LLMs. Prev Tsinghua Ph.D.Zhiqiu Lin @ZhiqiuLin
105 Followers 91 Following PhD Student at Carnegie Mellon University | Computer Vision and Language | Generative AIStanford Digital Econ.. @DigEconLab
8K Followers 280 Following Bringing together the world's best minds, whether human or machine, to study how digital technologies can transform the economy @StanfordHAI Director: @ErikBrynTiffany Knearem, PhD @tknearem
883 Followers 428 Following UX Researcher Google @materialdesign & Social @ACM_CHI, PhD Informatics @Penn_State 🐾 HCI, AI & Design 🤖 | Prior @ISTatPENNSTATE @GoogleAI @meta @jetprogramElla Minzhi Li @EllaMinzhiLi
145 Followers 105 Following CS PhD student at NUS @wing_nus 🇸🇬, incoming visiting PhD at Stanford @stanfordnlp🌲, NLP researcher📒Simon Guo 🦝 @simonguozirui
1K Followers 4K Following Incoming CS PhD student @Stanford and curr training models at @cohere | 🎓 @Berkeley_EECS | prev built things at @ @anyscalecompute @nvidiaHaotian Liu @imhaotian
6K Followers 397 Following building intelligence @xAI, creator of #LLaVA, cs @UWMadison, prev @MSFTResearchVirginia Adams @itsvadams
10 Followers 163 FollowingAnne Ouyang @anneouyang
3K Followers 582 Following Incoming CS PhD student @Stanford, currently cuDNN @Nvidia | M.Eng, B.S. in CS @MIT | self-improving ML systems + performance engineeringTian Gao @TianGao_19
220 Followers 149 Following CS PhD @Stanford | Prev: @UTAustin and @Tsinghua_Uni | Embodied AI/RL/RoboticsOlga Golovneva @OlgaNLP
60 Followers 82 FollowingVijay V. @vijaytarian
525 Followers 444 Following Grad student at CMU. I do research on applied NLP. he/himZora Zhiruo Wang @ZhiruoW
529 Followers 183 Following PhD student @LTIatCMU | previously: intern @Amazon Alexa AI | assistant researcher @Microsoft Research, Asia | intern @TencentAlexander Khazatsky @SashaKhazatsky
287 Followers 15 FollowingNicholas Lourie @NickLourie
136 Followers 287 Following I build things. 🤖 Doing a PhD at @nyuniversity (@CILVRatNYU) on better empirical methods for deep learning and data science. Advised by @kchonyc and @hhexiy.Guodong Zhang @Guodzh
24K Followers 417 Following Train, tune and align LLMs “good” @xai. Previously Gemini @GoogleDeepMind and PhD @UofT.Michelle Qin @michelleqin_
3K Followers 510 Following CS @pika_labs @Stanford ✌️ I care about people, AI, & design for play 🤸♀️Science of Science @MishaTeplitskiy
6K Followers 893 Following Sociology of science, technology, and innovation || Assistant professor at 〽️ichigan @UMSIVivian Liu @viv_lavida
860 Followers 207 Following CS PhD student @Columbia computer scientist, designer, writer, and dancerNeal Wu @WuNeal
15K Followers 390 Following Building @cognition_labs. Previously @tryramp, @GoogleBrain, @Harvard, competitive programming (featured in @Wired). Created https://t.co/pihw5AGvbV.Studio Ghibli Picture.. @ghiblipicture
1.4M Followers 49 Following Daily Studio Ghibli Pictures and GIFs 📸 | All credits to @GhibliUSA © | Parody AccountPhysical Intelligence @physical_int
4K Followers 8 Following Physical Intelligence (Pi), bringing AI into the physical world.Samuel "curry-howard .. @SamuelAinsworth
3K Followers 3K Following prev: @BrownUniversity, @uwcse/@uw_wail phd, curr: research scientist @cruise. 0.1x engineer, 10x friend. spondyloarthritis, cars ruin cities, open sourceCognition @cognition_labs
123K Followers 19 Following Makers of Devin, the first AI software engineer. We are an applied AI lab focused on reasoning, and code is just the beginning. Join us: https://t.co/tpfZwEwGiqReplacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models abs: arxiv.org/abs/2404.18796 This paper from Cohere proposes to evaluate models using a Panel of LLm evaluators (PoLL). "we find that using a PoLL composed of a larger number of smaller…
(1/7) Do you want to test code generation models on the domains you care about? Struggling to find existing benchmarks that suit your needs? Our new work *CodeBenchGen* helps you build execution-based benchmarks based on your selected code fragments! (arxiv.org/abs/2404.00566)
One year ago, I left Google Brain (now DeepMind) to join a very early startup. We had fewer than 10 people at that time, and have grown many times since. Today, I am extremely proud to share our milestone. We are Augment. You can read about us here. techcrunch.com/2024/04/24/eri…
@ShunyuYao12 IMO 2007/P3 is also extremely hard, and I think it would be much harder for LLMs to solve it. However, as can be seen from the stats, many contestants got partial scores. It's very different from P6, where they either got everything (thanks to knowing Combinatorial…
Nonsense inputs may make sense for LMs Some phrases in the jibberish rubble make models answer or regurgitate knowledge. But what can we learn about those nonsensical phrases or from them on LMs? arxiv.org/abs/2404.17120 @V__Cherepanova @james_y_zou
Well... two problems: (1) SIX best math students in the USA get to compete. (2) If I were an IMO judge, the solution would receive a 3 out of 7. A stricter judge might give a 2. A more generous judge might give a 4, but I would protest anything more than that. Context:…
uh.... gpt2-chatbot just solved an International Math Olympiad (IMO) problem in one-shot the IMO is insanely hard. only the FOUR best math students in the USA get to compete prompt + its thoughts 🧵
Tell me that you're a language model from X corporation without telling me you're a language model from X corporation.
We are at 97 statements now. Help make it 100!
Congrats to everyone who just finished the PhD application cycle! 🎉🥳 If you found our repository of statements at cs-sop.org helpful, please consider sharing yours so that we could help more students in the future!
Congrats to everyone who just finished the PhD application cycle! 🎉🥳 If you found our repository of statements at cs-sop.org helpful, please consider sharing yours so that we could help more students in the future!
Whatever gpt2-chatbot might be, it definitely feels like gpt4.5. It has insane domain knowledge I have never seen before
MAIA (A Multimodal Automated Interpretability Agent) is here! 🧵 📝New paper: arxiv.org/abs/2404.14394 🌐Website: …imodal-interpretability.csail.mit.edu/maia/ Agents like MAIA advance automated interpretation of AI systems from one-shot feature description into an interactive regime where hypotheses…
The paper in question: arxiv.org/abs/2305.09601
One reviewer holistically considered the interplay between our legal and stats contribution which resulted in the perfect score. Two other reviewers scored only our legal contributions. Their main complaint was the legal analysis is thin. Legal analysis is only half the paper!
FAccT has a review system that is no different than ACL or ICML. With a reviewer pool largely consisting of specialists, interdisciplinary papers are at a big disadvantage. We received 3 law reviewers who only felt qualified to judge the legal portion of our work.
Our interdisciplinary law+stats paper got a perfect score 7M/5R yet still got rejected... @FAccTConference cannot claim to be an interdisciplinary conference if it adopts a peer review system which is systematically biased against interdisciplinary work!
Stanford NLP Retreat 2024! @RyanCLouie and I organized a PowerPoint Karaoke 🎤 My favorite part is Chris' answer: Q: What is the first principle component for both babies and undergrads? Chris Manning: HUNGER! @ChrisManning @stanfordnlp
model = learn(data) Synthetic data is great, but it’s not data. It’s an intermediate quantity created by learn(). Data is created by people and has privacy and copyright considerations. Synthetic “data” does not - it’s internal to learn().
a year ago, "what will the humans being doing? (reflections on generative AI futures)" …surprised a year later, we're not further along… still like copilot, still feel i'm not fully leveraging writing assistance or automation… more HCI, please :) jeffreybigham.com/blog/2023/what…
Professor life is off to a great start! Honored to receive a grant from Apple ML Research and to be named a Google Research Scholar. Looking forward to more work developing ML methods for healthcare and equity Pictured: an apple, Google, and me