Yizhong Wang @yizhongwyz
CS PhD student @uwcse @uwnlp. NLP/ML homes.cs.washington.edu/~yizhongw/ Seattle Joined April 2015-
Tweets510
-
Followers3K
-
Following1K
-
Likes4K
Today we're launching PRISM, a new resource to diversify the voices contributing to alignment. We asked 1500 people around the world for their stated preferences over LLM behaviours, then we observed their contextual preferences in 8000 convos with 21 LLMs arxiv.org/abs/2404.16019
From Claude100K to Gemini10M, we are in the era of long context language models. Why and how a language model can utilize information at any input locations within long context? We discover retrieval heads, a special type of attention head responsible for long-context factuality
We have a new preprint out - your language model is not a reward, it’s a Q function! 1. The likelihood of the preferred answer must go down - it’s a policy divergence 2. MCTS guided decoding on language is equivalent to likelihood search on DPO 3. DPO learns credit assignment
Excited to share a preview of Llama3, including the release of an 8B and 70B (82 MMLU, should be the best open weights model!), and preliminary results for a 405B model (still training, but already competitive with GPT4). Lots more still to come... ai.meta.com/blog/meta-llam…
What does it take to get a good MMLU score? Turns out: decent data, instructions in pretraining, fuzzy dedup, and quality filtering. just dropped OLMo 1.7-7b… nice perf lift over 1.0! Blog: blog.allenai.org/olmo-1-7-7b-a-… Model: huggingface.co/allenai/OLMo-1… Data: huggingface.co/allenai/dolma
🚀Multimodal agents is on rise in 2024! But even building app/domain-specific agent env is hard😰. Our real computer OSWorld env allows you to define agent tasks about arbitrary apps on diff. OS w.o crafting new envs. 🧐Benchmarked #VLMs on 369 OSWorld tasks: #GPT4V >> #Claude3
Great work on an important #NLProc community resource led by the indefatigable @paul_rottger
Great work on an important #NLProc community resource led by the indefatigable @paul_rottger
SWE-agent is blazing fast, and when it works it feels like magic! In this short demo I show how it solved a real bug in the neural network training code in scikit-learn. I also explain the process behind our agent-computer interface design choices.
When augmented with retrieval, LMs sometimes overlook retrieved docs and hallucinate 🤖💭 To make LMs trust evidence more and hallucinate less, we introduce Context-Aware Decoding: a decoding algorithm improving LM's focus on input contexts 📖 arxiv.org/pdf/2305.14739… #NAACL2024
yay gpt generated review
Looking to benchmark your web-based AI agent? Consider TurkingBench! See @KevLXu 's post for details. x.com/kevlxu/status/…
Looking to benchmark your web-based AI agent? Consider TurkingBench! See @KevLXu 's post for details. x.com/kevlxu/status/…
Ever wondered how LLMs stack up against human crowdsource workers? I'm thrilled to share "TurkingBench", a benchmark of web-based tasks for multi-modal and interactive AI agents. Draft: arxiv.org/abs/2403.11905 Project: turkingbench.github.io Code: github.com/JHU-CLSP/turki…
Inspired by several innovative Gemini Pro 1.5 demos 🔥, I'm sharing my favorites to highlight their creativity ✨. This thread aims to curate these intriguing examples and encourage people to explore unknown territories 🚀. Text 1. Space: a. Apollo 11 transcript interaction:…
🚀Our new paper on training details, official code, and FAQ of the "The-Era-of-1-bit-LLM" paper is public. github.com/microsoft/unil… 🔥We provide additional experiments and results that were not reported in the original paper. 📢Join in our discussion at huggingface.co/papers/2402.17…
Excited to share something that we've needed since the early open RLHF days: RewardBench, the first benchmark for reward models. 1. We evaluated 30+ of the currently available RMs (w/ DPO too). 2. We created new datasets covering chat, safety, code, math, etc. We learned a lot.…
It is currently PhD visit days at UW. Choosing among schools for a PhD is a tough choice. I wrote a blog post about some ways to think about this choice to make it easier and to find the school that is the best fit for you: timdettmers.com/2022/03/13/how…
Thank you, Brad (@ProfData) and Xiaoliang (@ken_lxlxl), for leading this! 🚀🧠✨ Introducing BrainBench, a forward-looking benchmark for predicting neuroscience results: arxiv.org/abs/2403.03230 -- more information at braingpt.org!
Thank you, Brad (@ProfData) and Xiaoliang (@ken_lxlxl), for leading this! 🚀🧠✨ Introducing BrainBench, a forward-looking benchmark for predicting neuroscience results: arxiv.org/abs/2403.03230 -- more information at braingpt.org! https://t.co/TcxkOOwjhw
Introducing AI2 𝕎𝕚𝕝𝕕𝔹𝕖𝕟𝕔𝕙 ! We aim to benchmark LLMs with challenging tasks from real users in the wild. 🤗 Link: hf.co/spaces/allenai… 🤩 What great features does it offer? 🌟x9 ⬇️ 🌟1. 𝐂𝐡𝐚𝐥𝐥𝐞𝐧𝐠𝐢𝐧𝐠 & 𝐑𝐞𝐚𝐥: We carefully curate a collection of 1024 hard…
Thanks @_akhaliq for sharing our work. We evaluated SOTA VLMs on challenging Raven's Progressive Matrices. We found that VLMs still struggle to reach human-level performance, with perception being the main bottleneck. arxiv.org/abs/2403.04732 github.com/apple/ml-rpm-b…
Thanks @_akhaliq for sharing our work. We evaluated SOTA VLMs on challenging Raven's Progressive Matrices. We found that VLMs still struggle to reach human-level performance, with perception being the main bottleneck. arxiv.org/abs/2403.04732 github.com/apple/ml-rpm-b…
How does a baby learn to navigate the world around them? 🚶♂️👶 Through exploration and learning from each little stumble and triumph. The ETO framework applies this very essence of human learning to AI, emphasizing the importance of both success and failure in developing better AI…
How does a baby learn to navigate the world around them? 🚶♂️👶 Through exploration and learning from each little stumble and triumph. The ETO framework applies this very essence of human learning to AI, emphasizing the importance of both success and failure in developing better AI…
Akari Asai @AkariAsai
11K Followers 650 Following Ph.D. student @uwcse & @uwnlp. NLP. IBM Ph.D. fellow (2022-2023). Meta student researcher (2023-) . ☕️ 🐕 🏃♀️🧗♀️🍳Jim Fan @DrJimFan
229K Followers 3K Following @NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.Graham Neubig @gneubig
31K Followers 588 Following Associate professor at CMU, studying natural language processing and machine learning.Yao Fu @Francis_YAO_
14K Followers 2K Following PhD @EdinburghNLP on LLMs and Machine Reasoning. Ex. @Columbia @PKU1898 @MITIBMLab @allen_ai AGI has yet to come, so keep runningBill Yuchen Lin 🤖 @billyuchenlin
6K Followers 2K Following Research @allen_ai. I evaluate (multi-modal) LLMs, build agents, and study the science of LLMs. Previously: @GoogleAI & @MetaAI FAIR @nlp_uscWenhu Chen @WenhuChen
11K Followers 520 Following AI researcher @UWaterloo @GoogleAI @VectorInst. Interested in natural language processing, diffusion models. I direct TIGER-Lab at UWaterloo.Weijia Shi @WeijiaShi2
5K Followers 968 Following PhD student @uwcse @uwnlp | Visiting Researcher @MetaAI | Undergrad @CS_UCLA | https://t.co/eLBQmgkvymAna Marasović @anmarasovic
4K Followers 604 Following Asst prof @UUtah · Ex @allen_ai @uwnlp postdoc @HD_NLP PhD · she/her 🇭🇷Ofir Press @OfirPress
10K Followers 3K Following I build tough benchmarks for LMs and then I get the LMs to solve them. Postdoc @Princeton. PhD from @nlpnoah @UW. Ex-visiting researcher @MetaAI & @MosaicML.Tim Dettmers @Tim_Dettmers
29K Followers 821 Following PhD Student at @UW. I blog about deep learning and PhD life at https://t.co/Y78KDJJFE7.Delip Rao e/σ @deliprao
46K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈Swaroop Mishra @Swarooprm7
5K Followers 894 Following Research Scientist @GoogleDeepMind (Gemini). Pioneering LLM Research 🔥. Instruction tuning, Factuality, Reasoning and next gen Product. Opinions my own.Yu Su @ysu_nlp
6K Followers 857 Following Dist. Assist. Prof.@OhioState, Director @osunlp, 20% Researcher@Microsoft. I like to think about intelligence, artificial or biologicalLuca Soldaini 🎀 @soldni
6K Followers 1K Following I like tokens! Lead for OLMo data team at @allen_ai (Dolma 🍇), OSS is fun, @QueerInAI organizer 🤖☕️🍕they/them (views mine, not my employer’s)Prithviraj (Raj) Amma.. @rajammanabrolu
5K Followers 520 Following Interactive & grounded AI, RL, NLP. Assistant Prof @UCSanDiego. Research Scientist @DbrxMosaicAI. Prev: @allen_ai, @GeorgiaTechAllen Institute for A.. @allen_ai
54K Followers 361 Following AI for the Common Good. › Join us: https://t.co/DqTs1G4bGO › Get our newsletter: https://t.co/tvb1VpySfLXin Eric Wang @xwang_lk
7K Followers 1K Following Multimodal and Embodied AI Researcher / Professor @UCSC. Director of https://t.co/Y4swOBag21. AI for Humanity in the long run. he/himJungo Kasai 笠井淳.. @jungokasai
2K Followers 386 Following Co-founder & CTO @kotoba_tech: "Towards End-to-End Speech Foundation Models." | PhD from @nlpnoah at @UW | IBM PhD Fellow | 孫正義育英財団生 | @Yale UndergraduateZhenwen Liang @LiangZhenwen
180 Followers 220 Following PhD stundent in NLP, University of Notre Dame. Previous intern at Aristo, AI2 and Tencent AI Lab.jessica🩶 @DS_Jessica_
13K Followers 2K Following analytics lead & angel investor & advisor. always learning = business & innovation. doing #datascienceDaniel Atonge @AtongeDaniel
69 Followers 649 Following Development Lead TechVenia | Passionate Software & Cloud Engineer 👨🏽💻Evanna Si @evanna_si47146
117 Followers 5K FollowingJatin Mehta @j1mehta
179 Followers 266 Following Co-founder@Speedy - effective and affordable content marketing with Generative AI. YCombinator (W23) | UC Irvine | IIT精神病狗婊子杂.. @frkglp
0 Followers 3K Following 神病狗婊子杂种邓小平,刘少奇就是整个世界的敌人,它那套歪把戏不除,世界战乱不断。Cgkl精神病狗婊子杂种习近平被凌迟处死。Cgk凌迟处死精神病狗婊子杂种中共狗屁家族邓小平,习近平,陈云,刘少奇,陈一新,张又侠,何卫东,刘振立,苗华,董军。锸s你跟踪本人的精神病狗婊子杂种全部中共空军、警察、台湾间谍$$$ @sp1d3r_8eyes
58 Followers 398 Following郑晓琼(Audrey Zh.. @Audrey_802
95 Followers 545 Following CEO of Beijing Open Space Technology 🌛 Global Publisher of JieTeng(China) 🌞ETH Zurich+HSG @embaX_swiss 👸 “Wave Rider” Translatorjiawei @sk413025
131 Followers 2K Followingtaptree @mi3fa5sol4mi2
28 Followers 171 Followinganushka @_anushkaagarwal
467 Followers 3K Following Machine learning Engineer @Neuralgarage| Research Intern @Airlab CMU| Nerfs| 3DMMSearsoat @searsoat38387
0 Followers 176 Followingssteevens @Steevens43
159 Followers 5K FollowingZhehao Zhang @Zhehao_Zhang123
98 Followers 390 Following Graduate student at @Dartmouthcs ; Visiting Research Intern @SALT_NLP; Prev. Research Intern @MSFTResearch; Formerly undergrad from @sjtu1896; NLP&ML #NLProccongyin mei @meinanjing2022
72 Followers 820 FollowingYiming Shi @uestcshiym
17 Followers 207 Following Pursuing a PhD in multimodal modeling. Undergrad @UESTC1956 Think and Move forward. e/accPANDA FRANK @PANDAFRANK6
0 Followers 202 FollowingPhillip Lindsay @EastLAPinche
57 Followers 386 Followingwuyonghuang @yonghuangwu
17 Followers 177 FollowingDacheng Li @DachengLi177
621 Followers 476 Following Intelligence. PhD @Berkeley_EECS @lmsysorg @ucbrise @berkeley_ai, Prev. @Google @SCSatCMU.Jordan Fisher @JordanFisherEzr
41 Followers 804 Following CEO of Standard AI. Mathematician, former fed, captivated by how to productize cutting edge researchIsmail Chaida 👨�.. @Ismail_CHAIDA
404 Followers 4K Following Software & Data/Kotlin/Scala Engineer | Views are my ownV Sriram @VSriram23
140 Followers 4K FollowingSamadeep @samadeepviews
104 Followers 1K Following Incoming Software Engineering Intern @GoogleIndia Computer Science UndergradAryan Pandey (Look fo.. @AryanPa66861306
1K Followers 3K Following Half Machine Learning Engineer || DevOps and Machine Learning || Open Source at OpenVINOYiping Wang @ypwang61
132 Followers 433 Following Ph.D. @uwcse. undergraduate @ZJU_China. I'm interested in mathematics, agi, and physics.วิวรรณ @0HY5uTtQ9gqWkx
75 Followers 1K Following ติดตามฉันเพื่อที่คุณจะได้รู้จักฉันมากขึ้น ฉันอัปเดตข้อมูลติดต่อของฉันในหน้าแรก อย่าลืมมาหาฉันRuairi @ruairiSpain
259 Followers 2K FollowingNirupama Ratna (looki.. @ratna_kandala
189 Followers 2K Following Ph.D. student in Linguistics @ IIT Hyderabad BS-MS in Systems Biology #NLP#AI#Neurosciencezhou9 @zhou986570475
12 Followers 62 FollowingMelisa Nunnelley @melisa21081
38 Followers 5K FollowingGregor von Dulong @gregorvondulong
56 Followers 135 Following Weekly Newsletter About ML Research And The Data Economy (https://t.co/EuxsoYIRyX) | ML Engineeralbrt.io 👨💻�.. @albertdbio
132 Followers 1K Following Full stack engineer applying machine learning to ecommerceSandra Ifeanyi @SandraIfea35170
30 Followers 5K FollowingTrevor Loy @trevorloy
17K Followers 2K Following VC investor emerging ecosystems @FlywheelVC. Lecturer entrepreneurship & VC @Stanford. Prev: BoD @NVCA; Mentor @KauffmanFellows; 3x founder; Chip design @Intel.INDRAJEET @indrajeet877
425 Followers 2K Following Head of Math Department,Allen Institute Karaikal BTech NITW 2012, Option trader & investor. Math geek, tech-forward, learner Plus Python & Spanish skills.Aaditya ; @Aaditya26082004
531 Followers 7K Following CS'26 • Machine Learning • Open-Source • Web Dev. • Algorithms • Jai Shree Krishna 🦚🪈Julieta Shelkoff @JShelkoff65527
53 Followers 5K FollowingXinyi Wang @XinyiWang98
793 Followers 299 Following UC Santa Barbara CS PhD student working on ML/NLPMOHAMMAD ALRIFAT @alrifat1992
12 Followers 110 FollowingEva Louise Marie Gabr.. @e681554349
9 Followers 3K FollowingAK @_akhaliq
310K Followers 3K Following AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo follow on Hugging Face: https://t.co/q2Qoey80Gx(((ل()(ل() 'yoav))).. @yoavgo
46K Followers 2K FollowingPercy Liang @percyliang
49K Followers 408 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | PianistAkari Asai @AkariAsai
11K Followers 650 Following Ph.D. student @uwcse & @uwnlp. NLP. IBM Ph.D. fellow (2022-2023). Meta student researcher (2023-) . ☕️ 🐕 🏃♀️🧗♀️🍳Jim Fan @DrJimFan
229K Followers 3K Following @NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.Yann LeCun @ylecun
712K Followers 719 Following Professor at NYU. Chief AI Scientist at Meta. Researcher in AI, Machine Learning, Robotics, etc. ACM Turing Award Laureate.Yi Tay @YiTayML
29K Followers 97 Following chief scientist / cofounder @RekaAILabs 🫠 past: research scientist @google brain 🤯 currently learning to be a dad 🍼Andrej Karpathy @karpathy
979K Followers 905 Following 🧑🍳. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets 🧠🤖💥Sasha Rush @srush_nlp
52K Followers 464 Following Professor, Programmer in NYC. Cornell Tech, Hugging Face 🤗 https://t.co/cZl0wTfqGzGraham Neubig @gneubig
31K Followers 588 Following Associate professor at CMU, studying natural language processing and machine learning.AI at Meta @AIatMeta
532K Followers 255 Following Together with the AI community, we are pushing the boundaries of what’s possible through open science to create a more connected world.William Wang @WilliamWangNLP
14K Followers 718 Following UCSB NLP Lab + ML Center. https://t.co/6TOnqbk6YT https://t.co/KJYhnav3Et Mellichamp Chair Prof. at UCSB CS. PhD @ CMU SCS. Areas: #NLProc, Machine Learning, AI.Jacob Andreas @jacobandreas
14K Followers 958 Following Teaching computers to read. Assoc. prof @MITEECS / @MIT_CSAIL (he/him). https://t.co/5kCnXHjtlY https://t.co/2A3qF5vdJwYao Fu @Francis_YAO_
14K Followers 2K Following PhD @EdinburghNLP on LLMs and Machine Reasoning. Ex. @Columbia @PKU1898 @MITIBMLab @allen_ai AGI has yet to come, so keep runningBill Yuchen Lin 🤖 @billyuchenlin
6K Followers 2K Following Research @allen_ai. I evaluate (multi-modal) LLMs, build agents, and study the science of LLMs. Previously: @GoogleAI & @MetaAI FAIR @nlp_uscWenhu Chen @WenhuChen
11K Followers 520 Following AI researcher @UWaterloo @GoogleAI @VectorInst. Interested in natural language processing, diffusion models. I direct TIGER-Lab at UWaterloo.Sam Bowman @sleepinyourhat
35K Followers 3K Following AI alignment + LLMs at NYU & Anthropic. Views not employers'. No relation to @s8mb. I think you should join @givingwhatwecan.Irene Chen @irenetrampoline
8K Followers 817 Following ML for equitable healthcare. Assistant Professor @UCBerkeley and @UCSF. Prev @Harvard, @MIT, @MSFTResearchHannah Rose Kirk @hannahrosekirk
3K Followers 685 Following AI researcher trying to make sense of all things cyberspace 🤖 Uni of Ox PhD (loading…) @oiioxford & @OxfordAI. Prev @turinginst & @Cambridge_Uni. Visitor @ NYUGuangxuan Xiao @Guangxuan_Xiao
1K Followers 513 Following Ph.D. student at @MITEECS Prev: CS & Finance @Tsinghua_UniMoshe Poliak @MoshePoliak
231 Followers 218 Following PhD student at MIT Brain and Cognitive Sciences. Advisor: Ted Gibson. I study psycholinguistics. immigrant 🏳️🌈 ex-STEM-phobicWeiyan Shi @shi_weiyan
3K Followers 694 Following Postdoc @StanfordNLP, incoming assistant professor @Northeastern, PhD @Columbia| Prev Intern @MetaAI |Co-created CICERO | persuasive chatbots + privacy #nlprocYiping Wang @ypwang61
132 Followers 433 Following Ph.D. @uwcse. undergraduate @ZJU_China. I'm interested in mathematics, agi, and physics.Paul Röttger @paul_rottger
2K Followers 455 Following Postdoc @MilaNLProc, working on evaluating and improving LLM safety. Previously PhD @oiioxford & CTO/co-founder @rewire_onlineAhmad Al-Dahle @Ahmad_Al_Dahle
4K Followers 53 Following #Girldad of twins. Leading GenAI @ Meta (llama, imagine, meta ai and more)Noam Brown @polynoamial
34K Followers 612 Following Researching reasoning @OpenAI | Co-created Libratus/Pluribus, the first superhuman no-limit poker AIs | Co-created CICERO | PhD from @SCSatCMULP Morency @lpmorency
1K Followers 21 Following Associate Professor at CMU studying multimodal and Social AI. Ice hockey goalie.Ofir Nachum @ofirnachum
4K Followers 343 Following Research at @OpenAI. Previously at @GoogleAI on the Brain Team. Doing work on #ReinforcementLearning and #MachineLearningSeungju Han @SeungjuHan3
181 Followers 234 Following Incoming predoctoral researcher + now visiting student researcher @ai2_mosaic @allen_ai working on LLMs. Undergrad @SeoulNatlUniGrant Sanderson @3blue1brown
365K Followers 362 Following Pi creature caretaker. Contact/faq: https://t.co/brZwdQfdifJeff Bezos @JeffBezos
6.4M Followers 355 Following Amazon. Blue Origin. Washington Post. Bezos Earth Fund. Bezos Academy.Arvind Narayanan @random_walker
119K Followers 412 Following Princeton CS prof. Director @PrincetonCITP. I write about the societal impact of AI, tech ethics, & social media platforms. BOOK: AI Snake Oil. Views mine.David @DavidSHolz
54K Followers 5K Following founder @midjourney, prev founder leap motion, nasa, max planckDevi Parikh @deviparikh
23K Followers 151 Following Former Sr. Director, GenAI @Meta. Prof @GeorgiaTech. Generative artist https://t.co/z4n9IRQ3s5. Co-founded Caliper. @CarnegieMellon @RowanUniversity alum.Siddharth Karamcheti @siddkaramcheti
3K Followers 794 Following PhD student @stanfordnlp & @StanfordAILab. I like language, robots, and people. ML/Robotics Intern @ToyotaResearch.Kamal Ndousse @kandouss
2K Followers 496 Following AI @AnthropicAI Social learning enthusiast. Opinions and dumb jokes my own.Iason Gabriel @IasonGabriel
3K Followers 448 Following Philosopher & Research Scientist @GoogleDeepMind | Artificial Intelligence, Alignment & Human Values | All views are my own | he/himKevin Xu @KevLXu
410 Followers 504 Following Incoming NYC SWE Intern @ Citadel | ML @jhuclsp | CS and Applied Math @JohnsHopkins | Prev Intern: @Google | Prev Co-Founder @TunnelHQAviral Kumar @aviral_kumar2
2K Followers 338 Following Research Scientist at Google DeepMind. Incoming Assistant Professor of CS & ML at CMU (Fall 2024). PhD from UC Berkeley.Yilun Du @du_yilun
5K Followers 211 Following PhD student at @MIT_LISLab/@MITCoCoSci, Researcher at @pika_labs, Generative Models, Robot Learning. Interned at @MetaAI, @DeepMind, Research Fellow at @openaiYuhuai (Tony) Wu @Yuhu_ai_
23K Followers 411 Following Co-Founder @xAI. Minerva, STaR, AlphaGeometry, AlphaStar, Autoformalization, Memorizing transformer.Manuel Kroiss @makro_ai
14K Followers 60 FollowingIgor Babuschkin @ibab
44K Followers 685 Following Maybe the real AGI was the friends we made along the way. @xAIAccenture @Accenture
543K Followers 2K Following Together, we deliver on the promise of technology and human ingenuity. Let there be change.OpenAI Developers @OpenAIDevs
71K Followers 0 Following Official @OpenAI account for anyone building on our APIs. Join us in building the future of AI. We ❤️ developers!Jonathan Frankle @jefrankle
16K Followers 685 Following Chief Scientist, Neural Networks @Databricks via MosaicML. PhD @MIT_CSAIL. BS/MS @PrincetonCS. DC area native. Making AI efficient for everyone at @DbrxMosaicAIDatabricks Mosaic Res.. @DbrxMosaicAI
30K Followers 115 Following We remove the barriers to state-of-the-art generative AI model development and make data + AI available to all.lil (library innovati.. @HarvardLIL
4K Followers 682 Following Just a crowd of coders, lawyers, librarians, designers, & tinkerers typing away in the basement of @HLSlib. Home of @caselawaccess @permacc @opencasebook etc.Akshita Bhagia @AkshitaB93
210 Followers 92 Following Research Engineer at AI2, compulsive reader, random-things writer.Amanda Askell @AmandaAskell
26K Followers 653 Following Philosopher & ethicist teaching models to be good @AnthropicAI. Personal account. All opinions come from my training data.Yijia Shao @EchoShao8899
2K Followers 281 Following CS Ph.D. student @StanfordNLP. Previous: undergraduate @PKU1898.Nika Haghtalab @nhaghtal
3K Followers 236 Following Assistant Professor @Berkeley_EECS (ML+Econ+Algorithms). Foundations of Machine Learning, by the people, for the people! Co-founder @let4all.Jieyu Zhang @JieyuZhang20
377 Followers 548 Following PhD student @uwcse | Undergrad @IllinoisCS | Intern @MSFTResearch | Data-centric AI/MLCollective Intelligen.. @collect_intel
3K Followers 50 Following collective intelligence for collective progress.Karina Nguyen @karinanguyen_
12K Followers 649 Following AI research & eng @AnthropicAI, prev. intern @nytimes, @square, @dropboxJack Clark @jackclarkSF
68K Followers 5K Following @AnthropicAI, ONEAI OECD, co-chair @indexingai, writer @ https://t.co/3vmtHYkaTu Past: @openai, @business @theregister. Neural nets, distributed systems, weird futuresFuzhao Xue @XueFz
4K Followers 542 Following Ph.D. candidate@NUSingapore, Intern of GEAR @NVIDIA | Google PhD Fellow | LLM, Foundation Model Scaling | Ex-@GoogleBrain | Zero-shot Cooking Learner🧑🍳Yifang Chen @cloudwaysX
455 Followers 641 Following Ph.D. student @uwcse. Previously @usc undergrad. Online Learning, reinforcement learning, bandits, and active learning.Yuling Gu @gu_yuling
391 Followers 666 Following Predoctoral researcher @allen_ai | @nyuniversity ➡️ @UW ➡️ @allen_ai @[email protected]Fatemeh Ghezloo @fghezloo
88 Followers 366 Following PhD candidate at University of Washington @UW @uwcse Using Computer Vision and Machine Learning on Medical data.Tianyu Liu @rogerliuty
70 Followers 456 Following LLM @AlibabaGroup Qwen Team. Past: Researcher @TencentGlobal HunyuanAide (LLM) Team | Intern/Visitor @MSFTResearch and @TTIC_connect | NLP PhD @PKU1898.Exciting news! 📢 In collaboration with Hugging Face 🤗, we are launching the Medical-LLM Leaderboard. This leaderboard aims to provide a standardized platform for evaluating large language models in the medical domain. It's encouraging to see tech companies like Google,…
New: Open Medical LLM Leaderboard! 🩺 In basic chatbots, errors are annoyances. In medical LLMs, errors can have life-threatening consequences 🩸 It's therefore vital to benchmark/follow advances in medical LLMs before thinking about deployment. Blog: huggingface.co/blog/leaderboa…
ChatGPT will now start to remember across threads! An important step towards increasing the usefulness of these models.
Memory is now available to all ChatGPT Plus users. Using Memory is easy: just start a new chat and tell ChatGPT anything you’d like it to remember. Memory can be turned on or off in settings and is not currently available in Europe or Korea. Team, Enterprise, and GPTs to come.
model = learn(data) Synthetic data is great, but it’s not data. It’s an intermediate quantity created by learn(). Data is created by people and has privacy and copyright considerations. Synthetic “data” does not - it’s internal to learn().
Talk: "OLMo: Findings of Training an Open LM" from Hanna Hajirshizi at AI2 from OSGAI. Extremely interesting overview of the 4 parts (Data, Training, Adaptation, Eval) of the OLMo open LLM project. Rare insight into how these processes work at scale. youtube.com/watch?v=qFZbu2…
@mirwox Thanks for sharing! I’ve been able to skim/read many more papers following a similar workflow.
Early slide figure motivating my talk tomorrow: starting in 2023 more people googled "Generative AI" than "learn Spanish"
The 2024 US Robotics Roadmap is out. Check out the perspective and recommendations at hichristensen.com/pdf/roadmap-20…
Professor life is off to a great start! Honored to receive a grant from Apple ML Research and to be named a Google Research Scholar. Looking forward to more work developing ML methods for healthcare and equity Pictured: an apple, Google, and me
So, this happened... $400M for an academic AI data center approved the NY legislature -- that's $400M for GPUs that go brrrrr... governor.ny.gov/news/governor-…
Earnest question: why don’t top AI labs share their safety tools? Seems like it would be pretty aligned to their mission?
i asked GPQA's example quantum mechanics question to my friend who is an expert in quantum and they told me: "all of these answers are incorrect" - it's google proof only because it's word salad!
Will your paper catch the eye of @_akhaliq? I built a demo that predicts if AK will select a paper. It has 50% F1 using DeBERTa finetuned on data from past year. As a test, our upcoming WildChat arXiv has a 56% chance. Hopefully not a false positive🤞 🔗huggingface.co/spaces/yuntian…
HELM Lite v1.2.0 is out! Datasets: NarrativeQA, NaturalQA, OpenbookQA, MMLU, MATH, GSM8K, LegalBench, MedQA, WMT14 Results (we still need to add Claude 3, which requires more prompt finagling): crfm.stanford.edu/helm/lite/v1.2…
We're thrilled to have @nlpnoah join us tomorrow for our final IC Distinguished Lecture of the spring semester! Come hear Noah make the argument for open language models from 11-12 at the TSRB 1st floor auditorium. You don't want to miss this talk! b.gatech.edu/4deAjWf
This has been a mammoth project BUT being in goblin mode 🧌 for the past couple months has been greatly improved by the company of great co-authors: @computermacgyve @bertievidgen @awhitefield8 @paul_rottger @max_nlp @katemargatina @adinamwilliams @hhexiy + Andrew, Rafael, Juan!
Closing thoughts: Alignment is tricky not just because of technical reasons or statistical choices but also for messy, normative and data-centric human factors. Let's dig into these, not shy away from them by always simulating human participants 🫡
Finally and most importantly: retrieval heads are causal: masking out retrieval heads, the model losses the needle; masking out random heads, the model's needle-in-a-haystack is not influenced. This explains which specific part of the attention is responsible for needle
We hope this work foster future research on reducing hallucination, improving reasoning, and compressing the KV cache! Paper: arxiv.org/abs/2404.15574 Code: github.com/nightdessert/R… Joint work with @yizhongwyz @Guangxuan_Xiao @haopeng_nlp