Nishant Subramani @nsubramani23
PhD student at @LTIatCMU // Prev: Predoctoral Researcher at @allen_ai in #NLProc // @BVB supporter // he/him nishantsubramani.github.io Pittsburgh, PA Joined January 2012-
Tweets736
-
Followers580
-
Following2K
-
Likes6K
Professor Ruha Benjamin (@ruha9) using her honorary degree address at Spelman to denounce Atlanta’s Cop City, the ongoing genocide of Palestinians, and the repression of student activists 💜
Excited to be TAing this! @gneubig and I will be giving a lecture on language model debugging and interpretability with a small primer on steering vectors and mechanistic interpretability so stay tuned!
Excited to be TAing this! @gneubig and I will be giving a lecture on language model debugging and interpretability with a small primer on steering vectors and mechanistic interpretability so stay tuned!
Now thesis is in — I’m on the job market! Looking for RS roles for product and/or research. My PhD focused on cross-lingual transfer + structure prediction but also interested in analysis of optimisation, training dynamics and evaluating generation 🧵
excited to be a part of this amazing OLMo🍇 team building 🛠️and releasing all the research artifacts to advance the study of LLMs! Big shoutout to @soldni @kylelostat for leading our data team 📜📚
excited to be a part of this amazing OLMo🍇 team building 🛠️and releasing all the research artifacts to advance the study of LLMs! Big shoutout to @soldni @kylelostat for leading our data team 📜📚
Can't recommend this enough! I had a great past two years at @ai2_allennlp as a predoctoral researcher. Happy to answer any questions folks may have and I'm around at #EMNLP2023 this week if people want to learn more!
Can't recommend this enough! I had a great past two years at @ai2_allennlp as a predoctoral researcher. Happy to answer any questions folks may have and I'm around at #EMNLP2023 this week if people want to learn more!
SCIENTISTS WHO WANT TO STAND FOR PALESTINE: Join us Sat December 9 for a convening to discuss taking steps to support our colleagues in Palestine. Details and registration here: breakthroughindia.org/international-… #FreeGazaNow #CeasefireNOW
The official CMU press release cs.cmu.edu/news/2023/diab…
We're releasing Dolma, an open 3T+ token dataset that includes research papers, web, code, wiki, and other data sources 📜 Hopefully this encourages more research groups to document datasets and help accelerate research on studying LLMs 🎉
We're releasing Dolma, an open 3T+ token dataset that includes research papers, web, code, wiki, and other data sources 📜 Hopefully this encourages more research groups to document datasets and help accelerate research on studying LLMs 🎉
🏆🎉 Hats off to @jennytliang, PhD student in @SCSatCMU, for leading teams that earned TWO prestigious awards for their breakthrough work! #SoftwareEngineering #NLP s3d.cmu.edu/news/2023/0811…
A bit of a last minute decision, but I'm gonna be at #ICML2023 next week! Would love to meet mutuals & make new friends... if you are doing data-centric NLP work, including LLM data & tooling, let's chat 😊
No spoilers, but this is the cleverest football advert I've ever seen.
The deadline for Spring 2024 Research Internships at AllenNLP is July 15th, in two weeks. If you think 2024 is a great time to do NLP research with top mentors, apply at boards.greenhouse.io/thealleninstit…!
Absolutely floored that @QueerinAI paper was selected for Best Paper award at @FAccTConference !!! It is such a honor, congrats to all my coauthors #FAccT2023
Excited we're hopping into the LLM game! Happy to be a part of the OLMo team!! 🎉🎉
Excited we're hopping into the LLM game! Happy to be a part of the OLMo team!! 🎉🎉
Akari Asai @AkariAsai
11K Followers 650 Following Ph.D. student @uwcse & @uwnlp. NLP. IBM Ph.D. fellow (2022-2023). Meta student researcher (2023-) . ☕️ 🐕 🏃♀️🧗♀️🍳Luca Soldaini 🎀 @soldni
6K Followers 1K Following I like tokens! Lead for OLMo data team at @allen_ai (Dolma 🍇), OSS is fun, @QueerInAI organizer 🤖☕️🍕they/them (views mine, not my employer’s)(((ل()(ل() 'yoav))).. @yoavgo
46K Followers 2K FollowingAna Marasović @anmarasovic
4K Followers 604 Following Asst prof @UUtah · Ex @allen_ai @uwnlp postdoc @HD_NLP PhD · she/her 🇭🇷Sam Bowman @sleepinyourhat
35K Followers 3K Following AI alignment + LLMs at NYU & Anthropic. Views not employers'. No relation to @s8mb. I think you should join @givingwhatwecan.Prithviraj (Raj) Amma.. @rajammanabrolu
5K Followers 520 Following Interactive & grounded AI, RL, NLP. Assistant Prof @UCSanDiego. Research Scientist @DbrxMosaicAI. Prev: @allen_ai, @GeorgiaTechDelip Rao e/σ @deliprao
46K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈Kyunghyun Cho @kchonyc
61K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).Matthew Finlayson @mattf1n
797 Followers 868 Following First year PhD at @nlp_usc | Former predoc at @allen_ai on @ai2_aristo | Harvard 2021 CS & LinguisticsSwaroop Mishra @Swarooprm7
5K Followers 894 Following Research Scientist @GoogleDeepMind (Gemini). Pioneering LLM Research 🔥. Instruction tuning, Factuality, Reasoning and next gen Product. Opinions my own.Sebastian Gehrmann @sebgehr
5K Followers 2K Following Head of NLP, CTO office, @Bloomberg. (he/him) Generating natural language, one word at a time. Also making sense of that language afterwards. views my ownQinyuan Ye @qinyuan_ye
2K Followers 1K Following 👩💻 Ph.D. student @nlp_usc @CSatUSC @USC_ISI | 🐾 Teaching machines to be more versatile and curious.Machel Reid @machelreid
2K Followers 1K Following Research Scientist @GoogleDeepMind Working on LLMs on the Gemini Team; did gemini 1.5 proOfir Press 🖋 @OfirPress
10K Followers 3K Following I build tough benchmarks for LMs and then I get the LMs to solve them. Postdoc @Princeton. PhD from @nlpnoah @UW. Ex-visiting researcher @MetaAI & @MosaicML.Lucy Li @lucy3_li
4K Followers 2K Following @UCBerkeley PhD student + @allen_ai. Human-centered #NLProc, computational social science, AI fairness. she/her. https://t.co/rtSSUhWQnLSara Hooker @sarahookr
39K Followers 7K Following I lead @CohereForAI. Formerly Research @Google Brain @GoogleDeepmind. ML Efficiency at scale, LLMs, @trustworthy_ml. Changing spaces where breakthroughs happen.Alexis Ross @alexisjross
3K Followers 887 Following phd-ing @MIT_CSAIL, interested in NLP for education | formerly nlp @allen_ai, comp sci & philosophy @harvard ‘20Weijia Shi @WeijiaShi2
5K Followers 968 Following PhD student @uwcse @uwnlp | Visiting Researcher @MetaAI | Undergrad @CS_UCLA | https://t.co/eLBQmgkvymDean Clark @DeanCla88922559
144 Followers 1K Following Disabled part-time student, registered for artificial kidney trials.Zaid Sheikh @zdshkh11
29 Followers 103 Following Senior Research Programmer at Carnegie Mellon UniversityTewsano @TewsanoDEMLcx8
0 Followers 72 FollowingNicholas Lourie @NickLourie
140 Followers 287 Following I build things. 🤖 Doing a PhD at @nyuniversity (@CILVRatNYU) on better empirical methods for deep learning and data science. Advised by @kchonyc and @hhexiy.Kabir @kabirahuja004
449 Followers 415 Following CSE PhD Student @uwnlp | Ex-RF @MSFTResearch | cinephile 🎥Tatochor @tatochor24798
0 Followers 176 FollowingPete @epwalsh
51 Followers 88 Following Research Engineer at @allen_ai. Lead engineer for OLMo pretraining.Imperial NLP @imperial_nlp
66 Followers 336 Following We are the Natural Language Processing community here at Imperial College London. Looking forward to sharing more of our work over the coming months! #NLProcChaitanya Malaviya @cmalaviya11
99 Followers 121 Following PhD student at UPenn | currently @GoogleDeepMindHenay @Henay349452
105 Followers 3K FollowingJoe Stacey @_joestacey_
569 Followers 1K Following PhD student at Imperial and Apple Scholar. I love running, NLP and travelling (in no particular order). Ex teacher and PwC Consultant. #NLProcAlexander Wan @alexwan55
475 Followers 944 Following CS at Berkeley; @BerkeleyML @BerkeleyNLP; NLP researchDavid Hall @dlwh
2K Followers 1K Following Research Engineering Lead at @StanfordCRFM . Previously co-founder at Semantic Machines ⟶ MSFT. Lead developer of Levanter, Breeze. he/him @[email protected]Lintang Sutawika @lintangsutawika
381 Followers 565 Following Incoming Ph.D. student @LTIatCMU. Researcher at @AIEleuther. Maintainer of LM-Eval Harness. Here for machine learning papers and discussion.Jushaan Kalra @JushaanSingh
159 Followers 1K Following alleged prompt engineer | ML @WadhwaniAI | Prev. @amazon, @dtu_delhi, MSBKKwanghee Choi @juice500ml
103 Followers 88 Following Master's student @LTIatCMU, studying speech AI at @shinjiw_at_cmu's @WavLabsamir gadre @sy_gadre
440 Followers 488 Following phd @columbia | formerly intern @allen_ai x2, ugrad @brownuniversity | pre-training | Black Lives Matter | he/himPrateek Yadav @prateeky2806
2K Followers 2K Following Ph.D. at @unccs Continual Model Adaptation and Composition Previously @MSFTResearch, @AmazonScience, @iitmadras. UG @iiscbangalore. Opinions are my own.JohnSnowLabs @JohnSnowLabs
41K Followers 30K Following Helping healthcare and life science organizations put AI to work faster with state-of-the-art LLM & NLP.Victoria_Johns @VictoriaJo48031
23 Followers 2K FollowingAmirhossein Abaskohi @AmirAbaskohi
145 Followers 871 Following Master Student @UBC_CS | NLP Researcher @UBC_NLP | Content Creator @YouTube and @Medium #NLProc #MachineLearningJohnny Tian-Zheng Wei @johntzwei
320 Followers 530 Following PhD student at USC. I'm interested in the legal issues of AI.Allan Zhou @AllanZhou17
1K Followers 447 Following Final-year AI PhD student @Stanford. NN architecture design, learned optimizers, and hparam optimization.Smete @Smete389512
109 Followers 4K FollowingBase_Hit_Belle_ @BelleHit7437
10 Followers 1K FollowingKapilDev Neupane @KapildevNeupane
4 Followers 57 FollowingShangbin Feng @shangbinfeng
1K Followers 1K Following PhD student @uwcse @uwnlp. Understanding and expanding the knowledge abilities of LMs, social NLP, networks and structures. he/him. #水文学家Pei Zhou @peizNLP
2K Followers 887 Following PhD @nlp_usc | Ex-@GoogleDeepMind, @GoogleAI, @allen_ai @AmazonScience @UCLA | Common Ground Reasoning for Communicative Agents | he/himSowmya S Sundaram @_sowmyasundaram
24 Followers 215 Following Postdoctoral Researcher @stanford; Previously at @l3s_luh, Germany. PhD from @iitmadras, India | Bringing AI to applicationsRoss @ma1547372858
15 Followers 1K FollowingNari Johnson @narijohnson
447 Followers 534 Following PhD student @mldCMU @scsatcmu. ML + HCI. she/her. currently 💭 AI evaluation & accountabilityVaibhav Raj @vrcoder045
38 Followers 1K Following Comp. Sci. Senior at IIT Bombay, upcoming SWE, ML enthusiastAl Mamun @al_mamun_sardar
276 Followers 3K Following Looking for NLP/LLM related PhD positions | Research Assistant (NLP) at Jahangirnagar University | MSc (CS)Shivam Rai @imsr282
326 Followers 5K Following Tech enthusiast 🚀 | Embarking on a journey through Machine Learning & Data Science 🤖📊 | Curious mind, coding heart ❤️ | Exploring the data-driven frontier 🌐Peter Hase @peterbhase
2K Followers 691 Following Google PhD Fellow at @uncnlp. Interested in interpretable ML, natural language processing, AI Safety, and Effective Altruism.Md. Shariful islam @SharifulPrince1
0 Followers 138 FollowingMaitrey Mehta @my_tray
196 Followers 384 Following PhD student at Utah NLP| #NLProc | Low-resource NLP |Ankur Parikh @ank_parikh
3K Followers 3K Following Staff Research Scientist at Google DeepMind. Former adjunct assistant prof at @NYU_Courant. PhD at @mldcmu. ML for Bio/Chem (Prev. NLP). All opinions my own.Tashe @Tashe1542616
151 Followers 2K FollowingMcSlayle @MSlayle31473
188 Followers 2K FollowingAkari Asai @AkariAsai
11K Followers 650 Following Ph.D. student @uwcse & @uwnlp. NLP. IBM Ph.D. fellow (2022-2023). Meta student researcher (2023-) . ☕️ 🐕 🏃♀️🧗♀️🍳Luca Soldaini 🎀 @soldni
6K Followers 1K Following I like tokens! Lead for OLMo data team at @allen_ai (Dolma 🍇), OSS is fun, @QueerInAI organizer 🤖☕️🍕they/them (views mine, not my employer’s)Ana Marasović @anmarasovic
4K Followers 604 Following Asst prof @UUtah · Ex @allen_ai @uwnlp postdoc @HD_NLP PhD · she/her 🇭🇷Sam Bowman @sleepinyourhat
35K Followers 3K Following AI alignment + LLMs at NYU & Anthropic. Views not employers'. No relation to @s8mb. I think you should join @givingwhatwecan.Yann LeCun @ylecun
712K Followers 719 Following Professor at NYU. Chief AI Scientist at Meta. Researcher in AI, Machine Learning, Robotics, etc. ACM Turing Award Laureate.Allen Institute for A.. @allen_ai
54K Followers 361 Following AI for the Common Good. › Join us: https://t.co/DqTs1G4bGO › Get our newsletter: https://t.co/tvb1VpySfLPercy Liang @percyliang
49K Followers 408 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | PianistPrithviraj (Raj) Amma.. @rajammanabrolu
5K Followers 520 Following Interactive & grounded AI, RL, NLP. Assistant Prof @UCSanDiego. Research Scientist @DbrxMosaicAI. Prev: @allen_ai, @GeorgiaTechWilliam Wang @WilliamWangNLP
14K Followers 718 Following UCSB NLP Lab + ML Center. https://t.co/6TOnqbk6YT https://t.co/KJYhnav3Et Mellichamp Chair Prof. at UCSB CS. PhD @ CMU SCS. Areas: #NLProc, Machine Learning, AI.Naomi Saphra @nsaphra
7K Followers 1K Following Waiting on a robot body. ML/NLP. All opinions are universal and held by both employers and family. Same username on every lifeboat off this sinking ship.Kyunghyun Cho @kchonyc
61K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).Shruti Rijhwani @shrutirij
4K Followers 499 Following * Research Scientist @GoogleDeepMind * #NLProc research * PhD from @LTIatCMU * Amateur woodworker, scuba diver, foosball playerDanish Pruthi @danish037
7K Followers 628 Following Faculty at Indian Institute of Science, Bangalore. PhD from @LTIatCMU.Matthew Finlayson @mattf1n
797 Followers 868 Following First year PhD at @nlp_usc | Former predoc at @allen_ai on @ai2_aristo | Harvard 2021 CS & LinguisticsSwaroop Mishra @Swarooprm7
5K Followers 894 Following Research Scientist @GoogleDeepMind (Gemini). Pioneering LLM Research 🔥. Instruction tuning, Factuality, Reasoning and next gen Product. Opinions my own.Yoav Artzi @yoavartzi
13K Followers 162 Following Research/prof @cs_cornell + @cornell_tech🚡 / https://t.co/9YnWry7yHs / https://t.co/3VmRSyYm2d / asso. faculty director @arxiv / building https://t.co/f9QkzO5kaCZaid Sheikh @zdshkh11
29 Followers 103 Following Senior Research Programmer at Carnegie Mellon UniversityKaitlyn Zhou @KaitlynZhou
463 Followers 315 Following Currently @allen_ai @ai2_mosaic PhD student @StanfordNLPSarah Schwettmann @cogconfluence
2K Followers 950 Following Research Scientist @MIT_CSAIL PhD @MITBrainAndCog, @BKCHarvard affiliate, teaching @MITMuseum StudioKabir @kabirahuja004
449 Followers 415 Following CSE PhD Student @uwnlp | Ex-RF @MSFTResearch | cinephile 🎥Yi Ma @YiMaTweets
71K Followers 123 Following Chair Professor in AI, Director of IDS, Head of CS, HKU; Professor of EECS, Berkeley; Author of Book: High-Dim Data Analysis, https://t.co/gwaqMJp8av.Senthooran Rajamanoha.. @sen_r
100 Followers 43 FollowingJonathan Whitaker @johnowhitaker
7K Followers 956 Following Data scientist and AI researcher. R&D at https://t.co/9xrxRrGfEE.Answer.AI @answerdotai
1K Followers 81 Following A new kind of AI R&D lab which creates practical end-user products based on foundational research breakthroughsOllie Liu @olliezliu
149 Followers 534 Following 👨🍳 research @MSFTResearch; phd student in ml/nlp @CSatUSC. 🎓 alum @mldcmu. 🧐 multi-modal foundation models, decision making, ai4science.Sedrick Keh @sedrickkeh2
159 Followers 185 Following research engineer @ToyotaResearch prev: machine learning @mldcmu interested in natural language generation and evaluationPengfei Liu @stefan_fee
2K Followers 616 Following Associate Prof. at SJTU, leading GAIR Lab (https://t.co/Nfd8KmZx3B) Co-founder of Inspired Cognition, Postdoc at @LTIatCMU, Previously FNLP, @MILAMontreal,Xuezhe Ma (Max) @MaxMa1987
1K Followers 350 Following Research Lead @USC_ISI and Research Assistant Professor @CSatUSC PhD at CMU ML/NLP @LTIatCMU @CarnegieMellonJovan Buha @jovanbuha
70K Followers 2K Following Senior NBA reporter covering the Lakers @TheAthletic. Buha's Block video podcast. Serbian & Puerto Rican. YT/TikTok/IG: @jovanbuha. Pronounced: Yo-von Boo-ha.Steph Milani @steph_milani
1K Followers 225 Following PhD Student at @mldcmu. Previously @UMBC @CMU_Robotics @MFSTResearch. Interested in human-centered reinforcement learning.David Chanin @chanindav
43 Followers 162 FollowingKai Zhang @DrogoKhal4
1K Followers 641 Following PhD student @osunlp. Ex @MSFTResearch and @GoogleDeepMind.Pete @epwalsh
51 Followers 88 Following Research Engineer at @allen_ai. Lead engineer for OLMo pretraining.#StopCopCity 🇵🇸.. @micahinATL
40K Followers 2K Following writer, law student. "subject is a support of defend the forest according to his twitter"Chaitanya Malaviya @cmalaviya11
99 Followers 121 Following PhD student at UPenn | currently @GoogleDeepMindArnab Sen Sharma @arnab_api
153 Followers 83 Following Ph.D. student @KhouryCollege, working to make LLMs interpretableDongwei Jiang @Dongwei__Jiang
134 Followers 249 Following Spent six years working in industry as a speech researcher, currently I'm shifting my focus to LLM and studying at @JohnsHopkins as a master's studentArvind Satyanarayan @arvindsatya1
6K Followers 2K Following Assistant Professor @MIT_CSAIL @mitvis. Data visualization @vega_vis, ML interpretability, cognitively convivial interaction. He/him. @[email protected].Canyu Chen @CanyuChen3
842 Followers 2K Following CS Ph.D. student @illinoistech | Truthful, Safe and Responsible LLMs | LLMs Meet Misinformation: https://t.co/up5sEN5r1gJiuding Sun @SunJiuding
53 Followers 49 Following Undergrad student @khourycollege | previously working at THU-KEG | NLP, ML, currently working on Instruction-following LLMs and their interpretabilityUBC NLP Group @UBC_NLP
373 Followers 50 Following NLP Group at the University of British Columbia Profs. @careninigiusepp, Raymond Ng, @VeredShwartzthamar | @thamar_solorio
2K Followers 675 Following NLP Prof @MBZUAI, & @UH, Director @RiTUAL_Lab. Friend, mother, partner, loves sunny days and live music. EiC @reviewAcl and ARR board. Views are my own.Sandro Pezzelle @sandropezzelle
805 Followers 668 Following Assistant Professor at the University of Amsterdam. #NLProc #AI #CogSci #interpretabilityMichael Hanna @michaelwhanna
264 Followers 310 Following PhD student at the University of Amsterdam / ILLC, interested in computational linguistics and (mechanistic) interpretabilityAlexander Wan @alexwan55
475 Followers 944 Following CS at Berkeley; @BerkeleyML @BerkeleyNLP; NLP researchDavid Hall @dlwh
2K Followers 1K Following Research Engineering Lead at @StanfordCRFM . Previously co-founder at Semantic Machines ⟶ MSFT. Lead developer of Levanter, Breeze. he/him @[email protected]Lintang Sutawika @lintangsutawika
381 Followers 565 Following Incoming Ph.D. student @LTIatCMU. Researcher at @AIEleuther. Maintainer of LM-Eval Harness. Here for machine learning papers and discussion.Mina Lee @MinaLee__
3K Followers 453 Following Postdoc at @MSFTResearch | Assistant Professor at @UChicagoCS (2024) | PhD at @Stanford | Language models, AI-assisted writing, Human-AI interaction ✍️Kwanghee Choi @juice500ml
103 Followers 88 Following Master's student @LTIatCMU, studying speech AI at @shinjiw_at_cmu's @WavLabMarius Mosbach @mariusmosbach
714 Followers 877 Following Postdoc @Mila_Quebec & @mcgillu | NLP researcherMind the Game @mindthegamepod
62K Followers 4 Following Don’t just play the game. Mind it. A podcast hosted by @kingjames and @jj_redick. Eps drop Wednesday. Brought to you by @ThreeFourTwopro and @uninterrupted.samir gadre @sy_gadre
440 Followers 488 Following phd @columbia | formerly intern @allen_ai x2, ugrad @brownuniversity | pre-training | Black Lives Matter | he/himRose @rose_e_wang
2K Followers 238 Following NLP & Education @stanfordnlp 🌲 Prev: 2020 MIT 🦫, Google Brain 🧠, Google Brain Robotics 🤖Xinya Du @Xinya16
813 Followers 434 Following Assistant Professor of CS, at UT Dallas; Cornell CS PhD. #NLProc #DLOkay, I'm gonna stop doing things for other people and code now and no one can stop me.
We're having a big event on agents at CMU on May 2-3 (one week from now), all are welcome! cmu-agent-workshop.github.io It will feature: * Invited talks from @alsuhr @ysu_nlp @xinyun_chen_ @MaartenSap and @chris_j_paxton * Posters of cutting edge research * Seminars and hackathons
On May 2-3, we're going to have a big event in Pittsburgh about LLM Agents. We have invited talks from great speakers inside and outside CMU, student research presentations and posters, tutorials and discussions! Come join us at CMU campus, and register at cmu-agent-workshop.github.io
When you are on an academic visit in the US and while walking in the street you find a protest supporting Palestine. What shall I do? Of course, join it 😊 From all over the world, free free Palestine 🇵🇸 #FreePalestine
i asked SARAH, the World Health Organization's new AI chatbot, for medical help near me, and it provided an entirely fabricated list of clinics/hospitals in SF. fake addresses, fake phone numbers. check out @jessicanix_'s take on SARAH here: bloomberg.com/news/articles/… via @business
Come to the US they said, its a free and democratic country they said... you will have free speech rights they said... students are not going to be arrested because of their ideas, they said.... #ColumbiaUniversity
Holy shit. Google fired 28 workers for protesting against its $1.2 billion contract with the IDF. Solidarity with these workers of conscience. medium.com/@notechforapar…
[p1] 🐕Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward🐕 Paper link: arxiv.org/pdf/2404.01258… page: github.com/RifleZhang/LLa… How to effectively train video large multimodal Model (LMM) alignment with preference modeling?
What challenges do developers face while using AI programming assistants like GitHub Copilot? 🤖🤔 Check out my #ICSE2024 paper (w/ @cyyang3_u and @bradamyers)! I'm presenting this work today at 4:15PM in the Fernando Pessoa room. See you there 🤗 arxiv.org/pdf/2303.17125…
Great release by @lintangsutawika, @arankomatsuzaki , and @colinraffel! Finally a fully-reproducible T5 model:
🚀 Introducing Pile-T5! 🔗 We (EleutherAI) are thrilled to open-source our latest T5 model trained on 2T tokens from the Pile using the Llama tokenizer. ✨ Featuring intermediate checkpoints and a significant boost in benchmark performance. Work done by @lintangsutawika, me…
Professor Ruha Benjamin (@ruha9) using her honorary degree address at Spelman to denounce Atlanta’s Cop City, the ongoing genocide of Palestinians, and the repression of student activists 💜
Defended my thesis yesterday :) Its been a fantastic ride at @columbianlp and I am grateful to my advisor @SmaraMuresanNLP for believing in my work. Special thanks to @VioletNPeng who introduced me to Creative NLG which made a lot of the work in my thesis possible
How does Mamba store knowledge? Is it very different from transformers? New pre-print with @diatkinson and @davidbau, where we investigate the mechanisms of factual recall within Mamba.
Pretraining data remains the most opaque part of the LLM stew. We have little sense of what companies are doing, but Dolma (arxiv.org/abs/2402.00159) provides a great look into the open LLM data process. Luca Soldaini - Curating Pretrain Data (AI2 / Dolma) youtube.com/watch?v=W73Sp7…
Version II of the tutorial on neural theorem proving: github.com/cmu-l3/ntptuto… Some new additions - Train a model that gets 29.5% on miniF2F - Data extraction in Lean, based on lean-training-data - LLMLean tool (github.com/cmu-l3/llmlean)
A tutorial on neural theorem proving: github.com/wellecks/ntptu… Interactive notebooks for learning about combining neural language models with formal proof assistants. Part I) Build and evaluate a next-step suggestion tool Part II) LLM cascades and Draft, Sketch, Prove
Over 2 months since this was noticed & it seems @aclmeeting execs find it optional, since 6 papers of the 10 removed the section, while 4 insisted to keep it. It even propagated to @NeurIPSConf with 2 papers used similar wording! Just remember who is setting the standards! 🧵
This came to my notice lately! I didn’t know that the “Ethics” section in papers could be used this way! If so, many papers can state the “heinous” genocide of over 25,000 Palestinians, majority children & women! @aclmeeting @emnlpmeeting What do you think about this? 🧵
Not an exciting LLM post but a post for humanity. My colleague, Alaa, is working hard to get the remaining members of her family out of Gaza. Please support her! gofundme.com/f/help-evacuat…
A few thoughts on joining ARR, and what I'd like to try to get done: hackingsemantics.xyz/2024/joining-a…
Anna Rogers joins the ARR as a new Editor-in-Chief! aclrollingreview.org/new-eic/