Ana Marasović @anmarasovic
Asst prof @UUtah · Ex @allen_ai @uwnlp postdoc @HD_NLP PhD · she/her 🇭🇷 anamarasovic.com Salt Lake City Joined April 2014-
Tweets2K
-
Followers4K
-
Following602
-
Likes9K
Adversarial Triggers For LLMs Are 𝗡𝗢𝗧 𝗨𝗻𝗶𝘃𝗲𝗿𝘀𝗮𝗹!😲 It is believed that adversarial triggers that jailbreak a model transfer universally to other models. But we show triggers don't reliably transfer, especially to RLHF/DPO models. Paper: arxiv.org/abs/2404.16020
Want to train an aligned LM in a new language 🌏 but don’t have preference data for training the reward model (RM)? 💡 Just use a RM for another language: it often works well, sometimes even BETTER than if you had a RM in your target language! 🤯 arxiv.org/abs/2404.12318
It's so frustrating when your prompt for revising text stops working with model "upgrades" 😭
Excited to share Penzai, a JAX research toolkit from @GoogleDeepMind for building, editing, and visualizing neural networks! Penzai makes it easy to see model internals and lets you inject custom logic anywhere. Check it out on GitHub: github.com/google-deepmin…
.@ReviewAcl How will this be handled: > Has published at least three papers in our *CL conferences2 in the last 5 years (note exceptions below). When 30% of submissions need emergency reviews and S(AC)s assign more freely as it's hard to find folks willing to review?
psa 🔔 dolma license now ODC-BY to match c4 and s2orc
@Stone_Tao For the future PhD applicants reading this tweet, if you're willing to look beyond ultra-competitive top 10 programs, there are literally thousands of fantastic faculty doing amazing work who would love to consider your application, and don't require 5 top-ranked papers out of…
All these apis is like having too many streaming services 😵💫
None of the students I've taken into my group had any of the requirements stated in the reddit post. I suppose many other advisors who are not at stanford, berkeley, cmu, uw, etc cannot recruit such students either. So, if you are open to going to other schools, don't despair!
None of the students I've taken into my group had any of the requirements stated in the reddit post. I suppose many other advisors who are not at stanford, berkeley, cmu, uw, etc cannot recruit such students either. So, if you are open to going to other schools, don't despair!
Gradient descent in numerical analysis 🤡
A feature I'd love to have: feed with folks I follow but only their original tweets/QTs, no retweets shown
Whoever is coaching prospective students to verbalize their whole CV in an email should learn how professors handle emails
Is there a paper describing how command r+ was trained?
@seb_ruder yes, we've been publishing all the vote data in this notebook, where you can find language breakdown, and at the bottom english-only, non-english leaderboards etc. colab.research.google.com/drive/1KdwokPj…
New open source implementation of EK-FAC influence functions (including for language models) by @juhan_bae. github.com/pomonam/kronfl…
I think I conditioned myself to listen either james blake or kid cudi when flying
(((ل()(ل() 'yoav))).. @yoavgo
46K Followers 2K FollowingSam Bowman @sleepinyourhat
35K Followers 3K Following AI alignment + LLMs at NYU & Anthropic. Views not employers'. No relation to @s8mb. I think you should join @givingwhatwecan.Delip Rao e/σ @deliprao
46K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈Tal Linzen @tallinzen
16K Followers 893 Following Professor @nyuling and @NYUDataScience, research scientist @GoogleAIAkari Asai @AkariAsai
11K Followers 650 Following Ph.D. student @uwcse & @uwnlp. NLP. IBM Ph.D. fellow (2022-2023). Meta student researcher (2023-) . ☕️ 🐕 🏃♀️🧗♀️🍳Luca Soldaini 🎀 @soldni
6K Followers 1K Following I like tokens! Lead for OLMo data team at @allen_ai (makin Dolma 🍇), open source science fan, @QueerInAI organizer 🤖☕️🍕they/themKayo Yin @kayo_yin
8K Followers 556 Following PhD student @berkeley_ai @berkeleynlp working on interpretability and signed languages. Former @msftresearch @deepmind @carnegiemellon @polytechnique. 🇫🇷🇯🇵Danish Pruthi @danish037
7K Followers 628 Following Faculty at Indian Institute of Science, Bangalore. PhD from @LTIatCMU.Jacob Andreas @jacobandreas
14K Followers 958 Following Teaching computers to read. Assoc. prof @MITEECS / @MIT_CSAIL (he/him). https://t.co/5kCnXHjtlY https://t.co/2A3qF5vdJwKyunghyun Cho @kchonyc
61K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).Allen Institute for A.. @allen_ai
53K Followers 361 Following AI for the Common Good. › Join us: https://t.co/DqTs1G4bGO › Get our newsletter: https://t.co/tvb1VpySfLNathan Schneider @complingy
4K Followers 1K Following Computational Linguist and Professional Nerd at Georgetown University he/him pronouns, ALL the prepositions @[email protected] @complingy.bsky.socialShruti Rijhwani @shrutirij
4K Followers 499 Following * Research Scientist @GoogleDeepMind * #NLProc research * PhD from @LTIatCMU * Amateur woodworker, scuba diver, foosball playerSara Hooker @sarahookr
39K Followers 7K Following I lead @CohereForAI. Formerly Research @Google Brain @GoogleDeepmind. ML Efficiency at scale, LLMs, @trustworthy_ml. Changing spaces where breakthroughs happen.Naomi Saphra @nsaphra
7K Followers 1K Following Waiting on a robot body. ML/NLP. All opinions are universal and held by both employers and family. Same username on every lifeboat off this sinking ship.Shaily @shaily99
5K Followers 2K Following PhD @LTIatCMU Prev: @GoogleAI @MSFTResearch. Working on ethics and evaluation in #NLProc. Usually ranting, often about research & DEI. 📚 @readsndrantsTim Dettmers @Tim_Dettmers
29K Followers 820 Following PhD Student at @UW. I blog about deep learning and PhD life at https://t.co/Y78KDJJFE7.Crazy Universe @Crazy_Universe0
96 Followers 1K Followingmetavalent stigmergy @metavalent
444 Followers 4K Following The process by which novel insights, intuitions, understandings, ideas, or concepts originate, germinate, blossom, propagate, and instantiate DCNs and DCNRs.Nicholas Meade @ncmeade
127 Followers 143 Following PhD student at @McGillU / @Mila_Quebec; Interested in #NLProc.Arkil @arkil_patel
757 Followers 828 Following PhD Student at Mila (@Mila_Quebec) and McGill (@mcgillu) | Research in ML/NLP | Prev @allen_ai @MSFTResearch | alum @bitspilaniindiaWilliam Li @Williamiumli
20 Followers 139 Following Incoming Ph.D. student @UCSanDiego, M.S.E. in CS @JohnsHopkins, B.S. in CS at SCUTWen Lai @Lavine_Lai
171 Followers 338 Following Phd student @CisLmu working on natural language processing #nlproc and machine translation | Interning at @Bosch_AISimon Dobnik @SimonDobnik
119 Followers 287 Following Professor at University of Gothenburg, Sweden. NLP researcher and lecturer.Joel Chen @joel_chen_
170 Followers 2K Following NLP MLE, interested in nn/dl4nlp, of course, and the LLM.Christian Moya Calder.. @chrismoya86
2 Followers 432 FollowingMark R. Hinkle @mrhinkle
7K Followers 5K Following I help enterprises understand and use artificial intelligence. Leveraging my 25 years of enterprise software experience in emerging technology to drive results.Trevor Loy @trevorloy
17K Followers 2K Following VC investor emerging ecosystems @FlywheelVC. Lecturer entrepreneurship & VC @Stanford. Prev: BoD @NVCA; Mentor @KauffmanFellows; 3x founder; Chip design @Intel.Luca @_lukfre
0 Followers 27 FollowingINDRAJEET @indrajeet877
423 Followers 2K Following Head of Math Department,Allen Institute Karaikal BTech NITW 2012, Option trader & investor. Math geek, tech-forward, learner Plus Python & Spanish skills.Mike Channon Ⓜ️ @XDA_Forum_Admin
6K Followers 5K Following Forum Admin at https://t.co/mFiBmgsI4b, Director at https://t.co/iH1LoXoajpDeekshith Reddy @deekshith180
15 Followers 201 Followingxenjoyer007 @xenjoyer007
1 Followers 132 FollowingAbdulrahman Tabaza @embed_dim
4 Followers 771 Following enjoyer of various vector spaces, encoders and modalitiesAryan Pandey (Look fo.. @AryanPa66861306
1K Followers 3K Following Half Machine Learning Engineer || DevOps and Machine Learning || Open Source at OpenVINOMaraseka53 @maraseka5317087
44 Followers 482 FollowingMuizz @muizzkhan77
34 Followers 1K Followingupteronext @upteronext
57 Followers 162 FollowingJindong Gu @Jindong73504766
287 Followers 886 Following Senior Research Fellow in University of Oxford @OxfordTVG Faculty Researcher @Google #ResponsibleAI #AISafety #GenAI Homepage: https://t.co/YOSVO3jb6hWelkin Huang @welkinwjh
17 Followers 146 FollowingBiancaKlebanoff @BiancaKleb81565
15 Followers 380 FollowingNils Lukas @NilsLukas7
158 Followers 288 Following Incoming Assistant Professor @MBZUAI | ML Security & Privacy PhD @UWaterloo | Previous intern @MSFTResearch, @BorealisAIJanhavee Shinde @SJanhavee
56 Followers 2K FollowingHenry Grafé @GrafeHenry97431
7 Followers 32 FollowingPensé FFun @inftyCategory
108 Followers 6K Followingharshith @theharshithh
219 Followers 2K Following trying to apply mathematics more. everywhere. prev: @marianaaihqDefu Cao @caodefu_dove
224 Followers 389 Following Phd student of @USC' CS. Working with Prof. @yanliu_usc. Time series 📈& Causal Inference 🔧💡 Ex: @PKU1898; @AdobeResearch, UCB, MSRA, Alibaba , Baidurob voigt @rfpvjr
873 Followers 932 Following using computational methods to understand the linguistic mechanisms of social problems | NLP, socioling, discourse-pragmatics | asst prof @linguisticsNUImad Khwaja @flyingblackswan
151 Followers 2K Following SaaS Growth || SEO Marketing Agency || EntrepreneurJason Cox @JasonOfficialMe
380 Followers 4K Following 🌐📉 | Data Engineer @Blizzard_Ent | Network Science Instructional Associate @gtcomputing | My posts sometimes represent myself but never my employer.Rohit @RohitUM1986
454 Followers 3K Following Roboticist, PhD student in Active Perception Liberal, Opinions PersonalPete @epwalsh
51 Followers 88 Following Research Engineer at @allen_ai. Lead engineer for OLMo pretraining.Akash Bahai @akashbahai
524 Followers 3K Following Structural Bioinformatics | Machine Learning, PostDoc at NTU | Past: @IISERPune, @Helmholtz_HZIMilouz Bhinouz @bhanouz
110 Followers 1K FollowingDebargha Ganguly @Debargha_
881 Followers 2K Following Trustworthy + scalable ML, CS PhD student @cwru; alum @ashokaunivAvshalom Manevich @AvshalomM
314 Followers 1K Following NLP Research Student @biunlp, Deep Learning Engineer @AI21Labs Ex intern @Amazon, @Bosch_AIMr.Stani @MrStani2
242 Followers 3K Following(((ل()(ل() 'yoav))).. @yoavgo
46K Followers 2K FollowingPercy Liang @percyliang
49K Followers 408 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | PianistSam Bowman @sleepinyourhat
35K Followers 3K Following AI alignment + LLMs at NYU & Anthropic. Views not employers'. No relation to @s8mb. I think you should join @givingwhatwecan.Delip Rao e/σ @deliprao
46K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈Sasha Rush @srush_nlp
52K Followers 464 Following Professor, Programmer in NYC. Cornell Tech, Hugging Face 🤗 https://t.co/cZl0wTfqGzTal Linzen @tallinzen
16K Followers 893 Following Professor @nyuling and @NYUDataScience, research scientist @GoogleAIAkari Asai @AkariAsai
11K Followers 650 Following Ph.D. student @uwcse & @uwnlp. NLP. IBM Ph.D. fellow (2022-2023). Meta student researcher (2023-) . ☕️ 🐕 🏃♀️🧗♀️🍳Luca Soldaini 🎀 @soldni
6K Followers 1K Following I like tokens! Lead for OLMo data team at @allen_ai (makin Dolma 🍇), open source science fan, @QueerInAI organizer 🤖☕️🍕they/themChristopher Manning @chrmanning
126K Followers 115 Following Director, @StanfordAILab. Assoc. Director, @StanfordHAI. Founder, @stanfordnlp. Prof. CS & Linguistics, @Stanford. IP @aixventureshq. 🇦🇺 Do #NLProc & #AI. 👋Kayo Yin @kayo_yin
8K Followers 556 Following PhD student @berkeley_ai @berkeleynlp working on interpretability and signed languages. Former @msftresearch @deepmind @carnegiemellon @polytechnique. 🇫🇷🇯🇵Danish Pruthi @danish037
7K Followers 628 Following Faculty at Indian Institute of Science, Bangalore. PhD from @LTIatCMU.Jacob Andreas @jacobandreas
14K Followers 958 Following Teaching computers to read. Assoc. prof @MITEECS / @MIT_CSAIL (he/him). https://t.co/5kCnXHjtlY https://t.co/2A3qF5vdJwKyunghyun Cho @kchonyc
61K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).Graham Neubig @gneubig
31K Followers 586 Following Associate professor at CMU, studying natural language processing and machine learning.Rosanne Liu @savvyRL
33K Followers 966 Following Cofounded & running @ml_collective. Host of Deep Learning Classics & Trends. Research at Google DeepMind. DEI/DIA Chair of ICLR & NeurIPS. Writing https://t.co/IbycyGfnDRAllen Institute for A.. @allen_ai
53K Followers 361 Following AI for the Common Good. › Join us: https://t.co/DqTs1G4bGO › Get our newsletter: https://t.co/tvb1VpySfLNathan Schneider @complingy
4K Followers 1K Following Computational Linguist and Professional Nerd at Georgetown University he/him pronouns, ALL the prepositions @[email protected] @complingy.bsky.socialNicholas Meade @ncmeade
127 Followers 143 Following PhD student at @McGillU / @Mila_Quebec; Interested in #NLProc.ACLRollingReview @ReviewAcl
5K Followers 62 Following ACL Rolling Review. Deadlines 10/15, 12/15, 2/15, 4/15 Tweets by @mayhewsw, @gneubig, @karmake2, @zeeraktalat, & otherslmsys.org @lmsysorg
37K Followers 171 Following Large Model Systems Organization. We created Vicuna and Chatbot Arena! Compare 30+ LLMs (GPT-4/Claude/Llamas) side-by-side at https://t.co/IDFeIDIOtmRoger Grosse @RogerGrosse
10K Followers 750 FollowingRyan Lowe @ryan_t_lowe
5K Followers 358 Following what is the place from which we are creating? ❤️✨🤠❤️Gillian Hadfield @ghadfield
5K Followers 710 Following Author of Rules for a Flat World. Law and economics professor exploring the legal innovation needed to keep up with 21st century technology and globalization.Karolina Stanczak @karstanczak
515 Followers 445 Following NLP & ML PhD candidate @uni_copenhagen @CopeNLUMehar Bhatia @bhatia_mehar
990 Followers 2K Following NLP || Grad CS Student at @UBC Vancouver 👩🎓|| @UBC_NLP @VectorInst || Studying culture, reasoning, alignment, fairness and biasesAmanda Askell @AmandaAskell
26K Followers 653 Following Philosopher & ethicist teaching models to be good @AnthropicAI. Personal account. All opinions come from my training data.Toby Shevlane @tshevl
2K Followers 1K Following Research Scientist testing AI models for new capabilities at @GoogleDeepMind. Tweeting about AI and the future.Anca Dragan @ancadianadragan
8K Followers 178 Following AI safety & alignment at Google DeepMind • associate professor at UC Berkeley EECS • proud mom of an amazing 2yr oldTeaching NLP Workshop.. @teaching_nlp
25 Followers 15 Following Teaching NLP workshop at #ACL2024 in BangkokConference on Languag.. @COLM_conf
2K Followers 6 Following https://t.co/GhGCMEoa4A Abstract submission: March 22, 2024Melanie Sclar @melaniesclar
2K Followers 412 Following PhD student @uwnlp @uwcse | Visiting Researcher @MetaAI FAIR Labs | Prev. Lead ML Engineer @asapp, intern @LTIatCMU | 🇦🇷Ben Recht @beenwrekt
26K Followers 365 Following optimization. machine learning. uc berkeley. I blog at https://t.co/fkJujOPsJb The world won't end.Alex Tamkin 🦣 @AlexTamkin
4K Followers 1K Following machine learning, science & society @AnthropicAI | prev: phd @StanfordAILab, @stanfordnlpNouha Dziri @nouhadziri
3K Followers 672 Following Research Scientist @allen_ai / @ai2_mosaic, PhD in NLP/Dialogue 🤖 UofA. Ex Visiting researcher @Mila_Quebec Ex Research intern at @GoogleDeepMind @MSFTResearchLukasz Kaiser @lukaszkaiser
7K Followers 47 FollowingMattia @GrespanMattia
33 Followers 109 Following Graduate Student @KahlertSoC @UUtah. ML, Logic, NLP (and Rock 'n' Roll).Vilém Zouhar @zouharvi
2K Followers 2K Following PhD student @ ETH Zürich | all aspects of #NLProc but mostly HCI, evaluation and MT | go #veganChan Young Park @chan_young_park
442 Followers 213 Following PhD student @LTIatCMU @uwcse, working on natural language processing and computational social science.Hal Daumé III @haldaume3
27K Followers 355 Following Human-centered AI #HCAI, NLP & ML. Director @trails_ai. Prof @umdCS, member of @CLIPumd @HCIL_umd, researcher @MSFTresearch. Fun: 🧗🧑🍳🧘⛷️🏕️. he/him.Marine Carpuat @MarineCarpuat
2K Followers 389 Following Associate Professor, Computer Science, University of Maryland. I go by she/her.Peter Jansen @peterjansen_ai
1K Followers 643 Following Associate Professor @uarizona; Visiting Scientist @allen_ai, AI/NLP; EntailmentBank; ScienceWorld; WorldTree; ExplanationBank. Tweets/opinions my own.The GenLaw Center @genlawcenter
483 Followers 22 Following The Center for Research on Generative AI, Law, and Policy https://t.co/mxbv72Mp3RVivek Srikumar @viveksrikumar
275 Followers 135 FollowingAshim Gupta @ashimgupta95
129 Followers 923 Following PhD Student researching NLP at University of Utah. self.bookmarks = likesNaveen Rao @NaveenGRao
28K Followers 785 Following VP GenAI @Databricks. Former CEO/cofounder MosaicML & Nervana/IntelAI. Neuro + CS. I like to build stuff that will eventually learn how to build other stuff.Jacob Johnson @jacobkj314
11 Followers 17 FollowingShayne Longpre @ShayneRedford
4K Followers 998 Following PhD @MIT. Prev: @Google Brain, @apple ML, @stanfordnlp. 🇨🇦 Interests: AI/ML/NLP, Data-centric AI, transparency & societal impactPamela Samuelson @PamelaSamuelson
11K Followers 2K Following Copyright, Internet Law, Privacy, EFF, EPIC, @auths_alliance, BCLTMaithra Raghu @maithra_raghu
17K Followers 476 Following Cofounder and CEO @Samaya_AI. Formerly Research Scientist Google Brain (@GoogleAI), PhD in ML @Cornell.Downtown SLC @DowntownSLC
78K Followers 3K Following Our mission is to build a dynamic and diverse community that is the regional center for culture, commerce and entertainment.Nathan Lambert @natolambert
25K Followers 689 Following Figuring out AI @allen_ai, "rl boi" DM me papers. Writes @interconnectsai, talks @retortai Has phd and some credentialsZining Zhu @zhuzining
603 Followers 484 Following Incoming assistant Professor @FollowStevens (2024-) Current: PhD candidate at @UofT, @VectorInst Areas: #NLProc #AIHanjie Chen @hanjie_chen
2K Followers 365 Following Incoming Assistant Professor @RiceCompSci, Postdoc @jhuclsp, working on Trustworthy AI/NLP/ML, PhD @CS_UVA, former intern @allen_ai, @MSFTResearch, @IBMPeter Cihon @pcihon
827 Followers 647 Following global public policy @github | AI governance | personalGabriele Sarti @gsarti_
2K Followers 2K Following PhD Student @GroNLP 🐮, core dev of @InseqLib (https://t.co/tTjrg26ygQ). Interpretability ∩ HCI ∩ #NLProc. Prev: @AmazonScience, @Aindo_AI, @ItaliaNLP_Lab.🇺🇦 Dzmitry Bahd.. @DBahdanau
6K Followers 36 Following Research Scientist & Research Lead at ServiceNow Research Adjunct Prof @ McGill. Member of Mila, Quebec AI Institute. Stream of consciousness is my own.Boaz Barak @boazbaraktcs
17K Followers 419 Following Computer Scientist. See also https://t.co/EXWR5k634w, https://t.co/SEVX6it6z3 ( @[email protected] , boaz.barak in threads ). Opinions my own.Ajeya Cotra @ajeya_cotra
6K Followers 286 Following AI could get really powerful soon and I worry we're underprepared. Analysis+grantmaking in AI alignment @open_phil (views my own), editor+writer @plannedobs.Rebecca Fiebrink @RebeccaFiebrink
6K Followers 1K Following Creative, usable, humane machine learning. Professor @ Creative Computing Institute UAL. Creator of Wekinator. Views my own she/her 🏳️🌈 @[email protected]Divyansh Kaushik @dkaushik96
4K Followers 3K Following Emerging tech and national security. DC/PGH. “An imported Indian immigrant,” @BreitbartNews."nicole" @ninklefitz
1K Followers 517 Following master of decorum @alpacaml. prev: @MicrosoftResearch, @MosaicML, @Mila_QuebecYacine Jernite @YJernite
4K Followers 1K Following ML & Society lead @huggingface, NLPer at heart, focusing on data and ML systems governance these days he/him #BlackLivesMatterNeel Nanda @NeelNanda5
13K Followers 89 Following Mechanistic Interpretability lead @DeepMind. Formerly @AnthropicAI, independent. In this to reduce AI X-risk. Neural networks can be understood, let's go do it!Srishti Palani (she/h.. @SrishtiPalani
712 Followers 1K Following PhD Researcher @UCSanDiego | Designing Human-AI Systems To Boost Search, Synthesis & Creativity | Previously @MSFTResearch @allen_ai @ADSKResearchAdversarial Triggers For LLMs Are 𝗡𝗢𝗧 𝗨𝗻𝗶𝘃𝗲𝗿𝘀𝗮𝗹!😲 It is believed that adversarial triggers that jailbreak a model transfer universally to other models. But we show triggers don't reliably transfer, especially to RLHF/DPO models. Paper: arxiv.org/abs/2404.16020
One of most intriguing findings of 2023 is that adversarial triggers that jailbreak one or more LLMs transfer to other models. We were so excited that we spent many months figuring out the conditions for universal transfer but the transfer never happened. It wasn't a bug 😀
Adversarial Triggers For LLMs Are 𝗡𝗢𝗧 𝗨𝗻𝗶𝘃𝗲𝗿𝘀𝗮𝗹!😲 It is believed that adversarial triggers that jailbreak a model transfer universally to other models. But we show triggers don't reliably transfer, especially to RLHF/DPO models. Paper: arxiv.org/abs/2404.16020
Want to train an aligned LM in a new language 🌏 but don’t have preference data for training the reward model (RM)? 💡 Just use a RM for another language: it often works well, sometimes even BETTER than if you had a RM in your target language! 🤯 arxiv.org/abs/2404.12318
Curious about socially-intelligent AI? Check out our paper on underlying technical challenges, open questions, and opportunities to advance social intelligence in AI agents: Work w/ @lpmorency, @pliang279 📰Paper: arxiv.org/abs/2404.11023 💻Repo: github.com/l-mathur/socia… 🧵1/9
I am super excited about the release of our 8B & 70B LLaMA 3 models! Huge team effort, amazing learning experience, and we're not done - the 405B is still training! #Llama3
It’s here! Meet Llama 3, our latest generation of models that is setting a new standard for state-of-the art performance and efficiency for openly available LLMs. Key highlights • 8B and 70B parameter openly available pre-trained and fine-tuned models. • Trained on more…
BREAKING: We just made our largest ever plastic catch. Interceptor 006 in the Rio Las Vacas, Guatemala, stopped 272 truckloads of trash - equal to 1.4 million kg (3.1 million lbs) - from flowing into the Caribbean Sea. All in a single evening.
Over the past year, tech CEOs seem to have realized the naivete of their "AGI in 3 years" projections. Instead of walking back their claims, they've watered down what they mean by AGI so much that it's meaningless now. It helped that AGI was never clearly defined to begin with.
Has anyone trained a model on the parallel corpus of papers --> thesis translations yet? 😆
Excited to share Penzai, a JAX research toolkit from @GoogleDeepMind for building, editing, and visualizing neural networks! Penzai makes it easy to see model internals and lets you inject custom logic anywhere. Check it out on GitHub: github.com/google-deepmin…
So glad to share that I am one of the recipients of an @OpenAI Superaligment Fast Grant on the topic of #CoTfaithfulness 🥳🥳
The superalignment fast grants are now decided! We got a *ton* of really strong applications, so unfortunately we had to say no to many we're very excited about. There is still so much good research waiting to be funded. Congrats to all recipients!
data forensics 101
We reconstructed the data by extracting the SVG from the paper, parsing out the point locations & colors, mapping the coordinates to model size & FLOP, and mapping the colors to loss values. This let us closely approximate their original dataset from just the figure. (2/9)
@JesseDodge we cited your tweet in our paper :) arxiv.org/abs/2307.10700
this tweet fully recycled from the recent Gemini release 😂 getting as much mileage as i can out of it
Today Meta released Llama 3! Congrats to the team. In their blog post they wrote that, "the curation of a large, high-quality training dataset is paramount", while providing almost no information about how it was made, how it was filtered, or its contents.
we seem to converge to a terminology that has "pre-training" and "post-training", but without any "training" between them.
That time of the year when I am confused if 5/6 is 6th May or 5th June.
The Future of Humanity at Oxford has unfortunately closed. The long-term future is often the subject of thoughtless narratives and empty rhetoric. During its 19 years, FHI showed that precise and incisive thinking about the long-term future is possible and most fertile. ->