Alessandro Stolfo @alesstolfo
PhD Student @ ETH Zürich in #NLProc | Prev. @oracle Labs alestolfo.github.io Joined March 2012-
Tweets75
-
Followers677
-
Following399
-
Likes3K
🧵@alesstolfo : "Our results show that while larger models tend to ground their outputs more effectively, a significant portion of correct answers remains compromised by hallucinations." ↩️
Welcome to our poster at 11 am! We will introduce our #emnlp2023 work: understanding how LLMs answer multi-step reasoning questions (by memorization or step-by-step reasoning). Paper: arxiv.org/abs/2310.14491
I and many of our group members are at #EMNLP2023 this week. We have a number of contributions to the conference spanning various topics in NLP, Reasoning and AI in Education. Please drop by if you are interested in any of the following:
@gu_yuling In “A Causal Framework to Quantify the Robustness of Mathematical Reasoning with LMs.”, @alesstolfo @ZhijingJin @JupyterAI @bschoelkopf @mrinmayasachan examine LLM behavior and robustness in solving math problems, but using causal graphs. Paper: arxiv.org/abs/2210.12023…
Excited to be at #ACL2023NLP to present our paper “A Causal Framework to Quantify the Robustness of Mathematical Reasoning with LMs.” Stop by and check out our poster tomorrow @ 9am (Frontenac Ballroom) arxiv.org/abs/2210.12023 @ZhijingJin @JupyterAI @bschoelkopf @mrinmayasachan
orporick @orporick
52K Followers 26K Following Scavo fossati, scrivo manifesti. https://t.co/h8BzhbqEvjFriday @asia_marosa
371 Followers 516 Following Conosco la metà di voi soltanto a metà e nutro per meno della metà di voi metà dell'affetto che meritateAutosabotaje @autosabotag_Gio
613 Followers 285 FollowingTiago Pimentel @tpimentelms
1K Followers 248 Following Postdoc at @ETH_en. Formerly, PhD student at @Cambridge_Uni.Niklas Stoehr @niklas_stoehr
799 Followers 754 Following PhD student @ETH advised by @RyanDCotterell, @Cervisiarius and @AaronSchein ⭕️ Language Model Interpretability and Application ⭕️ @GoogleAI, @TechAtBloombergRyan David Cotterell @ryandcotterell
9K Followers 1K FollowingTiwa Eisape @tiwa_eisape
1K Followers 1K Following PhD student at @MIT working on NLP and cognitive science - @NSF grfp fellow. Previously with @GoogleAI and @Meta FAIRKumar Shridhar @JupyterAI
584 Followers 1K Following PhD in ML/NLP @eth_en | I do #NLProc and #AI | Past: @MSFTResearch @AIatMeta @AmazonScience @rptu_kl_ld | He/him. Views are my own.Mrinmaya Sachan @mrinmayasachan
2K Followers 2K Following Assistant Professor of Computer Science at ETH Zurich working in natural language processing (#NLProc), machine learning and education (#edtech).Dibyajyoti Acharya @DibyajyotiAch04
136 Followers 5K Following Student, Learner, Explorer 🤓 Interested in all things AI.jinzhuan @jinzhuan2
26 Followers 99 FollowingNicolò De Sabbata @cndesabbata
99 Followers 439 Following 👨🏻💻CS MS student at @EPFL_en | Visiting Student Researcher @Princeton |Previously @AXA @amazon | 🧠Deep Learning, NLP & Cognitive Science | 🇪🇺🇮🇹🇨🇭Alexander Perevalov @Perevalov_A
72 Followers 95 Following 👨🎓 PhD Student & Research Assistant & Research Project Lead 🇩🇪 Hochschule Anhalt // HTWK Leipzig // Uni Paderborn 🇷🇺 From Perm, RussiaAshutosh Mehra @ashutoshmehra
1K Followers 5K Following Senior Principal Scientist at Adobe. Working on Acrobat AI Assistant, LLMs, and document ML.Leo Boytsov @srchvrs
7K Followers 2K Following Sr. Research Scientist @AWS Labs (ph-D @LTIatCMU) working on unnatural language processing, speaking πtorch & C++. Opinions sampled from MY OWN 100T param LM.Besmira Nushi 💙�.. @besanushi
2K Followers 739 Following Researcher @MSFTResearch artificial intelligence, human-machine collaboration, technology & society.Joe Stacey @_joestacey_
569 Followers 1K Following PhD student at Imperial and Apple Scholar. I love running, NLP and travelling (in no particular order). Ex teacher and PwC Consultant. #NLProcPrhp1 @mojrad24
17 Followers 4K Following not a bot, Just someone with a strong thirst for knowledgeMüge Kural @mugekural
158 Followers 409 FollowingAmirhossein Abaskohi @AmirAbaskohi
145 Followers 871 Following Master Student @UBC_CS | NLP Researcher @UBC_NLP | Content Creator @YouTube and @Medium #NLProc #MachineLearningWanru Zhao (Looking f.. @Renee42581826
513 Followers 2K Following Postgraduate Student @CaMLSys @Cambridge_CL | Ex-Intern @DGLGraph @AWS and @CambridgeJBS | Do not go gentle into that good night 🧗Yufei Liu @liuyufei118
26 Followers 85 FollowingDominik Glandorf @dogl26
22 Followers 152 Following Educational Data Scientist - University of Tübingen - ML-Science ColaboratoryHamid Palangi @hmd_palangi
733 Followers 687 Following Principal Researcher @MSFTResearch, Affiliate Associate Professor @UWliuyong @forrestbing
265 Followers 5K Following I am a researcher in AIGC, Multi-modality and VitrualHuman tech directionjluite @jluite2014
276 Followers 4K Followingعەبدولهادی .. @HadyHaji
115 Followers 388 Following deep learning, natural language processing, speech recognition. MSc AI &NLPB @bbbb_bb_b
0 Followers 3K FollowingJulia Chatain @JuliaChatain
704 Followers 726 Following Dr. | Senior Scientist @SEC_ETH | Future Embodied Learning Technologies (FELT) | Math Ed • HCI • XR • AI • Games | @[email protected] | she/they | 🇸🇬🇨🇭🏳️🌈Ripan Kumar Kundu @riponkundu69
164 Followers 714 Following Ph.D. student at @mizzou, working on Explainable AI and Trustworthy AI in @VR. Working at @dependableCPS Laboratory.Tuyen Huynh @hntuyen
46 Followers 1K FollowingWes Gurnee @wesg52
3K Followers 198 Following Optimizer @MIT @ORCenter PhD student thinking about Mechanistic Interpretability, Optimization, and Governance.Nahz @NahumSQL
196 Followers 350 Following Figuring out AWS Cloud ☁️ || Google Certified Data Analyst || AI 🎨 explorer || Gamer 🎯🎮 || Web3.0 EnthusiastJoseph Imperial @josephimperial_
1K Followers 5K Following UKRI CDT PhD @ARTAIBath @bathnlp @UniofBath 🇬🇧. Language generation, evals, and alignment. Research Faculty @NationalUPhil 🇵🇭. He/him.Ray Becker (@raybecke.. @raybbecker
4K Followers 4K Following Research Software Engineer in Argument Technology @ARG_tech @dundeeuniMartin Ziqiao Ma @ziqiao_ma
1K Followers 1K Following 〽️ PhD @Michigan_AI | 💼 @Adobe | 🎓@SJTU1896 @Amazon |🤔️ = (🗣️#NLProc + 🤖#EmbodiedAI + 🤝#Interaction) x 🧠#CogSci | Grounding language to 👥 & 🌎.Yılmazcan Özyurt @ylmzcnzyrt
129 Followers 118 FollowingCan Udomcharoenchaiki.. @canudomc
83 Followers 605 Following NLP Researcher @ VISTEC/ Data Science For Social Good 2018/ Pronouns: he/himMathew Manoj @MathewManoj19
15 Followers 145 FollowingSebastian Gehrmann @sebgehr
5K Followers 2K Following Head of NLP, CTO office, @Bloomberg. (he/him) Generating natural language, one word at a time. Also making sense of that language afterwards. views my ownSarah Wiegreffe @sarahwiegreffe
4K Followers 984 Following At @allen_ai @ai2_aristo @uwnlp. Research in language model transparency & interpretability. PhD from @mlatgt @icatgt @gtcomputing. Views my own.Marius Mosbach @mariusmosbach
714 Followers 877 Following Postdoc @Mila_Quebec & @mcgillu | NLP researcherMichael Hanna @michaelwhanna
264 Followers 310 Following PhD student at the University of Amsterdam / ILLC, interested in computational linguistics and (mechanistic) interpretabilityTim Vieira @xtimv
4K Followers 1K Following machine learning, reinforcement learning, programming languages, handstands (he/him)Young @younqchan
170 Followers 3K Following Final year Ph.D. student working on Out-of-Distribution Generalization and Causality of Large Pre-trained Models, and Graph Neural Networks.Nikhil Prakash @nikhil07prakash
365 Followers 2K Following CS Ph.D. @KhouryCollege | Prev: Visiting Scholar @ MPI-SP @maxplanckpress; Intern @kixlab_kaist @SamsungBlr @iitrpr @HasuraHQLisa Alazraki @LisaAlazraki
651 Followers 790 Following #ML & #NLProc PhD student @ImperialCollege. Prev. research intern @GoogleAI. Reasoning, planning & LLMs as agents. Reposted papers are my reading list 📚Bruno Saraiva @bdfsaraiva
6 Followers 72 Following PhD Candidate | Researcher | Assistant Lecturer at Lusófona UniversityJushaan Kalra @JushaanSingh
159 Followers 1K Following alleged prompt engineer | ML @WadhwaniAI | Prev. @amazon, @dtu_delhi, MSBKorporick @orporick
52K Followers 26K Following Scavo fossati, scrivo manifesti. https://t.co/h8BzhbqEvjFriday @asia_marosa
371 Followers 516 Following Conosco la metà di voi soltanto a metà e nutro per meno della metà di voi metà dell'affetto che meritateAutosabotaje @autosabotag_Gio
613 Followers 285 FollowingTiago Pimentel @tpimentelms
1K Followers 248 Following Postdoc at @ETH_en. Formerly, PhD student at @Cambridge_Uni.Josef Valvoda @ValvodaJosef
675 Followers 1K Following PhD candidate @CambridgeNLP group @Cambridge_UniNiklas Stoehr @niklas_stoehr
799 Followers 754 Following PhD student @ETH advised by @RyanDCotterell, @Cervisiarius and @AaronSchein ⭕️ Language Model Interpretability and Application ⭕️ @GoogleAI, @TechAtBloombergSebastian Ruder @seb_ruder
80K Followers 1K Following Multilingual LLMs @cohere • Prev: @GoogleDeepMind • Newsletter: https://t.co/7JGh2qpG98Ryan David Cotterell @ryandcotterell
9K Followers 1K FollowingTiwa Eisape @tiwa_eisape
1K Followers 1K Following PhD student at @MIT working on NLP and cognitive science - @NSF grfp fellow. Previously with @GoogleAI and @Meta FAIRKumar Shridhar @JupyterAI
584 Followers 1K Following PhD in ML/NLP @eth_en | I do #NLProc and #AI | Past: @MSFTResearch @AIatMeta @AmazonScience @rptu_kl_ld | He/him. Views are my own.Jacob Andreas @jacobandreas
14K Followers 958 Following Teaching computers to read. Assoc. prof @MITEECS / @MIT_CSAIL (he/him). https://t.co/5kCnXHjtlY https://t.co/2A3qF5vdJwMrinmaya Sachan @mrinmayasachan
2K Followers 2K Following Assistant Professor of Computer Science at ETH Zurich working in natural language processing (#NLProc), machine learning and education (#edtech).Leo Boytsov @srchvrs
7K Followers 2K Following Sr. Research Scientist @AWS Labs (ph-D @LTIatCMU) working on unnatural language processing, speaking πtorch & C++. Opinions sampled from MY OWN 100T param LM.Ashutosh Mehra @ashutoshmehra
1K Followers 5K Following Senior Principal Scientist at Adobe. Working on Acrobat AI Assistant, LLMs, and document ML.Joe Stacey @_joestacey_
569 Followers 1K Following PhD student at Imperial and Apple Scholar. I love running, NLP and travelling (in no particular order). Ex teacher and PwC Consultant. #NLProcAri Kobren @akobren
168 Followers 299 Following scientist, musician, snowboarder, futboller, wannabe farmerBesmira Nushi 💙�.. @besanushi
2K Followers 739 Following Researcher @MSFTResearch artificial intelligence, human-machine collaboration, technology & society.Max Tegmark @tegmark
145K Followers 29 Following Known as Mad Max for my unorthodox ideas and passion for adventure, my scientific interests range from artificial intelligence to the ultimate nature of realityDominik Glandorf @dogl26
22 Followers 152 Following Educational Data Scientist - University of Tübingen - ML-Science ColaboratoryWanru Zhao (Looking f.. @Renee42581826
513 Followers 2K Following Postgraduate Student @CaMLSys @Cambridge_CL | Ex-Intern @DGLGraph @AWS and @CambridgeJBS | Do not go gentle into that good night 🧗Yufei Liu @liuyufei118
26 Followers 85 FollowingHamid Palangi @hmd_palangi
733 Followers 687 Following Principal Researcher @MSFTResearch, Affiliate Associate Professor @UWJulia Chatain @JuliaChatain
704 Followers 726 Following Dr. | Senior Scientist @SEC_ETH | Future Embodied Learning Technologies (FELT) | Math Ed • HCI • XR • AI • Games | @[email protected] | she/they | 🇸🇬🇨🇭🏳️🌈Wes Gurnee @wesg52
3K Followers 198 Following Optimizer @MIT @ORCenter PhD student thinking about Mechanistic Interpretability, Optimization, and Governance.Yılmazcan Özyurt @ylmzcnzyrt
129 Followers 118 FollowingJoelle Pineau @jpineau1
10K Followers 352 Following AI researcher. VP AI Research (FAIR), @AIatMeta. Professor of Computer Science, @mcgillu. Core academic member, @Mila_QuebecSebastian Gehrmann @sebgehr
5K Followers 2K Following Head of NLP, CTO office, @Bloomberg. (he/him) Generating natural language, one word at a time. Also making sense of that language afterwards. views my ownSarah Wiegreffe @sarahwiegreffe
4K Followers 984 Following At @allen_ai @ai2_aristo @uwnlp. Research in language model transparency & interpretability. PhD from @mlatgt @icatgt @gtcomputing. Views my own.Alexander Hoyle @miserlis_
809 Followers 475 Following PhD student in Natural Language Processing @umdcs, advised by @psresnik. @[email protected] . Previously summer intern @msftresearch, @ai2_allennlp. he/himMarius Mosbach @mariusmosbach
714 Followers 877 Following Postdoc @Mila_Quebec & @mcgillu | NLP researcherZurichAI @zurichnlp
372 Followers 140 Following We host meetups in Zurich to bring together researchers, engineers and enthusiasts in NLP, CV and beyond!Michael Hanna @michaelwhanna
264 Followers 310 Following PhD student at the University of Amsterdam / ILLC, interested in computational linguistics and (mechanistic) interpretabilityAntoine Bosselut @ABosselut
3K Followers 602 Following Helping machines make sense of the world. Asst Prof @ICepfl; Before: @stanfordnlp @allen_ai @uwnlp @MSFTResearch #NLProc #AITim Vieira @xtimv
4K Followers 1K Following machine learning, reinforcement learning, programming languages, handstands (he/him)Jasmijn Bastings @jasmijnbastings
4K Followers 2K Following Sr Research Scientist @GoogleDeepMind. Interested in gender, feminism, fairness, bias & ethics in #NLProc/#AI. Views my own. She/they.Nithish Kannen @NithishKannen
450 Followers 2K Following Languages @GoogleAI | Ex- @AmazonScience London, @IBMResearch | @CNERG @IITKgp | #NLPProcAlireza Mohammadshahi @alireza_mshi
547 Followers 791 Following Co-founder of @LeerooAI | Ex-@MetaAI, @EPFLYuntian Deng @yuntiandeng
3K Followers 3K Following #NLProc Postdoc @ai2_mosaic | Assistant Professor @UWaterloo '24 | Faculty Affiliate @VectorInst '24 | PhD @HarvardGabriele Sarti @gsarti_
2K Followers 2K Following PhD Student @GroNLP 🐮, core dev of @InseqLib (https://t.co/tTjrg26ygQ). Interpretability ∩ HCI ∩ #NLProc. Prev: @AmazonScience, @Aindo_AI, @ItaliaNLP_Lab.Nikhil Prakash @nikhil07prakash
365 Followers 2K Following CS Ph.D. @KhouryCollege | Prev: Visiting Scholar @ MPI-SP @maxplanckpress; Intern @kixlab_kaist @SamsungBlr @iitrpr @HasuraHQLisa Alazraki @LisaAlazraki
651 Followers 790 Following #ML & #NLProc PhD student @ImperialCollege. Prev. research intern @GoogleAI. Reasoning, planning & LLMs as agents. Reposted papers are my reading list 📚Dominik Stammbach @dominsta_nlp
267 Followers 381 Following PhD student in Natural Language Processing, ZurichOliver Eberle @EberleOliver
195 Followers 615 Following Postdoctoral Researcher @ Machine Learning Group, @TUBerlin 🇩🇪 | 🔮 Explainable AI | 📚 NLP & Humanities | 🧠 Alumni @bccn_berlinOracle @Oracle
820K Followers 825 Following Leading the cloud. We help people see data in new ways, discover insights, unlock endless possibilities.clem 🤗 @ClementDelangue
91K Followers 5K Following Co-founder & CEO @HuggingFace 🤗, the open and collaborative platform for AI buildersMichael Burry Stock T.. @burrytracker
277K Followers 34 Following Account dedicated to Stocks Burry Owns Powered by @joinautopilot_ Download Autopilot to invest alongside Burry's portfolioNoam Brown @polynoamial
34K Followers 612 Following Researching reasoning @OpenAI | Co-created Libratus/Pluribus, the first superhuman no-limit poker AIs | Co-created CICERO | PhD from @SCSatCMUAlexis Conneau @alex_conneau
24K Followers 113 Following Audio AGI Research Lead @OpenAI - GPT-Next - Past: XLM, Unsupervised ASR, Unsupervised MT, Wav2vec 2.0/XLSR, MUSE, Unsupervised cross-lingual transferNaomi Saphra @nsaphra
7K Followers 1K Following Waiting on a robot body. ML/NLP. All opinions are universal and held by both employers and family. Same username on every lifeboat off this sinking ship.Jonas Pfeiffer @PfeiffJo
3K Followers 686 Following Research Scientist @GoogleDeepMind | @AdapterHub | previously @nyuniversity @TUDarmstadt @UKPLab @MetaAI @spotify | https://t.co/oPoAvcAx97 | (he/him)Evan Miller @EvMill
5K Followers 160 Following Statistically inclined software developer, occasional blogger about math + stats stuffNeel Nanda @NeelNanda5
13K Followers 89 Following Mechanistic Interpretability lead @DeepMind. Formerly @AnthropicAI, independent. In this to reduce AI X-risk. Neural networks can be understood, let's go do it!allen h @HoskinsAllen
283 Followers 2K Following Independent AI research; taking time off from medical schoolFranz Nowak @franz_nowak
143 Followers 179 Following PhD Student at @CSatETH, Natural Language Processing enthusiastNew research post on refusals in LLMs lesswrong.com/posts/jGuXSZgv…
@gfodor This makes perfect sense? They trained the model from scratch, of course it can learn to use filler tokens
Fantastic work from @sen_r and @ArthurConmy - done in an impressive 2 week paper sprint! Gated SAEs are a new sparse autoencoder architecture that seem a major Pareto improvement. This is now my team's preferred way to train SAEs, and I hope it'll accelerate the community's work!
New @GoogleDeepMind MechInterp work! We introduce Gated SAEs, a Pareto improvement over existing sparse autoencoders. They find equally good reconstructions with around half as many firing features, while maintaining interpretability (CI 0-13% improvement). Joint w/ @ArthurConmy
We’re excited to share Gated SAEs: an improvement to Sparse Autoencoder training that scales to 7B parameter models (at least!)
New @GoogleDeepMind MechInterp work! We introduce Gated SAEs, a Pareto improvement over existing sparse autoencoders. They find equally good reconstructions with around half as many firing features, while maintaining interpretability (CI 0-13% improvement). Joint w/ @ArthurConmy
The latest talk at @zurichnlp also exists as a video presentation since this morning. 🌂 I welcome feedback on the format, what works, and what doesn't. 🙏 youtu.be/yeEZpf4BlDA
(A chi ha scelto come mestiere di parlare male continuamente di ragazzi e ragazze vorrei dire che al corso pomeridiano di astrofisica per il triennio vengono volontariamente 35 persone a seguire due ore dopo averne fatte cinque la mattina. Senza voti, per interesse.)
instruction tuning & size & beam search ➡️ less hallucination (however, sampling is a seemingly necessary evil to avoid degenerate outputs). Some caveats/limitations: 1. model-based evaluation of groundness 2. Only OSS models are evaluated arxiv.org/abs/2404.07060
@alesstolfo This is such a cool single-author paper! Kudos Ale!
We've spent a tremendous amount of time reflecting whether the NLP task of Sentiment Classification (x=review, y=rating) is causal or anticausal since 2020. Check out our 2024 latest answer➡️arxiv.org/abs/2404.11055 💡We combined Causality and Psychology insights & improved #LLMs!
🧵@alesstolfo : "Our results show that while larger models tend to ground their outputs more effectively, a significant portion of correct answers remains compromised by hallucinations." ↩️
AC: what's the problem? me: the review is clearly ChatGPT generated reviewer: diving into the discourse tapestry, it is crucial to unweave the possibilit-
Thank you @ReviewAcl that the options aren't 0, 4, 5, 6, 7, 8 anymore.
@rao2z I have an idea for how to fix it...
This is the path that Italy should have followed, too. It's not too late to correct course.
Until 50 years ago, CO₂ emissions developed in lockstep with economic growth in France. Since the early 1970s, the opposite has been true: emissions declined as people in France got richer.
Can we localize the weights and mechanisms used by a language model to recite entire paragraphs of its training data?📄➡️🤖➡️📄 arxiv.org/pdf/2403.19851… To find out, have a look at my @GoogleAI intern project advised by Owen Lewis, @MitchellAGordon and Chiyuan Zhang. Thread ⬇️
The Dream-Comes-True❤️ moment for causality researchers -- meeting Judea Pearl @yudapearl in person! Yuda's profoundly knowledgeable in philosophy, genuinely enjoys singing, and is just charming. Truly touched by his talk & personal stories w/ @kahneman_daniel (Two great minds!)
Yesterday! @yanaiela came to @EdinburghNLP @InfAtEd to talk about What’s In My Big Data from @allen_ai
@pmddomingos Let’s add so much noise that the signal gets obscured and then claim there’s no signal
@pmddomingos Try magnifying that scale. You can clearly see an upward slope, and given that a 1c average temp change is significant, your scale is way too compressed.