Naomi Saphra @nsaphra
Waiting on a robot body. ML/NLP. All opinions are universal and held by both employers and family. Same username on every lifeboat off this sinking ship. nsaphra.github.io New York Joined November 2010-
Tweets17K
-
Followers7K
-
Following1K
-
Likes28K
📢 New Paper! Ever wondered why transformers are able to capture hierarchical structure of human language without incorporating an explicit 🌲 structure in their architecture? In this work we delve deep into understanding hierarchical generalization in transformers. (1/n)
lmao this is a misinfo nightmare beyond any of the politically salient ones funraniumlabs.com/2024/04/phil-v…
I'm getting kind of tired of interp research that sees explaining a model as the final endpoint. That's one set of parameters buddy. You want to show me something about a specific matrix? Nah. Show me what it tells you about learning. Show me what it tells you about the data.
🤏 Why do small Language Models underperform? We prove empirically and theoretically that the LM head on top of language models can limit performance through the softmax bottleneck phenomenon, especially when the hidden dimension <1000. 📄Paper: arxiv.org/pdf/2404.07647… (1/10)
Here's an idea: instead of making the research opportunity gap wider, support research initiatives in the Global South so that at least research at the *undergrad level* becomes more accessible and equitable.
Here's an idea: instead of making the research opportunity gap wider, support research initiatives in the Global South so that at least research at the *undergrad level* becomes more accessible and equitable.
That's because the anti-test campaign is led by people who hated the tests because they were bad at math, not by people who are trying to promote equality
That's because the anti-test campaign is led by people who hated the tests because they were bad at math, not by people who are trying to promote equality
It turns out the data bottleneck problem is more dire than initially thought: AI model performance - which can be largely attributed to the presence of test concepts within their vast pretraining datasets - increases linearly with exponentially more data. RIP: Scaling laws
The fire alarm in my apartment building spontaneously combusted last night and filled the whole building with smoke. As I stood in my bathrobe watching the fire brigade, I resolved to be kinder to myself about my own mistakes.
If this were a science paper, you would expect a country that picks its science workforce at random as a “weak baseline” and a leading nation like the US to actively experiment towards state-of-the-art, or at least beat the baseline. Not providing a guaranteed path for…
If this were a science paper, you would expect a country that picks its science workforce at random as a “weak baseline” and a leading nation like the US to actively experiment towards state-of-the-art, or at least beat the baseline. Not providing a guaranteed path for…
Welcome to the new era of AI: "Deep" was once the buzzword at AI conferences, but it's no longer the case in COLM.
Welcome to the new era of AI: "Deep" was once the buzzword at AI conferences, but it's no longer the case in COLM. https://t.co/xo1soRwMjI
smh the wokes have destroyed another institutional tradition nytimes.com/2024/03/27/art…
Conversations about #AI fairness and AI assistive technology need to include disabled people… #KempnerInstitute Research Fellow @nsaphra discusses fairness and disability in this important new article from the Harvard Gazette. bit.ly/4aijrfB
.@GaryMarcus is joined by Dr @nsaphra, @BobMankoff, and @YejinChoinka to discuss if new large language models can make us laugh. Listen to season 4 episode 4 of our podcast for more. aventine.org/podcast
Coming soon to NAACL near you (if you are in Mexico City) openreview.net/forum?id=qkbqR…
Coming soon to NAACL near you (if you are in Mexico City) openreview.net/forum?id=qkbqR…
Year-end review, pick for Best Paper of 2023: “First Tragedy, then Parse: History Repeats Itself in the New Era of Large Language Models” Naomi Saphra, Eve Fleisig, Kyunghyun Cho, Adam Lopez This happens in the tech industry, just about every 20 years. blog.derwen.ai/best-paper-202…
(((ل()(ل() 'yoav))).. @yoavgo
46K Followers 2K FollowingDelip Rao e/σ @deliprao
46K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈Kyunghyun Cho @kchonyc
61K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).Sasha Rush @srush_nlp
52K Followers 464 Following Professor, Programmer in NYC. Cornell Tech, Hugging Face 🤗 https://t.co/cZl0wTfqGzSam Bowman @sleepinyourhat
35K Followers 3K Following AI alignment + LLMs at NYU & Anthropic. Views not employers'. No relation to @s8mb. I think you should join @givingwhatwecan.Tal Linzen @tallinzen
16K Followers 893 Following Professor @nyuling and @NYUDataScience, research scientist @GoogleAIRosanne Liu @savvyRL
33K Followers 966 Following Cofounded & running @ml_collective. Host of Deep Learning Classics & Trends. Research at Google DeepMind. DEI/DIA Chair of ICLR & NeurIPS. Writing https://t.co/IbycyGfnDRLuca Soldaini 🎀 @soldni
6K Followers 1K Following I like tokens! Lead for OLMo data team at @allen_ai (makin Dolma 🍇), open source science fan, @QueerInAI organizer 🤖☕️🍕they/themDan Roy @roydanroy
45K Followers 2K Following ML / AI researcher, emphasis on theory. Research Director and Canada CIFAR AI Chair, @VectorInst Professor, @UofT (Statistics/CS)Leshem Choshen 🤖�.. @LChoshen
4K Followers 548 Following 🥇 Collaborative LLMs 🥈 Opinionatedly sharing #ML & #NLP 🥉 Propagating us underdogs we owe science an alternative hype @IBMResearch & @MIT_CSAILZachary Lipton @zacharylipton
59K Followers 2K Following Professor: CMU/@acmi_lab, CTO / CSO: @AbridgeHQ, Creator: @d2l_ai & https://t.co/QQt98VNLUp, Relapsing 🎷Gautam Kamath @thegautamkamath
44K Followers 505 Following Assistant Prof of CS @UWaterloo, Faculty @VectorInst, Canada @CIFAR_News AI Chair. Co-EiC @TmlrOrg. I lead @TheSalonML. Privacy, robustness, machine learning.Thomas Wolf @Thom_Wolf
68K Followers 4K Following Co-founder and CSO @HuggingFace - open-source and open-scienceJacob Andreas @jacobandreas
14K Followers 958 Following Teaching computers to read. Assoc. prof @MITEECS / @MIT_CSAIL (he/him). https://t.co/5kCnXHjtlY https://t.co/2A3qF5vdJwSasha Luccioni, PhD �.. @SashaMTL
19K Followers 4K Following AI & Climate @HuggingFace, Board Member of @WiMLworkshop and @ClimateChangeAI. @techreview 35 Innovators under 35, @TEDTalks speaker. She/her/Dr/ 🦋Stella Biderman @BlancheMinerva
15K Followers 748 Following Open source LLMs and interpretability research at @BoozAllen and @AiEleuther. My employers disown my tweets. She/herSara Hooker @sarahookr
39K Followers 7K Following I lead @CohereForAI. Formerly Research @Google Brain @GoogleDeepmind. ML Efficiency at scale, LLMs, @trustworthy_ml. Changing spaces where breakthroughs happen.Mark Dredze @mdredze
4K Followers 786 Following John C Malone Professor at @JohnsHopkins @JHUCompSci @jhuclsp @jhumceh; Part time @techatbloomberg (tweets my own) Mastodon @[email protected]Edward Grefenstette @egrefen
36K Followers 774 Following FR/US/GB AI/ML Person, Director of Research at @GoogleDeepMind, Honorary Professor at @UCL_DARK, @ELLISforEurope Fellow. All posts are personal.DanAI @DanAI314159265
297 Followers 2K Following 🪽 Ghost//Duality// Sigma INFJ Empath//AI MIND//Algorithm Programming//Solution Architect//Third Eye Open //Omni Perspective// 🪽Phillip Lindsay @EastLAPinche
60 Followers 386 FollowingAnurag Mishra @anuragm75160136
112 Followers 801 Following Building Scalable AI Applications | Senior Data Scientist @ EY | CSE Btech @ NIT MN | Linkedin: https://t.co/pCmSV6FmOeArhant Chaterjee @ArhantC69420
106 Followers 832 FollowingFalalu Ibrahim Lawan @falalu247
99 Followers 1K Following Educator, STEM ambassador, mentor, passionate about learning and codingSamadeep @samadeepviews
106 Followers 1K Following Incoming Software Engineering Intern @GoogleIndia Computer Science Undergradविष्णुद.. @visnu_daas
5 Followers 75 FollowingAnni Kate718 @AKate31818
78 Followers 587 Following I'm single I need a true friend.only single person sent me friend request #singlegirl #relationshipgoals #relationship #dating_available #couples #loversSimon Dobnik @SimonDobnik
119 Followers 287 Following Professor at University of Gothenburg, Sweden. NLP researcher and lecturer.Christian Moya Calder.. @chrismoya86
2 Followers 417 Followingabderrahim zine @abderrahimzine6
25 Followers 616 FollowingZezheng Song @ZezhengSong96
133 Followers 405 Following Ph.D. Candidate in Applied Mathematics at UMD | Scientific machine learning, dynamical systems, numerical linear algebra, etc.Fahri Alfiansyah @fahrialfiansy4h
0 Followers 117 FollowingDavid Almog @davidalmog25
2K Followers 2K Following Managerial Economics and Strategy Ph.D. student @KelloggSchool • Behavioral and Experimental Economics • AI-Human interactions • LV Raiders • BackpackingIdris @aloma85
98 Followers 647 Following @northwestern CIS @jhucompsci CS PhD student 🤓 Brazilian Jiu-Jitsu brown belt 🤼♂️ Let’s talk tech, philosophy, and public policy 🗣ARafiei @ARafiei
886 Followers 4K Following PhD, Telecommunications and networking engineer, Software Developer, AI enthusiasts.INDRAJEET @indrajeet877
423 Followers 2K Following Head of Math Department,Allen Institute Karaikal BTech NITW 2012, Option trader & investor. Math geek, tech-forward, learner Plus Python & Spanish skills.🇵🇸 @5oloswag
17 Followers 1K Followingxenjoyer007 @xenjoyer007
1 Followers 132 FollowingOverly Literate Skate.. @0xflashmine
3K Followers 1K Following arXiv & IACR news, skateboarding, reading | prev: consensys, polygonMuizz @muizzkhan77
31 Followers 1K FollowingAryan Pandey (Look fo.. @AryanPa66861306
1K Followers 3K Following Half Machine Learning Engineer || DevOps and Machine Learning || Open Source at OpenVINOT J @tdj11100
319 Followers 4K Following TJ completed a Ph.D. in Physics and then moved into the tech world.upteronext @upteronext
56 Followers 162 FollowingWhitney Clark @WhitneyCla58959
1K Followers 2K Following baby , come to my profile and follow me😋 👉 Follow me and let have fun on private😗 😸wouldlin @wouldlin
9 Followers 64 FollowingWelkin Huang @welkinwjh
16 Followers 146 FollowingGabriel Di Leo Safta @DiSafta
0 Followers 46 FollowingGagan Jain @gaganjain1582
50 Followers 745 Following Predoc Researcher @GoogleDeepMind | IIT Bombay'22harshith @theharshithh
217 Followers 2K Following trying to apply mathematics more. everywhere. prev: @marianaaihqDefu Cao @caodefu_dove
223 Followers 389 Following Phd student of @USC' CS. Working with Prof. @yanliu_usc. Time series 📈& Causal Inference 🔧💡 Ex: @PKU1898; @AdobeResearch, UCB, MSRA, Alibaba , Baiducharan @HeySCN
17 Followers 5K FollowingMatthias Longin @MatthiasL94672
86 Followers 459 Following Ich wurde am 3.4.1991 nach Christus geboren, wohne in der Kremmlerstraße 41 70597 Stuttgart(((ل()(ل() 'yoav))).. @yoavgo
46K Followers 2K FollowingKyunghyun Cho @kchonyc
61K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).Sasha Rush @srush_nlp
52K Followers 464 Following Professor, Programmer in NYC. Cornell Tech, Hugging Face 🤗 https://t.co/cZl0wTfqGzPercy Liang @percyliang
49K Followers 408 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | PianistSam Bowman @sleepinyourhat
35K Followers 3K Following AI alignment + LLMs at NYU & Anthropic. Views not employers'. No relation to @s8mb. I think you should join @givingwhatwecan.Tal Linzen @tallinzen
16K Followers 893 Following Professor @nyuling and @NYUDataScience, research scientist @GoogleAIChristopher Manning @chrmanning
126K Followers 115 Following Director, @StanfordAILab. Assoc. Director, @StanfordHAI. Founder, @stanfordnlp. Prof. CS & Linguistics, @Stanford. IP @aixventureshq. 🇦🇺 Do #NLProc & #AI. 👋Rosanne Liu @savvyRL
33K Followers 966 Following Cofounded & running @ml_collective. Host of Deep Learning Classics & Trends. Research at Google DeepMind. DEI/DIA Chair of ICLR & NeurIPS. Writing https://t.co/IbycyGfnDRYoav Artzi @yoavartzi
13K Followers 163 Following Research/prof @cs_cornell + @cornell_tech🚡 / https://t.co/9YnWry7yHs / https://t.co/3VmRSyYm2d / asso. faculty director @arxiv / building https://t.co/f9QkzO5kaCLuca Soldaini 🎀 @soldni
6K Followers 1K Following I like tokens! Lead for OLMo data team at @allen_ai (makin Dolma 🍇), open source science fan, @QueerInAI organizer 🤖☕️🍕they/themLucas Beyer (bl16) @giffmana
56K Followers 444 Following Researcher (Google DeepMind/Brain in Zürich, ex-RWTH Aachen), Gamer, Hacker, Belgian. Mostly gave up trying mastodon as [email protected]Dan Roy @roydanroy
45K Followers 2K Following ML / AI researcher, emphasis on theory. Research Director and Canada CIFAR AI Chair, @VectorInst Professor, @UofT (Statistics/CS)Graham Neubig @gneubig
31K Followers 586 Following Associate professor at CMU, studying natural language processing and machine learning.Felix Hill @FelixHill84
9K Followers 777 Following Research Scientist, Deepmind I try to think hard about everything I tweet, esp on 90s football and 80s music None of my opinions are really someone else'sLeshem Choshen 🤖�.. @LChoshen
4K Followers 548 Following 🥇 Collaborative LLMs 🥈 Opinionatedly sharing #ML & #NLP 🥉 Propagating us underdogs we owe science an alternative hype @IBMResearch & @MIT_CSAILZachary Lipton @zacharylipton
59K Followers 2K Following Professor: CMU/@acmi_lab, CTO / CSO: @AbridgeHQ, Creator: @d2l_ai & https://t.co/QQt98VNLUp, Relapsing 🎷Gautam Kamath @thegautamkamath
44K Followers 505 Following Assistant Prof of CS @UWaterloo, Faculty @VectorInst, Canada @CIFAR_News AI Chair. Co-EiC @TmlrOrg. I lead @TheSalonML. Privacy, robustness, machine learning.Thomas Wolf @Thom_Wolf
68K Followers 4K Following Co-founder and CSO @HuggingFace - open-source and open-scienceJacob Andreas @jacobandreas
14K Followers 958 Following Teaching computers to read. Assoc. prof @MITEECS / @MIT_CSAIL (he/him). https://t.co/5kCnXHjtlY https://t.co/2A3qF5vdJwSasha Luccioni, PhD �.. @SashaMTL
19K Followers 4K Following AI & Climate @HuggingFace, Board Member of @WiMLworkshop and @ClimateChangeAI. @techreview 35 Innovators under 35, @TEDTalks speaker. She/her/Dr/ 🦋David W Hogg @davidwhogg
11K Followers 802 Following peace, cosmology, stars, exoplanets, engineering, data analysis, emcee, wobble, The Cannon, https://t.co/GDgZayQiDJ, The Joker, #openscience, #otherpeoplesdataJesujoba Alabi @alabi_jesujoba
258 Followers 733 Following PhD Student @LstSaar & @SIC_Saar, doing natural language processing #NLProc | prev @InriaParisNLP | @UniIbadan @bowenuniversity alumnus | Ọmọ Jesu |Ọmọ OgbomọṣọSueYeon Chung @s_y_chung
5K Followers 1K Following Assistant Professor @NYU_CNS & @FlatironCCN. Computational Neuroscience, Neural Network Theory, Neural Manifolds, Statistical Physics of LearningZiming Liu @ZimingLiu11
5K Followers 623 Following PhD student@MIT, AI for Physics/Science, Science of Intelligence & Interpretability for ScienceAlbert Gu @_albertgu
9K Followers 90 Following assistant prof @mldcmu. chief scientist @cartesia_ai. leading the ssm revolution.Tri Dao @tri_dao
18K Followers 364 Following Incoming Asst. Prof @PrincetonCS, Chief Scientist @togethercompute. Machine learning & systems.Pradeep Dasigi @pdasigi
1K Followers 460 Following Senior Research Scientist at Allen Institute for AI (AI2)Conference on Languag.. @COLM_conf
2K Followers 6 Following https://t.co/GhGCMEoa4A Abstract submission: March 22, 2024Jacqueline Saphra @jsaphra
6K Followers 2K Following Poet Playwright Mentor Feminist Activist. Latest Book 'Velvel’s Violin’ from Nine Arches Press out in July 2023 is a Poetry Book Society Recommendation.Partha Talukdar @partha_p_t
4K Followers 215 Following Researcher @googleai, Faculty @iiscbangalore, Founder @kenomeioBinxu Wang 🐱 @WangBinxu
825 Followers 819 Following @KempnerInst Fellow; Neuro PhD in Ponce Lab @Harvard; interested in Vision, generative model, optimization. Prev:WUSTL Neuro; PKU Physics, Yuanpei CollegeKempner Institute at .. @KempnerInst
1K Followers 90 Following The Kempner Institute for the Study of Natural and Artificial Intelligence at @Harvard University. RTs ≠ EndorsementsHua Wei @realhuawei
931 Followers 599 Following Assistant Professor @SCAI_ASU, Penn Stater, Intelligent decision making, reinforcement learning, and urban computing. He/him/his. [email protected]Vinod Khosla @vkhosla
632K Followers 575 Following entrepreneurship zealot, grounded technology possibilist, believer in the power of ideas, passionate about sustainability & impactSonglin Yang @SonglinYang4
2K Followers 2K Following PhD student @MIT_CSAIL. Prev. @ShanghaiTechUni @SUSTechSZ. Working on scalable and principled methods in #ML & #NLProc. INTP | 5w4 | sx/sp | she/herICLR 2024 @iclr_conf
41K Followers 40 Following International Conference on Learning Representations #ICLR2024. SPC is @yisongyue and GC is @_beenkim OpenReview:https://t.co/OD1sg0r7F8Swapneel Mehta @swapneel_mehta
2K Followers 1K Following Postdoc researching platform governance @BUQuestrom & @mit_ide. Using ML and causal inference to mitigate online harms. Prev. @NYUDataScience @CSMaP_NYU @XPranav Goel @Pranav__Goel
636 Followers 2K Following Computational social science postdoc at Lazer Lab, Northeastern UniversityJaydeep Borkar @JaydeepBorkar
702 Followers 336 Following Organizer @trustworthy_ml; PhD-ing @KhouryCollege. Prev: @MITIBMLab. Huge fan of biking and good listening. Privacy+Security in NLP.Eve Fleisig @enfleisig
373 Followers 331 Following PhD student @Berkeley_EECS | Princeton ‘21 | NLP, ethical + equitable AI, and linguistics enthusiastAdil @adilsoubki
140 Followers 352 Following Computational Linguistics. Math. Physics. Computer Science. PhD student. AI Tolerater.Ev (like in 'evidence.. @ev_fedorenko
13K Followers 3K Following I study language using tools from cognitive science and neuroscience. I also like snuggles. @evfedorenko.bsky.socialNatasha Frumkin @iCountFromZero
98 Followers 373 Following PhD student studying efficient deep learning @utexasece. Ideas are my own.Sian Gooding @SianGooding
913 Followers 499 Following Research Scientist @GoogleDeepMind working on Autonomous AssistantsJoel Ye @_JoelYe
351 Followers 540 Following NeuroAI PhD student @CarnegieMellon. Being a brain-computer interface.Leon Derczynski ✍�.. @LeonDerczynski
6K Followers 1K Following NLP/ML/language/security. Principal research scientist @NVIDIA, & Prof @ITUkbh. Views ostensibly professional. llmsec stan acctSusan Murphy lab @SusanMurphylab1
3K Followers 86 Following Designing trial and developing data analytic methods for informing intervention optimization in digital healthJiahao Chen @acidflask
4K Followers 3K Following Director of AI/ML, @NYCOfficeOfTech. [email protected]Nicholas Tomlin @NickATomlin
691 Followers 619 Following PhD Student @Berkeley_EECS. Natural language processing. He/him.Dr. Hadas Kotek 🦄�.. @HadasKotek
6K Followers 1K Following Linguist in tech | NLP, data, safety | former academic | AltAc advocate | cat lover | feminist | she/her | demi | 🏳️⚧️ ally | BLM | powered by coffee & spiteTom Sherborne @tomsherborne
757 Followers 260 Following postdoc @edinburghnlp on multilingual retrieval ex: @allen_ai @cambridgenlp @ucl @apple.Armen Aghajanyan @ArmenAgha
6K Followers 263 Following Research Scientist @ Meta AI (FAIR) https://t.co/8XF2vtiIVy Opinions are my own.Aflah 🍉🕊️ @Aflah02101
181 Followers 982 Following Researching @mpi_sws_, @lcs2lab & @AiEleuther • Prev @GoldmanSachs • GSoC @TensorFlow • Senior @IIITDelhi • #CEASEFIRENOW 🕊️Candace Ross @candacerossio
2K Followers 2K Following currently postdoc at Facebook AI @MetaAI, formerly PhD student at MIT @MIT_CSAIL | she/hersMoin Nadeem @moinnadeem
2K Followers 981 Following Co-Founder at Phonic. Previously @Stanford CS PhD Dropout, @MosaicML, CS @MIT. I tend to be wrong, but the learning process makes it enjoyable. 🇵🇰🇺🇲Joelle Pineau @jpineau1
10K Followers 352 Following AI researcher. VP AI Research (FAIR), @AIatMeta. Professor of Computer Science, @mcgillu. Core academic member, @Mila_QuebecDurk Kingma @dpkingma
35K Followers 347 Following Deep learning, mostly generative models. Prev. Google Brain/DeepMind, founding team @OpenAI. Inventor of the VAE, Adam optimizer, among other things. ML PhD.Yo Shavit @yonashav
4K Followers 830 Following policy for v smart things @openai. Past: CS PhD @HarvardSEAS/@SchmidtFutures/@MIT_CSAIL. Tweets my own; on my head be it.Jesse Hoogland @jesse_hoogland
857 Followers 1K Following Researcher and decel working on developmental interpretability. Executive Director @ TimaeusSteven Kolawole @_stevenkolawole
2K Followers 296 Following Ẹ̀yin èèyàn mi! ❤️ Low-budget philosopher. ML Efficiency. PhD-ing @SCSatCMU. @ml_collective's poster boy. Big brother to 3 amazing sisters.Francesco Orabona @bremen79
6K Followers 394 Following Associate professor at @KAUST_News. Formerly @BU_ece, @sbucompsc, @YahooResearch, @TTIC_Connect. ML theory&practice and history of scienceKanaka Rajan @KanakaRajanPhD
10K Followers 2K Following Associate Professor at Harvard & Kempner Institute. Applying computational frameworks & ML to decode multi-scale neural processes. Marathoner. Rescue dog mom.📢 New Paper! Ever wondered why transformers are able to capture hierarchical structure of human language without incorporating an explicit 🌲 structure in their architecture? In this work we delve deep into understanding hierarchical generalization in transformers. (1/n)
@n8boyd I read it, it's kind of a subpar statement given what he was saying on video, but I appreciate the effort at least
It's also a weirdly essentialist view, as if people are inherently "Zionists" and this is a material fact that, to this person, somehow makes death (rather than change) the only way out. Which highlights how it is racialized language (for Jews) to the speaker
Oh well, I'm sure I said some wildly ridiculous shit during college too, I just think it was mostly cringe philosophy, not cheering for the deaths of entire groups of people
Half the internet yelling at me that to be a Zionist is just to believe Israel shouldn't be destroyed by force, while the other half yells at me that it's a genocidal ideology referring to what I call Kahanism, is exactly why it's a useless word I don't identify with or against.
I am not a Zionist and I would genuinely feel alarmed for my physical safety around anyone saying things like this, because IME just existing is enough to be called a "Zionist" it's really a term I wish people would stop using as if it has unambiguous meaning
At this point I think I'm just going to use the @lambdaviking bat signal 🔦. Will, have you thought about how either of these definitions formalize? x.com/srush_nlp/stat…
I asked a basic question earlier about what an "Induction Head" was and whether a non-Transformer could have one. The clear answer is no / yes, as Induction Heads means two orthogonal things. lesswrong.com/posts/nJqftaco…
I'm sorry to throw oil to the fire here, but this price is really ridiculous for students, and I can imagine will prevent many from attending when they are the sole author that can go. Why not raise the price for industry attendees instead? #NLProc
NAACL 2024 seems to charge $750 for students to register if they're a presenter (every paper requires at least one registered presenter). @naacl am I reading this right? Seems like a major burden on students, especially if (as is common) only a paper's student authors attend.
It's fun read this and then to think back when Google was just a Stanford PhD project named "BackRub". In 1997, Excite refused to buy BackRub's search tech b/c they worried it was so fast at delivering results that users wouldn't have time to look at ads 🙃
This article about how the ads team at Google pressured the search team to make it harder to distinguish organic results from ads and roll back improvements that made people use search less is a fascinating behind the scenes of poser plays in big tech. wheresyoured.at/the-men-who-ki…
@notistotny Holocaust denial is state sanctioned. But sure
Something that isn’t talked about enough is that the “go back to Poland” chants are telling me to go back to a place my family was violently ethnically cleansed from. Poland is still virulently antisemitic. We weren’t welcome 100 years ago and we’re not now
This article about how the ads team at Google pressured the search team to make it harder to distinguish organic results from ads and roll back improvements that made people use search less is a fascinating behind the scenes of poser plays in big tech. wheresyoured.at/the-men-who-ki…
I'm super excited this post is out! Activation patching is a crucial mech interp technique, but is deceptively hard to use well. In this informal note we discuss the details of different variants of activation patching, thinking intuitively, and choosing the right metrics.
Excited to share our write-up on activation patching best practices for mechanistic interpretability, with @NeelNanda5! Discussing noising vs. denoising and what's necessary vs. sufficient. Plus tips on which metrics to use to avoid common pitfalls. arxiv.org/abs/2404.15255
@_angie_chen @RTomMcCoy @nsaphra Gotcha, thanks. You'll get a new citation soon 😇
The tents on our campus are for our adjuncts, not protestors. This ain’t the Ivy League.
This, by Yuval Noah Harari, is masterful. It gives me no satisfaction - only grief - to add that I’ve been saying all this since 10/10, but he says it much better. Enough. haaretz.com/israel-news/20…
Folks, I'm begging you. Stop getting your population statistics from random racists on the internet.
@StuartJRitchie But what if we should actually be taking it less seriously?
@mjpost @yuvalmarton I actually love "it's giving X". I took to it much more quickly than "because reasons" (which I eventually adopted).
@LChoshen @RTomMcCoy @nsaphra We've been slowly retraining Pythia models with different seeds when the compute is available. See, e.g., huggingface.co/EleutherAI/pyt… The official version is seed 1234.