Leo Du @leoduw
Positive semi-nondeterministic PhD student @jhuclsp currently visiting #rycolab ETH | previously @uwcse | math junkie | profile pic reads "the 8th Busy Beaver" Seattle, WA Joined February 2017-
Tweets181
-
Followers168
-
Following202
-
Likes1K
"Let no one ignorant of geometry enter." Finally, a compute can enter Plato's academy.
"Let no one ignorant of geometry enter." Finally, a compute can enter Plato's academy.
wrote a short note on using parallel scans for backprop: justintchiu.com/blog/pscan_dif… turns out there was already a paper on this too! arxiv.org/abs/1907.10134
New paper out w/ @ShriramKMurthi accepted to OOPSLA'24: a psychometric analysis of programming language learning. We added ~200 quiz questions to a popular book on Rust and collected ~1,000,000 answers from ~60,000 people over 1 year. arxiv.org/abs/2401.01257
"Are Emergent Abilities of Large Language Models a Mirage?" is a NeurIPS outstanding paper!🙌🏿 Congrats especially to the students @RylanSchaeffer @BrandoHablando & other awardees. If you want to learn more, check out the oral & poster 👇🏿this afternoon (Dec 14) 1/2
"Are Emergent Abilities of Large Language Models a Mirage?" is a NeurIPS outstanding paper!🙌🏿 Congrats especially to the students @RylanSchaeffer @BrandoHablando & other awardees. If you want to learn more, check out the oral & poster 👇🏿this afternoon (Dec 14) 1/2 https://t.co/bQDdrhGTB9
If you are interested in knowing how you can do energy-based sampling from language models, make sure to check our #NeurIPS23 paper titled “Structured Voronoi Sampling”...🧵 arxiv.org/pdf/2306.03061…
Congratulations to Rylan Schaeffer, Brando Miranda, Sanmi Koyejo for winning a best paper award at NeurIPS for this insightful paper. Are Emergent Abilities of Large Language Models a Mirage? arxiv.org/abs/2304.15004
One of the fundamental problems with probability notation in machine learning is due to the fact that few people really have a firm grasp on conditioning from a measure theoretical perspective. Another issue: random variables versus indexed collections of probability spaces.
Following up a weekend effort by another weekend effort: llama2. rs 🦀 github.com/leo-du/llama2.… In a single Rust file w/ * zero dependencies (i.e. custom rng w/ PCG) * zero lines of `unsafe` code (very 🦀!) * support user prompts * (almost) same performance
Following up a weekend effort by another weekend effort: llama2. rs 🦀 github.com/leo-du/llama2.… In a single Rust file w/ * zero dependencies (i.e. custom rng w/ PCG) * zero lines of `unsafe` code (very 🦀!) * support user prompts * (almost) same performance
We summarized the #acl2023nlp Toronto conference for you with some poster recordings and author interviews! 👇 🎬 youtu.be/-Agcr0nawuk Featuring @s_tworkowski @jasivan_s @kundan_official @ebugliarello @leoduw @_florianmai @franz_nowak @PaulDarm @MoritzPlenz and @JayAlammar 👏
Q: Does my LM leak probability onto infinite strings? A: For RNNs and PFSAs you need to test, but Transformers always generate EOS in finite time (prob=1). 🤔First we need to formalize the question… cs.jhu.edu/~jason/papers/… #ACL2023 poster Tue 11am w/@leoduw @ryandcotterell et al
Is the following question directly answerable? We introduce CREPE -- a new QA task for identifying and correcting false presuppositions (backgrounded assumptions) in questions based on world knowledge. arxiv.org/abs/2211.17257 (also @ 12:15 Tuesday @ Metropolitan East #ACL2023)
Except Rust. Rust is a goddamn miracle x.com/taliaringer/st…
Except Rust. Rust is a goddamn miracle x.com/taliaringer/st…
If I can print this on a tshirt I’m wearing this to ACL
If I can print this on a tshirt I’m wearing this to ACL https://t.co/mSQPmrfkAi
-"What do you call a group of algebraic topologists?" -"The Fundamental Group."
Congrats @xtimv !!!
Tiwa Eisape @tiwa_eisape
1K Followers 1K Following PhD student at @MIT working on NLP and cognitive science - @NSF grfp fellow. Previously with @GoogleAI and @Meta FAIRYahan Li @YLiiiYLiii
52 Followers 90 Following Graduate Student @JHUCompSci. Research Assistant @jhuclsp. Previously CSE Undergrad @UMich. Interested in Clinical NLP and LLM☀️Rais Latif @RaisLatif_Study
39 Followers 5K Following Hi I'm Rais. I'm mainly focussing on Math and Science lifelong. There is a lot to discover in these fields and my mind is always blown by all the cool things.Benjamin Dayan @BenjaminDayan
69 Followers 174 FollowingShiftySloth @DriftySloth
0 Followers 7 FollowingNikola Selic @nikola_selic
182 Followers 1K Following ex SDE intern @ AWS | MSc @TU_Muenchen | I try to make cool stuff, noisy input gang. Opinions expressed entirely my own.nuri @bigrealxx
373 Followers 4K FollowingDhruv Agarwal @dhruvagarwal17
207 Followers 549 Following CS PhD student @UMassAmherst IESL @UMass_NLP, working on ML for NLP. Multi-step reasoning, retrieval augmentation, and automated scientific discovery.Saibo-Creator @SaiboGeng
121 Followers 159 Following CS PhD @ EPFL 🇨🇭 | focusing on Constraining LLMs using Parsing | Incoming intern at MicrosoftNikhil Sharma @nikhilsksharma
235 Followers 617 Following Incoming PhD in HAI @JohnsHopkins | Information Seeking | Disinformation Agents | Copilots for Social Good | PhD @JHUCLSP @JHUMCEH #NLProcEdgeAI Geek @edgeaiguy
1K Followers 5K Following Crafting AI solutions for tiny devices. | Ex-Samsung |Yan Du @duyanbj
21 Followers 91 FollowingWenting Zhao @wzhao_nlp
812 Followers 356 Following PhD student @cornell_tech Food for life, NLP for soul!Tongfei Chen @ctongfei
372 Followers 651 Following Researcher in #NLProc; Functional programmer @scala_lang; {Natural | Programming} language enthusiast; NLP/ML/PL. Tweets are my ownAdyasha Maharana @adyasha10
546 Followers 644 Following PhD Student @uncnlp. Interests: data efficiency, vision+language, causality, AI+health. Previously PRIOR@allen_ai, @AdobeResearch, @sciomellc, @IHME_UW, @IITKgppkms🍕 @PrakamyaMishra
698 Followers 1K Following Applied Research Engineer @AMD | Researcher @ BioNLP Lab @manningcics | Ex-Applied Scientist Intern @AmazonScience | MS CS @manningcicsShruti (they/she) �.. @shrutibkoch
1K Followers 5K Following Indigenous PhD aspirant nagivating thru academic barriers | MH equity, Psych of social inequalities |mitugan @mityabor
104 Followers 1K FollowingYu Zhang @yzhang_cs
93 Followers 366 Following PhD Student @ Soochow University, working on efficient methods for LLMs; a disciple of parallel programming.Dr. Adam Erickson �.. @admercs
1K Followers 5K Following Building @Nervosys. Co-founded @Wingcopter @UBCUAS. Invented hybrid AI land models, AI⋂ESM⋂EO. PGP: D682 515D BB36 AD9AKyle Marieb @kylemarieb
740 Followers 5K Following Profoundly deaf with cochlear implants 🦻🤖 YouTube Backend SWE 📺Holden Lee @oldheneel
140 Followers 54 Following Researcher in math and computer science Writer of science fiction and fantasyleonanor @leonanor237820
29 Followers 159 FollowingMarc Garcia @datapythonista
1K Followers 227 Following #pandas core dev Developer, speaker, trainer, advisor #python #polars #data #rust #rustlang #arrow #apachearrow #linuxCharles Page @CharlesPag95488
76 Followers 1K FollowingWill Merrill @lambdaviking
2K Followers 569 Following Ph.D. student @ NYU🗽 Theoretical aspects of NLP and LMs /nætʃɹəl/🇮🇸 + formal🤵 languages + TCS🧮Lone Striker @LoneStriker1
41 Followers 527 FollowingArcturus 🌥️ 🦇.. @Arcturus_f
651 Followers 3K Following Follow the arc @dClimateDAO 🌥️ @KERNEL0x #kb3 fellow #YFI ~lighut-sarpenSasha Rush @srush_nlp
52K Followers 464 Following Professor, Programmer in NYC. Cornell Tech, Hugging Face 🤗 https://t.co/cZl0wTfqGzOgnjen Todic @ognjen_todic
495 Followers 2K Following Entrepreneur & Engineer. Building on-device speech recognition solutions @ https://t.co/mrhEaR4f9O and organically growing the business.Sami Nas 👨⚕�.. @digitalhealthxx
8K Followers 9K Following Senior functional/technical consultant to bring added value via #digitalhealth #ai and #datascience based solutions #MedTwitterYuntian Deng @yuntiandeng
3K Followers 3K Following #NLProc Postdoc @ai2_mosaic | Assistant Professor @UWaterloo '24 | Faculty Affiliate @VectorInst '24 | PhD @Harvardharold C @haroldc2022
62 Followers 2K FollowingAnonymous User @peachorangesdae
21 Followers 141 FollowingRaphael Schumann @RaphiRaph_
368 Followers 1K Following Natural Language Processing PhD Student @ Heidelberg University.Vincent Lordier @vlordier
573 Followers 4K FollowingDouglas Drumond Kayam.. @douglasdrumond
1K Followers 5K Following Eng manager and developer BSc in CompSci @unicampoficial MBA in Business Analytics & Big Data @FGV MSc student in CompSci w/ AI @UoY_CS Married to @letochieNathaniel Weir @Nathaniel_Weir
506 Followers 851 Following PhD candidate @jhuclsp working on reasoning. Formerly @ai2_aristo, MS Semantic Machines, @MSFTResearch, @BrownCSDept. On the job market (industry/postdoc)Philipp Hennig @PhilippHennig5
6K Followers 321 Following Professor for the Methods of Machine Learning at the University of Tübingen.Tom Goldstein @tomgoldsteincs
23K Followers 2K Following Professor at UMD. AI security & privacy, algorithmic bias, foundations of ML. Follow me for commentary on state-of-the-art AI.Jonas Geiping @jonasgeiping
2K Followers 612 Following Machine Learning Research at the ELLIS Institute & MPI-IS // Investigating fundamental questions in Safety, Security, Privacy & Efficiency of modern MLYunmo Chen @YunmoChen
311 Followers 296 Following Ph.D. candidate in Computer Science #NLProc at @jhuclsp | Once @Apple | Twice @MSFTResearch | Twice @AmazonWenting Zhao @wzhao_nlp
812 Followers 356 Following PhD student @cornell_tech Food for life, NLP for soul!Tongfei Chen @ctongfei
372 Followers 651 Following Researcher in #NLProc; Functional programmer @scala_lang; {Natural | Programming} language enthusiast; NLP/ML/PL. Tweets are my ownLingfeng Shen @Lingfeng_nlp
290 Followers 525 Following MS student @jhuclsp, Research on #NLP and #ML @MercedesAMGF1 Fan!Guanghui Qin @hiaoxui
79 Followers 56 Following Ph.D. student in Natural Language Processing at Johns Hopkins University.Holden Lee @oldheneel
140 Followers 54 Following Researcher in math and computer science Writer of science fiction and fantasyWill Merrill @lambdaviking
2K Followers 569 Following Ph.D. student @ NYU🗽 Theoretical aspects of NLP and LMs /nætʃɹəl/🇮🇸 + formal🤵 languages + TCS🧮Will Crichton @tonofcrates
7K Followers 164 Following Cognitive engineer, incoming assistant professor @BrownUniversity.AI Coffee Break with .. @AICoffeeBreak
7K Followers 391 Following 📺 ML Youtuber https://t.co/ZDot8670KO 👩🎓 PhD student in Computational Linguistics @ Heidelberg University | Impressum: https://t.co/sKu3Rh0sQ4Csaba Szepesvari @CsabaSzepesvari
8K Followers 704 Following "If there is not folly in the world, then the world itself is folly. You must understand that mistakes are not always regrets." - Paul Tobin, Bandette🤠Jifan Zhang @jifan_zhang
192 Followers 202 Following Ph.D. @WisconsinCS @WIDiscovery | Previously BS/MS @uwcse, internship @Meta @Google | Label-Efficient Learning, Active Learning, Large Pretrained ModelsAmirhossein Kazemneja.. @a_kazemnejad
838 Followers 482 Following Grad student in NLP @Mila_Quebec, @mcgillu, and @rllabmcgill. Working on Transformers and generalizationFreda Shi @fredahshi
2K Followers 674 Following Starting July 2024: Asst. Prof. @UWCheritonCS @VectorInst, #nlproc #compling Now: PhD Student @TTIC_Connect Ex-@PKU1898, @MetaAI, @GoogleDeepMind Feeder of 3 🐈Franz Nowak @franz_nowak
143 Followers 179 Following PhD Student at @CSatETH, Natural Language Processing enthusiastRuixiang Cui @ruixiangcui
271 Followers 454 Following final-year PhD Student at @coastalcph. ex-intern @MSFTResearch. ex-visitor @stanfordnlp. LLM eval, multilinguality, compositional generalization. he/they.Zhaofeng Wu @zhaofeng_wu
1K Followers 171 Following PhD student @MIT_CSAIL | Previously @ai2_allennlp | MS'21 BS'19 BA'19 @uwnlpKarolina Stanczak @karstanczak
515 Followers 446 Following NLP & ML PhD candidate @uni_copenhagen @CopeNLUAaron Jaech @AaronJaech
275 Followers 562 FollowingJulian Eisenschlos @eisenjulian
1K Followers 988 Following Math, NLP, Deep Learning • Google @DeepMind • Previously @ASAPP & @facebook • Co-founder @BotMaker_ioShijie Wu @EzraWu
2K Followers 1K Following LLM Research engineer/scientist at @Bloomberg AI. PhD at @jhuclsp. ex @AIatMeta. He/Him. Opinions are my own. DM open. Threads @ezra_wuVilém Zouhar @zouharvi
2K Followers 2K Following PhD student @ ETH Zürich | all aspects of #NLProc but mostly HCI, evaluation and MT | go #veganAndreas Opedal @OpedalAndreas
208 Followers 189 Following PhD student in NLP and Computational Linguistics at @ETH_enLinxing Preston Jiang @lpjiang97
308 Followers 194 Following PhD student @uwcse interested in theoretical neuroscience.Talia Ringer 🟣 �.. @TaliaRinger
26K Followers 6K Following Professor, @plfmse, @IllinoisCS! Proof Automation. @SigplanM & CCF Founder. Israeli-American for peace, equality, & justice. They/היא, ND, bi. די לכיבושPo-Shen Loh @PoShenLoh
8K Followers 252 Following Social entrepreneur • Intl Math Olymp Fndn VP Advancement • Founder https://t.co/3P8QqOQ9QX NOVID expii • Math Professor @CarnegieMellonKeenan Crane @keenanisalive
27K Followers 452 Following Digital Geometer, Assoc. Prof. of Computer Science & Robotics @CarnegieMellon @SCSatCMU and member of the @GeomCollective. https://t.co/edHwujkFsAXuhui Zhou @nlpxuhui
688 Followers 430 Following PhD student @LTIatCMU. Previously, @GeorgiaTech, @UWNLP, and @Apple. Social Intelligence in language +X. He/Him.🐳Kevin Buzzard @XenaProject
9K Followers 0 Following Mathematician learning Lean and trying to teach it to others. Now gone to Mathstodon (March 2023). No longer reading or replying to mentions.Alfredo Canziani @alfcnz
86K Followers 268 Following Musician, math lover, cook, dancer, 🏳️🌈, and an ass prof of Computer Science at New York UniversityDesh Raj @rdesh26
3K Followers 2K Following Research Scientist @Meta (AI Speech) | Previously: @jhuclsp, @IITGuwahatiYuntian Deng @yuntiandeng
3K Followers 3K Following #NLProc Postdoc @ai2_mosaic | Assistant Professor @UWaterloo '24 | Faculty Affiliate @VectorInst '24 | PhD @HarvardAnej Svete @AnejSvete
125 Followers 85 Following PhD fellow at @ETH_en @ETH_AI_Center, supervised by @ryandcotterell and @val_boeva. Working on understanding language models through formal language theory.Chen Zhao @henryzhao4321
635 Followers 348 Following Assistant Professor NYU Shanghai, Postdoc NYU, PhD @umdclip doing NLP research, bridge playerRyan Adams @ryan_p_adams
34K Followers 1K Following Machine Learning Researcher, CS Professor (@PrincetonCS), Dad, WoodworkerGianni Gastaldi @gian.. @gastaldi_gianni
233 Followers 593 Following Philosopher of (formal) sciences and CS/NLP researcher @ETH. President @HaPoComputing. Formerly Professor @mocontemporain, Philosophy @ENS_ULMMathieu Blondel @mblondel_ml
9K Followers 421 Following Research scientist at Google DeepMind. Current research interests: differentiable programming, LLMs, Transformers.Aki Nishimura @_aki_nishimura
549 Followers 133 Following (Bayesian) Statistician / Data Scientist for Public Health / Statistical Software Developer / Assistant Professor @jhubiostatMolei Tao @MoleiTaoMath
386 Followers 140 Following Associate Professor at Georgia Tech, mathematician, machine learner, physicist, ex-semiprogamerDavid Alvarez Melis @elmelis
2K Followers 2K Following Asst. Prof. @hseas || Researcher @MSRNE || ML + NLP || Previously: @MIT_CSAIL NYU @IBMResearch @ITAM_mxJacob Buckman @jacobmbuckman
5K Followers 372 Following Founder @manifest__ai. PhD candidate @MILAMontreal. Formerly @jhuclsp, @GoogleAI, @SCSatCMU.Peter Shor @PeterShor1
19K Followers 102 Following Discovered Shor's algorithm for prime factorization on quantum computers.Wondering how much progress was delayed by the chinchilla optimality paper, and people assuming that was a “given” because it came from DeeepMind.
Guess who's about to become Rhode Island's #1 Rustacean and #2 document engineer (behind Andy van Dam, ofc). Catch me as an assistant prof at Brown CS in 2025!
@yoavgo While working on (arxiv.org/abs/2403.09636) we discovered that we're able to retain many metrics including perplexity and many downstream tasks for very high compression ratios. Then we evaluated on MMLU and the score was terrible. From that point on our goal changed to getting…
I feel like everyone that followed me after the H1B post will be supremely disappointed when I resume posting long threads about neural transducers 👀
Statistics and ML theory would sound so much cooler if we called everything a power law.
I think conference orgs need to set policy for LLMs in experiments. Is NOT using proprietary systems (GPT, Claude, ...) a major concern? Is ONLY using proprietary systems a major concern? In reviews of my papers, and as AC, I've seen both, and it's super unfair to authors.
Often, when top writers/coders complain they are not getting much out of LLMs, folks like to pile on to tell them they are not using LLMs correctly. But people speak from their realities without understanding others.
On a platform like X, with a large sampling of the tech population, and if statistics hold, there is a larger population of X users not in the top decile of any given thing. So it’s easy to build a “consensus” that a specific application of LLM is useful for *everyone*.
“I’m not able to learn mathematics easily, I have to work. It takes a very long time and I have a terrible memory. I forget things. So I try to work, despite these handicaps, and the way I worked was trying to understand really well the simple things.” newscientist.com/article/242319…
Conformal maps (differentiable over the complex plane) can be defined infinitesimally as circle packings. en.wikipedia.org/wiki/Conformal… en.wikipedia.org/wiki/Holomorph…
My biggest challenge in life has consistently been: what skills that I don’t have a natural talent for are worth honing? Figuring out what’s holding you back is signal, but it’s ambiguous signal: sometimes you should grind a skill out and sometimes you should find a new goal.
My kindest sympathies for the Bayesian Optimization folks
Have you ever done a dense grid search over neural network hyperparameters? Like a *really dense* grid search? It looks like this (!!). Blueish colors correspond to hyperparameters for which training converges, redish colors to hyperparameters for which training diverges.
Another book which I need is "The Parts of Statistical Physics which have Rigorous and Elementary Proofs".
@npparikh I've tried bits and pieces and never found a text which really landed. In a way, I want the book which treats control concepts for an algorithms audience. There are versions of this, but they're often not quite thorough enough on background, or not modern enough on applications.
If the current AI safety scaremongers existed in Bell Labs they would've gagged Shannon from publishing his Information Theory paper since it outlines principles that govern all communications. “𝙒𝙝𝙖𝙩 𝙞𝙛 𝙥𝙚𝙤𝙥𝙡𝙚 𝙪𝙨𝙚 𝙩𝙝𝙚𝙨𝙚 𝙥𝙧𝙞𝙣𝙘𝙞𝙥𝙡𝙚𝙨 𝙩𝙤…
Imagine if Bell Labs kept their research breakthroughs under wraps
Often reviewers see fancy math, sophisticated proof techniques, etc., as an end in itself. In machine learning research, these are sometimes a necessary evil, but hardly desirable. We want to achieve a good outcome as simply as possible! Our goal should be *reducing* complexity.
Slowly realizing that two days ago I successfully defended my PhD! 🤯 I’m extremely grateful to my supervisors, @IAugenstein and @ryandcotterell, my PhD committee, @SergeBelongie, @pascalefung, @licwu, and all of my colleagues and collaborators!
Massive congrats to @karstanczak for passing her PhD defence with flying colours! 🎊🥂🥳 Very proud of you 🤗🥹 Thanks to @SergeBelongie @pascalefung @licwu for serving on the committee. Karolina’s thesis on multilingual gender bias probing: di.ku.dk/english/resear… #NLProc
Student: What does dx in an integral mean? Me (thinking): It’s a smooth section of the exterior power of the cotangent bundle. Me (out loud): It’s a tiny change in x.