Qian Huang @qhwang3
@xai | CS PhD student @StanfordAILab q-hwang.github.io Palo Alto, CA Joined March 2017-
Tweets96
-
Followers2K
-
Following277
-
Likes232
Language models today are trained to reason either 1) generally, imitating online reasoning data or 2) narrowly, self-teaching on their own solutions to specific tasks Can LMs teach themselves to reason generally?🌟Introducing Quiet-STaR, self-teaching via internal monologue!🧵
Announcing Score Entropy Discrete Diffusion (SEDD) w/ @chenlin_meng @StefanoErmon. SEDD challenges the autoregressive language paradigm, beating GPT-2 on perplexity and quality! Arxiv: arxiv.org/abs/2310.16834 Code: github.com/louaaron/Score… Blog: aaronlou.com/blog/discrete-… 🧵1/n
Congrats to all the authors got accepted by #WWW2024 If you already plan a trip to WWW, welcome to submit papers to our WWW Graph Foundation Model Workshop (GFM) . For more details, please visit the official website: www24gfm.com The submission deadline is February 5.
Can deep learning work on small data with far more features than samples? We present PLATO: a method that achieves the state-of-the-art on such datasets by using prior domain information! neurips.cc/virtual/2022/p… 🧵 Published in #NeurIPS2023 with @ren_hongyu @KexinHuang5 @jure
I am at NeuriPS 2023 this week and will have poster at every morning poster session 😝 Happy to chat!
I reverse-engineered AlphaCode2's submission history and manually performed the Codeforces evals. I'm ... again concerned that data leakage is affecting the results. For the DP problem highlighted in the AlphaCode2 release, look at AC2's solution vs. the tutorial. (1/5)
Happy to OSS gpt-fast, a fast and hackable implementation of transformer inference in <1000 lines of native PyTorch with support for quantization, speculative decoding, TP, Nvidia/AMD support, and more! Code: github.com/pytorch-labs/g… Blog: pytorch.org/blog/accelerat… (1/12)
If you want a summary of the major events of the recent OpenAI drama, I made a timeline of the major events plotted on a prediction market of whether Sam Altman will remain CEO. Data taken from @ManifoldMarkets (1/3)
📢 I'm on the faculty job market for 2024! 📢 Grateful for any RTs and pointers! My vision is to develop biomedical AI systems that exhibit generalist capabilities, see e.g. nature.com/articles/s4158… Below a selected overview of my prior & ongoing work:
I’m on the academic job market! I’ll have a PhD from @Stanford CS in 2024. My research develops ML + network science methods to tackle complex societal challenges, from pandemics to polarization to supply chains. See my website + research statement for details! Highlights below:
Jim Fan @DrJimFan
229K Followers 3K Following @NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.Percy Liang @percyliang
49K Followers 408 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | PianistPetar Veličković @PetarV_93
30K Followers 555 Following Staff Research Scientist @GoogleDeepMind | Affiliated Lecturer @Cambridge_Uni | Associate @clarehall_cam | GDL Scholar @ELLISforEurope. Monoids. 🇷🇸🇲🇪🇧🇦Michael Galkin @michael_galkin
5K Followers 268 Following AI Research Scientist @Intel AI Lab. Prev: Postdoc @Mila_Quebec & McGill. GraphML, Knowledge Graphs, GNNs, NLP. Grandmaster of 80's music (according to Spotify)Learning on Graphs Co.. @LogConference
7K Followers 749 Following LoG is a new annual research conference that covers areas broadly related to machine learning on graphs and geometry, with a special focus on review quality.Horace He @cHHillee
24K Followers 449 Following Working at the intersection of ML and Systems @ PyTorch "My learning style is Horace twitter threads" - @typedfemaleWeihua Hu @weihua916
6K Followers 1K Following Graphs. Deep Learning. Currently at https://t.co/O6xlgZ1LWi. Previously CS Ph.D. @Stanford RS Intern @Meta @GoogleDeepMind BS/MS University of Tokyo.rishi @RishiBommasani
4K Followers 2K Following Stanford CS PhD @StanfordCRFM @StanfordNLP @StanfordAILab @StanfordHAI Advisers: @percyliang @jurafsky Previous: @CornellCIS @clairecardie #FoundationModelsYuanqi Du @YuanqiD
2K Followers 959 Following Passionate researcher and community builder @AI_for_Science @LogConference; CS PhD @Cornell; Prev @DeepModeling, @AmlabUva, @MSFTResearchMichael Bronstein @mmbronstein
43K Followers 4K Following #DeepMind Professor of #AI @UniofOxford / Fellow @ExeterCollegeOx / ML Lead @ProjectCETI / https://t.co/kZpGpDzYeVZhaocheng Zhu (on the.. @zhu_zhaocheng
2K Followers 286 Following Final-year PhD @Mila_Quebec. BSc @PKU1898. Intern @Google. Reasoning, large language models, knowledge graphs and ML systems. Photographer held back by CS/ML.Emanuele Rossi @emaros96
4K Followers 590 Following ML for Drug Discovery @vant_ai. Previously, research @Twitter and FabulaAI (acquired by Twitter). PhD in Graph ML at @imperialcollege and @Cambridge_Uni alumnusDerek Lim @dereklim_lzh
2K Followers 1K Following ML @MIT_CSAIL & @LiquidAI_ Symmetries in ML @bostonsymmetry Prev @NVIDIA @MetaAI @Cornell.Mengzhou Xia @xiamengzhou
3K Followers 621 Following PhD student @princeton_nlp, MS @CarnegieMellon, Undergrad at Fudan.Miles Brundage @Miles_Brundage
43K Followers 10K Following Policy research at @openai. I mostly tweet about AI, animals, and sci-fi. He/him. Views my own.Kexin Huang @KexinHuang5
2K Followers 562 Following PhD Student @Stanford CS with @jure; Machine Learning + BiomedicineThomas Kipf @tkipf
25K Followers 1K Following AI Research at @GoogleDeepMind. Ex-Physicist. Graph Neural Networks & Controllable Generative Models (e.g. GCNs, Structured World Models, Slot Attention).Denny Zhou @denny_zhou
9K Followers 420 Following @GoogleDeepMind founder & lead of Reasoning Team. Build LLMs to reason. Opinions my own.Jeff Dean (@🏡) @JeffDean
296K Followers 6K Following Chief Scientist, Google DeepMind and Google Research. Co-designer/implementor of things like @TensorFlow, MapReduce, Bigtable, Spanner, Gemini .. (he/him)depubohaz1987 @depubohaz152709
18 Followers 35 Followingsadf @7777777777777fs
40 Followers 213 FollowingManling Li @ManlingLi_
3K Followers 421 Following Postdoc @Stanford, Incoming Assistant Professor @Northwestern, PhD @UIUC. Working on Knowledge Foundation Models, especially for Multimodal data (Language + X).keqin @Keqin_Chen
1 Followers 91 FollowingLuli Boubou @Luliboubou
48 Followers 257 Following As product manager, mostly working to get it right and done. For the rest, constantly learning to live.Wu Nickel @wu_nickel99419
1 Followers 31 FollowingJack Reacher @JackReach516
71 Followers 1K FollowingVofey RNeo @VofeyRneo
0 Followers 1K Followingcyk @ychai1224
5 Followers 398 FollowingYtkkk @Monotonik_
15 Followers 816 FollowingBen Schulz @schulzb589
1K Followers 5K Following 3D Geospatial Analyst at Maxar Space Operations. Opinions expressed on this site are my own and do not necessarily represent the views of Maxar Technologies.LouiΞPΞcan 🔩📠.. @louiepecan
918 Followers 5K Following grep 'the loot'👀🤖 Let none ignorant of geometry enter here📠💯 Stack moar GPUs 💎🤌 #aiart #stablediffusion #deadfellazStephen @stphnftz
14 Followers 309 Following artificial intelligence; computer science; mathematics; physics; biology; neuroscience; linguistics; psychology; philosophy; artDJ😼- e(nergy)/acc @sourabhrj
180 Followers 2K Following We must start using (green) energy as money. Energy is the real currency of the Universe #UBE #greenenergydollar #UBI Msc ELE Eng @DTU dk 🇮🇳 ➡️🇩🇰Sal Spina @supasal34
632 Followers 468 FollowingConsuela Rinaldo @ConsuelaR21060
60 Followers 5K FollowingBhargavkc22 @bhargavkc22
249 Followers 4K FollowingVaibhav @vaibhav_pandey
317 Followers 692 Following Building LLM powered tools for jobseekers | Prev: Product/Revenue at Infoedge, Gradeup (acquired by Byjus)Hui CHEN @chchenhui
53 Followers 399 Following Postdoc @ NTU, Ph.D. in #NLProc @sutdsg, B.Eng. in CS @ZJU_china.Ani Vadavatha @AnithaVadavatha
2K Followers 5K Following Partner/Founder/Advisor/Technologist in AI/ Biotech/ Web3 @aiona.ai @abplusventures @CapitalCode @UrthCapitalZirui Cheng @Zirui_Cheng_
96 Followers 460 Following Undergraduate Student @Tsinghua_Uni | Formerly Visiting @CarnegieMellon @UCSanDiegoPECET0X64 @PECET0X64
11 Followers 668 FollowingLe (Lena) Huang @LeHuang9
163 Followers 568 Following Bioinformatics and Computational Biology PhD at @unc in @yunliunc lab | ex Biostatistician Intern @Merck BARDS | Big Fan of Joe Hisaishihenry o @Henry0244
33 Followers 361 Following PhD student @Ajou university specializing in Social Commerce, Blockchain, and Marketing Information Systems.dandan he @DandanH65748
1 Followers 10 FollowingJun Du @dujun001
108 Followers 2K FollowingHadiovski @Hadiovski
11 Followers 410 Followingjovial @grepNstep
40 Followers 2K Following Retweet != endorsement. Trust those who seek the truth, doubt those who find itMrHobbo221 @hobbo221
72 Followers 912 Following Fly on the wall who likes to listen to people smarter than him.Shashank Sangar @ShashankTesla
16 Followers 204 Following Recruiting at Tesla AI for Core Autonomy (Autopilot & Optimus)Zory Zhang @zory_zhang
67 Followers 624 Following @IllinoisCS Reason2Learn: sample-efficient human-like learning (via explantion and abstraction) + persuasive and generalizable inference (via analogy reasoning)joao @jay_wooow
7K Followers 3K Following CPO @catena_labs | don’t decelerate, decentralize | exploring inner, outer, and latent space | prev: @jump_, @protocollabs, @GoogleAI | @oiioxfordssteevens @Steevens43
159 Followers 5K FollowingSankeerth Rao Karingu.. @sankeerth1729
569 Followers 2K Following Founder, CEO @Stealth ex-Research Scientist @Google Research Ph.D. in ML @ UC San Diego Undergrad in EE @ IIT BombayWeiyan Shi @shi_weiyan
3K Followers 696 Following Postdoc @StanfordNLP, incoming assistant professor @Northeastern, PhD @Columbia| Prev Intern @MetaAI |Co-created CICERO | persuasive chatbots + privacy #nlprocEjafa Bassam @EjafaBassam
12 Followers 229 Following Graduate Student @PKU1898 Researching DL/RL and applications of AI @PKUCS1978Varun Talwar @vt_65
22 Followers 5K FollowingFANVince @FANVince
76 Followers 1K FollowingXin Xu @XinXuNLPer
65 Followers 315 Following CS M.E.@ZJU_China Ex-intern @MSFTResearch Asia Incoming CSE Ph.D. Student @ucsd_cse Model Editing, Debiasing, IE, Music AIDavis Brown @davisbrownr
352 Followers 982 Following Research in interpretability, science of deep learning, safety and security @pnnlab. Opinions my own.AK @_akhaliq
310K Followers 3K Following AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo follow on Hugging Face: https://t.co/q2Qoey80GxYann LeCun @ylecun
712K Followers 719 Following Professor at NYU. Chief AI Scientist at Meta. Researcher in AI, Machine Learning, Robotics, etc. ACM Turing Award Laureate.Andrej Karpathy @karpathy
980K Followers 905 Following 🧑🍳. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets 🧠🤖💥Jim Fan @DrJimFan
229K Followers 3K Following @NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.Percy Liang @percyliang
49K Followers 408 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | PianistPetar Veličković @PetarV_93
30K Followers 555 Following Staff Research Scientist @GoogleDeepMind | Affiliated Lecturer @Cambridge_Uni | Associate @clarehall_cam | GDL Scholar @ELLISforEurope. Monoids. 🇷🇸🇲🇪🇧🇦Google DeepMind @GoogleDeepMind
944K Followers 275 Following We’re a team of scientists, engineers, ethicists and more, committed to solving intelligence, to advance science and benefit humanity.François Chollet @fchollet
470K Followers 769 Following Deep learning @google. Creator of Keras. Author of 'Deep Learning with Python'. Opinions are my own.Horace He @cHHillee
24K Followers 449 Following Working at the intersection of ML and Systems @ PyTorch "My learning style is Horace twitter threads" - @typedfemaleWeihua Hu @weihua916
6K Followers 1K Following Graphs. Deep Learning. Currently at https://t.co/O6xlgZ1LWi. Previously CS Ph.D. @Stanford RS Intern @Meta @GoogleDeepMind BS/MS University of Tokyo.Anthropic @AnthropicAI
262K Followers 26 Following We're an AI safety and research company that builds reliable, interpretable, and steerable AI systems. Talk to our AI assistant Claude at https://t.co/aRbQ97uk4d.rishi @RishiBommasani
4K Followers 2K Following Stanford CS PhD @StanfordCRFM @StanfordNLP @StanfordAILab @StanfordHAI Advisers: @percyliang @jurafsky Previous: @CornellCIS @clairecardie #FoundationModelsAndrew Ng @AndrewYNg
1.0M Followers 913 Following Co-Founder of Coursera; Stanford CS adjunct faculty. Former head of Baidu AI Group/Google Brain. #ai #machinelearning, #deeplearning #MOOCsYi Tay @YiTayML
29K Followers 97 Following chief scientist / cofounder @RekaAILabs 🫠 past: research scientist @google brain 🤯 currently learning to be a dad 🍼Michael Bronstein @mmbronstein
43K Followers 4K Following #DeepMind Professor of #AI @UniofOxford / Fellow @ExeterCollegeOx / ML Lead @ProjectCETI / https://t.co/kZpGpDzYeVSasha Rush @srush_nlp
52K Followers 464 Following Professor, Programmer in NYC. Cornell Tech, Hugging Face 🤗 https://t.co/cZl0wTfqGzZhaocheng Zhu (on the.. @zhu_zhaocheng
2K Followers 286 Following Final-year PhD @Mila_Quebec. BSc @PKU1898. Intern @Google. Reasoning, large language models, knowledge graphs and ML systems. Photographer held back by CS/ML.Shunyu Yao @ShunyuYao12
7K Followers 858 Following Language agents (ReAct, Reflexion, Tree of Thoughts) for digital automation (WebShop, SWE-bench, SWE-agent)xAI @xai
997K Followers 36 FollowingKen Liu @kenziyuliu
448 Followers 778 Following CS PhD @StanfordAILab. Thinks about ML privacy, security, localization, trustworthiness. Prev @SCSatCMU, @GoogleAI, @Sydney_Uni 🇦🇺Keiran Paster @keirp1
1K Followers 638 Following Currently PhD at the University of Toronto. Fall 2023 student researcher at Google. Training sequence models. Recent: APE, STEVE-1, OpenWebMath, Llemma.Hongyi Wang @HongyiWang10
1K Followers 1K Following Senior Project Scientist @mldcmu @CarnegieMellon; MLSys researcher; Member @llm360; Ph.D. @WisconsinCS; On the academic job market NOW!Miles Brundage @Miles_Brundage
43K Followers 10K Following Policy research at @openai. I mostly tweet about AI, animals, and sci-fi. He/him. Views my own.Omar Shaikh @oshaikh13
579 Followers 798 Following CS Ph.D. student @Stanford - previously @GeorgiaTech - also @[email protected]Jie Huang @jefffhj
4K Followers 569 Following Ph.D. Candidate at UIUC🌽; Formerly @GoogleDeepmind @NVIDIAAI @AmazonScience. #NLProc Large Language ModelsBingbing Wen @bingbingwen1
135 Followers 375 Following PhD student @UW_iSchool LLMs in expertise domain, multi-modality | prev intern @MSFTResearch | #NLProcChuang Gan @gan_chuang
4K Followers 457 Following Faculty Member at UMass Amherst; Principal researcher at MIT-IBM Watson AI Lab; Homepage: https://t.co/oXP6pqXCpoJuntao Ren @JuntaoRen
53 Followers 206 FollowingXindi Wu @cindy_x_wu
938 Followers 807 Following PhD student @PrincetonCS | Data-centric multimodal ml | prev @RealityLabs @roboVisionCMU @CMU_Robotics @SnapchatChris Yao Du @yao53513502
435 Followers 3K Following PhD@HKUST Computer Vision, Medical Image AnalysisPeiyang Song @p_song1
205 Followers 545 Following Honors CS Undergrad @UCSB_CCS. SURF fellow at Anima AI+Science Lab @caltech. Researcher @UCSBArchLab. Fmr @Tsinghua_Uni @NKU1919.Shuyan Zhou @shuyanzhxyc
2K Followers 594 Following Ph.D. student @LTIatCMU working on agents | she/theyZhiwei Deng @ZhiweiDeng8
32 Followers 47 Following Memory Research Scientist @ Google Research Postdoc @ Princeton University, CSOfir Press 🖋 @OfirPress
10K Followers 3K Following I build tough benchmarks for LMs and then I get the LMs to solve them. Postdoc @Princeton. PhD from @nlpnoah @UW. Ex-visiting researcher @MetaAI & @MosaicML.Stanford NLP Group @stanfordnlp
145K Followers 179 Following Computational Linguists—Natural Language—Machine Learning @chrmanning @jurafsky @percyliang @ChrisGPotts @tatsu_hashimoto @MonicaSLam @Diyi_Yang @StanfordAILabRylan Schaeffer @RylanSchaeffer
3K Followers 979 Following CS PhD student with @sanmikoyejo at @stai_research @StanfordAILabDenny Zhou @denny_zhou
9K Followers 420 Following @GoogleDeepMind founder & lead of Reasoning Team. Build LLMs to reason. Opinions my own.Yunyi Shen/申云逸 .. @ShenRaphael
395 Followers 188 Following PhD student @MIT EECS: Bayesian statistics/kinda machine learning/carnivore. Wildlife photographer & HAM (KD9TZJ). once UWMadison/PKU, MSc statistics/ecologySeungone Kim @seungonekim
928 Followers 832 Following Incoming Ph.D. student @LTIatCMU, M.S. student @kaist_ai working on LLM Evaluation & Systems that Improve with (Human) Feedback | Prev: @yonsei_u @NAVER_AI_LabShangbin Feng @shangbinfeng
1K Followers 1K Following PhD student @uwcse @uwnlp. Understanding and expanding the knowledge abilities of LMs, social NLP, networks and structures. he/him. #水文学家near @nearcyan
45K Followers 882 Following https://t.co/IdaJwZJCXm partner @ https://t.co/9g1MIgjiqc dms openAlex Gu @minimario1729
2K Followers 2K Following phd @MIT_CSAIL, llm for math and code. intern @MetaAI and analyst @pillar_vc. prev @BigCodeProject, @MITIBMLab, @JaneStreetGroup, @PonyAI_techTom Lieberum @lieberum_t
948 Followers 178 Following Trying to reduce AGI x-risk by understanding NNs Interpretability RE @DeepMind BSc Physics from @RWTH GWWC pledgee @ https://t.co/Vh2bvwhuwdXuechen Li @lxuechen
2K Followers 900 Following Building intelligence @xai. PhD @Stanford. Undergrad @UofT. Worked at @GoogleAI @MSFTResearch @Vectorinst. I go by Chen.Hamed Nilforoshan @h_nilforoshan
387 Followers 52 Following CS PhD student at Stanford, advised by @jure. ML/Data Science for inequality+health | Formerly ML Data Scientist @Airbnb, and @Columbia CS w/ @sirriceYusuf Roohani @yusufroohani
605 Followers 382 Following Machine Learning & Systems Biology. Currently PhD @StanfordAILab, prev. @GSK, @CarnegieMellon.John Hewitt @johnhewtt
4K Followers 22 Following CS PhD @stanford with @stanfordnlp. Frmr. @penn, intern @deepmind, @googleai, ++. Understanding and improving neural learning from language. Co-teach CS 224n.Evan Hernandez @evanqed
333 Followers 138 Following ph.d. student @mit | building @evidenceopen | formerly @google @uwmadison | #nlproc and loud musicChristopher Manning @chrmanning
127K Followers 116 Following Director, @StanfordAILab. Assoc. Director, @StanfordHAI. Founder, @stanfordnlp. Prof. CS & Linguistics, @Stanford. IP @aixventureshq. 🇦🇺 Do #NLProc & #AI. 👋ML4H @SymposiumML4H
2K Followers 59 Following Machine Learning for Health (ML4H) • New Orleans 2023• #ml4h2023 • Contact: [email protected]Ananya Kumar @ananyaku
4K Followers 472 Following Researcher at @openai Previously PhD at Stanford University (@StanfordAILab) advised by Percy Liang and Tengyu MaBinghao Huang @binghao_huang
670 Followers 946 Following CS PhD student @UIUC, advised by @YunzhuLiYZ. Previous Mechanical and Aerospace Engineering @UCSD, advised by @xiaolonw. Embodied AI/Robot LearningSerina Chang @serinachang5
2K Followers 381 Following On the 2023-2024 academic job market | CS PhD candidate at Stanford, advised by @jure and @jugander. Networks, ML, public health, computational social science.Ben Prystawski @BenPrystawski
474 Followers 387 Following Cognitive science PhD student @Stanford, studying cultural evolution and reasoning. Previously undergrad @UofT.sarah guo // convicti.. @saranormous
91K Followers 3K Following startup investor and builder, founder @w_conviction. accelerating AI adoption, interested in progress. tech podcast: @nopriorspodMaddy Bowers @mattlbowers
322 Followers 616 Following PhD Student at MIT working in program synthesis and ML. Interested in abstraction learning and applying PL methods to AI. they/she@jure @james_y_zou @michiyasunaga @qhwang3 @KexinHuang5 @kaidicao It has been a great pleasure working on this project with the following amazing people: My advisors @jure and @james_y_zou Collaborators from Amazon: Vassilis N. Ioannidis, Karthik Subbian Labmates at Stanford: Shiyu Zhao*, @michiyasunaga, @KexinHuang5 , @kaidicao , @qhwang3…
Thrilled to release 🌟STaRK 🌟 - A large-scale LLM retrieval benchmark on semi-structured knowledge bases. While LLMs excel at reasoning and semantic retrieval, they struggle with more complex tasks. Especially when real-world user queries require a combination of unstructured…
Language models today are trained to reason either 1) generally, imitating online reasoning data or 2) narrowly, self-teaching on their own solutions to specific tasks Can LMs teach themselves to reason generally?🌟Introducing Quiet-STaR, self-teaching via internal monologue!🧵
Announcing Score Entropy Discrete Diffusion (SEDD) w/ @chenlin_meng @StefanoErmon. SEDD challenges the autoregressive language paradigm, beating GPT-2 on perplexity and quality! Arxiv: arxiv.org/abs/2310.16834 Code: github.com/louaaron/Score… Blog: aaronlou.com/blog/discrete-… 🧵1/n
Announcing it 2 months after the work was done, but gpt-fast now supports Mixtral + MoE models! Featuring: - faster decoding than any (non-Groq) API endpoint, at up to 220 tok/s/user. - no custom kernels - int8/TP - still simple! How do we do it? Well, torch.compile :) (1/5)
i wish we had more of these types of papers - ones that almost propose a line of thought experiments rather than a set of new tricks - yet remain great reads nonetheless. can you train a language model without fixed token embeddings? arxiv.org/abs/2305.16349
@WeizmannScience We encapsulate these properties in our swebench.com benchmark I think @qhwang3's MLAgentBench is another great example of a good way to benchmark LMs, and @jyangballin's InterCode is also great. These benchmarks are great interactive envs to push LMs to the limit
Congrats to all the authors got accepted by #WWW2024 If you already plan a trip to WWW, welcome to submit papers to our WWW Graph Foundation Model Workshop (GFM) . For more details, please visit the official website: www24gfm.com The submission deadline is February 5.
Thank you to the ML4H 2023 Program Committee! @serinachang5 @dr_nyamewaa @bonadossou @qhwang3
Honored to win Poland's best CS master thesis prize for my work on long context LLM w/ @PiotrRMilos🎉 Can't make it to #NeurIPS2023😭, but @CStanKonrad will present LongLLaMA paper tmr! Thu 10:45, Poster #326, Session 5 Interested in extending context to 256K? Come and say hi!
Can deep learning work on small data with far more features than samples? We present PLATO: a method that achieves the state-of-the-art on such datasets by using prior domain information! neurips.cc/virtual/2022/p… 🧵 Published in #NeurIPS2023 with @ren_hongyu @KexinHuang5 @jure
We had a splendid poster session yesterday! Thanks for stopping by and the great discussions! 🤩 Loved how one person who went: "Zero-shot? That's impossible!🤯" #NeurIPS2023 @h_nilforoshan
I’ll be at #NeurIPS2023 Wed-Sat. DM if you’d like to chat! Come see our spotlight paper Thurs morning with @ericzelikman @qhwang3 @GabrielPoesia @noahdgoodman. Also come see the amazing work of people in the lab at @aloeworkshop @IMOLNeurIPS2023 zelikman.me/parselpaper/
I’m presenting 2 papers at #NeurIPS2023 on data-centric ML for large language models: DSIR (targeted data selection): Wed Dec 13 @ 5pm DoReMi (pretraining data mixtures): Thu Dec 14 @ 10:45am Excited to chat about large language models, data, pretraining/adaptation, and more!
What an amazing, insightful panel discussion at @SymposiumML4H morning session on “Global Health and Health Equity”! I’ll be moderating the afternoon session on “Generative AI: the road ahead” - be sure to check it out!
I’m at Neurips 2023 all week. Happy to talk with anyone about PyTorch, ML compilers, LLM inference, etc. I’d especially encourage folks to reach out if you’re just starting to get into ML systems.
As mentioned previously, I found AlphaCode2 accounts, and through stalking their submission history, I manually performed the AlphaCode2 Codeforces evals. Overall, very impressive! I arrive at a rating of ~1650, which is the 85-90th percentile of CF users. (1/19)
I reverse-engineered AlphaCode2's submission history and manually performed the Codeforces evals. I'm ... again concerned that data leakage is affecting the results. For the DP problem highlighted in the AlphaCode2 release, look at AC2's solution vs. the tutorial. (1/5)