Banghua Zhu @BanghuaZ

PhD @Berkeley_EECS, statistics, info theory, LLM, RL, Human-AI Interactions. people.eecs.berkeley.edu/~banghua/ Berkeley, CA Joined August 2018

Tweets

212
Followers

2K
Following

804
Likes

2K

Weijie Su @weijie444

5 days ago

Ongoing lawsuits against GenAI firms over possible use of #copyrighted data for training raise vital questions for our society. 🤖⚖️ How can we address the copyright challenges? New research proposes a solution: "An Economic Solution to Copyright Challenges of Generative AI"

1 13 64 6K 27

Download Image

Vivek Raghunathan @vivek7ue

6 days ago

Excited to announce #SnowflakeArctic, our new OSS LLM. Play with it at arctic.streamlit.app Read our cookbook at snowflake.com/en/data-cloud/… Read our blog at snowflake.com/blog/arctic-op… We are just getting started ...

sridhar @RamaswmySridhar

6 days ago

32 87 562 219K 110

Download Video

4 15 72 14K 17

Sebastien Bubeck @SebastienBubeck

a week ago

phi-3 is here, and it's ... good :-). I made a quick short demo to give you a feel of what phi-3-mini (3.8B) can do. Stay tuned for the open weights release and more announcements tomorrow morning! (And ofc this wouldn't be complete without the usual table of benchmarks!)

40 182 907 418K 294

Download Video

Banghua Zhu @BanghuaZ

a week ago

Chatbot Arena usually captures the combination of two aspects: Basic capability + human preference alignment. In terms of basic capability, it seems still not yet at GPT-4 level from all benchmark metrics. But Llama3 did a really great job on human preferecen alignment, likely…

lmsys.org @lmsysorg

a week ago

30 165 1K 1.1M 312

Download Image

0 1 13 3K 1

Aviral Kumar @aviral_kumar2

a week ago

Many LLM fine-tuning methods. Unclear what you should use & why? In our new paper, we did an extensive study of on-policy RL, supervised & offline contrastive methods (DPO, IPO) to answer this... 🧵⬇️ On-policy > offline, mode-seeking > mode-covering understanding-rlhf.github.io

3 65 270 32K 246

Download Image

Banghua Zhu @BanghuaZ

a week ago

Very excited about the release of arena hard, the main benchmark we looked at when selecting the checkpoints for Starling model. It focuses on a subset of very hard prompts from chatbot arena.

lmsys.org @lmsysorg

a week ago

Very excited about the release of arena hard, the main benchmark we looked at when selecting the checkpoints for Starling model. It focuses on a subset of very hard prompts from chatbot arena.

20 123 639 119K 272

Download Image

1 3 34 7K 9

Zhengyao Jiang @zhengyaojiang

a week ago

Llama3 reminds everyone of the misconception about scaling laws again: it's not that a larger model is always better, but that a larger model is cheaper to train if you want to reach the same performance. Yes, this might be somewhat counter-intuitive, but this is one of the key…

3 36 236 46K 115

Download Image

Andrej Karpathy @karpathy

2 weeks ago

Congrats to @AIatMeta on Llama 3 release!! 🎉 ai.meta.com/blog/meta-llam… Notes: Releasing 8B and 70B (both base and finetuned) models, strong-performing in their model class (but we'll see when the rankings come in @ @lmsysorg :)) 400B is still training, but already encroaching…

145 1K 8K 840K 2K

Hanna Hajishirzi @HannaHajishirzi

2 weeks ago

Introducing our best OLMo yet. OLMo 1.7-7B outperforms LLaMa2-7B, approaching LLaMa2-13B at MMLU and GSM8k. High-quality data and staged training are key. I am so proud of our team making such significant improvement in a short period after our first release.

Allen Institute for AI @allen_ai

2 weeks ago

13 44 170 66K 41

Download Image

2 13 105 11K 14

Download Image

Beidi Chen @BeidiChen

2 weeks ago

📢We're thrilled to announce that Kurt Keutzer will give the keynote speech for MLSys 2024 Young Professionals Symposium. Welcome to join us for exciting invited talks by @Azaliamirh, Xupeng Miao, @jiawzhao , @ying11231 , @tri_dao on cutting-edge MLSys research! The full…

1 9 60 9K 1

Download Image

Dianbo Liu @DianboLiu

3 weeks ago

Welcome to our AI tea talks Singapore series. The very first talk will be given by Prof. Natasha Jaques from UW/Google Deepmind about Reinforcement learning with human feedback. Zoom link: nus-sg.zoom.us/j/84608066438Z… meeting ID: 846 0806 6438 All are welcome to join.…

1 13 91 61K 8

Download Image

Yuandong Tian @tydsh

3 weeks ago

I will give a keynote on Theoretical Foundations of Foundation Models (TF2M) workshop in ICML'24 and be a panelist to discuss interesting topics.

Theoretical Foundations of Foundation Models @tf2m_workshop

3 weeks ago

I will give a keynote on Theoretical Foundations of Foundation Models (TF2M) workshop in ICML'24 and be a panelist to discuss interesting topics.

1 8 45 22K 10

0 8 68 9K 12

Banghua Zhu @BanghuaZ

3 weeks ago

Check out the ICML workshop on Theoretical Foundations of Foundation Models!

Theoretical Foundations of Foundation Models @tf2m_workshop

3 weeks ago

Check out the ICML workshop on Theoretical Foundations of Foundation Models!

1 8 45 22K 10

0 3 21 3K 2

Song Mei @Song__Mei

3 weeks ago

My group at Berkeley Stats and EECS has a postdoc opening in the theoretical (e.g., scaling laws, watermark) and empirical aspects (e.g., efficiency, safety, alignment) of LLMs or diffusion models. Send me an email with your CV if interested!

1 23 95 20K 29

Kanishk Gandhi @gandhikanishk

3 weeks ago

Language models struggle to search, not due to an architecture problem, but a data one! They rarely see how to search or backtrack. We show how LLMs can be taught to search by representing the process of search in language as a flattened string, a stream of search (SoS)!

7 112 579 94K 546

Download Gif

Kangwook Lee @Kangwook_Lee

4 weeks ago

In @myhakureimu's recent work, we observed something very similar! Consider this prompt: 3+5=9 5+10=16 3+4=8 1+1=? LLMs will answer 2! What if we provide hundreds of examples? LLMs will give up the original definition of "addition", and will start predicting 3!

Anthropic @AnthropicAI

4 weeks ago

83 348 2K 500K 871

Download Image

3 10 92 16K 32

Download Image

Bill Yuchen Lin 🤖 @billyuchenlin

4 weeks ago

🆕 Check out the recent update of 𝕎𝕚𝕝𝕕𝔹𝕖𝕟𝕔𝕙! We have included a few more models including DBRX-Instruct @databricks and StarlingLM-beta (7B) @NexusflowX which are both super powerful! DBRX-Instruct is indeed the best open LLM; Starling-LM 7B outperforms a lot of even…

Bill Yuchen Lin 🤖 @billyuchenlin

2 months ago

24 111 541 218K 224

Download Image

3 32 127 44K 25

Download Image

Banghua Zhu @BanghuaZ

4 weeks ago

Huge congrats to the amazing folks at lmsys! Vicuna and chatbot arena are really important milestones in the field of open source and LLMs!

lmsys.org @lmsysorg

a month ago

Huge congrats to the amazing folks at lmsys! Vicuna and chatbot arena are really important milestones in the field of open source and LLMs!

7 21 198 39K 22

0 1 17 3K 0

Sebastian Raschka @rasbt

a month ago

Just wrote a new article on "Tips for LLM Pretraining and Evaluating Reward Models" (magazine.sebastianraschka.com/p/tips-for-llm…). Here, I am reviewing a paper that discusses strategies for continuing LLM pretraining. Then, I discuss reward modeling used in reinforcement learning with human…

2 119 611 48K 411

Lewis Tunstall @_lewtun

a month ago

I ran the new #DBRX Instruct model through 4 benchmarks that have high correlation with the @lmsysorg Chatbot Arena and measure different capabilities: 💬 MT Bench: a multi-turn chat benchmark that uses GPT-4 as a judge. Known to suffer from length bias & is somewhat noisy, but…

9 11 128 58K 66

Download Image

Pauthare @PautharepxNK

0 Followers 19 Following

Hui Xu @HuiXu43118541

3 Followers 44 Following

Akash @pocuseverything

2K Followers 5K Following

Thutoez @thutoez71050

0 Followers 181 Following

Shanita Sachar @SacharSac

17 Followers 3K Following

Anubis @anput_anubis

75 Followers 580 Following ODTÜ

Product @ Google | Firebase serverless lead (web, compute, storage & AI & ML). Previously product @MSFT | 24+ years in tech .. dev, PMM, PM

Opinions are my own

Dmitry Lyalin @LyalinDotCom

9K Followers 6K Following Product @ Google | Firebase serverless lead (web, compute, storage & AI & ML). Previously product @MSFT | 24+ years in tech .. dev, PMM, PM Opinions are my own

Scoanitosm @scoanitosm43603

0 Followers 72 Following

Siloughf @siloughf4685

0 Followers 179 Following

EarthaEva @W1RF1BW3E7nFka4

0 Followers 89 Following

Dorothy @dorothy_austin5

132 Followers 3K Following

Henry John @HenryJohn125977

0 Followers 6 Following

Jungwon Choi @JungwonChoi11

16 Followers 33 Following Assistant professor @ UWECE / Power Electronics/HF Power Converter/WPT/Renewable Energy System

Guannan Qu @guannanqu

115 Followers 80 Following Assistant Professor at CMU Machine learning, control, reinforcement learning, multi-agent systems

Mickel Liu @mickel_liu

100 Followers 235 Following research visiting @uwnlp, Prev: @PKU1898, @uoftengineering RL + LLM

Linjun Zhang @linjunz_stat

475 Followers 540 Following Assistant Professor of Statistics @RutgersU

DawnBruce @i9VUCN077txLrk

0 Followers 169 Following

วิวรรณา @3Vu2TQSIem5X6

47 Followers 1K Following เราเจอชะตากรรมแบบไหน ชอบติดตามไว้ก่อนได้นะครับ ผมจะส่งข้อมูลติดต่อไปที่หน้าแรกเป็นระยะๆครับ

younghoax @younghoax20

454 Followers 7K Following Doctor, stocks, crypto,AI

° autonomous systems engineer ¶
° wired in via RJ45 ¶
° running serial experiments on AI ¶
° effectively accelerating ¶
° pet lynx of @_Mira___Mira_

Purring Lynx @Purring_Lynx

35 Followers 121 Following ° autonomous systems engineer ¶ ° wired in via RJ45 ¶ ° running serial experiments on AI ¶ ° effectively accelerating ¶ ° pet lynx of @_Mira___Mira_

Sizhe Zhou @SizheZhou189667

73 Followers 616 Following MS @IllinoisCS | BEng @SJTU1896

neo @neobyd

49 Followers 392 Following

Yifang Chen @cloudwaysX

455 Followers 641 Following Ph.D. student @uwcse. Previously @usc undergrad. Online Learning, reinforcement learning, bandits, and active learning.

Jiaxin Huang @jiaxinhuang0229

296 Followers 54 Following Incoming assistant professor @WUSTL CSE. PhD Candidate @IllinoisCS. Currently visiting @uwnlp. NLP, ML, Data Mining.

Florine Colletta @CollettaFl70045

83 Followers 5K Following

dumbol @dumbol6

84 Followers 1K Following redart

PhD Student @RutgersCS. Trustworthy and Responsible Generative Artificial Intelligence. Intern @SonyAI_global (current) @Meta GenAI (incoming)

Zhenting Wang @wang1999_zt

79 Followers 234 Following PhD Student @RutgersCS. Trustworthy and Responsible Generative Artificial Intelligence. Intern @SonyAI_global (current) @Meta GenAI (incoming)

The girl with broken tooth. I fancy neurosynaptic chips more than potato chips. Love reading scientific papers & procrastination. Violin.hypocrite. fan ig

Shu @Rainb0ish

688 Followers 6K Following The girl with broken tooth. I fancy neurosynaptic chips more than potato chips. Love reading scientific papers & procrastination. Violin.hypocrite. fan ig

Enio Fernandes @EN1O

197 Followers 4K Following Teacher

jh w @jhw990164844563

0 Followers 10 Following

Programming Engineer & Linux+ | IT & Net+ | CCIE & CISSP | Azure Developer & Multi-Clouds Architect+ | Quantum AI Builder+ | #الحمدلله_على_نعمة_الامارات 🇦🇪 ❤️

【𝕐o𝕦𝕤𝕖�.. @YosGPT

Arnav Das @arnaved

75 Followers 325 Following

郝博阳 @ekaths

15 Followers 345 Following

Michael M. Pieler @MichaelMPieler

332 Followers 1K Following

Muhammad Abdullah @Abdullah_kwl

42 Followers 501 Following Life is better when you're laughing...... "your time is limited,So don't waste it living someone else's life❤

Jonathan Wang @givemettt5600

23 Followers 179 Following

Tianle (Tim) Li @LiTianleli

13 Followers 10 Following EECS Undergraduate at UC Berkeley. ML Researcher at @BerkeleySky and @lmsysorg

Ph.D. Student @UW, MELODI Lab and @uw_wail at @uwcse Formerly @amazonscience, EE undergrad @iitdelhi. An active photographer and Alpinist!

Gantavya Bhatt @BhattGantavya

548 Followers 1K Following Ph.D. Student @UW, MELODI Lab and @uw_wail at @uwcse Formerly @amazonscience, EE undergrad @iitdelhi. An active photographer and Alpinist!

Giulia Fanti @giuliacfanti

2K Followers 675 Following Assistant prof @ CMU ECE studying privacy, data sharing, and generative models

Dimitris Papailiopoul.. @DimitrisPapail

11K Followers 976 Following prof @ wisconsin; thinking about transformers; learning in context; babas of Inez Lily

Delsie Specter @specter60156

86 Followers 5K Following

Yujia Qin @TsingYoga

950 Followers 262 Following Doing a startup right now, LLM+Agent

Allen Zhou @zhoujinjing09

285 Followers 1K Following AI Alchemist @TensorChord

Woosuk Kwon @woosuk_k

2K Followers 351 Following PhD student at @Berkeley_EECS building @vllm_project

Elachqar Oussama @Oussama_e

60 Followers 2K Following

Korrapati Hemanth @Hemanth2k22

78 Followers 1K Following Effective Accelerationism

Evan @evan_a_frick

6 Followers 12 Following CS at Berkeley. ML Research @berkeley_ai ML Engineer @NexusflowX

MesubsetofRunionC @mesubsetof

33 Followers 471 Following

Eternal Max @k9TWEQPCC5d2dUG

66 Followers 459 Following

daniel (e/acc) ⚡ @luckfvoursme

395 Followers 6K Following Web 3 👾/ Industria 4.0 / Sociedades 5.0 🌱

CS PhD @ UC Berkeley. Creator of Gorilla, GoEx, RAFT, OpenFunctions and Berkeley Function Calling Leaderboard. Previously researcher @GoogleAI @MSFTResearch

Shishir Patil @shishirpatil_

3K Followers 850 Following CS PhD @ UC Berkeley. Creator of Gorilla, GoEx, RAFT, OpenFunctions and Berkeley Function Calling Leaderboard. Previously researcher @GoogleAI @MSFTResearch

Irene Chen @irenetrampoline

8K Followers 817 Following ML for equitable healthcare. Assistant Professor @UCBerkeley and @UCSF. Prev @Harvard, @MIT, @MSFTResearch

Experts, research and administration news from the University of Washington. Media assistance: uwnews@uw.edu. See also: @UW @UWAthletics @UWMedicine

UW News @uwnews

23K Followers 2K Following Experts, research and administration news from the University of Washington. Media assistance: [email protected]. See also: @UW @UWAthletics @UWMedicine

University of Washington students, faculty and staff believe in boundless opportunities. Do you dare to Be Boundless? At the UW, you can.

University of Washing.. @UW

186K Followers 2K Following University of Washington students, faculty and staff believe in boundless opportunities. Do you dare to Be Boundless? At the UW, you can.

Mechanical Engineerin.. @ME_at_UW

2K Followers 418 Following Our faculty and students create a healthier, cleaner and more prosperous world. @UW @UWEngineering

UW Student Life @uwstudentlife

5K Followers 466 Following Follow us and learn more about student life at the University of Washington!

The @UW Population Health Initiative seeks to create a world where all people can live healthier and more fulfilling lives.

UW Population Health .. @UW_PHI

2K Followers 187 Following The @UW Population Health Initiative seeks to create a world where all people can live healthier and more fulfilling lives.

Official account of the University of Washington (@UW) Information School, one of the world's top schools in information science. We make information work.

UW iSchool @uw_ischool

6K Followers 2K Following Official account of the University of Washington (@UW) Information School, one of the world's top schools in information science. We make information work.

The UW Alumni Association is the foundation of the University of Washington alumni community. We connect alumni and friends around the world to the UW!

UW Alumni @UWalum

9K Followers 742 Following The UW Alumni Association is the foundation of the University of Washington alumni community. We connect alumni and friends around the world to the UW!

Newsroom reports news from UW Medicine and the University of Washington School of Medicine. We cover clinical care, research, education and issues.

UW Medicine Newsroom @uwmnewsroom

7K Followers 1K Following Newsroom reports news from UW Medicine and the University of Washington School of Medicine. We cover clinical care, research, education and issues.

@UW president, loves teaching, learning & dawgs of all kinds, advocate for access & excellence, sings Bow Down to WA w/a rumba beat #GoHuskies

Ana Mari Cauce @amcauce

9K Followers 792 Following @UW president, loves teaching, learning & dawgs of all kinds, advocate for access & excellence, sings Bow Down to WA w/a rumba beat #GoHuskies

From changemakers to educators, we create the leaders of tomorrow. Share your UW College of Education experience with #EduDawgs.

UW College of Educati.. @UWCollegeOfEd

5K Followers 880 Following From changemakers to educators, we create the leaders of tomorrow. Share your UW College of Education experience with #EduDawgs.

Jungwon Choi @JungwonChoi11

16 Followers 33 Following Assistant professor @ UWECE / Power Electronics/HF Power Converter/WPT/Renewable Energy System

UW Engineering @uwengineering

11K Followers 2K Following Research and administration news from the University of Washington’s College of Engineering. See also: @UWNews and @UW.

Electrical & Computer Engineering at the University of Washington is a top-ranked, vibrant department, leading in cutting-edge science, technology & innovation.

UW ECE @uw_ece

2K Followers 606 Following Electrical & Computer Engineering at the University of Washington is a top-ranked, vibrant department, leading in cutting-edge science, technology & innovation.

UW NLP @uwnlp

11K Followers 160 Following The NLP group at the University of Washington.

Mickel Liu @mickel_liu

100 Followers 235 Following research visiting @uwnlp, Prev: @PKU1898, @uoftengineering RL + LLM

Guannan Qu @guannanqu

115 Followers 80 Following Assistant Professor at CMU Machine learning, control, reinforcement learning, multi-agent systems

Linjun Zhang @linjunz_stat

475 Followers 540 Following Assistant Professor of Statistics @RutgersU

Volkan Cevher @CevherLIONS

3K Followers 579 Following Associate Professor of Electrical Engineering, EPFL. Amazon Scholar. ELLIS Fellow.

Tiffany Ding @tifding

210 Followers 81 Following Statistics PhD student @UCBerkeley

Yifang Chen @cloudwaysX

455 Followers 641 Following Ph.D. student @uwcse. Previously @usc undergrad. Online Learning, reinforcement learning, bandits, and active learning.

Jiaxin Huang @jiaxinhuang0229

296 Followers 54 Following Incoming assistant professor @WUSTL CSE. PhD Candidate @IllinoisCS. Currently visiting @uwnlp. NLP, ML, Data Mining.

Yujia Qin @TsingYoga

950 Followers 262 Following Doing a startup right now, LLM+Agent

Giulia Fanti @giuliacfanti

2K Followers 675 Following Assistant prof @ CMU ECE studying privacy, data sharing, and generative models

Gantavya Bhatt @BhattGantavya

548 Followers 1K Following Ph.D. Student @UW, MELODI Lab and @uw_wail at @uwcse Formerly @amazonscience, EE undergrad @iitdelhi. An active photographer and Alpinist!

Woosuk Kwon @woosuk_k

2K Followers 351 Following PhD student at @Berkeley_EECS building @vllm_project

Tianle (Tim) Li @LiTianleli

13 Followers 10 Following EECS Undergraduate at UC Berkeley. ML Researcher at @BerkeleySky and @lmsysorg

Peng Ding @pengding00

2K Followers 385 Following Associate Professor of Statistics

Ahmad Al-Dahle @Ahmad_Al_Dahle

4K Followers 53 Following #Girldad of twins. Leading GenAI @ Meta (llama, imagine, meta ai and more)

Sheng Shen @shengs1123

1K Followers 540 Following Ph.D. student @berkeley_ai; Building 🦙@MetaAi; Former @MSFTResearch, @allen_ai, @GoogleDeepMind

Sergey Edunov @edunov

948 Followers 103 Following Director of Engineering @ GenAI, Meta. I work on Llamas

Creator @datasetteproj, co-creator Django. PSF board. @nichemuseums. Hangs out with @natbat + @cleopaws. He/Him. Mastodon: https://t.co/t0MrmnJW0K

Simon Willison @simonw

71K Followers 5K Following Creator @datasetteproj, co-creator Django. PSF board. @nichemuseums. Hangs out with @natbat + @cleopaws. He/Him. Mastodon: https://t.co/t0MrmnJW0K

Güçlü Gökozan @GucluGokozan

17K Followers 2K Following • CEO & Entrepreneur ⛵️🏀 https://t.co/4HjOvw5eyT

Susan Murphy lab @SusanMurphylab1

3K Followers 86 Following Designing trial and developing data analytic methods for informing intervention optimization in digital health

Zhuang Liu @liuzhuang1234

3K Followers 933 Following Research Scientist @MetaAI (FAIR, at NYC). machine learning, computer vision, neural networks. PhD from @Berkeley_EECS

Senior Research Scientist at @GoogleAI and Assistant Professor @uwcse. Social Reinforcement Learning in multi-agent and human-AI interactions. PhD from @MIT.

Natasha Jaques @natashajaques

25K Followers 1K Following Senior Research Scientist at @GoogleAI and Assistant Professor @uwcse. Social Reinforcement Learning in multi-agent and human-AI interactions. PhD from @MIT.

Ananya Kumar @ananyaku

4K Followers 472 Following Researcher at @openai Previously PhD at Stanford University (@StanfordAILab) advised by Percy Liang and Tengyu Ma

Shashank Sonkar @shashank_nlp

51 Followers 396 Following NLP+Education | Grad Student @rbaraniuk group | @RiceECE @rice_dsp @OpenStax @IITKanpur

Enric Boix @eaboix

18 Followers 3 Following PhD Student at MIT EECS

Faculty @TUDelft, Prev. @ETH_en @Stanford @EPFL_en
Interested in ML robustness, reliability and reasoning
Exec. Editor of https://t.co/qddizl9xTb @DMLRJournal

Nezihe Merve Gürel @nmervegurel

1K Followers 481 Following Faculty @TUDelft, Prev. @ETH_en @Stanford @EPFL_en Interested in ML robustness, reliability and reasoning Exec. Editor of https://t.co/qddizl9xTb @DMLRJournal

Ryan David Cotterell @ryandcotterell

9K Followers 1K Following

kamalikac @kamalikac

4K Followers 1K Following Machine Learner. https://t.co/hqhkWR570O

Hanna Hajishirzi @HannaHajishirzi

6K Followers 328 Following Associate professor at @uw_cse; senior director at @allen_ai co-leading @allenNLP; AI/NLP researcher at @uw_nlp

Dan Alistarh @DAlistarh

387 Followers 87 Following Professor at IST Austria

Ananda Theertha Sures.. @th33rtha

521 Followers 125 Following Researcher in machine learning and information theory.

Lisa Dunlap @lisabdunlap

502 Followers 154 Following PhD student & vibe curator @berkeley_ai and Sky Computing Lab -- for the love of god look at your data

Jeff Harris @jeffintime

3K Followers 883 Following the malleability of minds. API product @openai

Satya Nadella @satyanadella

3.3M Followers 286 Following Chairman and CEO at Microsoft

@Toong @TianDatong

293 Followers 2K Following Intelligent Symbiosis - All Things Connected, A Light in the Rift. 智能共生——万物互联，裂隙有光。 https://t.co/jffHiZGZ7u

sridhar @RamaswmySridhar

6 hours ago

Seen on the @WSJ ! Join us at snowflake.com/summit/

3 6 127 7K 5

Download Image

William Wang @WilliamWangNLP

3 hours ago

It’s the time of the year that new faculty members are about to choose their offer and start a faculty job. 🤩🤩🤩I have some advice that I wish I knew when I first started: 1/6

2 4 45 6K 23

Alex Dimakis @AlexGDimakis

16 hours ago

Nice collection of finetuning datasets

Maxime Labonne @maximelabonne

a day ago

💾 LLM Datasets LLM development is increasingly moving towards curating high-quality datasets, as shown by Llama 3. I've compiled a collection of fine-tuning datasets along with advice and tools for creating your own. 💻 GitHub: github.com/mlabonne/llm-d…

22 150 753 55K 720

Download Image

0 0 9 3K 5

Maxime Labonne @maximelabonne

a day ago

22 150 753 55K 720

Download Image

Interconnects @interconnectsai

17 hours ago

Phi 3 and Arctic: Outlier LMs are hints Models that seem totally out of scope from recent open LLMs give us a sneak peek of where the industry will be in 6 to 18 months. interconnects.ai/p/phi-3-and-ar…

0 3 15 8K 8

Graham Neubig @gneubig

20 hours ago

Tell me that you're a language model from X corporation without telling me you're a language model from X corporation.

7 1 54 8K 6

Download Image

Srinivas Narayanan @snsf

20 hours ago

Memory is available to all ChatGPT Plus users. We hope that you will find the answers become more personalized and relevant over time with use.

OpenAI @OpenAI

a day ago

Memory is now available to all ChatGPT Plus users. Using Memory is easy: just start a new chat and tell ChatGPT anything you’d like it to remember. Memory can be turned on or off in settings and is not currently available in Europe or Korea. Team, Enterprise, and GPTs to come.

372 804 5K 1.4M 861

Download Video

2 3 34 4K 1

Ahmad Beirami @abeirami

21 hours ago

I'll be at #AISTATS2024 later this week! With Madhow, we will co-present @BhagyashreePu13's poster on TEXP to improve robustness with a tweak to the first layer of the network. Looking forward to meeting old and new friends!

Bhagyashree Puranik @BhagyashreePu13

3 weeks ago

📢📢📢 Late post, but here we go...! I am thrilled to announce that our work on 𝙚𝒏𝙝𝒂𝙣𝒄𝙞𝒏𝙜 𝙤𝒖𝙩-𝙤𝒇-𝒅𝙞𝒔𝙩𝒓𝙞𝒃𝙪𝒕𝙞𝒐𝙣 𝙧𝒐𝙗𝒖𝙨𝒕𝙣𝒆𝙨𝒔 of deep neural networks has been accepted to 𝘼𝑰𝙎𝑻𝘼𝑻𝙎 2024!

2 5 48 18K 18

Download Image

1 5 34 3K 5

Kangwook Lee @Kangwook_Lee

22 hours ago

I'm honored to receive the Amazon Research Award🎉 My group will be exploring how to use LLMs better, guided by principles of information and coding theory. Special thanks to @myhakureimu @yzeng58 and @yingfan_bot, who are already actively engaged in this exciting research 😊

Amazon Science @AmazonScience

4 days ago

The recipients, representing 51 universities in 15 countries, will have access to Amazon public datasets, AWS AI/ML services and tools, and more. Congrats to the 99 awardees! #AmazonResearchAwards amazon.science/research-award…

1 8 72 50K 25

9 4 113 11K 13

William Fedus @LiamFedus

2 days ago

After years eclipsed by its big brothers, gpt-2 resurgant? 🤔

Boris Dayma 🖍️ @borisdayma

2 days ago

The hype for finding out what is "gpt2-chatbot" on lmsys chatbot arena is real 😅

5 8 121 135K 16

2 3 24 9K 4

Hao Zhang @haozhangml

2 days ago

(perhaps) the most important topic in LLMs -- the data recipe!

Snowflake @SnowflakeDB

4 days ago

We’re excited to share insights and lessons learned collecting the data needed for Arctic as part of our #SnowflakeArctic Cookbook Series. 📖 Our third edition covers the filtering, processing, and composition techniques we used, including what worked and what didn't.

1 5 35 10K 13

0 1 25 6K 10

Irene Chen @irenetrampoline

2 days ago

Professor life is off to a great start! Honored to receive a grant from Apple ML Research and to be named a Google Research Scholar. Looking forward to more work developing ML methods for healthcare and equity Pictured: an apple, Google, and me

33 20 1K 102K 106

Download Image

Nathan Lambert @natolambert

4 days ago

Snowflakes Arctic LLM team must literally be cooking

4 6 117 14K 18

Download Image

Gautam Kamath @thegautamkamath

5 days ago

A suit jacket and a backpack is the universal uniform of the academic job interview in CS

4 1 44 42K 1

Sebastien Bubeck @SebastienBubeck

4 days ago

@srush_nlp Hey Sasha, I think it makes sense. Phi-3 is fundamentally different from other models, so its behavior can be unexpected in some cases, both in a good and bad way (hopefully though much more in a good way ;-)).

3 0 16 4K 2

Ahmad Al-Dahle @Ahmad_Al_Dahle

5 days ago

What a week since we released Llama 3! I couldn’t be more proud of the response. 🏆 Llama 3 70B is now the highest ranking open model on @lmsysorg leaderboard. 📈 1.2M+ downloads. 🤗 600+ derivative models on @huggingface. I'm excited for much more to come.

19 22 236 30K 19

Download Image

Vivek Raghunathan @vivek7ue

5 days ago

Excited to partner w/ @vipulved @percyliang @tri_dao and team on this!

Together AI @togethercompute

5 days ago

Together AI and Snowflake partner to bring their state-of-the-art Arctic LLM to enterprise customers. Experience Arctic on Together Inference with best in class performance. api.together.xyz/playground/cha…

1 17 82 12K 13

Download Image

0 2 22 3K 3

Mengdi Wang @MengdiWang10

5 days ago

Ppl ask: Why not simply add gradient to the backward sampling process of a diffusion model? Big NO! 🚩Naive gradient don't work as guidance!🚩 Naive gradient jeopardizes the data manifold learnt from pre-training. We show in theory and experiment that it take samples far away…

0 16 116 11K 81

Download Image

FoundationsAISeminars @FAIS_Warwick

5 days ago

🎉🎉🎉We're thrilled to announce the kickoff of our Foundations of AI Seminar (FAIS) series, featuring an impressive lineup of speakers, starting tomorrow. Our first seminar is a special one, as we are honoured to welcome Prof. Volkan Cevher @CevherLIONS from @EPFL_en.