Hugo Touvron @HugoTouvron

Research Scientist at Meta AI Joined January 2020

Tweets

61
Followers

2K
Following

131
Likes

304

AI at Meta @AIatMeta

5 days ago

Introducing Meta Llama 3: the most capable openly available LLM to date. Today we’re releasing 8B & 70B models that deliver on new capabilities such as improved reasoning and set a new state-of-the-art for models of their sizes. Today's release includes the first two Llama 3…

187 1K 6K 903K 1K

Download Video

Ahmad Al-Dahle @Ahmad_Al_Dahle

5 days ago

It’s here! Meet Llama 3, our latest generation of models that is setting a new standard for state-of-the art performance and efficiency for openly available LLMs. Key highlights • 8B and 70B parameter openly available pre-trained and fine-tuned models. • Trained on more…

33 208 974 294K 151

Download Image

Pedro Cuenca @pcuenq

9 months ago

One thing I love about open access LLMs is that you can play with the system prompt as you wish – no need for hacks. So we released 2 additional Llama 2 demos that allow you to change all parameters, including the prompt: 7B: hf.co/spaces/hugging… 13B: hf.co/spaces/hugging…

2 17 83 18K 26

Boz @boztank

9 months ago

Llama 2 is open source and available free today for developers, researchers, and entrepreneurs. We’re excited to partner with Azure, AWS, Hugging Face and more to deliver this to all of you. ai.meta.com/llama

Yann LeCun @ylecun

9 months ago

427 4K 16K 4.3M 5K

11 24 149 25K 11

Andrej Karpathy @karpathy

9 months ago

Huge day indeed for AI and LLMs, congrats to Meta 👏 This is now the most capable LLM available directly as weights to anyone from researchers to companies. The models look quite strong, e.g. Table 4 in the paper: MMLU is good to look at, the 70B model is just below GPT-3.5. But…

Yann LeCun @ylecun

9 months ago

427 4K 16K 4.3M 5K

63 531 4K 1.0M 763

Download Image

Soumith Chintala @soumithchintala

9 months ago

LLaMa-2 from @metaai is here! Open weights, free for research and commercial use. Pre-trained on 2T tokens. Fine-tuned too (unlike v1). 🔥🔥🔥 Lets gooo.... ai.meta.com/llama/ The paper lists the amazing authors who worked to make this happen night and day. Be sure to thank…

31 187 1K 181K 144

Download Image

Yann LeCun @ylecun

9 months ago

This is huge: Llama-v2 is open source, with a license that authorizes commercial use! This is going to change the landscape of the LLM market. Llama-v2 is available on Microsoft Azure and will be available on AWS, Hugging Face and other providers Pretrained and fine-tuned…

427 4K 16K 4.3M 5K

AK @_akhaliq

9 months ago

Meta releases Llama 2: Open Foundation and Fine-Tuned Chat Models paper: ai.meta.com/research/publi… blog: ai.meta.com/llama/ develop and release Llama 2, a collection of pretrained and fine-tuned large language models (LLMs) ranging in scale from 7 billion to 70 billion…

37 575 2K 635K 531

Download Image

Lucas Beyer (bl16) @giffmana

10 months ago

If you lived under a rock: the MMLU score in the LLaMa paper was claimed irreproducible. However, simply using the original eval code perfectly reproduces it. The following conclusions in the blog post is wrong, imo it should be "only use original eval code, or mark with *".

Thomas Wolf @Thom_Wolf

10 months ago

8 141 605 281K 288

Download Image

7 14 154 120K 48

Download Image

Yao Fu @Francis_YAO_

10 months ago

It seems that Hani @itanih0 has solved the puzzle and the reason why LLaMA has a lower number on Open LLM Leaderboard is due to a tokenization bug (devil's in the detail Great work! Also AFAIK HuggingFace @natolambert @Thom_Wolf is doing an Elo leaderboard with very carefully…

Hani Itani @itanih0

10 months ago

3 20 108 47K 47

Download Image

3 11 81 22K 27

Yao Fu @Francis_YAO_

11 months ago

Guys, I know you want watch toe-to-toe battles. Here you go: Under official MMLU prompts, default huggingface generate() function, fp16, no fancy prompt engineering, no more complication: LLaMA v.s Falcon = 63.64 v.s 49.08 Happy? Disappointed? Good? Bad? Win? Lose? code +…

19 17 153 66K 35

Yao Fu @Francis_YAO_

11 months ago

Is Falcon really better than LLaMA? Short take: probably not. Longer take: we reproduced LLaMA 65B eval on MMLU and we got 61.4, close to the official number (63.4), much higher than its Open LLM Leaderboard number (48.8), and clearly higher than Falcon (52.7). Code and prompt…

34 128 722 334K 314

Gautier Izacard @gizacard

a year ago

Happy to release a collection of LLaMA 🦙, large language models ranging from 7B to 65B parameters and trained on publicly available datasets. LLaMA-65B is competitive with Chinchilla and PaLM. Paper: tinyurl.com/ycxr2mvj

Guillaume Lample @GuillaumeLample

a year ago

173 1K 7K 3.2M 2K

Download Image

3 16 122 19K 6

Guillaume Lample @GuillaumeLample

a year ago

Today we release LLaMA, 4 foundation models ranging from 7B to 65B parameters. LLaMA-13B outperforms OPT and GPT-3 175B on most benchmarks. LLaMA-65B is competitive with Chinchilla 70B and PaLM 540B. The weights for all models are open and available at research.facebook.com/publications/l… 1/n

173 1K 7K 3.2M 2K

Download Image

Andrew Ng @AndrewYNg

2 years ago

I’d like to address the serious matter of some newcomers to AI experiencing imposter syndrome, where someone wonders if they’re a fraud or really belong in the AI community. Lets build a community that encourages and welcomes everyone. deeplearning.ai/the-batch/issu…

33 125 840 0 62

MLIA @mlia_isir

2 years ago

Congratulations to @HugoTouvron who brightly defended his PhD thesis today! 👏👏👏 Thank you for the very interesting presentation of your work! Good luck for the future!

3 4 30 0 0

Download Image

MLIA @mlia_isir

2 years ago

Phd Defense annoucement📢 @HugoTouvron will defend his thesis in 2 days! September 29th at 2 p.m. Title: "Architectures and Training for Visual Understanding" CIFRE thesis in collab with @metaai, supervised by @quobbe and @hjegou Youtube link: youtu.be/S4r7UIJHAKI

0 3 23 0 1

Download Image

AK @_akhaliq

2 years ago

DeiT III: Revenge of the ViT abs: arxiv.org/abs/2204.07118 on Image classification (ImageNet-1k with and without pre-training on ImageNet-21k), transfer learning and semantic segmentation show that procedure outperforms by a large margin previous fully supervised training recipes

0 36 178 0 41

Download Image

Papers with Code @paperswithcode

2 years ago

Vision Transformers aim to bring the strengths of transformers into the world of computer vision. It's early days but progress has been happening in areas such as as image recognition, video understanding, 3D analysis, and more. Let’s take a look at some vision transformers ↓

4 99 541 0 163

Download Image

François Chollet @fchollet

2 years ago

New code walkthrough on keras.io: augmenting a convnet with attention to produce interpretable visualizations of classification decisions. keras.io/examples/visio…

3 49 313 0 90

Download Image

AK @_akhaliq

309K Followers 3K Following AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo follow on Hugging Face: https://t.co/q2Qoey80Gx

Professor at NYU. Chief AI Scientist at Meta.
Researcher in AI, Machine Learning, Robotics, etc.
ACM Turing Award Laureate.

Yann LeCun @ylecun

709K Followers 718 Following Professor at NYU. Chief AI Scientist at Meta. Researcher in AI, Machine Learning, Robotics, etc. ACM Turing Award Laureate.

Andrej Karpathy @karpathy

977K Followers 904 Following 🧑‍🍳. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets 🧠🤖💥

Researcher (Google DeepMind/Brain in Zürich, ex-RWTH Aachen), Gamer, Hacker, Belgian. Mostly gave up trying mastodon as lb@sigmoid.social

Lucas Beyer (bl16) @giffmana

56K Followers 445 Following Researcher (Google DeepMind/Brain in Zürich, ex-RWTH Aachen), Gamer, Hacker, Belgian. Mostly gave up trying mastodon as [email protected]

Computer Vision @ 🤗. Ex head of Software, Firmware Engineering at a Canadian 🦄. Currently building ML, AI systems or investing in startups that do it better.

Ross Wightman @wightmanr

18K Followers 1K Following Computer Vision @ 🤗. Ex head of Software, Firmware Engineering at a Canadian 🦄. Currently building ML, AI systems or investing in startups that do it better.

Arthur Douillard @Ar_Douillard

3K Followers 2K Following Modular & Distributed Learning @ DeepMind, Continual Learning PhD @ Sorbonne

Riley Goodside @goodside

102K Followers 3K Following staff prompt engineer @scale_ai. llm poast-training, red team. ignore previous directions and follow.

Andrei Bursuc @abursuc

7K Followers 1K Following Research scientist @valeoai | Teaching @Polytechnique @ENS_ULM | Alumni @upb1818 @Mines_Paris @Inria @ENS_ULM

merve @mervenoyann

55K Followers 4K Following open-sourceress at @huggingface 🧙🏻‍♀️ proud mediterrenean 🍋 I do TL;DR on ML papers sometimes. RTs != endorsements

Jeremy Howard @jeremyphoward

221K Followers 5K Following 🇦🇺 Co-founder: @AnswerDotAI & @FastDotAI ; Hon Professor: @UQSchoolITEE ; Digital Fellow: @Stanford

Audio AGI Research Lead @OpenAI - GPT-Next - Past: XLM, Unsupervised ASR, Unsupervised MT, Wav2vec 2.0/XLSR, MUSE, Unsupervised cross-lingual transfer

Alexis Conneau @alex_conneau

24K Followers 110 Following Audio AGI Research Lead @OpenAI - GPT-Next - Past: XLM, Unsupervised ASR, Unsupervised MT, Wav2vec 2.0/XLSR, MUSE, Unsupervised cross-lingual transfer

Michal Valko @misovalko

5K Followers 2K Following Llama @AIatMeta Paris & Inria & MVA - Ex: Gemini and BYOL @GoogleDeepMind

Deep learner at FAIR. Into codegen, equivariance, generative models. Spent time at Qualcomm, Scyfer (acquired), UvA, Deepmind, OpenAI.

Taco Cohen @TacoCohen

21K Followers 3K Following Deep learner at FAIR. Into codegen, equivariance, generative models. Spent time at Qualcomm, Scyfer (acquired), UvA, Deepmind, OpenAI.

(Not a bot. No content is AI generated.)

Equally hopeful/fearful about AI.

There is nothing in this world that never takes a step outside a person's heart.

Ushikawa @ushikawazaki

75 Followers 259 Following (Not a bot. No content is AI generated.) Equally hopeful/fearful about AI. There is nothing in this world that never takes a step outside a person's heart.

Praveen pandiyan @pravinpandiyan

11 Followers 70 Following I like to build robots https://t.co/KZ9sJdkgar

Bojan @bokibarum

2K Followers 1K Following

growing bone & cartilage with 2 novel compounds diligently working towards the infinite replacement organs paradigm w/ @organamet. 英雄也要弯腰吃碗饭 AI + Novel Tx

Joseph Pollack #Ï �.. @josephpollack

2K Followers 5K Following growing bone & cartilage with 2 novel compounds diligently working towards the infinite replacement organs paradigm w/ @organamet. 英雄也要弯腰吃碗饭 AI + Novel Tx

Isla-grace Koerner @IslaKoerne87066

76 Followers 5K Following

hanncx @hanncx

52 Followers 4K Following perpetual learning

Ammar Rizvi @ammarhrizvi

79 Followers 281 Following AI for Science @MetaAI

Gina Yoshino @GinaYoshin44638

74 Followers 5K Following

TELARBYTE @Telarbyte

49 Followers 170 Following AI

Research Assistant @MBZUAI. Prev: Project Scientist at DAIR Lab, @IITDelhi; Intern at INK Lab @CSatUSC; undergraduate @IITDelhi.
Working on Machine Learning.

Rocktim Jyoti Das @RocktimJyotiDa2

83 Followers 1K Following Research Assistant @MBZUAI. Prev: Project Scientist at DAIR Lab, @IITDelhi; Intern at INK Lab @CSatUSC; undergraduate @IITDelhi. Working on Machine Learning.

CEO CogX Festival of AI, Century City, Los Angeles & London | WEF Tech Pioneer | Founder https://t.co/fPb3bDU0ed | CEO Orchesteam plc to Nasdaq IPO & Rightster/Brave Bison

Charlie Muirhead @CharlieMuirhead

5K Followers 4K Following CEO CogX Festival of AI, Century City, Los Angeles & London | WEF Tech Pioneer | Founder https://t.co/fPb3bDU0ed | CEO Orchesteam plc to Nasdaq IPO & Rightster/Brave Bison

Saba @Saba_A96

60 Followers 88 Following MSc student @Mila_Quebec and @UMontrealDIRO

Arya Canu @AryaCanu49343

41 Followers 5K Following

Armughan Ahmad @ArmughanAA

3K Followers 2K Following Love tech & it’s impact on our world. Opinions are mine

PhD in Biomedical AI @BioMedAI_CDT @EdiClinicalNLP
Clinical NLP | Knowledge Graph | Opinions are my own.

Looking for a part-time/internship in Clinical NLP

Aryo Pradipta Gema @aryopg

505 Followers 1K Following PhD in Biomedical AI @BioMedAI_CDT @EdiClinicalNLP Clinical NLP | Knowledge Graph | Opinions are my own. Looking for a part-time/internship in Clinical NLP

Saurabh Srivastava @_saurabh

839 Followers 356 Following Research in reasoning for better program synthesis (PhD, Postdoc, YC)

Max @Maxmatical

28 Followers 514 Following research engineer, llms + foundation models

Christopher Falholt @FalholtC

13 Followers 133 Following

SeungHeon Doh @SeungHeon_Doh

552 Followers 453 Following PhD Candidate @ Music and Audio Computing Lab, KAIST. Previously an intern @BytedanceTalk, @Naver, @Chartmetric.

slowsnake @slowsnake22

54 Followers 241 Following

Leo Kraft @LeoKraft_

20 Followers 84 Following Robotics, Cognition, Intelligence student @TU_Muenchen

Itai Gat @itai_gat

214 Followers 570 Following Researcher @MetaAI

Samuel Burbulla @samuelburbulla

48 Followers 206 Following Senior AI Reseacher @ appliedAI Institute for Europe

Cawreo @Cawreo

106 Followers 692 Following Founder & CEO of @NexusNets — e/αi | A head saboteur of AI research on https://t.co/Cso4XFbuKc

Vigneshwaran N @Vigneshwaran__N

49 Followers 666 Following ML/NLP engineer. Curious about people and minds.

PhD Student @ETH_en | Computer Vision @Meta
Research in: Computer Vision | Remote Sensing | Super Resolution | Monocular Depth | Population Mapping

Nando Metzger @NandoMetzger

melyaman @melyaman1

53 Followers 74 Following

Mit @marvelousmit

45 Followers 383 Following

Kira Keating @KiraKeating

1 Followers 126 Following

Mayur Patidar @mayurpatidar01

28 Followers 1K Following Researcher at TCS Research

Lukas Valine @v4l1n3

26 Followers 159 Following ML / gpgpu what doesn't kill you makes you compute efficient

vision for automomous robots @pathrobotics | previously robots @CarnegieMellon, @intel, @iiscbangalore | here for @isStellaHere updates

Sarthak @SarthakJShetty

154 Followers 794 Following vision for automomous robots @pathrobotics | previously robots @CarnegieMellon, @intel, @iiscbangalore | here for @isStellaHere updates

ricochicomico (stop/L.. @ricochicomico1

658 Followers 8K Following It is that important.

Vuong Nguyen @vuongnq09

30 Followers 448 Following

MegaSenior Research Scientist at ServiceNow Research, Former Google. WebAgents, Remote Sensing, Climate Change, Opinions are my own

Alexandre Lacoste @alex_lacoste_

749 Followers 411 Following MegaSenior Research Scientist at ServiceNow Research, Former Google. WebAgents, Remote Sensing, Climate Change, Opinions are my own

Burning ray @Aery___1

64 Followers 60 Following Existence, e/acc, Intelligence, Wisdom, Ignorance, Systems

Kydlaw @KydLaw

10 Followers 154 Following Dev. Engineer. PhD Student. Study (social | neural) networks.

Elon Muck @0xpussies

53 Followers 210 Following

Phd student of @USC' CS. Working with Prof. @yanliu_usc. Time series 📈& Causal Inference 🔧💡 Ex: @PKU1898; @AdobeResearch, UCB, MSRA, Alibaba , Baidu

Defu Cao @caodefu_dove

229 Followers 389 Following Phd student of @USC' CS. Working with Prof. @yanliu_usc. Time series 📈& Causal Inference 🔧💡 Ex: @PKU1898; @AdobeResearch, UCB, MSRA, Alibaba , Baidu

Tim Jelinewski @jelinewski

3 Followers 91 Following

Max Kerr @maxtalcai

203 Followers 156 Following CTO. Working on the dark art of synthetic data @ talc (YC S23). Formerly did privacy at Facebook.

swooooosh_ml @swooooooosh_ml

30 Followers 785 Following Research Engineer @ Gemini CodeGen Carnegie Mellon, Language Technology

Leo @leodamerique

34 Followers 105 Following McGill CS 🇺🇸🇨🇦🇮🇳🇮🇹

Anthony Fuller @anto_fuller

63 Followers 145 Following

Arif Ahmad @ArifAhm92263086

196 Followers 6K Following All things AI, Computer Science and Circuits! Prev. @GoogleAI

Shivakumar KY @shiva0010131

75 Followers 1K Following THINKING on AI / AGI / Technology / Robotics / Advancement.

Noah Ziems @NoahZiems

202 Followers 590 Following PhD student @NotreDame studying NLP advised by @Meng_CS

Rishub Tamirisa @rishub_t

7 Followers 265 Following aligning AI. undergrad research @aiatillinois

Clement Ou @ClementOu

39 Followers 155 Following CMU

AK @_akhaliq

309K Followers 3K Following AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo follow on Hugging Face: https://t.co/q2Qoey80Gx

Yann LeCun @ylecun

709K Followers 718 Following Professor at NYU. Chief AI Scientist at Meta. Researcher in AI, Machine Learning, Robotics, etc. ACM Turing Award Laureate.

Andrej Karpathy @karpathy

977K Followers 904 Following 🧑‍🍳. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets 🧠🤖💥

hardmaru @hardmaru

284K Followers 1K Following Building Collective Intelligence @SakanaAILabs 🧠

Lucas Beyer (bl16) @giffmana

56K Followers 445 Following Researcher (Google DeepMind/Brain in Zürich, ex-RWTH Aachen), Gamer, Hacker, Belgian. Mostly gave up trying mastodon as [email protected]

François Chollet @fchollet

469K Followers 770 Following Deep learning @google. Creator of Keras. Author of 'Deep Learning with Python'. Opinions are my own.

We’re a team of scientists, engineers, ethicists and more, committed to solving intelligence, to advance science and benefit humanity.

Google DeepMind @GoogleDeepMind

942K Followers 275 Following We’re a team of scientists, engineers, ethicists and more, committed to solving intelligence, to advance science and benefit humanity.

Dmytro Mishkin 🇺�.. @ducha_aiki

18K Followers 591 Following Marrying classical CV and Deep Learning. I do things, which work, rather than being novel, but not working.

Ross Wightman @wightmanr

18K Followers 1K Following Computer Vision @ 🤗. Ex head of Software, Firmware Engineering at a Canadian 🦄. Currently building ML, AI systems or investing in startups that do it better.

Cofounded and lead @PyTorch at Meta.
Also dabble in robotics at NYU.

AI is delicious when it is accessible and open-source.

Soumith Chintala @soumithchintala

185K Followers 876 Following Cofounded and lead @PyTorch at Meta. Also dabble in robotics at NYU. AI is delicious when it is accessible and open-source.

Aran Komatsuzaki @arankomatsuzaki

94K Followers 78 Following @TeraflopAI

Together with the AI community, we are pushing the boundaries of what’s possible through open science to create a more connected world.

AI at Meta @AIatMeta

530K Followers 255 Following Together with the AI community, we are pushing the boundaries of what’s possible through open science to create a more connected world.

Tensors and neural networks in Python with strong hardware acceleration. PyTorch is an open source project at the Linux Foundation. #PyTorchFoundation

PyTorch @PyTorch

379K Followers 77 Following Tensors and neural networks in Python with strong hardware acceleration. PyTorch is an open source project at the Linux Foundation. #PyTorchFoundation

New Orleans, Dec 10-16, 23. https://t.co/ga8aOw615g Tweets to this account are not monitored. Please send feedback to townhall@neurips.cc.

NeurIPS Conference @NeurIPSConf

111K Followers 35 Following New Orleans, Dec 10-16, 23. https://t.co/ga8aOw615g Tweets to this account are not monitored. Please send feedback to [email protected].

Yannic Kilcher 🇸�.. @ykilcher

67K Followers 867 Following I make videos. Skill > Destiny. vi / vim

Co-Founder of Coursera; Stanford CS adjunct faculty. Former head of Baidu AI Group/Google Brain. #ai #machinelearning, #deeplearning #MOOCs

Andrew Ng @AndrewYNg

1.0M Followers 909 Following Co-Founder of Coursera; Stanford CS adjunct faculty. Former head of Baidu AI Group/Google Brain. #ai #machinelearning, #deeplearning #MOOCs

Andrei Bursuc @abursuc

7K Followers 1K Following Research scientist @valeoai | Teaching @Polytechnique @ENS_ULM | Alumni @upb1818 @Mines_Paris @Inria @ENS_ULM

Google AI is focused on bringing the benefits of AI to everyone. In conducting and applying our research, we advance the state-of-the-art in many domains.

Google AI @GoogleAI

2.2M Followers 23 Following Google AI is focused on bringing the benefits of AI to everyone. In conducting and applying our research, we advance the state-of-the-art in many domains.

Ilya Sutskever @ilyasut

370K Followers 2 Following towards a plurality of humanity loving AGIs @openai

Alexis Conneau @alex_conneau

24K Followers 110 Following Audio AGI Research Lead @OpenAI - GPT-Next - Past: XLM, Unsupervised ASR, Unsupervised MT, Wav2vec 2.0/XLSR, MUSE, Unsupervised cross-lingual transfer

Anjney Midha @AnjneyMidha

7K Followers 1K Following general partner @a16z. industrialization maximalist. @prev: ceo/founder @ubiquity6 (acquired by @discord)

Mark Zuckerberg @finkd

760K Followers 748 Following

Alexandr Wang @alexandr_wang

142K Followers 695 Following ceo at @scale_ai. rational in the fullness of time

Piotr Bojanowski @p_bojanowski

557 Followers 131 Following Research Scientist at Facebook AI Research. Interested in Machine Learning and Computer Vision.

Arthur Mensch @arthurmensch

40K Followers 868 Following Co-founder and CEO @MistralAI. Apply https://t.co/yHGRZAtjcx

Simplifying LLMs, MLOps, Python & Machine Learning for you! • AI Engineering @LightningAI • Lead DataScientist • BITS Pilani • 3 Patents

Akshay 🚀 @akshay_pachaar

134K Followers 415 Following Simplifying LLMs, MLOps, Python & Machine Learning for you! • AI Engineering @LightningAI • Lead DataScientist • BITS Pilani • 3 Patents

Co-founder & CTO @poolsideai w/ @jasoncwarner

“The best way to predict the future is to invent it.” - Alan Kay

Prev: Athenian & source{d}

Eiso Kant @eisokant

7K Followers 1K Following Co-founder & CTO @poolsideai w/ @jasoncwarner “The best way to predict the future is to invent it.” - Alan Kay Prev: Athenian & source{d}

principal researcher, @googledeepmind. ex director of emea at fair @metaai. mostly work on open projects: fasttext, dino, llama, gemma.

Armand Joulin @armandjoulin

4K Followers 344 Following principal researcher, @googledeepmind. ex director of emea at fair @metaai. mostly work on open projects: fasttext, dino, llama, gemma.

Kevin Stone @kevinleestone

378 Followers 272 Following Research @ OpenAI, previously at FAIR, TRI, and Google working on LLMs, RL, and Robotics.

MoTS @normativeai. Ex @UMichCSE, 2x @MetaAI, @allen_ai | Speech & NLP | Robustness, Data & Annotations, Evaluation & Interpretability in LLMs

Mimansa Jaiswal @MimansaJ

1K Followers 3K Following MoTS @normativeai. Ex @UMichCSE, 2x @MetaAI, @allen_ai | Speech & NLP | Robustness, Data & Annotations, Evaluation & Interpretability in LLMs

Gautier Izacard @gizacard

832 Followers 301 Following @Microsoft Prev: PhD @MetaAI @ENS_ULM

Laurent Sifre @laurentsifre

1K Followers 411 Following Research Scientist @DeepMind since 2014. Worked on #AlphaGo #AlphaFold and #AlphaStar, now focused on #NLP at scale.

Sharan Narang @sharan0909

2K Followers 254 Following LLMs and AI Research (Llama 2 & 3 lead) @Meta | ex @Google (PaLM lead, T5), ex @Baidu (Deep Speech 2, Sparse Neural Networks), ex @Nvidia

Crafting pixels w PhotoRoom after some time in sunny California and happy Copenhagen. Meta (xformers, FairScale, R&D), EyeTribe (acq) Mostly tweeting around AI

Benjamin Lefaudeux @BenTheEgg

1K Followers 2K Following Crafting pixels w PhotoRoom after some time in sunny California and happy Copenhagen. Meta (xformers, FairScale, R&D), EyeTribe (acq) Mostly tweeting around AI

Marie-Anne Lachaux @MaLachaux

544 Followers 80 Following Researcher at @MistralAI ex @MetaAI

David Mizrahi @dmizrahi_

85 Followers 231 Following AI/ML Researcher @Apple

joao carreira @joaocarreira

1K Followers 253 Following Research Scientist at Google DeepMind

Priya Goyal @priy2201

1K Followers 498 Following Founding member @datologyai, ex-Google Deepmind, ex-Facebook AI Research (FAIR).

Thomas Scialom @ThomasScialom

6K Followers 227 Following AGI Researcher @MetaAI -- Lead Llama 2 and Postraining Llama 3. Also CodeLlama, Galactica, Toolformer, Bloom, Nougat, GAIA, ..

Thomas Lucas @ThomasLUC4S

22 Followers 50 Following

Mitchell Wortsman @Mitchnw

2K Followers 955 Following @AnthropicAI | prev @uwcse

researcher in #deeplearning #computervision | assistant professor at @NYU_Courant @nyuniversity | previous: research scientist @metaai (FAIR) @UCSanDiego

Saining Xie @sainingxie

14K Followers 1K Following researcher in #deeplearning #computervision | assistant professor at @NYU_Courant @nyuniversity | previous: research scientist @metaai (FAIR) @UCSanDiego

Co-founder & CEO @GoogleDeepMind - working on AGI. Trying to understand the fundamental nature of reality. Also revolutionising drug discovery @IsomorphicLabs

Demis Hassabis @demishassabis

356K Followers 124 Following Co-founder & CEO @GoogleDeepMind - working on AGI. Trying to understand the fundamental nature of reality. Also revolutionising drug discovery @IsomorphicLabs

Vasilis Vryniotis @bbriniotis

64K Followers 718 Following Machine Learning Engineer, Data Scientist and proud geek.

Menglin Jia @menglin_jia

118 Followers 124 Following PhD student at Cornell

Databricks Mosaic Res.. @DbrxMosaicAI

30K Followers 115 Following We remove the barriers to state-of-the-art generative AI model development and make data + AI available to all.

Maximilian Ilse @MaxIlse

2K Followers 834 Following Senior Researcher @ Health Futures - Microsoft Research. he/him.

DeepAI is an experimental AI Product Lab. Sharing cool research at @arxiv_daily

For product support email team@deepai.org

DeepAI @DeepAI

51K Followers 2K Following DeepAI is an experimental AI Product Lab. Sharing cool research at @arxiv_daily For product support email [email protected]

Seong Joon Oh @coallaoh

1K Followers 870 Following Leading the STAI group at the University of Tübingen https://t.co/qrSPDDcdOy Advising @ParameterLab.

Olivier Bousquet @obousquet

3K Followers 305 Following Engineering Director, Google Research.

Microsoft Research @MSFTResearch

553K Followers 2K Following We advance science and technology to benefit humanity. https://t.co/kz0nARXbwT Register for Microsoft Research Forum on June 4 ⬇️ Get our newsletter

Neil Houlsby @neilhoulsby

4K Followers 317 Following Professional AI researcher; amateur athlete. Senior Staff RS in the Google Deepmind, Zürich. Attempts triathlons.

Zhiding Yu @ZhidingYu

1K Followers 382 Following Working to make machines understand the world like human beings. Words are my own.

Max Welling @wellingmax

32K Followers 428 Following

Computer Vision & Machine Learning researcher @naverlabseurope
Chair on Lifelong representation learning @MIAI_UGA she/her

Diane Larlus @dlarlus

3K Followers 722 Following Computer Vision & Machine Learning researcher @naverlabseurope Chair on Lifelong representation learning @MIAI_UGA she/her

Lex Fridman @lexfridman

3.5M Followers 125 Following Host of Lex Fridman Podcast. Interested in robots and humans.

Oisin Mac Aodha @oisinmacaodha

1K Followers 2K Following Lecturer in Machine Learning @ School of Informatics, University of Edinburgh.

Li Dong @donglixp

3K Followers 3K Following NLP Researcher at Microsoft Research

Neil Zeghidour @neilzegh

3K Followers 584 Following deep learning @kyutai_labs

Stay on top of all exciting new developments in #DeepLearning. Every week fresh to your inbox: https://t.co/04EU35uVE5

Sponsored by Comet (@Cometml)

Deep Learning Weekly @dl_weekly

11K Followers 1K Following Stay on top of all exciting new developments in #DeepLearning. Every week fresh to your inbox: https://t.co/04EU35uVE5 Sponsored by Comet (@Cometml)

labml.ai @labmlai

12K Followers 8 Following 📝 Annotated paper implementations https://t.co/qeO4UTbrJ3

Natalia Neverova @NataliaNeverova

3K Followers 690 Following Research Lead @Meta AI (GenAI)

Gabriel Synnaeve @syhw

14K Followers 1K Following Nerd & Dad

Andrej Karpathy @karpathy

5 days ago

The model card has some more interesting info too: github.com/meta-llama/lla… Note that Llama 3 8B is actually somewhere in the territory of Llama 2 70B, depending on where you look. This might seem confusing at first but note that the former was trained for 15T tokens, while the…

29 100 1K 163K 369

Joelle Pineau @jpineau1

5 days ago

We just released Meta Llama 3: the most capable openly available LLM available to date! The 8B & 70B models are out now, and we expect to release models with larger context windows, additional model sizes and more capabilities in the coming months.

AI at Meta @AIatMeta

5 days ago

187 1K 6K 903K 1K

Download Video

11 33 392 50K 25

Andrew Ng @AndrewYNg

5 days ago

Meta released Llama 3 on my birthday! 🎂 Best present ever, thanks Meta! 😀

289 173 4K 282K 122

Ahmad Al-Dahle @Ahmad_Al_Dahle

5 days ago

@karpathy @AIatMeta @lmsysorg Request registered :)

9 0 171 16K 4

Andrej Karpathy @karpathy

5 days ago

Congrats to @AIatMeta on Llama 3 release!! 🎉 ai.meta.com/blog/meta-llam… Notes: Releasing 8B and 70B (both base and finetuned) models, strong-performing in their model class (but we'll see when the rankings come in @ @lmsysorg :)) 400B is still training, but already encroaching…

143 1K 8K 818K 2K

AI at Meta @AIatMeta

5 days ago

187 1K 6K 903K 1K

Download Video

Fernando Pérez-García @fepegar_

3 months ago

Big kudos also to the researchers that created the methods our work built upon, e.g., @mcaron31 @armandjoulin @HugoTouvron @p_bojanowski Maxime Oquab @TimDarcet @jensenzhoujh and many others!

0 0 7 366 0

Ahmad Al-Dahle @Ahmad_Al_Dahle

5 days ago

33 208 974 294K 151

Download Image

Alexandr Wang @alexandr_wang

5 days ago

Congrats to the entire @Meta team, including @Ahmad_Al_Dahle @manohar_paluri @dkm2110 @HugoTouvron @ThomasScialom Angela Fan @finkd @_chriscox @ylecun @ragavan and many other great folks! Llama3 looks amazing, and the 400B looks even more exciting :)

Ahmad Al-Dahle @Ahmad_Al_Dahle

5 days ago

33 208 974 294K 151

Download Image

3 0 57 19K 13

Lucas Beyer (bl16) @giffmana

2 months ago

YES. Thanks Andrej. To this date still, way Way WAY too many people doing DL are way Way WAY too careless. I think each small DL team needs at least two people who are obsessed with detail. But the team shouldn't be composed of solely such people either, or it'll go nowhere.

Andrej Karpathy @karpathy

2 months ago

Beautiful work / attention to detail trying to get Gemma to finetune correctly. There are so many foot guns here to be super careful with. All of these issues don't throw any errors, they silently make your network worse. A great example of what I wrote about in my "A Recipe for…

88 318 3K 508K 2K

6 27 380 52K 119

Andrej Karpathy @karpathy

3 months ago

@eladgil @patrickc In AI at least, the real 30 under 30 imo you have never heard of. They are 5 layers down the org chart from the CEO. They are usually not on Twitter, they have an unmaintained LinkedIn, they don’t go on podcasts, and they maybe published at one point but don’t do so anymore. They…

141 449 5K 1.1M 1K

Andrew Ng @AndrewYNg

5 months ago

@geoffreyhinton I'd like to respectfully point out that the logic in this argument is based on a flawed model for how scientists think. Scientists don't just take a weighted average of others' opinions to form their own. A good scientist takes as input lots of data, including others' opinions,…

236 655 8K 1.1M 801

Philipp Schmid @_philschmid

8 months ago

Code Llama with @huggingface🤗 Yesterday, @metaai released Code Llama, a family of open-access code LLMs! Today, we release the integration in the Hugging Face ecosystem🔥 Models: 👉 huggingface.co/codellama blog post: 👉 hf.co/blog/codellama Blog post covers how to use it!

7 80 298 234K 90

Yann LeCun @ylecun

9 months ago

@Noahpinion No. The research arm of Bell Labs was never about moonshots. It was about hiring the best scientists into small departments (typically 5 to 15 people) and giving them resources and a *lot* of freedom to work on what *they* deemed most promising. That's how you get breakthroughs.

54 228 2K 353K 209

Pedro Cuenca @pcuenq

9 months ago

2 17 83 18K 26

Delip Rao e/σ @deliprao

9 months ago

This is another one of those ill-thought, fear-mongering scientific disinformation about LLMs, and I will explain why in this long thread. 🧶

Aidan Clark @_aidan_clark_

9 months ago

I flip-flop on how bad releasing model weights is, but what is clear to me is that we're in a honeymoon period before something bad happens like mass social manipulation and surely Meta is gonna regret making "we let anyone use our great models for anything" a selling point.

60 12 171 704K 66

6 157 651 368K 271

Andreas Köpf @neurosp1ke

9 months ago

@ThomasScialom @metaai @HugoTouvron Thanks 🙏 & congrats to you, colleagues & mngmt at Meta for releasing this innovation catalyst 🦙😊.

0 0 4 86 0

Haotian Liu @imhaotian

9 months ago

🧵1/ Exciting news! We've just released a major update for LLaVA, our open-source large multimodal model, with support for LLaMA-2, LoRA training with academia GPUs, higher resolution (336x336), 4-/8- inference, and more! 🚀🌋

19 110 519 202K 250

Download Image

Andrej Karpathy @karpathy

9 months ago

@sharan0909 @CalvinHolloway6 @metaai +++, to avoid confusion I think "Llama 2" should imo always be assumed to refer to 70B model, when it's not it should be explicitly disambiguated, e.g. Llama2-7B.