Jerry Wei @JerryWeiAI

🧐 Improving and aligning large language models 🧠 Research Engineer @GoogleDeepMind ⏰ Past: @Stanford, @Google Brain jerrywei.net Stanford, California Joined June 2015

Tweets

177
Followers

5K
Following

252
Likes

125

Jerry Wei @JerryWeiAI

6 days ago

Cool piece from the Financial Times comparing hallucinations in LLMs to hallucinations in humans! People often complain about how LLMs frequently hallucinate, but it’s easy to forget that humans hallucinate a lot as well. For example, if you read some article and then later tell…

6 12 77 21K 61

lmsys.org @lmsysorg

a week ago

More exciting news today -- Gemini 1.5 Pro result is out! Gemini 1.5 Pro API-0409-preview now achieves #2 on the leaderboard, surpassing #3 GPT4-0125-preview to almost top-1! Gemini shows even stronger performance on longer prompts, in which it ranks joint #1 with the latest…

lmsys.org @lmsysorg

a week ago

7 50 377 263K 61

36 187 927 443K 170

Download Image

Jerry Wei @JerryWeiAI

2 weeks ago

torch/tf.einsum() is life-changing.

1 2 17 3K 4

Jerry Wei @JerryWeiAI

2 weeks ago

Huge congrats to @YiTayML and the rest of the Reka team for this launch! Personally, I'm super impressed with how Reka-Core can match/beat GPT-4-Turbo and Claude-3-Opus on many benchmarks despite Reka being a much smaller team. Also "as for Belebele, we hit our credit threshold…

Yi Tay @YiTayML

2 weeks ago

11 56 416 48K 205

Download Image

2 1 33 5K 5

Yi Tay @YiTayML

2 weeks ago

Our @RekaAILabs Tech Report / Paper is out! 🔥 Tech reports with completely no information are kinda boring so we’re revealing some interesting information on how we train our series of Reka models including tokens, architecture, data & human evaluation workflows. 😃 We tried…

11 56 416 48K 205

Download Image

Andrew M. Dai @iamandrewdai

3 weeks ago

We present a survey of synthetic data approaches for LLMs, highlighting both where it's needed and its potential pitfalls! It'll make a great weekend read. Thanks to: @RuiboLiu @JerryWeiAI @denny_zhou and our other collaborators.

Ruibo Liu @RuiboLiu

3 weeks ago

1 19 113 34K 68

1 4 29 5K 10

elvis @omarsar0

3 weeks ago

Best Practices and Lessons Learned on Synthetic Data for Language Models Great overview by Google DeepMind on synthetic data research. It covers applications, challenges, and future directions This is an important paper given the significant advancements we are seeing from the…

3 79 306 25K 229

Download Image

Jerry Wei @JerryWeiAI

3 weeks ago

Fun fact: our paper was put on hold by arxiv for a while because arxiv detected that we used the phrase "time travel," which is a topic that arxiv frequently gets bad submissions for. When we Ctrl-F'd "time travel" in our paper, we had actually just cited a paper called "Time…

Aran Komatsuzaki @arankomatsuzaki

3 weeks ago

6 137 698 154K 706

Download Image

19 23 284 94K 169

AK @_akhaliq

3 weeks ago

Best Practices and Lessons Learned on Synthetic Data for Language Models The success of AI models relies on the availability of large, diverse, and high-quality datasets, which can be challenging to obtain due to data scarcity, privacy concerns, and high costs.

4 79 311 47K 230

Download Image

Jerry Wei @JerryWeiAI

3 weeks ago

There has been growing concerns about running out of high-quality training data for LLMs, and naturally many turn towards synthetic data to help remedy this issue. Indeed, synthetic data can be generated at large scales and is thus a valuable resource for training/evaluating…

Ruibo Liu @RuiboLiu

3 weeks ago

1 19 113 34K 68

1 8 49 11K 24

Aran Komatsuzaki @arankomatsuzaki

3 weeks ago

Google presents Best Practices and Lessons Learned on Synthetic Data for Language Models Provides an overview of synthetic data research, discussing its applications, challenges, and future directions arxiv.org/abs/2404.07503

6 137 698 154K 706

Download Image

Jerry Wei @JerryWeiAI

4 weeks ago

Thanks for this insightful feedback! Clarified these points in a revision: 1. Replaced "superhuman" with "outperforms crowdsourced human annotators" to not imply beating expert humans 2. Added FAQ sec. discussing this distinction 3. Updated related work/SAFE with prior methods

Greg Durrett @gregd_nlp

a month ago

3 26 183 56K 92

0 1 31 5K 4

Jerry Wei @JerryWeiAI

a month ago

This is one of the core surprising findings of our paper - previous efforts in using LLMs for evaluation primarily seek to achieve high correlation with human annotations. But we took a closer look at the data and noticed that human raters were not super reliable in fact…

John Nay @johnjnay

a month ago

11 95 413 49K 284

Download Image

0 5 66 11K 41

Chengrun Yang @chengrun_yang

a month ago

We focus on long-form factuality in open domain, and so we show an entire evaluation pipeline with dataset + autorater + metric. The dataset was generated with LLMs and the autorater is an LLM agent with Google Search, demonstrating LLMs can rate themselves better than humans!

Jerry Wei @JerryWeiAI

a month ago

9 77 369 143K 291

Download Image

0 3 16 2K 1

Yifeng Lu @yifenglou

a month ago

Our new efforts is trying to address an elephant in the room for LLM: Given factuality/hallucination is so critical to the success of LLM, is there a quantitive evaluation to benchmark all existing LLMs in general? Hope our benchmark would be adopted and benchmarked as part of…

Jerry Wei @JerryWeiAI

a month ago

9 77 369 143K 291

Download Image

0 3 20 6K 16

Ruibo Liu @RuiboLiu

a month ago

New factuality research! We use LMs as annotators & search engines for grounding to create a realistic benchmark for evaluating long-form factuality. Simulating your daily queries to LMs about knowledge & truth. 🔍📊 #NLProc #FactChecking Check this out! 👇

Jerry Wei @JerryWeiAI

a month ago

9 77 369 143K 291

Download Image

0 4 29 5K 8

Nathan Hu @NathanHu12

a month ago

New work on evaluating long form factuality 🎉. Our method SAFE combines google search and LLM queries to extract and verify individual claims in responses. Most excitingly, we show SAFE is cheaper💰 and more reliable ✅ than human annotators.

Jerry Wei @JerryWeiAI

a month ago

9 77 369 143K 291

Download Image

0 3 13 2K 1

JOURNEY OF IDEAS @ONLYWORK0

93 Followers 3K Following

Jayashri @jayashri94

63 Followers 2K Following Likes to read about technology, AI and almost everything. #Python,#React,#Typescript.

Kartik Natarajan @kartiknatarajan

209 Followers 1K Following

Kai Xiang @xiang_kai_MIT

30 Followers 284 Following deeply curious about human and machine

kobciye films @KobciyeF13403

41 Followers 170 Following

PhD student @HKUSTKnowComp | Supervised by @yqsong | Intern @NVlDlAAl | Linguistic Entailment & Abstraction | Causal Inference

Zhaowei Wang @ZhaoweiWang4

726 Followers 673 Following PhD student @HKUSTKnowComp | Supervised by @yqsong | Intern @NVlDlAAl | Linguistic Entailment & Abstraction | Causal Inference

HolyDifficult @HolyDifficult

10 Followers 155 Following

Block Chen @GPT327

377 Followers 3K Following @RUCerofChina. Founder&CEO of DeepWond AI. AI Research&Safety. AI game&Metaverse.

ALBIV⚡ @valdra76

2K Followers 5K Following ALII NESCIO QUO PACTO OBDURUERUNT

Anish Mudide @AMudide

28 Followers 148 Following lucid dreaming @mit

Researcher @GoogleDeepmind. Phd from @Cornell. Working on contextual bandits, reinforcement learning and their applications in user interactive systems.

Yi Su @YiSu37328759

387 Followers 618 Following Researcher @GoogleDeepmind. Phd from @Cornell. Working on contextual bandits, reinforcement learning and their applications in user interactive systems.

acidoom @acidoom

93 Followers 964 Following

renAI Lab @renAI_Lab101

1 Followers 15 Following renAI Lab

Winnie Yeung @mimo90918

2 Followers 52 Following MLE @ Square

Rongduan Zhu @RongduanZhu

33 Followers 326 Following

Siva Worajitwanakul @Champiionnii

11 Followers 61 Following

Ittseta @IssEossda

79 Followers 674 Following

luckyyy0317 @Luckyyy_meow

84 Followers 771 Following PhD candidate, Computer Science

simrat hanspal @simsimsandy

84 Followers 454 Following Exploring LLMs | Data scientist with a curious engineering mind

Redmond @BetaTomorrow

216 Followers 658 Following working class

Yichen Gong @YichenGong123

2 Followers 29 Following

waoirk @waoirk

3 Followers 122 Following

Bo FENG @BoFeng26809821

141 Followers 1K Following Economics Predoc @HarvardHBS.

Kate @stateof_kate

1K Followers 620 Following building something new @deepunitai

Charleno Pires @charlenopires

2K Followers 5K Following Creative Man

Taylor Blake @taylorblake

380 Followers 359 Following Learning, product, strategy.

Phillip Lindsay @EastLAPinche

61 Followers 421 Following

K @karimedl

0 Followers 402 Following

Sehyun Kwon @sehyunkwon22

224 Followers 506 Following Ph.D student @ Seoul National University

Nikhil Khandekar @nsk7153

0 Followers 155 Following

PhD @UniofOxford researching on NLP for Automated Fact Checking and Factuality in LLMs. Previously shipped code 🚀@TwigaFoods. Happens to run fast sometimes.🏃

Jabez Magomere @jabez_magomere

74 Followers 400 Following PhD @UniofOxford researching on NLP for Automated Fact Checking and Factuality in LLMs. Previously shipped code 🚀@TwigaFoods. Happens to run fast sometimes.🏃

JonnieLewandoski @JonnieL29718

42 Followers 1K Following

soonwoo @JohnSwkwon

1 Followers 40 Following

Nandan Shettigar @mfflnando

231 Followers 1K Following #TAMU2020

Nate Boyd @n8boyd

695 Followers 2K Following Invest in and help build deep tech & AI startups ~ dad & partner ~ curious & skeptical

Final-year PhD student @Yale, #NLProc, LLM for Code. (ex-)intern @GoogleDeepMind, @MetaAI, @MSFTResearch, @allen_ai. MS from @SCSatCMU. Opinions are my own.

Ansong Ni @AnsongNi

1K Followers 384 Following Final-year PhD student @Yale, #NLProc, LLM for Code. (ex-)intern @GoogleDeepMind, @MetaAI, @MSFTResearch, @allen_ai. MS from @SCSatCMU. Opinions are my own.

neurolicious @neuro__licious

87 Followers 251 Following ML | AI

TRB ttigers @_ttigers

4K Followers 383 Following wildrift player for @tribegaming

Fabian @schimpffabian

74 Followers 708 Following Futures in which AI is the greatest thing ever are possible but not unavoidable.

I'm a Data Scientist with a degree in Computational Data Science from Indian Institute of Science, Bangalore. I'm passionate about social change, animal welfare

Shreya Roy @ShreyaR54107751

2 Followers 59 Following I'm a Data Scientist with a degree in Computational Data Science from Indian Institute of Science, Bangalore. I'm passionate about social change, animal welfare

Supriya Naik @supriyan_iima

32 Followers 136 Following IIM Ahmedabad, IIT Roorkee

Sid Kapur @sidkap_

1K Followers 1K Following ML engineer, interested in housing/econ/history stuff

Segmond Yunsai @ysegmond

324 Followers 611 Following Interests:- wrenching old bmws vROOOM, programming vRAAAM

sachosdev @sachoslks

43 Followers 1K Following Trying to be a game developer. sachosdev on Google Play Store. Go check some of my games out!

Cecilia-Z @zzzfffc

1 Followers 72 Following

Saul @Saul31962301

23 Followers 111 Following

carlos daniel @animex400

4 Followers 50 Following

I don’t really know what to put here. I mostly research AI/technology and spend time with my husband/pets. Pics are 7-8 yrs old, I don’t rly take selfies. Sry.

SecrtAgntSquirl @SecrtAgntSquirl

586 Followers 3K Following I don’t really know what to put here. I mostly research AI/technology and spend time with my husband/pets. Pics are 7-8 yrs old, I don’t rly take selfies. Sry.

James 🐉 @jbelevate

196 Followers 336 Following Human optimisation is the goal of our species

Note Able @curiousgangsta

372 Followers 3K Following

Corey Lynch @coreylynch

10K Followers 1K Following AI at @figure_robot, previously research scientist at @GoogleDeepMind.

Melody Guan ʕᵔᴥ�.. @MelodyGuan

3K Followers 780 Following

Jacob Pfau @jacob_pfau

898 Followers 1K Following Mostly AI alignment. PhD student at NYU

Greg Diamos @GregoryDiamos

3K Followers 99 Following Lamini | I build AI supercomputers

Large Model Systems Organization. We created Vicuna and Chatbot Arena! Compare 30+ LLMs (GPT-4/Claude/Llamas) side-by-side at https://t.co/IDFeIDIOtm

lmsys.org @lmsysorg

38K Followers 172 Following Large Model Systems Organization. We created Vicuna and Chatbot Arena! Compare 30+ LLMs (GPT-4/Claude/Llamas) side-by-side at https://t.co/IDFeIDIOtm

François Chollet @fchollet

470K Followers 769 Following Deep learning @google. Creator of Keras. Author of 'Deep Learning with Python'. Opinions are my own.

Working on AI safety, past on robotics, applied research @OpenAI; Writing ML blogs to help myself & others to learn; Ideas my own.

Lilian Weng @lilianweng

95K Followers 148 Following Working on AI safety, past on robotics, applied research @OpenAI; Writing ML blogs to help myself & others to learn; Ideas my own.

bilal2vec @bilaltwovec

2K Followers 781 Following ✨ research engineer • prev @googlebrain @cohere @dbrxmosaicai • se @uwaterloo

Dani Yogatama @DaniYogatama

4K Followers 196 Following CEO @RekaAILabs, Associate Professor @CSatUSC

Nat McAleese @nmca

3K Followers 306 Following Superalignment by models helping humans help models help humans at OpenAI. Previously @DeepMind. Views my own.

• Director of the Center for AI Safety (https://t.co/ahs3LYCpqv)
• GELU/ImageNet-C/MMLU/safety groundwork
• PhD in AI from UC Berkeley
https://t.co/rgXHAnYAsQ
https://t.co/YtGtDh1aAV

Dan Hendrycks @DanHendrycks

17K Followers 81 Following • Director of the Center for AI Safety (https://t.co/ahs3LYCpqv) • GELU/ImageNet-C/MMLU/safety groundwork • PhD in AI from UC Berkeley https://t.co/rgXHAnYAsQ https://t.co/YtGtDh1aAV

David Krueger @DavidSKrueger

13K Followers 4K Following Cambridge faculty - AI alignment, deep learning, and existential safety. Formerly Mila, FHI, DeepMind, ElementAI, AISI.

Chief Evangelist Officer of Qwen Team & OpenDevin, building LLM and LMM. Now @Alibaba_Qwen . Previously @PKU1898 LANCO group. ❤️ 🍵 ☕️ 🍷 🥃

Junyang Lin @JustinLin610

5K Followers 1K Following Chief Evangelist Officer of Qwen Team & OpenDevin, building LLM and LMM. Now @Alibaba_Qwen . Previously @PKU1898 LANCO group. ❤️ 🍵 ☕️ 🍷 🥃

#SalesforceAI advances state-of-the-art #AI techniques that pave the path for innovative products at Salesforce. Focus areas include #ML, #NLP, #AIforGood

Salesforce AI Researc.. @SFResearch

13K Followers 118 Following #SalesforceAI advances state-of-the-art #AI techniques that pave the path for innovative products at Salesforce. Focus areas include #ML, #NLP, #AIforGood

Principal Engineer, @GoogleDeepMind Gemini. prev PaLM-2. Tinkering with optimization and distributed systems. opinions are my own.

rohan anil @_arohan_

12K Followers 2K Following Principal Engineer, @GoogleDeepMind Gemini. prev PaLM-2. Tinkering with optimization and distributed systems. opinions are my own.

Eric Zelikman @ericzelikman

5K Followers 1K Following studying why @xAI // was phd-ing @stanford

Machel Reid @machelreid

2K Followers 1K Following Research Scientist @GoogleDeepMind Working on LLMs on the Gemini Team; did gemini 1.5 pro

Jan Leike @janleike

44K Followers 322 Following ML Researcher, co-leading Superalignment @OpenAI. Optimizing for a post-AGI future where humanity flourishes.

Steven Zheng @HuaixiuZheng

172 Followers 60 Following Trained in quantum computing and quantum physics, LLM research in Google DeepMind

Robert Dadashi @robdadashi

2K Followers 388 Following reinforcement learning research @GoogleDeepMind, built RLHF layer of Bard and Gemma

Maarten Sap (he/him) @MaartenSap

5K Followers 645 Following Working on #NLProc for social good. Currently at @LTIatCMU, previously at @UWNLP, @MSFTResearch, and @allen_ai. 🏳‍🌈

Sharan Narang @sharan0909

2K Followers 254 Following LLMs and AI Research (Llama 2 & 3 lead) @Meta | ex @Google (PaLM lead, T5), ex @Baidu (Deep Speech 2, Sparse Neural Networks), ex @Nvidia

Kevin Liu @kliu128

7K Followers 629 Following Preparedness at @openai

Stanford CS PhD working on RL, Education, and NLP. Advised by Emma Brunskill and Chris Piech. Ex @stanfordsymsys. @StanfordAILab 2023 summer @MSFTResearch

Allen Nie (🇺🇦�.. @Allen_A_N

1K Followers 1K Following Stanford CS PhD working on RL, Education, and NLP. Advised by Emma Brunskill and Chris Piech. Ex @stanfordsymsys. @StanfordAILab 2023 summer @MSFTResearch

Desh Raj @rdesh26

3K Followers 2K Following Research Scientist @Meta (AI Speech) | Previously: @jhuclsp, @IITGuwahati

Josh Woodward @joshtwoodward

789 Followers 517 Following VP, @Google Labs

Marc Andreessen 🇺�.. @pmarca

1.4M Followers 24K Following Techno-optimist. E/acc. Technology brother. Move Fast and Make Things. p(Doom) = 0; p(“1984”) = not 0.

Chengrun Yang @chengrun_yang

243 Followers 71 Following Research Scientist at @GoogleDeepMind

Chip hog @GoogleDeepMind. Bard Research Tool Use lead. Prev: TPU hoarder at @YouTube, recommender systems @VEVO. Opinions my own.

Jarrod Kahn @kahnvex

346 Followers 174 Following Chip hog @GoogleDeepMind. Bard Research Tool Use lead. Prev: TPU hoarder at @YouTube, recommender systems @VEVO. Opinions my own.

Greg Durrett @gregd_nlp

6K Followers 752 Following CS professor at UT Austin. I do NLP most of the time. he/him

trieu @thtrieu_

2K Followers 241 Following thinking about thinking. created alphageometry, darkflow. prev: nyu, google brain/deepmind

Covariant @CovariantAI

11K Followers 158 Following Empowering robots to see, think, and act.

Language Technologies.. @LTIatCMU

9K Followers 233 Following The Language Technologies Institute in Carnegie Mellon University's @SCSatCMU

SF AI Studio Lead @Accenture, partnering with @OpenAI @Google @Microsoft. Pianist. German Quantum Physicist. Creator of the Nth Floor. Views are my own. x/acc.

Ben Holfeld @BenHolfeld

89K Followers 32K Following SF AI Studio Lead @Accenture, partnering with @OpenAI @Google @Microsoft. Pianist. German Quantum Physicist. Creator of the Nth Floor. Views are my own. x/acc.

Ellen Wu @zeqiuwu1

593 Followers 430 Following PhD student at UWNLP

Tri Dao @tri_dao

19K Followers 365 Following Incoming Asst. Prof @PrincetonCS, Chief Scientist @togethercompute. Machine learning & systems.

4th Year PhD Student at @wing_nus @nuscomputing. Studying discourse and emojis with a focus on interpretability of LMs . @Charles_Leclerc will win WDC.

Yisong Miao @YisongMiao

585 Followers 1K Following 4th Year PhD Student at @wing_nus @nuscomputing. Studying discourse and emojis with a focus on interpretability of LMs . @Charles_Leclerc will win WDC.

Research Scientist @allen_ai / @ai2_mosaic, PhD in NLP/Dialogue 🤖 UofA. Ex Visiting researcher @Mila_Quebec Ex Research intern at @GoogleDeepMind @MSFTResearch

Nouha Dziri @nouhadziri

3K Followers 676 Following Research Scientist @allen_ai / @ai2_mosaic, PhD in NLP/Dialogue 🤖 UofA. Ex Visiting researcher @Mila_Quebec Ex Research intern at @GoogleDeepMind @MSFTResearch

Researcher (Google DeepMind/Brain in Zürich, ex-RWTH Aachen), Gamer, Hacker, Belgian. Mostly gave up trying mastodon as lb@sigmoid.social

Lucas Beyer (bl16) @giffmana

56K Followers 446 Following Researcher (Google DeepMind/Brain in Zürich, ex-RWTH Aachen), Gamer, Hacker, Belgian. Mostly gave up trying mastodon as [email protected]

Arthur Mensch @arthurmensch

40K Followers 874 Following Co-founder and CEO @MistralAI. Apply https://t.co/yHGRZAtjcx

Horace He @cHHillee

24K Followers 449 Following Working at the intersection of ML and Systems @ PyTorch "My learning style is Horace twitter threads" - @typedfemale

Chelsea Finn @chelseabfinn

69K Followers 384 Following Asst Prof of CS & EE @Stanford. PhD from @Berkeley_EECS, EECS BS from @MIT

Reid Hoffman @reidhoffman

707K Followers 624 Following Entrepreneur. Investor. Strategist.

PhD @EdinburghNLP on LLMs and Machine Reasoning. Ex. @Columbia @PKU1898 @MITIBMLab @allen_ai AGI has yet to come, so keep running

Yao Fu @Francis_YAO_

14K Followers 2K Following PhD @EdinburghNLP on LLMs and Machine Reasoning. Ex. @Columbia @PKU1898 @MITIBMLab @allen_ai AGI has yet to come, so keep running

Satya Nadella @satyanadella

3.3M Followers 286 Following Chairman and CEO at Microsoft

Peter Welinder @npew

34K Followers 754 Following VP Product @OpenAI

Han @hhua_

3K Followers 4K Following Invest @GVteam during 🌞 and hacker at 🌒. Investing in AI, infra, deep tech, fintech/crypto ⚡️🤖🧠. Views are my own.

Peter Battaglia @PeterWBattaglia

14K Followers 349 Following Research Scientist, DeepMind

Reverse engineering neural networks at @AnthropicAI. DMs open! Previously @distillpub, OpenAI Clarity Team, Google Brain. Personal account.

Chris Olah @ch402

91K Followers 173 Following Reverse engineering neural networks at @AnthropicAI. DMs open! Previously @distillpub, OpenAI Clarity Team, Google Brain. Personal account.

AI researcher @UWaterloo @GoogleAI @VectorInst. Interested in natural language processing, diffusion models. I direct TIGER-Lab at UWaterloo.

Wenhu Chen @WenhuChen

11K Followers 520 Following AI researcher @UWaterloo @GoogleAI @VectorInst. Interested in natural language processing, diffusion models. I direct TIGER-Lab at UWaterloo.

elvis @omarsar0

5 days ago

@JerryWeiAI A bit different but I found the inspiration behind this paper interesting: x.com/omarsar0/statu…

elvis @omarsar0

4 weeks ago

Visualization-of-Thought Elicits Spatial Reasoning in LLMs Inspired by a human cognitive capacity to imagine unseen worlds, this new work proposes Visualization-of-Thought (VoT) prompting to elicit spatial reasoning in LLMs. VoT enables LLMs to "visualize" their reasoning…

4 119 425 93K 303

Download Image

1 0 1 345 2

elvis @omarsar0

5 days ago

@JerryWeiAI Nice share! A deeper understanding of the two might lead to better ideas to improve LLMs. I have seen several papers (even simple prompting ones) borrowing ideas from cognition. I really enjoy this type of research. And of course, your paper on long-form factuality was quite…

1 0 2 2K 0

Jerry Wei @JerryWeiAI

6 days ago

6 12 77 21K 61

Jerry Wei @JerryWeiAI

2 weeks ago

@_jasonwei

0 1 58 4K 1

Download Image

Jason Wei @_jasonwei

2 weeks ago

nothing gets my heart rate up like waiting for eval results on new models to come in

20 27 419 82K 34

Yi Tay @YiTayML

2 weeks ago

@hyhieu226 Thanks Hieu. Whenever GPUs died we took out stashes of paper and did backprop by hand. Works very well!

1 0 5 649 0

Jerry Wei @JerryWeiAI

2 weeks ago

Yi Tay @YiTayML

2 weeks ago

11 56 416 48K 205

Download Image

2 1 33 5K 5

Yi Tay @YiTayML

2 weeks ago

@JerryWeiAI thanks Jerry!

0 0 2 201 0

Yi Tay @YiTayML

2 weeks ago

11 56 416 48K 205

Download Image

Andrew M. Dai @iamandrewdai

3 weeks ago

Ruibo Liu @RuiboLiu

3 weeks ago

Thanks Aran for sharing our work! This is a survey paper I’ve been thinking about for a long time, as we have seen an increasing need for synthetic data. As we will probably run out of fresh tokens soon, the audience of this paper should be everyone who cares about AI progress.

1 19 113 34K 68

1 4 29 5K 10

Jarrod Kahn @kahnvex

3 weeks ago

@JerryWeiAI Style nit: def should_put_on_hold(paper: str) -> bool: return 'time travel' in paper

2 0 17 3K 2

Ruibo Liu @RuiboLiu

3 weeks ago

This is true haha. So for anyone who has an ArXiv submission "on-hold" for unclear reasons: Please double check whether you have keywords such as "time travel" in your text. This is another lesson we have learned. 😆😆😆

Jerry Wei @JerryWeiAI

3 weeks ago

19 23 284 94K 169

0 0 15 3K 3

Jerry Wei @JerryWeiAI

3 weeks ago

Aran Komatsuzaki @arankomatsuzaki

3 weeks ago

6 137 698 154K 706

Download Image

19 23 284 94K 169

elvis @omarsar0

3 weeks ago

3 79 306 25K 229

Download Image

Jerry Wei @JerryWeiAI

3 weeks ago

Ruibo Liu @RuiboLiu

3 weeks ago

1 19 113 34K 68

1 8 49 11K 24

AK @_akhaliq

3 weeks ago

4 79 311 47K 230

Download Image

Ruibo Liu @RuiboLiu

3 weeks ago

Aran Komatsuzaki @arankomatsuzaki

3 weeks ago

6 137 698 154K 706

Download Image

1 19 113 34K 68

Aran Komatsuzaki @arankomatsuzaki

3 weeks ago

6 137 698 154K 706

Download Image

Hyung Won Chung @hwchung27

3 weeks ago

Flan2 paper is now on JMLR, 1.5 years after the initial arXiv release. It already feels quite dated, reflecting how fast the field is moving. That said Flan-T5 series is still going strong, with an astonishing 52M cumulative downloads 🤯 How are people using these models?

5 5 66 14K 11

Jerry Wei @JerryWeiAI

4 weeks ago

Greg Durrett @gregd_nlp

a month ago

This is a cool method, but "superhuman" is an overclaim based on the data shown. There are better datasets than FActScore for evaluating this: ExpertQA arxiv.org/abs/2309.07852 by @cmalaviya11 +al Factcheck-GPT arxiv.org/abs/2311.09000 by Yuxia Wang +al (+ same methodology) 🧵