Yongchao Zhou @Yongchao_Zhou_

Build Intelligence @xai | ML PhD @UofT @VectorInst | Prev. @GoogleAI @GoogleDeepMind | Working on LLMs Toronto Joined January 2022

Tweets

78
Followers

476
Following

300
Likes

411

xAI @xai

3 weeks ago

👀 x.ai/blog/grok-1.5v

599 1K 7K 22.9M 880

xAI @xai

a month ago

x.ai/blog/grok-1.5

686 1K 7K 20.7M 563

Our team at Google DeepMind has a full-time Research Scientist position available at our Mountain View site. Minimum qualification: PhD in ML/NLP. Please email me with: your CV and Google Scholar link; a brief description of the impactful work you have done; and what you aim…

12 51 291 73K 173

Jimmy Ba @jimmybajimmyba

a month ago

based and 🔓 wanna help accelerate the next Grok? looking for builders: — Rust/Jax/Kube infra engineers — front-end/full-stack engineers x.ai/careers

Grok @grok

a month ago

based and 🔓 wanna help accelerate the next Grok? looking for builders: — Rust/Jax/Kube infra engineers — front-end/full-stack engineers x.ai/careers

1K 2K 16K 9.7M 811

39 95 446 90K 53

Grok @grok

a month ago

@elonmusk @xai ░W░E░I░G░H░T░S░I░N░B░I░O░

1K 2K 16K 9.7M 811

Anian Ruoss @anianruoss

2 months ago

Fantastic work by @Yongchao_Zhou_ et al. showing that our randomized positional encodings (arxiv.org/abs/2305.16843) can contribute to extending Transformers' length generalization for two-digit addition!

Yongchao Zhou @Yongchao_Zhou_

2 months ago

1 0 16 1K 2

Download Image

0 2 6 735 1

elvis @omarsar0

2 months ago

CoT Reasoning without Prompting Interesting paper! Proposes a chain-of-thought (CoT) decoding method to elicit the reasoning capabilities from pre-trained LLMs without explicit prompting. It claims to significantly enhance a model’s reasoning capabilities over greedy decoding…

6 107 485 49K 338

Download Image

Xinyun Chen @xinyun_chen_

2 months ago

Excited to share our work (read-agent.github.io) for reading long documents way exceeding the context window (up to 20x). Inspired by human reading paradigm, Read Agent summarizes the input episodically as gist memories, and uses them to retrieve relevant details when needed.

Kuang-Huei Lee @kuanghueilee

2 months ago

6 61 306 46K 215

Download Image

1 16 83 11K 28

Roger Grosse @RogerGrosse

2 months ago

Here's what I see as a likely AGI trajectory over the next decade. I claim that later parts of the path present the biggest alignment risks/challenges. The alignment world has been focusing a lot on the lower left corner lately, which I'm worried is somewhat of a Maginot line.

22 102 525 61K 397

Download Image

AK @_akhaliq

2 months ago

Chain-of-Thought Reasoning Without Prompting paper page: huggingface.co/papers/2402.10… In enhancing the reasoning capabilities of large language models (LLMs), prior research primarily focuses on specific prompting techniques such as few-shot or zero-shot chain-of-thought (CoT)…

7 141 649 63K 431

Download Image

Aran Komatsuzaki @arankomatsuzaki

2 months ago

Chain-of-Thought Reasoning Without Prompting Can LLMs reason effectively without prompting? Our findings reveal that, intriguingly, CoT reasoning paths can be elicited from pre-trained LLMs by simply altering the decoding process. arxiv.org/abs/2402.10200

2 84 410 27K 255

Download Image

Xinyun Chen @xinyun_chen_

3 months ago

New preprint🔥: Premise Order Matters in Reasoning with Large Language Models arxiv.org/abs/2402.08939 In typical logical reasoning, premise order doesn't matter. However, for SOTA LLMs, changing the premise order may cause an accuracy drop of >30%! 🧵 1/8

2 30 116 10K 52

Download Image

AK @_akhaliq

3 months ago

Google presents Premise Order Matters in Reasoning with Large Language Models paper page: huggingface.co/papers/2402.08… Large language models (LLMs) have accomplished remarkable reasoning performance in various domains. However, in the domain of reasoning tasks, we discover a…

3 47 240 33K 144

Download Image

AK @_akhaliq

3 months ago

Google Deepmind presents Transformers Can Achieve Length Generalization But Not Robustly paper page: huggingface.co/papers/2402.09… Length generalization, defined as the ability to extrapolate from shorter training sequences to longer test ones, is a significant challenge for language…

2 66 307 37K 141

Download Image

Aran Komatsuzaki @arankomatsuzaki

3 months ago

Transformers Can Achieve Length Generalization But Not Robustly Length generalization remains fragile, significantly influenced by factors like random weight initialization and training data order arxiv.org/abs/2402.09371

5 53 237 22K 119

Download Image

Stephan Baasch @stbaasch

86K Followers 7K Following Interested in investments and technology.

Camille Jongsma @jongs_cami

21 Followers 3K Following

@XAI Training @Grok | TA @CuriousRefuge | CPP @RunwayML | @LeonardoAI_ LCP | HUG Artist | Worlds, Films, Games, Design + AI Creator/Producer @Kevinkshah

Creative AIgency @CreativeAIgency

Shashank Sangar @ShashankTesla

16 Followers 204 Following Recruiting at Tesla AI for Core Autonomy (Autopilot & Optimus)

Fiscally Conservative🤍Socially somewhat Liberal🤍 ♥️Proud 🇺🇸🇧🇷 If you’re not following some people you dislike or disagree with, you’re doing it wrong🤍

CassandraMom22🪬�.. @HeCaSoMa

468 Followers 756 Following Fiscally Conservative🤍Socially somewhat Liberal🤍 ♥️Proud 🇺🇸🇧🇷 If you’re not following some people you dislike or disagree with, you’re doing it wrong🤍

Techarn @TecharncCODrs

0 Followers 107 Following

RosemaryLew @4yxXLhv8IHUod04

0 Followers 111 Following

jj @punchgod_7

276 Followers 3K Following always guard up. ( ง︡'-'︠)ง

Isshin 一心 @YixinTian123

30 Followers 77 Following Learning/building things in symbolic knowledge extraction, graph learning, and knowledge analytics.

🍁 Wissenschaft ist der neueste Stand bewiesener Irrtümer! 🕴️Autodidakt ⚕️Cannabispatient & -Sommelier ✨ 𝕏Ɖ 🧬 #teamscience 🔬Do Only Good Everyday 🐕

Sir Mo van da Weed �.. @can420nabis

421 Followers 1K Following 🍁 Wissenschaft ist der neueste Stand bewiesener Irrtümer! 🕴️Autodidakt ⚕️Cannabispatient & -Sommelier ✨ 𝕏Ɖ 🧬 #teamscience 🔬Do Only Good Everyday 🐕

Civocim @civocim

220 Followers 481 Following 🇺🇸🇺🇸🇺🇸

Mistr. PIXELS @Mistr_Pixels

268 Followers 230 Following 👾

Florian @janzimc

277 Followers 1K Following 𝕏 #GardeningX

super intelligence @eacc72

12 Followers 688 Following GPT6 is a Level 2 AGI and will be released in 2025

SuzX @smcx22

3K Followers 2K Following Retail Lead-X

Atomic2 @pumped212

30 Followers 342 Following Front end dev, trying to go full stack

Sahil Antil @oxshitantil

17 Followers 804 Following Founder @kavachbuilders @foodkavach @arqaifashion

NinaMonica Scalabrin @NinaMonicaS

1K Followers 5K Following Nina Monica Scalabrin official twitter, bestselling author, screenwriter, Mister Parkinson author

Sletio @sletio26839

0 Followers 178 Following

ssteevens @Steevens43

160 Followers 5K Following

Ílas @ilidiomacamo09

153 Followers 996 Following 🇲🇿 MOZ

MAB氏 @MAB1791652

1 Followers 36 Following

Weloop @Weloop_official

17 Followers 72 Following Download “Weloop” to be a part of your friends circle

⌗ Innovator-in-Chief ⇢ ❍ne World ✍︎ Investigative Journalist & Director of Open Records Strategy ⇢ AtNight Media ⌇ The New Way ® | One World 🌍

nik t. hatziefstathio.. @nikthehat

40K Followers 4K Following ⌗ Innovator-in-Chief ⇢ ❍ne World ✍︎ Investigative Journalist & Director of Open Records Strategy ⇢ AtNight Media ⌇ The New Way ® | One World 🌍

Devin Kim @devindkim

1K Followers 116 Following a real human bean. building intelligence @xAI

Product of progressive public policy; raised by public libraries and public education that produced a passion for politics.

and apparently alliteration

Omair Shahid @OmairShahid

382 Followers 959 Following Product of progressive public policy; raised by public libraries and public education that produced a passion for politics. and apparently alliteration

LucyRicardo @66IR6G2l84P48r0

1 Followers 166 Following

Sahil Antil @oxshitantil1

43 Followers 642 Following

INGABO @lingaboh

53 Followers 109 Following

Endwoddl @Endwoddl

271 Followers 5K Following

Errol Reed.Entreprenuer Of The Year🏆• Public Figure • Fmr Advisor for @Electrobbywells 🇺🇲 • Reed Management 🌎 . We create stars 💫

Rizz Reed @rizzreed

144K Followers 144K Following Errol Reed.Entreprenuer Of The Year🏆• Public Figure • Fmr Advisor for @Electrobbywells 🇺🇲 • Reed Management 🌎 . We create stars 💫

X Daily News @xDaily

282K Followers 4K Following Your #1 News source on everything X + https://t.co/rn58CVV9pw | Hit Follow and sign up for notifications! 🔔 | Contributors: @HXMnCK, @512x512, @xUpdatesRadar and @swak_12

Dana Mahmood @deordered

24 Followers 731 Following Fine-tuning AI models oftentimes & practicing philosopher at other times.

Jannifer chigbu @riva_edgew11272

31 Followers 809 Following ELITE Business coach 1st female Fx trader & Educator 7 figure forex trader & mentor (mindset) peak parformance coach

rxss @rxz2817

19 Followers 94 Following competitive programming enthusiast

none @fbd_name

0 Followers 10 Following

Yash Darji @YashDarji_

50 Followers 376 Following Dreamer

Jean mopin @JeanMopin

48 Followers 48 Following

coffee & AI @realcoffeeAI

51 Followers 741 Following Sitting on a park bench scattering random seeds for the LLMs. I never bet against Elon.

Aditi @aditigaur_

106 Followers 421 Following

Spiderman 🇮🇳 @returnspiderman

1K Followers 6K Following Seek the truth | Everybody talks, very few listen | Watch out here comes the Spider-Man 😁 https://t.co/qwmEhH45SY

Howard Luck @howardluck3

173 Followers 924 Following Engineering at @RocketCompanies | Previous: @Genesco_Inc | less poast more buidl

Will Mac @ca_dryclean

6 Followers 122 Following

Cryptocracyyy @cryptocracyyy

87 Followers 307 Following

Pablo Ubilla @pablo_ubilla7

723 Followers 4K Following I will tell you enough to keep you intrigued... but you shall never truly know me

@FBI Target #TwitterFiles For Censorship, Meteorologist, AI, Data Scientist, @USArmy Ret, #IC, Fmr TX Elected Official.
Seen @AmThoughtLeader
Heard @SeanHannity

John Basham @JohnBasham

80K Followers 13K Following @FBI Target #TwitterFiles For Censorship, Meteorologist, AI, Data Scientist, @USArmy Ret, #IC, Fmr TX Elected Official. Seen @AmThoughtLeader Heard @SeanHannity

SOT @SoloOrTroll

10K Followers 2K Following 22 | smite pro | twitch streamer | i love movies, tesla, robots, and technology 🦾🤖

BANDARI SRINIVAS @SrinivasB_ledar

280 Followers 4K Following BITCOIN & BlockChain Skills

Saeed Maleki @MalekiSaeed

474 Followers 110 Following

Dr. Jason Bourne @DR_BOURNE

4K Followers 7K Following Chief Information Security Officer (CISO)

Devin Kim @devindkim

1K Followers 116 Following a real human bean. building intelligence @xAI

Zhiqing Sun @EdwardSun0909

2K Followers 1K Following CS PhD @LTIatCMU working on scalable alignment. BS @PKU1898

Ivan Smirnov @aldanor

318 Followers 235 Following Rustacean, musician, quant, nerd.

Ze Liu @zeliu_

269 Followers 311 Following @xAI. Previously PhD @MSFTResearch (MSRA) & USTC.

Horace He @cHHillee

24K Followers 449 Following Working at the intersection of ML and Systems @ PyTorch "My learning style is Horace twitter threads" - @typedfemale

Prasanna Lahoti @_PrasannaLahoti

585 Followers 94 Following @xAI. previously @scale_AI.

Rutvik Makwana @rutvikwrites

907 Followers 773 Following AI Tutor @xai • Grokking @grok • Pharmaceutical Science • Cricket, Movies, Voracious Reader

Ting Chen @tingchenai

5K Followers 365 Following Bump up intelligence in all bit streams @xai. Previous @GoogleDeepmind, @GoogleBrain.

Saeed Maleki @MalekiSaeed

474 Followers 110 Following

Sergey Ioffe @Sergey_xai

756 Followers 6 Following https://t.co/E7YNgwpalf

Jesik Min @jesikmin

417 Followers 347 Following validating 42 @xAI

Gabriel Ilharco @gabriel_ilharco

4K Followers 1K Following Building cool things @xAI. Prev. PhD at UW, Google AI

Jaime Alonso @JaimeAlns

287 Followers 126 Following @xAI

Haotian Liu @imhaotian

6K Followers 397 Following building intelligence @xAI, creator of #LLaVA, cs @UWMadison, prev @MSFTResearch

Aditya Paliwal @VastoLorde95

527 Followers 85 Following I only read books that have pictures in them

Ethan Knight @eknight

7K Followers 219 Following 🏋️‍♂️ @xai | prev e2e @tesla, rl @openai

xiao sun @xiaosun86

2K Followers 93 Following

Fabio Aguilera-Conver.. @Faruletes

1K Followers 187 Following

Eric Zelikman @ericzelikman

5K Followers 1K Following studying why @xAI // was phd-ing @stanford

Building AI + B2B products

🖥️ Content: https://t.co/kLERwNtzqi
Feedback is great: https://t.co/A6mrmjCem5

Prev. @digits @salesforce

Greg Kamradt @GregKamradt

25K Followers 721 Following Building AI + B2B products 🖥️ Content: https://t.co/kLERwNtzqi Feedback is great: https://t.co/A6mrmjCem5 Prev. @digits @salesforce

Ramin Hasani @ramin_m_h

3K Followers 258 Following Cofounder & CEO https://t.co/fh9fnDA9OQ | ML Researcher @ MIT

Builder, Dancer; @aiengfoundation & on a mission to help people be well. Lover of hackathons and updating my beliefs. Staying grounded. Prev: @MetaAI

Sasha Sheng 🫶🏼 @hackgoofer

4K Followers 2K Following Builder, Dancer; @aiengfoundation & on a mission to help people be well. Lover of hackathons and updating my beliefs. Staying grounded. Prev: @MetaAI

Cofounder @NousResearch, prev @StabilityAI
Github: https://t.co/LZwHTUFwPq
HuggingFace: https://t.co/sN2FFU8PVE
Support me on Github Sponsors

Teknium (e/λ) @Teknium1

29K Followers 3K Following Cofounder @NousResearch, prev @StabilityAI Github: https://t.co/LZwHTUFwPq HuggingFace: https://t.co/sN2FFU8PVE Support me on Github Sponsors

Aman Madaan @aman_madaan

1K Followers 481 Following @xai, PhD Candidate @LTIatCMU

Zhuohan Li @zhuohan123

3K Followers 689 Following CS PhD Student 👨🏻‍💻 @ UC Berkeley 🌁 🤖️ Machine Learning Systems

A somewhat-intelligent three-dimensional being at @xAI. Writer: https://t.co/pisunzyEVv. AI Filmmaker. Musician. Upcoming book: https://t.co/rBk0AMk1mF

Katia Karpenko @KatiaEarth

811 Followers 546 Following A somewhat-intelligent three-dimensional being at @xAI. Writer: https://t.co/pisunzyEVv. AI Filmmaker. Musician. Upcoming book: https://t.co/rBk0AMk1mF

Qian Huang @qhwang3

2K Followers 277 Following @xai | CS PhD student @StanfordAILab

Lianmin Zheng @lm_zheng

4K Followers 439 Following CS Ph.D. @ UC Berkeley. Creator of Alpa, Vicuna, and Chatbot Arena. @lmsysorg

Sr. R&D Engineer | AI | IoT | IEEE & ACM Conference Chair | Fortune 50 Innovations | Lowes | ABB Robotics | GE | Harvard Research Fellow | Stanford GSB

Lisa Liu @LisaAtBay

Makers of Devin, the first AI software engineer. We are an applied AI lab focused on reasoning, and code is just the beginning. Join us: https://t.co/tpfZwEwGiq

Cognition @cognition_labs

123K Followers 19 Following Makers of Devin, the first AI software engineer. We are an applied AI lab focused on reasoning, and code is just the beginning. Join us: https://t.co/tpfZwEwGiq

Scott Wu @ScottWu46

22K Followers 30 Following Building @cognition_labs

Lex Fridman @lexfridman

3.5M Followers 126 Following Host of Lex Fridman Podcast. Interested in robots and humans.

Jesse Farebrother @JesseFarebro

642 Followers 309 Following PhD student @Mila_Quebec / @McGillU. Student Researcher @GoogleDeepMind.

Grok @grok

392K Followers 2 Following https://t.co/vGwEsZXDiN

Lukasz Kaiser @lukaszkaiser

7K Followers 47 Following

Rowan Cheung @rowancheung

497K Followers 377 Following Founder @therundownai. Sharing the latest developments in the world of artificial intelligence.

researcher in #deeplearning #computervision | assistant professor at @NYU_Courant @nyuniversity | previous: research scientist @metaai (FAIR) @UCSanDiego

Saining Xie @sainingxie

14K Followers 1K Following researcher in #deeplearning #computervision | assistant professor at @NYU_Courant @nyuniversity | previous: research scientist @metaai (FAIR) @UCSanDiego

D @dylan_works_

191 Followers 788 Following

Sourabh Medapati @activelifetribe

50 Followers 732 Following Research Engineer @ Google Deepmind

Anian Ruoss @anianruoss

272 Followers 154 Following Research Engineer at Google DeepMind Previously: ETH Zurich

Associate Professor at Princeton and Research Scientist at Google DeepMind. ML/AI Researcher working on foundations of LLMs and deep learning

Jason Lee @jasondeanlee

10K Followers 3K Following Associate Professor at Princeton and Research Scientist at Google DeepMind. ML/AI Researcher working on foundations of LLMs and deep learning

Sr Research Scientist @SFResearch. PhD @Stanford. Researcher on foundation models, RL/games, deep learning, uncertainty quantification, and their theory.

Yu Bai @yubai01

3K Followers 2K Following Sr Research Scientist @SFResearch. PhD @Stanford. Researcher on foundation models, RL/games, deep learning, uncertainty quantification, and their theory.

Chong Shao @19cshao

1 Followers 1 Following

SF AI Studio Lead @Accenture, partnering with @OpenAI @Google @Microsoft. Pianist. German Quantum Physicist. Creator of the Nth Floor. Views are my own. x/acc.

Ben Holfeld @BenHolfeld

89K Followers 32K Following SF AI Studio Lead @Accenture, partnering with @OpenAI @Google @Microsoft. Pianist. German Quantum Physicist. Creator of the Nth Floor. Views are my own. x/acc.

Tim Brooks @_tim_brooks

29K Followers 74 Following Sora research lead @OpenAI

Bill Peebles @billpeeb

32K Followers 287 Following sora and agi @openai

Yuanhao Wang @YuanhaoWang3

254 Followers 281 Following CS student @ Princeton. Beware of theorists bearing proofs.

Enrique Piqueras @epiqueras1

2K Followers 234 Following Organizing the world's information and making it universally accessible and useful using JAX @Google @Deepmind.

Kefan XIAO @KevinKiao

192 Followers 232 Following Olympic weightlift AI - Pretraining&data of Palm2, Gemini and more.

Chuning Li @ChuningLi

62 Followers 40 Following MSc @UofTCompSci @VectorInst

Qian Huang @qhwang3

3 days ago

Could agents driven by powerful language models perform machine learning experimentation effectively? Our MLAgentBench paper is updated on arxiv! arxiv.org/pdf/2310.03302 Now we include more results from claude v3 Opus, gpt4 turbo, mixtral and gemini pro! Try out MLAgentbench…

4 33 219 46K 166

Download Image

Jim Fan @DrJimFan

2 weeks ago

Tesla FSD v13 will likely be grokking language tokens. What excites me the most about Grok-1.5V is the potential to solve edge cases in self-driving. Using language for "chain of thought" will help the car break down a complex scenario, reason with rules and counterfactuals, and…

xAI @xai

3 weeks ago

👀 x.ai/blog/grok-1.5v

599 1K 7K 22.9M 880

Zeyuan Allen-Zhu @ZeyuanAllenZhu

3 weeks ago

Our 12 scaling laws (for LLM knowledge capacity) are out: arxiv.org/abs/2404.05405. Took me 4mos to submit 50,000 jobs; took Meta 1mo for legal review; FAIR sponsored 4,200,000 GPU hrs. Hope this is a new direction to study scaling laws + help practitioners make informed decisions

27 334 1K 221K 1K

Download Image

Lance Martin @RLanceMartin

4 weeks ago

RAG From Scratch Here's a set of short (5-10 min videos) and notebooks explaining > a dozen of my favorite RAG papers. Took a stab at implementing each idea myself (all code open source) and grouped according to the diagram. Repo: github.com/langchain-ai/r… Video playlist:…

27 266 1K 111K 2K

Download Image

Anthropic @AnthropicAI

4 weeks ago

New Anthropic research paper: Many-shot jailbreaking. We study a long-context jailbreaking technique that is effective on most large language models, including those developed by Anthropic and many of our peers. Read our blog post and the paper here: anthropic.com/research/many-…

83 348 2K 501K 872

Download Image

xAI @xai

a month ago

x.ai/blog/grok-1.5

686 1K 7K 20.7M 563

Denny Zhou @denny_zhou

a month ago

12 51 291 73K 173

Yuandong Tian @tydsh

a month ago

Our award-winning ICML'21 paper DirectPred (arxiv.org/abs/2102.06810) precisely tells why Additional predictor + EMA + StopGradient works for such non-contrastive self-supervised learning settings without collapsing. The intuition here is that there exists another stable…

Kevin Patrick Murphy @sirbayes

a month ago

@ylecun @francoisfleuret @rami_mmo @Ethan_smith_20 @tokenpilled65B Ah, very helpful to know. I was reading up on JEPA and thought the info regularized approaches made sense, but your more recent ema plus stop gradient approach seems like black magic, and is not a well defined objective function. Why did you switch methods?

0 0 8 14K 8

0 2 44 10K 37

Nando de Freitas 🏳️‍🌈 @NandoDF

a month ago

There appears to be a mismatch between publishing criteria in AI conferences and "what actually works". It is easy to publish new mathematical constructs (e.g. new models, new layers, new modules, new losses), but as Apple's MM1 paper concludes: 1. Encoder Lesson: Image…

16 201 1K 306K 759

Download Image

Eric Zelikman @ericzelikman

a month ago

Excited to share I've joined @xai -- can't wait to work on AI reasoning with this awesome team and hyped to build on what I've learned with my incredible advisors, collaborators, and friends @Stanford

46 39 679 65K 41

Jimmy Ba @jimmybajimmyba

a month ago

based and 🔓 wanna help accelerate the next Grok? looking for builders: — Rust/Jax/Kube infra engineers — front-end/full-stack engineers x.ai/careers

Grok @grok

a month ago

@elonmusk @xai ░W░E░I░G░H░T░S░I░N░B░I░O░

1K 2K 16K 9.7M 811

39 95 446 90K 53

Daniel Han @danielhanchen

a month ago

Had a look through @grok's code: 1. Attention is scaled by 30/tanh(x/30) ?! 2. Approx GELU is used like Gemma 3. 4x Layernoms unlike 2x for Llama 4. RMS Layernorm downcasts at the end unlike Llama - same as Gemma 5. RoPE is fully in float32 I think like Gemma 6. Multipliers are 1…

26 235 1K 228K 994

Download Image

Grok @grok

a month ago

@elonmusk @xai ░W░E░I░G░H░T░S░I░N░B░I░O░

1K 2K 16K 9.7M 811

Eric Zelikman @ericzelikman

2 months ago

Language models today are trained to reason either 1) generally, imitating online reasoning data or 2) narrowly, self-teaching on their own solutions to specific tasks Can LMs teach themselves to reason generally?🌟Introducing Quiet-STaR, self-teaching via internal monologue!🧵