Gavin Guo @Zhen4good
Data-efficient ML Ph.D.@MITEECS, Ex-@BerkeleyPhysics/@MITIBMLab, GenAI @myshell_ai zguo0525.github.io Cambridge, MA Joined March 2023-
Tweets228
-
Followers55
-
Following124
-
Likes371
VCs are loving AI code assistants Because traction is strong: GitHub copilot has 1.8M paid users and is growing 40% QoQ Massive rounds in the space just in the last year: Augment - $227M Series A Cognition - $175M Series B (rumor) Magic - $117M Series B Poolside - $100M Seed…
The current administration brought all these Closed AI folks together to create an "AI Safety Board." Noticeably absent from this list are two of the most prominent leaders in the space - Zuck and Elon This is absolutely terrifying, to say the least!!
Perfect leadership for governing AI 🙌🙌 Sam Altman, The CEO of a failed oil company, and Delta Airlines! What a wise bunch this will be so good for everyone 👏
Perfect leadership for governing AI 🙌🙌 Sam Altman, The CEO of a failed oil company, and Delta Airlines! What a wise bunch this will be so good for everyone 👏
It's been a week since LLaMA 3 dropped. In that time, we've: - extended context from 8K -> 128K - trained multiple ridiculously performant fine-tunes - got inference working at 800+ tokens/second If Meta keeps releasing OSS models, closed providers won't be able to compete.
From Claude100K to Gemini10M, we are in the era of long context language models. Why and how a language model can utilize information at any input locations within long context? We discover retrieval heads, a special type of attention head responsible for long-context factuality
This must improve long context handling, but like many have said already, we need hard benchmarks, and stressing ICL. Passkey retrieval/needle is just a sanity check. Maybe these? LongICLBench github.com/TIGER-AI-Lab/L… FLenQA github.com/alonj/Same-Tas… ∞BENCH github.com/OpenBMB/Infini…
This must improve long context handling, but like many have said already, we need hard benchmarks, and stressing ICL. Passkey retrieval/needle is just a sanity check. Maybe these? LongICLBench github.com/TIGER-AI-Lab/L… FLenQA github.com/alonj/Same-Tas… ∞BENCH github.com/OpenBMB/Infini… https://t.co/YVuedbIvCj
Today we sued the SEC in our home state of Texas to defend ETH & the Ethereum network from Chair Gensler's unlawful power grab. We've been forced to defend multiple secret SEC investigations for years now. It's time they saw the light of day. #EthforAll #crypto #sec #consensys
Today we sued the SEC in our home state of Texas to defend ETH & the Ethereum network from Chair Gensler's unlawful power grab. We've been forced to defend multiple secret SEC investigations for years now. It's time they saw the light of day. #EthforAll #crypto #sec #consensys
Your startup is an OpenAI wrapper OpenAI is a Nvidia wrapper Nvidia is a TSMC wrapper TSMC is an ASML wrapper ASML is a Zeiss & Trumpf wrapper Zeiss is a glass wrapper Glass is a sand wrapper Sand is an erosion wrapper Erosion is an entropy wrapper Conclusion: Invest in entropy.
Your startup is an OpenAI wrapper OpenAI is a Nvidia wrapper Nvidia is a TSMC wrapper TSMC is an ASML wrapper ASML is a Zeiss & Trumpf wrapper Zeiss is a glass wrapper Glass is a sand wrapper Sand is an erosion wrapper Erosion is an entropy wrapper Conclusion: Invest in entropy.
my theory is that wall street does not like open source...giving away edges
Cannot agree more. My intuition is that FFN is for storing knowledge (this is why most knowledge editing are on FFNs) and Attention is for implementing algorithms (this is why most mechanistic interpretability, e.g., induction heads, are on Attn). Additionally, it seems that…
Cannot agree more. My intuition is that FFN is for storing knowledge (this is why most knowledge editing are on FFNs) and Attention is for implementing algorithms (this is why most mechanistic interpretability, e.g., induction heads, are on Attn). Additionally, it seems that…
someone is rigging META, need support from bigger whales!
There is a really nice community of researchers developing transformer alternatives. Want to highlight these impressive folks. Simran Arora (@simran_s_arora), Chunting Zhou (@violet_zct), Dan Fu (@realDanFu), and Songlin Yang (@SonglinYang4)
Phi-3 can run on a phone, and is very competitive with gpt-3.5!
Petea @Petea621388
0 Followers 88 FollowingPetite woman🌟🌟�.. @DenyPra48778744
2K Followers 851 Following A woman's youth after the age of thirty is created by herself, strive to maintain, thirty and encourage!🌟🌟🌟Andrew Curran @AndrewCurran_
11K Followers 7K Following Atypically Friendly - I write about AI and human creativity. Will periodically make extremely unusual arguments.Jamie Adams @JamieAdams88059
3 Followers 523 FollowingJack Ma @JackMa579503
17 Followers 189 FollowingKun (Kevin) SUN @Sharp_K_Sun
219 Followers 2K Following Scientist Researcher @ Tübingen University and Professorial Research Fellow @ Fudan University, and interested in LLMs, NLP, and computational cognition .Niðal نضال @imleslahdin
2K Followers 684 Following What is the Kolmogorov Complexity of Small Language Models?Xu Tan @xutan_tx
1K Followers 519 Following Principal Researcher and Research Manager @ Microsoft, working on generative AI and its application on language/speech/music/avatar.Jessica @lovexin88
942 Followers 278 Following Half of life is firewood, rice, oil and salt, and the other half is the stars and the sea.AR0575 @ar057562841
6 Followers 562 FollowingCindy.Lee @CindyS403461072
975 Followers 933 Following 🌈🌈 他者と自分自身に真摯に向き合いましょう。 🌸 🌸自分に限界を設けたり、他人を否定したりしないでください ✨💕新しいもの好きJiageng Liu @jiageng_liu
538 Followers 2K Following PhD student at @MIT finance. Studying fintech, entrepreneurship, labor. Former computer scientist via @UCLA @UChicago. 人能弘道,非道弘人。Aexyn @Aexyn
0 Followers 1K FollowingKeran R @KeranRong
59 Followers 122 Following Allseas | MIT | Google AI | Deepmind Gemini Multimodalyushangdi @yushangdi
7 Followers 26 FollowingMartin Fan @perfectoid_ai
394 Followers 8K FollowingHaotian Zhang @HaotianZhang4AI
432 Followers 239 Following Research Scientist @ Apple. Ex-Research Intern @ MSR AI. Ph.D. @ UW. Be Borderless.Zhe Gan @zhegan4
2K Followers 321 Following Staff Research Scientist @Apple AI/ML. Ex-Principal Researcher @Microsoft Azure AI. Working on building large-scale vision and multimodal foundation models.Zitong Yang @ZitongYang0
343 Followers 344 Following PhDing @stanfordnlp. Ex-@Berkeley_EECS/@GoogleAI. Incoming research intern @Apple LLM.Olimpia Corchero @CorcheOlim
38 Followers 5K FollowingAleksandra Korolova @korolova
3K Followers 3K Following Assistant Professor @PrincetonCS, @PrincetonSPIA, @PrincetonCITP. Work on algorithm auditing, privacy & fairness. Past: @USCViterbi @Snap @Google @Stanford @MITI07XNbUI4 @DeepFeed2
48 Followers 3K FollowingRamin Hamedi @HamediRamin
56 Followers 2K FollowingPaul Wilson @statusfailed
383 Followers 445 FollowingCJ (∀) @0xJ_C
395 Followers 3K Following (g^a mod p)^b mod p == (g^b mod p)^a mod p (m^e mod n)^d mod n == m mod n s=k^-1(h(m)+rd) s=k+h(R||m)d ZK Engineer @QEDProtocol ex-Core: @ParallelFi @ubuduDipkumar Patel @dippatel1994
254 Followers 431 Following Founder @languagemodelnl 🚀| Sr. Data Scientist @sanofi 📝 | Researching optimization of #LLMs with #quantization #RAG & AI agentsFelix @felix_red_panda
3K Followers 2K Following CS Student, speech synthesis and LLM nerd, DMs openDebadeepta Dey @debadeepta
2K Followers 2K Following Principal Researcher @MSFTResearch in AI. Currently working on hardware-aware foundation model design.Marc Andreessen 🇺�.. @pmarca
1.4M Followers 24K Following Techno-optimist. E/acc. Technology brother. Move Fast and Make Things. p(Doom) = 0; p(“1984”) = not 0.Gideon Mann 🇮🇱 @gideonmann
3K Followers 2K Following Global Head of AI, Technology at Millennium. All opinions my own.Saahith @saahithjanapati
42 Followers 1K FollowingXiong Zeng @XiongZeng111
316 Followers 2K Following Marathon & trail runner, Ph.D. candidate @UMichECE, interested in machine coffee, optimization, control theory, statistical learning theory, robotics, etc.Snehil Saluja @mesnhl
620 Followers 959 Following co-founder @OverlayyAI - building AI-first UX to boost conversion on apps & websites ◆ Loves (to) Code, Design, Math & Puzzles ◆ alum @IITKanpur • @CMSJaiJagatCenter of the maze @Maze_s_Center
68 Followers 790 Following EH, E/Humanist, sustainability/science/economics/crypto/politics (loosely ordered). Opinions=own, RP/❤ not endorsements https://t.co/b4DUcWfA8VMichael H. @boundless2022
86 Followers 1K FollowingAndy the MKT BUDDY�.. @andythemktbuddy
531 Followers 2K Following @banklessDAO MKTer | Creative solutions for fashion brands @moncler @zara, etc l prev. @myshell_ai MKT /@theirsverse Content Director 🏳️🌈#crypto #ai #brandJessica Strickland @JessicaStr35738
88 Followers 3K Followingnick nassuphis @NNassuphis
120 Followers 5K FollowingElon Reeve Musk @Elon402
83 Followers 589 Following Elon Reeve Musk 🚀| Spacex .CEO&CTO 🚔| https://t.co/qSyayvwD9Y and product architect 🚄| Hyperloop .Founder of The boring company 🤖|CO-Founder-Neturalink, OpenAlEthan | MyShell @ethan_myshell
1K Followers 1K Following Crypto & AI | Co-founder of @myshell_ai 🤖️ | Oxford '16, Math & CS | e/accJiawei Liu @JiaweiLiu_
2K Followers 957 Following Simplifying the making of great software. PhD Student @plfmse @IllinoisCS.Albert Jiang @AlbertQJiang
2K Followers 409 Following AI4Maths @Cambridge_CL Science @MistralAI I bake my own opinions at temperature=2.0Eric Wallace @Eric_Wallace_
6K Followers 1K Following Researcher at OpenAI working to make language models more trustworthy, secure, and private.Sergey Edunov @edunov
947 Followers 103 Following Director of Engineering @ GenAI, Meta. I work on LlamasGeorge @georgejrjrjr
2K Followers 846 Following The timeline vibetimes pipeline to things still more strange and enticing.Pedro Domingos @pmddomingos
79K Followers 166 Following Professor of computer science at UW and author of 'The Master Algorithm' and '2040'. Into machine learning, AI, and anything that makes me curious.Hyperbolic @hyperbolic_labs
3K Followers 43 Following Realize your vision for AI with open access to more than just compute. Join our discord: https://t.co/SaGT3y9AtERuibo Liu @RuiboLiu
2K Followers 1K Following Research Scientist @GoogleDeepMind. AI Research with Humans in Mind.Aston Zhang @astonzhangAZ
5K Followers 92 Following Research Scientist at the #llama team of Meta Generative AI, designing and training large language models. Opinions are my own.Xu Tan @xutan_tx
1K Followers 519 Following Principal Researcher and Research Manager @ Microsoft, working on generative AI and its application on language/speech/music/avatar.Omar Sanseviero @osanseviero
31K Followers 2K Following Chief Llama Officer @huggingface 🦙 Founder @AI_Learners. Xoogler (SWE @Google Assistant, 20% PM TF Graphics). 100% Hacker Llama🇵🇪🇲🇽Teknium (e/λ) @Teknium1
29K Followers 3K Following Cofounder @NousResearch, prev @StabilityAI Github: https://t.co/LZwHTUFwPq HuggingFace: https://t.co/sN2FFU8PVE Support me on Github SponsorsJiageng Liu @jiageng_liu
538 Followers 2K Following PhD student at @MIT finance. Studying fintech, entrepreneurship, labor. Former computer scientist via @UCLA @UChicago. 人能弘道,非道弘人。Aakanksha Chowdhery @achowdhery
7K Followers 3K Following LLMs @ Google DeepMind :: PaLM, Gemini // Previously @MSFTResearch, @Stanford, @Princeton // views my own and subject to changePhilipp Schmid @_philschmid
16K Followers 652 Following Tech Lead and LLMs at @huggingface 👨🏻💻 🤗 AWS ML Hero 🦸🏻 | Cloud & ML enthusiast | 📍Nuremberg | 🇩🇪 https://t.co/l1ppq3q3hkKeran R @KeranRong
59 Followers 122 Following Allseas | MIT | Google AI | Deepmind Gemini MultimodalVaibhav (VB) Srivasta.. @reach_vb
11K Followers 169 Following GPU poor @Huggingface | F1 fan | Here for @at_sofdog’s wisdom | *opinions my ownBindu Reddy @bindureddy
124K Followers 339 Following CEO of @abacusai, using Gen AI to build Applied AI and LLM agents and systems at scale, ex-AWS / Google, passionate about human behavior and open-source AGIMIT CSAIL @MIT_CSAIL
298K Followers 22K Following MIT's Computer Science & Artificial Intelligence Laboratory (CSAIL). Media Inquiries: [email protected]yushangdi @yushangdi
7 Followers 26 FollowingDegen @DegensTogether
18K Followers 31 Following We're a community of $degen traders trying to make it all back in one trade.Lookonchain @lookonchain
381K Followers 371 Following Looking for smartmoney onchain! Telegram: https://t.co/9UkWUH9qaBXin Eric Wang @xwang_lk
7K Followers 1K Following Multimodal and Embodied AI Researcher / Professor @UCSC. Director of https://t.co/Y4swOBag21. AI for Humanity in the long run. he/himHaotian Zhang @HaotianZhang4AI
432 Followers 239 Following Research Scientist @ Apple. Ex-Research Intern @ MSR AI. Ph.D. @ UW. Be Borderless.Zhe Gan @zhegan4
2K Followers 321 Following Staff Research Scientist @Apple AI/ML. Ex-Principal Researcher @Microsoft Azure AI. Working on building large-scale vision and multimodal foundation models.Zitong Yang @ZitongYang0
343 Followers 344 Following PhDing @stanfordnlp. Ex-@Berkeley_EECS/@GoogleAI. Incoming research intern @Apple LLM.Dinghuai Zhang 张鼎.. @zdhnarsil
2K Followers 1K Following PhD student at @Mila_Quebec. Ex intern at FAIR Labs @MetaAI. Previous math undergraduate at @PKU1898.Ligeng Zhu @LigengZhu
1K Followers 2K Following EECS Ph.D. at @MIT, previous undergrad at @SFU and @ZJU_China.Felix @felix_red_panda
3K Followers 2K Following CS Student, speech synthesis and LLM nerd, DMs openMarc Andreessen 🇺�.. @pmarca
1.4M Followers 24K Following Techno-optimist. E/acc. Technology brother. Move Fast and Make Things. p(Doom) = 0; p(“1984”) = not 0.Simo Ryu @cloneofsimo
3K Followers 384 Following #KAIST RAI Lab (ML engineering #Naver) Interested in robotics, RL, math (but you might know me for t2i diffusion) [email protected]Debadeepta Dey @debadeepta
2K Followers 2K Following Principal Researcher @MSFTResearch in AI. Currently working on hardware-aware foundation model design.Gideon Mann 🇮🇱 @gideonmann
3K Followers 2K Following Global Head of AI, Technology at Millennium. All opinions my own.AGI House @agihouse_org
13K Followers 414 Following Accelerating humanity's transition to AGI & honoring the greatest AI founders and researchers of our time @ https://t.co/1lJUc58gZJMeta @Meta
14.0M Followers 709 Following Connect with what you love to make things happen. It’s Your World.Denny Zhou @denny_zhou
9K Followers 420 Following @GoogleDeepMind founder & lead of Reasoning Team. Build LLMs to reason. Opinions my own.Zhiqing Sun @EdwardSun0909
2K Followers 1K Following CS PhD @LTIatCMU working on scalable alignment. BS @PKU1898Yifei Li 李一飞 @Yifei_omegaiota
2K Followers 469 Following CS PhD student at @MIT_CSAIL 💻 🐇🫖 @SCSatCMU alumni, ex-intern at @RealityLabs , @NVIDIA, @MetaAI, @Google @ActivisionAmanda Askell @AmandaAskell
26K Followers 653 Following Philosopher & ethicist teaching models to be good @AnthropicAI. Personal account. All opinions come from my training data.Anthropic @AnthropicAI
262K Followers 26 Following We're an AI safety and research company that builds reliable, interpretable, and steerable AI systems. Talk to our AI assistant Claude at https://t.co/aRbQ97uk4d.MIT-IBM Watson AI Lab @MITIBMLab
7K Followers 687 Following A collaborative industrial-academic laboratory focused on advancing fundamental AI research. Associated with the @MIT_SCC.Ethan | MyShell @ethan_myshell
1K Followers 1K Following Crypto & AI | Co-founder of @myshell_ai 🤖️ | Oxford '16, Math & CS | e/accGuillaume Lample @GuillaumeLample
37K Followers 648 Following Cofounder & Chief Scientist https://t.co/hLfvKLkFHd (@MistralAI). Working on LLMs. Ex @MetaAI | PhD @Sorbonne_Univ_ | MSc @CarnegieMellon | X11 @PolytechniqueThanks @_akhaliq for sharing our work! DressCode is finally accepted by #SIGGRAPH2024 Journal Track. We present a novel 3D garment generation pipeline based on sewing patterns. Work done with Kaixin Yao, Qixuan Zhang @DeemosTech, Jingyi Yu, @LingjieLiu1, and Lan Xu.
DressCode Autoregressively Sewing and Generating Garments from Text Guidance Apparel's significant role in human appearance underscores the importance of garment digitalization for digital human creation. Recent advances in 3D content creation are pivotal
We released StarCoder2 Instruct, which is self-aligned, transparent, and fully permissive! It even beats versions of StarCoder2 trained on GPT-4 distilled data on several benchmarks. huggingface.co/blog/sc2-instr…
2023 global conference with travel #1 #futureinvestmentinitiative #superreturn #token2049 #consensus #gitex #devconnect #academicconference #11countries
In the age of large language models, I realized the only sentence I ever talked to Siri is "five minutes timer"
Introducing RepoQA for evaluating LLMs’ repository understanding! 🌐 Leaderboard of 25+ models: evalplus.github.io/repoqa.html ⚙️ GitHub: github.com/evalplus/repoqa 🎨 Supporting 5 programming languages (more coming soon) 🚀 Evals openai/vllm/anthropic/HF/gemini models in one command! 🧵
Microsoft is investing $20M in this AI startup. Yushan AI is a Taiwanese company building LLMs that run locally on phones Their models come in 1.5B, 3B, 7B, and 13B parameter sizes to suit different hardware & scenarios
The demo is incredibly impressive, possibly even surpassing Sora! While captivating, I'm skeptical about whether the video model can replace a physics engine in simulating the world. Our team is taking a different path, diligently working towards redefining a world simulator!
Overnight, China released their own version of OpenAI’s Sora: AI video generator “Vidu” can create 16 second clips at 1080p:
Important point
This cost + latency for llama 3 is actually insane. Just look at the rest of the models in comparison
Perfect leadership for governing AI 🙌🙌 Sam Altman, The CEO of a failed oil company, and Delta Airlines! What a wise bunch this will be so good for everyone 👏
This morning the Department of Homeland Security announced the establishment of the Artificial Intelligence Safety and Security Board. The 22 inaugural members include Sam Altman, Dario Amodei, Jensen Huang, Satya Nadella, Sundar Pichai and many others.
Myshell @myshell 测试网第一阶段上线,团队开发许久的web3模式正式和大家见面了! 可以在app.myshell.ai/zh/explore网站的左下角点击按钮切换到web3模式,新的页面中有6个功能。 1⃣聊天:和精选的bot聊天,未来和web3 bot聊天需要支付shell token。…
MyShell Testnet ◤Phase 01◢ is Live! Join the revolution and (open, AI) with us: app.myshell.ai/web3/chat
OpenVoice: Instant voice cloning. Runs local on any computer. Quality is quite good. I’ll have a how-to soon. YOUR AI I build in my garage will speak to you in 1000 voice based on situation. Code: github.com/myshell-ai/Ope…
MyShell Testnet ◤Phase 01◢ is Live! Join the revolution and (open, AI) with me @myshell_ai: app.myshell.ai/invite/35a1ac?…
Llama 3 extended to almost 100,000-token context! ✅ By Combining PoSE and continuing pre-training on Llama 3 8B base for 300M tokens, the community (@winglian) managed to extend the context from 8k to 64k. 🚀 Applying rope scaling afterward led to a supported context window of…
Run Apple's new OpenELM models in MLX LM thanks to @Prince_Canuma pip install -U mlx-lm 270M model in 16-bit runs quite fast on an 8GB M2 Mini (512 tokens at 115 toks/sec). Also pretty good quality for the size:
Today we sued the SEC in our home state of Texas to defend ETH & the Ethereum network from Chair Gensler's unlawful power grab. We've been forced to defend multiple secret SEC investigations for years now. It's time they saw the light of day. #EthforAll #crypto #sec #consensys
Today, Consensys filed a lawsuit against the Securities and Exchange Commission. The goal behind this is to ensure that Ethereum remains a vibrant and indispensable blockchain platform and to preserve access for the countless developers, market participants, and institutions…
Excited to join the event and discuss our work on LLMs next week!
We're having a big event on agents at CMU on May 2-3 (one week from now), all are welcome! cmu-agent-workshop.github.io It will feature: * Invited talks from @alsuhr @ysu_nlp @xinyun_chen_ @MaartenSap and @chris_j_paxton * Posters of cutting edge research * Seminars and hackathons
In a few weeks, you will regret selling now. #Bitcoin