Kaizhao Liang @KyleLiang5
ML @ SambaNova Systems kyleliang919.github.io Austin, Texas Joined December 2018-
Tweets2K
-
Followers213
-
Following55
-
Likes7K
Should you do research in a AI startup? Does it burn someone else's money and your equity? Or is it the key to success? If you do it, how do you manage it? Drawing on experiences at Xerox PARC, Amazon, Body Labs, & @meshcapade, I try to shed some light: medium.com/@black_51980/s…
Exclusive: OpenAI has been developing a web search service in its latest challenge to Google. theinformation.com/articles/opena… By @aaronpholmes
@AndrewYNg Thanks for the shout out, Andrew. We are focused on improving inference throughput in tokens per second for the reasons you describe. Also pushing hard to make training and fine-tuning significantly more efficient. Same platform - training, fine-tuning and high speed…
Much has been said about many companies’ desire for more compute (as well as data) to train larger foundation models. I think it’s under-appreciated that we have nowhere near enough compute available for inference on foundation models as well. Years ago, when I was leading teams…
People haven't learned yet that they shouldn't trust cherry-picked demos in AI (both fake or engineered)? x.com/ClementDelangu…
People haven't learned yet that they shouldn't trust cherry-picked demos in AI (both fake or engineered)? x.com/ClementDelangu…
Every single pretraining run is a leap of faith into the hyperspace. We are the navigator 🫠
❓Wanna host a Llama2-7B-128K (14GB weight + 64GB KV cache) at home🤔 📢 Introducing TriForce! 🚀Lossless Ultra-Fast Long Seq Generation — training-free Spec Dec! 🌟 🔥 TriForce serves with 0.1s/token on 2 RTX4090s + CPU – only 2x slower on an A100 (~55ms on chip), 8x faster…
Come work with us for 6 months on synthetic data research! Reach out to amazing @ecats_ if you are interested 🔥
Glad to join the startup panel at @UCBerkeley! Grateful for the invite @ASUC_Berkeley @ACE_Berkeley. It was an enriching experience to share my startup journey with DeepMusic and @SambaNovaAI, and learn from other panelists and the audience 🚀 #BerkeleyInnovates #LLM
" LLAMA3 still suffers non-negligent degradation in these scenarios, especially in ultra-low bit-width. " Very interesting paper in the Large Language Model space named "How Good Are Low-bit Quantized LLAMA3 Models? An Empirical Study" 📌 This research dives deep into…
not true, especially for language. if you trained a large & deep MLP language model with no self-attention, no matter how much data you'll feed it you'll still be lacking behind a transformer (with much less data). will it get to the same point? i don't think so. your tokens…
not true, especially for language. if you trained a large & deep MLP language model with no self-attention, no matter how much data you'll feed it you'll still be lacking behind a transformer (with much less data). will it get to the same point? i don't think so. your tokens…
Today is the beginning of our moonshot to solve embodied AGI in the physical world. I’m so excited to announce Project GR00T, our new initiative to create a general-purpose foundation model for humanoid robot learning. The GR00T model will enable a robot to understand multimodal…
How Good Are Low-bit Quantized LLaMA3 Models? Meta's LLaMA family has become one of the most powerful open-source Large Language Model (LLM) series. Notably, LLaMA3 models have recently been released and achieve impressive performance across various with super-large scale
ID-Animator: Zero-Shot Identity-Preserving Human Video Generation Presents a zero-shot human-video generation approach that can perform personalized video generation given single reference facial image without further training proj: id-animator.github.io abs:…
Don’t blindly base your decision on which LLM to use on broken benchmarks like MMLU... If you are serious about choosing the right LLM for your use case, you NEED to create an eval of your own. Let’s talk about how you can make one 🧵 x.com/nearcyan/statu…
Don’t blindly base your decision on which LLM to use on broken benchmarks like MMLU... If you are serious about choosing the right LLM for your use case, you NEED to create an eval of your own. Let’s talk about how you can make one 🧵 x.com/nearcyan/statu…
Nicolas Keller @Nicolas_Keller
837 Followers 5K Following Interested in science-based startups. Having the time of my life @meshcapade; angel investor; ex Vsquared Ventures, ex @FRANKAROBOTICS; @iGEM alumnusVisiting Fellow, Ph.D.. @jackiefloyd
1K Followers 2K Following AI and geophysics. Earth foundation models. Accused of having big ideas. Also, cats. Be fearless. UT-Austin BS, Columbia PhD. Past: @UTGeophysics @LamontEarthnvbkdw @nvbkdw
89 Followers 662 Following random tweets about tech and software engineering Building Next-gen AI inference infrastructure ex @AWS, @Apple, @Uber, etc.Ajay Jain @ajayj_
6K Followers 3K Following Co-founder @genmoai. Co-created denoising diffusion (DDPM), DreamFusion, Dream Fields. Ex Ph.D. @berkeley_ai, @googleai, @facebookai, @nvidiaai, @mitEmilyHarper @CwvkngPtg40cdq
0 Followers 35 FollowingKiran Ranganath @Kiran_Ranganath
140 Followers 1K Following PhD @ucr_ece | ML Systems researcher @sambaNovaAI | Classical Liberalism, Public Policy, Literature, ಕನ್ನಡ, संस्कृतम्, and 🌱 food.Eddy Emmanuel @youngboi_eddy
110 Followers 431 Following Machine learning //Artificial intelligence//crypto enthusiast. GitHub: https://t.co/pLyM6JSfh5 LinkedIn:https://t.co/iLoDYlwIXDDawei Huang @Dawei_Huang
23 Followers 78 FollowingSumti Jairath @SumtiJairath
21 Followers 60 Followingaiforcodeandproofs @solvay_1927
130 Followers 275 Following NLP, SAT/SMT solving, Automated theorem provingbun.bun.🐽 @DS_Bun_
19K Followers 15K Following love #pugs, senior data scientist @datafying my tweet = data science, machine learning, ai, deep learning and pug as well.amit ⚡️ @gravicle
7K Followers 4K Following ceo @LumaLabsAI | prev: built Vision Pro at | everything is figureoutableElon @Elonmk002
23 Followers 208 FollowingKevin Hu 🤖 @OldGunix
327 Followers 1K Following Am I a human dreaming to be a robot? Or am I a robot dreaming to be a human?Jaideep Sarkar @thisisjaidsar
34 Followers 268 Following Builder. Head of Software and ML @ Sambanova Systems. Passionate about solving business problems with AI. Opinions are solely mine.Haige Bo @HaigeBo2819
1 Followers 14 FollowingMatthew Povey @mattpovey
103 Followers 307 Following It’s a new day, it’s a new Twitter account. Yorkshireman in Amsterdam.Fanchao Chen @FanchaoChen
219 Followers 1K Following PhD Student @WisconsinCS | Prev. @FudanUni, @NTUsg, @ucbrise, @MSFTResearch, Moonshot AI | Machine Learning SystemsVaibhav @vaibhav_p1234
428 Followers 897 Following Unraveling AI complexities, crafting user-friendly innovations. Bridging the gap between intricate tech and practical applications.Ameen Patel @Ameen_ml
103 Followers 714 Following Staff ML Engineer @togethercompute Interested in deep learning and distributed systemsChaowei Xiao @ChaoweiX
2K Followers 454 Following Assistant Professor @University of Wisconsin, Madison Researcher@NVIDIA| Researcher on AI Safety/SecurityRichard Halkett @RichardHalkett
462 Followers 643 Following Oakham born, Wigan bred, LA living. Chief Customer Officer at SambaNova Systems. Passionate about life, wife, sons, history, tech, friends & politics.Mingran Wang @MingranW
9 Followers 15 FollowingXu Cao @IrohXu
77 Followers 74 Following CS PhD Student @IllinoisCS; Chief Research Scientist of PediaMed AI. ML&AIGC&LLM4AD&AI4ASD researcher.Changran Hu @changran_hu
58 Followers 118 Following SambaNova | Berkeley | Tsinghua | Co-founder of DeepMusicX @Christi29229134
9 Followers 115 Following LLMs and conversational ads @Microsoft, prev. HAI @Stanford, semi-covariance estimation @erasmusuni. views my ownDave Munichiello @davemuni
3K Followers 1K Following Managing Partner @GVteam (Google Ventures) where we support high-growth tech entrepreneurs. https://t.co/igCXzcaDbZ. Previously GtM/Ops leader @KivaSystems, @Amazon. Veteran.Junbo Li @ljb121002
85 Followers 213 Following ML student @ Sailing lab @mbzuai @mldcmu; A core team member of @llm360; Undergrad from @FudanUni Math School. Incoming Ph.D. student @UTCompSciXiaoyuan (Isaac) Wang @IsaacXiaoyuan
63 Followers 502 Following MS @CarnegieMellon | Interested in 3D vision, visual generation, and visual reasoningAna Rojo-Echeburúa @arojomaths
720 Followers 3K Following Data Science & AI || PhD in Applied Mathematics || Spanish living in Scotland || Crossfit Athlete || Content Creatorapple pie @love_cooc
743 Followers 772 Following Professional beauty,Traveler 🌍✈🚁 fitness, skiing, diving🏇⛷🧘♀️🤿Jeffrey Wolberg @JeffreyWolberg
188 Followers 1K Following Columbia '25, Computer Science; Appreciative of all the wonderful things about modern life.Helena Bawerman @HelenafjBawerm
7 Followers 387 Following Gathered on the site of girls from all US states 😉 Ready for private meetings See nude photos before a date! Watching this https://t.co/Vu1N50ME3aBlaze (Balázs Galamb.. @gblazex
1K Followers 974 Following A Smooth Guy; Developer of SmoothScroll for macOS, Windows & Google Chrome.Melanie @Sheeshe195591
179 Followers 4K Following See the world on the road, and get to know yourself on the way!Jason Weston @jaseweston
9K Followers 568 Following Research @MetaAI+NYU. Pretrain+FT: NLP from Scratch (2011). Multilayer attention+position embed+LLM: MemNets (2015). Recent (2023+):Sys 2 Attn, Self-Rewarding..Zeyu Qin @ZeyuQin_alan
119 Followers 853 Following Ph.D. student of CSE at HKUST @hkust, AI Safety, and Stability of ML methods and models.Kadir Ersoy @Ersoy_kadir1
8 Followers 118 FollowingUrmish Thakker @UrmishThakker
424 Followers 1K Following LLM @SambanovaAI | | Ex-@arm research| @mlperf1| @BigscienceW| @TXInstruments,@AMD| @WisconsinCS| @bitspilaniindiaCooper Leong @cooperleong22
97 Followers 1K FollowingHailey Schoelkopf @haileysch__
3K Followers 811 Following she/her | research scientist @aiEleuther | LLM training/infra, eval, data | LM Evaluation Harness maintainerHugging Face Status @hf_status
453 Followers 0 FollowingOpenAI @OpenAI
3.4M Followers 0 Following OpenAI’s mission is to ensure that artificial general intelligence benefits all of humanity. We’re hiring: https://t.co/dJGr6LgzPAAaron Defazio @aaron_defazio
6K Followers 362 Following Research Scientist at Meta working on optimization. Fundamental AI Research (FAIR) teamMaisa @maisaAI_
3K Followers 3 Following Maisa abstracts the complexities of AI development. Powered by KPU, the most advanced reasoning system for LLMs that overcomes their intrinsic limitations.Groq Inc @GroqInc
45K Followers 468 Following Creator of the LPU™ Inference Engine, providing the fastest speed for AI applications, designed & engineered in N. America https://t.co/DsEqVAC5DpCognition @cognition_labs
123K Followers 19 Following Makers of Devin, the first AI software engineer. We are an applied AI lab focused on reasoning, and code is just the beginning. Join us: https://t.co/tpfZwEwGiqPhysical Intelligence @physical_int
4K Followers 8 Following Physical Intelligence (Pi), bringing AI into the physical world.Jim Fan @DrJimFan
229K Followers 3K Following @NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.Michael Zhang @mzhangio
1K Followers 426 Following CS PhD Student @hazyresearch, @StanfordAILab. Robustness. Foundations of foundation models. Want to make them less shaky.SambaNova Systems @SambaNovaAI
3K Followers 713 Following We bring #AI innovations developed in advanced research to organizations around the world. Sign up for updates to stay ahead of AI: https://t.co/bGeeh5JSt0EleutherAI @AiEleuther
19K Followers 76 Following A non-profit research lab focused on interpretability, alignment, and ethics of artificial intelligence. Creators of GPT-J, GPT-NeoX, and VQGAN-CLIPDemi Guo @demi_guo_
22K Followers 693 Following Co-founder & CEO @pika_labs | ex @StanfordAILab @Harvardcat with confusing au.. @Cat_Auras
2.3M Followers 33 Following Even cat can confuse “us”. | dm for credit or removalVipul Ved Prakash @vipulved
5K Followers 841 Following Building an AI supercomputer out of spare internet parts. Founder, CEO @togethercomputeMarques Brownlee @MKBHD
6.2M Followers 472 Following Web Video Producer | ⋈ | Pro Ultimate Frisbee Player | Host of @WVFRM @TheStudioTogether AI @togethercompute
27K Followers 303 Following The future of AI is open-source. Let's build together.hazyresearch @HazyResearch
7K Followers 1K Following A research group in @StanfordAILab working on the foundations of machine learning & systems. https://t.co/JHK58TDorG Ostensibly supervised by Chris RéYang Song @DrYangSong
10K Followers 886 Following Leading the Strategic Explorations team @OpenAI. Score-Based Models. Diffusion Models. Consistency Models.Rivers Have Wings @RiversHaveWings
31K Followers 224 Following AI/generative artist. Writes her own code. Absolute power is a door into dreaming.Percy Liang @percyliang
49K Followers 408 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | PianistStability AI @StabilityAI
189K Followers 31 Following We are building the foundation to activate humanity's potential.daniel bashir @spaniel_bashir
694 Followers 219 Following applied typist (ml engineer), chief shenanigans officer @gradientpub nuggets @Last_Week_in_AI bad writing https://t.co/e2f47gN1JtRoss Wightman @wightmanr
18K Followers 1K Following Computer Vision @ 🤗. Ex head of Software, Firmware Engineering at a Canadian 🦄. Currently building ML, AI systems or investing in startups that do it better.Lucas Beyer (bl16) @giffmana
56K Followers 443 Following Researcher (Google DeepMind/Brain in Zürich, ex-RWTH Aachen), Gamer, Hacker, Belgian. Mostly gave up trying mastodon as [email protected]Tri Dao @tri_dao
18K Followers 364 Following Incoming Asst. Prof @PrincetonCS, Chief Scientist @togethercompute. Machine learning & systems.Sara Hooker @sarahookr
39K Followers 7K Following I lead @CohereForAI. Formerly Research @Google Brain @GoogleDeepmind. ML Efficiency at scale, LLMs, @trustworthy_ml. Changing spaces where breakthroughs happen.Andrej Karpathy @karpathy
978K Followers 904 Following 🧑🍳. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets 🧠🤖💥Beidi Chen @BeidiChen
6K Followers 350 Following Asst. Prof @CarnegieMellon, Visiting Researcher @Meta, Postdoc @Stanford, Ph.D. @RiceUniversity, Large-Scale ML, a fan of Dota2.Chulin Xie @ChulinXie
627 Followers 661 Following CS PhD student at UIUC and student researcher @GoogleAI; Ex research intern @MSFTResearch @NvidiaAIStanislav Fort ✨�.. @stanislavfort
10K Followers 6K Following AI @GoogleDeepMind | Stanford PhD in AI & Cambridge physics | ex-{Anthropic, Stability, Google Brain} | techno-optimism+alignment+progress+growth 🇺🇸🇨🇿AK @_akhaliq
309K Followers 3K Following AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo follow on Hugging Face: https://t.co/q2Qoey80GxOriol Vinyals @OriolVinyalsML
166K Followers 82 Following VP of Research & Deep Learning Lead, Google DeepMind. Gemini co-lead. Past: AlphaStar, AlphaFold, AlphaCode, WaveNet, seq2seq, distillation, TF.Kosta Derpanis @CSProfKGD
48K Followers 198 Following #CS Associate Prof @YorkUniversity, #ComputerVision Scientist Samsung #AI, @VectorInst Faculty Affiliate, TPAMI AE, #CVPR2024/#ECCV2024 Publicity Co-chairAI Conference DL Coun.. @DlCountdown
16K Followers 11 Following Bot. I daily tweet progress towards machine learning and computer vision conference deadlines. Maintained by @chriswolfvision.Greg Yang @TheGregYang
53K Followers 661 Following Cofounder https://t.co/SpHbO7FZNV. Morgan Prize Honorable Mention 2018. Developing the theory of #TensorPrograms and the practice of scaling #neuralnetworks.Xikun Zhang @xikun_zhang_
551 Followers 56 Following #computerscience Ph.D. at @Stanford advised by @AaronNewmanLab and @Prof_Lundberg. #AI #MachineLearning #singlecell and #spatial omicsYibo Jacky Zhang @Ybo_Z
65 Followers 245 Following PhD student at Stanford University interested in machine learning from fundamental perspectives.Boxin Wang @wbx_life
540 Followers 462 Following Research Scientist at NVIDIA @nvidia. UIUC Ph.D. @IllinoisCS in Trustworthy and Scalable LLM. Previously at MSR @MSFTResearch, Google Research @googleai.Secure Learning Lab (.. @uiuc_aisecure
937 Followers 288 Following We are a computer science research group led by Bo Li at UIUC, focusing on responsible and trustworthy machine learning.🫡 Heard and understood. 💪 We are working hard to build out more for the developers creating applications Powered by Groq. Let's go!🚀
Little tech really needs to get a lot more aggressive against big tech. They're going for full control right now.
The true impact of #GenerativeAI hinges on responsible application. We prioritize fairness, transparency, #security, and #dataprivacy in our generative #AI applications. Explore our commitment to #ResponsibleAI: sambanova.ai/blog/responsib… #LLM #SovereignAI
Thanks @AndrewYNg. 100k+ developers using @GroqInc share the need for speed and latency for the reasons you mentioned. Very exciting times for the industry!
Much has been said about many companies’ desire for more compute (as well as data) to train larger foundation models. I think it’s under-appreciated that we have nowhere near enough compute available for inference on foundation models as well. Years ago, when I was leading teams…
Should you do research in a AI startup? Does it burn someone else's money and your equity? Or is it the key to success? If you do it, how do you manage it? Drawing on experiences at Xerox PARC, Amazon, Body Labs, & @meshcapade, I try to shed some light: medium.com/@black_51980/s…
Excited to present our latest research: 🦘LayerSkip! huggingface.co/papers/2404.16… We run a subset of earlier layers of an LLM, & verify/correct using the remaining layers, to achieve upto 🚀2.16x speedup on Llama 7B @AkshatS07 @bilgeacun @bwasti @Ahhegazy77 @BeidiChen @CarolejeanWu
AI NEWS: Elon Musk is reportedly raising $6 billion for xAI at a valuation of $18 billion. Plus, huge developments from SenseTime, Sanctuary AI, Adobe, Apple, Tesla Optimus, and Cognition Labs' Devin. Here's everything going on in AI right now:
Exclusive: OpenAI has been developing a web search service in its latest challenge to Google. theinformation.com/articles/opena… By @aaronpholmes
Your startup is an OpenAI wrapper OpenAI is a Nvidia wrapper Nvidia is a TSMC wrapper TSMC is an ASML wrapper ASML is a Zeiss & Trumpf wrapper Zeiss is a glass wrapper Glass is a sand wrapper Sand is an erosion wrapper Erosion is an entropy wrapper Conclusion: Invest in entropy.
@AndrewYNg Thanks for the shout out, Andrew. We are focused on improving inference throughput in tokens per second for the reasons you describe. Also pushing hard to make training and fine-tuning significantly more efficient. Same platform - training, fine-tuning and high speed…
Much has been said about many companies’ desire for more compute (as well as data) to train larger foundation models. I think it’s under-appreciated that we have nowhere near enough compute available for inference on foundation models as well. Years ago, when I was leading teams…
People haven't learned yet that they shouldn't trust cherry-picked demos in AI (both fake or engineered)? x.com/ClementDelangu…
Remember that 'air head' video made with Sora? Turns out it used a ton of rotoscoping and manual VFX. A 'head' would pop back on, and the balloon colors would keep changing from generation to generation. TL;DR researchers and developers of generative AI tools really need to…
OpenAI is a Nvidia wrapper Nvidia is a TSMC wrapper TSMC is an ASML wrapper ASML is a Zeiss wrapper Congratulations everyone you just discovered how a technologically advanced economy operates.
❓Wanna host a Llama2-7B-128K (14GB weight + 64GB KV cache) at home🤔 📢 Introducing TriForce! 🚀Lossless Ultra-Fast Long Seq Generation — training-free Spec Dec! 🌟 🔥 TriForce serves with 0.1s/token on 2 RTX4090s + CPU – only 2x slower on an A100 (~55ms on chip), 8x faster…
Yes! the same way all tech companies write their own code, all AI companies will train, optimize, run their own models (instead of out-sourcing AI to other companies through APIs).
Come work with us for 6 months on synthetic data research! Reach out to amazing @ecats_ if you are interested 🔥
Glad to join the startup panel at @UCBerkeley! Grateful for the invite @ASUC_Berkeley @ACE_Berkeley. It was an enriching experience to share my startup journey with DeepMusic and @SambaNovaAI, and learn from other panelists and the audience 🚀 #BerkeleyInnovates #LLM
" LLAMA3 still suffers non-negligent degradation in these scenarios, especially in ultra-low bit-width. " Very interesting paper in the Large Language Model space named "How Good Are Low-bit Quantized LLAMA3 Models? An Empirical Study" 📌 This research dives deep into…
Moderna has deployed 400 GPTs for things like contract checking & research. I spoke to a couple other Fortune 1000 firms that have also deployed GPTs internally. I am actually a bit surprised at how much internal experimentation is happening.
Some remarkable points: 100% adoption of ChatGPT in the legal team. Integration into core processes like dose distribution. A ton of custom GPTs for specific work flows. Scaling up necessitates AI assistance in lieu of hiring 100.000 employees. youtu.be/t3UHnKLVS1M?si…
🤝😍
First @nvidia DGX H200 in the world, hand-delivered to OpenAI and dedicated by Jensen "to advance AI, computing, and humanity":