zhou su @suhmily
Joined November 2014-
Tweets42
-
Followers32
-
Following358
-
Likes21
#SocialVideoHelper# github.com/liu044100/Soci…
#SocialVideoHelper# github.com/liu044100/Soci…
#SocialVideoHelper# github.com/liu044100/Soci…
#SocialVideoHelper# github.com/liu044100/Soci…
#SocialVideoHelper# github.com/liu044100/Soci…
#SocialVideoHelper# github.com/liu044100/Soci…
#SocialVideoHelper# github.com/liu044100/Soci…
#SocialVideoHelper# github.com/liu044100/Soci…
#SocialVideoHelper# github.com/liu044100/Soci…
#SocialVideoHelper# github.com/liu044100/Soci…
#SocialVideoHelper# github.com/liu044100/Soci…
Parse json Response Twitter api ObjectiveC IOS | Code Germs codegerms.com/parse-json-res…
#SocialVideoHelper# github.com/liu044100/Soci…
#SocialVideoHelper# github.com/liu044100/Soci…
#SocialVideoHelper# github.com/liu044100/Soci…
#SocialVideoHelper# github.com/liu044100/Soci…
#SocialVideoHelper# github.com/liu044100/Soci…
Lara Rutkowski @RutkowsLa
39 Followers 5K FollowingRayna Zanes @RZanes14528
79 Followers 5K Followingวิไลวาส.. @J4czU04CMTK9Vfr
73 Followers 1K Following ในโลกของฉัน มีกลยุทธ์การออกเดทที่จะทำให้หัวใจคุณเต้นเร็วขึ้น ดังนั้นให้ความสนใจตอนนี้! หน้าแรกอัพเดทข้อมูลการติดต่อOk Behring @behri_o
25 Followers 5K FollowingJuno Langloss @JLanglos
36 Followers 5K FollowingRikki Carkhuff @CarkhuffRi20922
77 Followers 5K FollowingMarget Lockett @LockMarge
76 Followers 5K FollowingMonnie Wildrick @MonnWildri
73 Followers 5K FollowingKaitlyn Mccord @kaitly_mcc
34 Followers 5K FollowingBeverly Kwek @bever_kw
31 Followers 5K FollowingSianna Clickner @SianClickn
43 Followers 5K FollowingNakesha Vizcarrondo @NakeshaViz21946
88 Followers 5K FollowingIsabell Tonge @IsabellTon27162
59 Followers 5K FollowingElla-mai Dipiano @DipianoMai61051
83 Followers 5K FollowingNinfa @hooksninfa25
151 Followers 3K FollowingLeia Montello @LeiaMontel93316
80 Followers 5K FollowingKeva Denapoli @DenapoKe
58 Followers 5K FollowingFatima Straseskie @FatiStraseski
49 Followers 5K FollowingElizabeth_Robi @ElizabethR31600
19 Followers 2K FollowingRebekah Uncapher @RebeUncap
32 Followers 5K FollowingZarko Grbic @zarko_g13
64 Followers 4K FollowingSofia Baxter @melbjane
72 Followers 2K FollowingJhalak @JhalakGoyal
15 Followers 106 FollowingDont Trip @DontKingpin77
2 Followers 44 FollowingOriol Vinyals @OriolVinyalsML
167K Followers 82 Following VP of Research & Deep Learning Lead, Google DeepMind. Gemini co-lead. Past: AlphaStar, AlphaFold, AlphaCode, WaveNet, seq2seq, distillation, TF.kingseeker peter fram.. @revhowardarson
15K Followers 719 Following god created him and then demanded that he dieT. Wouters @Yhg1s
2K Followers 146 Following Cat owner, Googler, Python Steering Council and @ThePSF Board, Release Manager for Python 3.12 and 3.13. (He/him or they/them.) Masto: @[email protected]Tesla Bot Journal @TeslaBotJournal
8K Followers 222 Following Chronicles of Optimus, and Other Humanoid Robots: Technology, Business, and Social DynamicsLilian Weng @lilianweng
95K Followers 148 Following Working on AI safety, past on robotics, applied research @OpenAI; Writing ML blogs to help myself & others to learn; Ideas my own.Songlin Yang @SonglinYang4
2K Followers 2K Following PhD student @MIT_CSAIL. Prev. @ShanghaiTechUni @SUSTechSZ. Working on scalable and principled methods in #ML & #NLProc. INTP | 5w4 | sx/sp | she/herDan Fu @realDanFu
4K Followers 176 Following CS PhD Candidate at Stanford, systems for machine learning. Sometimes YouTuber/podcaster. Academic Partner, @togethercompute.Simran Arora @simran_s_arora
2K Followers 212 Following CS PhD student at @StanfordAILab @hazyresearchKenneth Li @ke_li_2021
714 Followers 418 FollowingXin Wang @xinw_ai
4K Followers 991 Following Senior Researcher @MSFTResearch. PhD from @Berkeley_EECS. #artificialintelligence #LLM #multimodalEric Wallace @Eric_Wallace_
6K Followers 1K Following Researcher at OpenAI working to make language models more trustworthy, secure, and private.Leandro von Werra @lvwerra
6K Followers 310 Following Machine learning @huggingface: co-lead of @bigcodeproject and maintainer of TRL.Sharan Narang @sharan0909
2K Followers 254 Following LLMs and AI Research (Llama 2 & 3 lead) @Meta | ex @Google (PaLM lead, T5), ex @Baidu (Deep Speech 2, Sparse Neural Networks), ex @NvidiaSergey Edunov @edunov
948 Followers 103 Following Director of Engineering @ GenAI, Meta. I work on LlamasHugo Larochelle @hugo_larochelle
113K Followers 626 Following Google DeepMind researcher, machine learning professor, ex-Twitter Cortex, father of 4, wine/music/comedy enthusiastAleksandra Faust @AleksandraFaust
2K Followers 515 Following Research Scientist with Google @Deepmind. Previously, @GoogleAI in #GoogleBrain. @Waymo, @SandiaLabs, @UNM, @UIUC.Feryal @FeryalMP
9K Followers 2K Following Staff Research Scientist @DeepMind & Board of Directors @WiMLworkshop.Stephanie Chan @scychan_brains
3K Followers 2K Following Senior Research Scientist at DeepMind. Artificial and biological brains 🤖 🧠 Views are my ownBernd Bohnet @bohnetbd
1 Followers 1 FollowingLei M. Zhang @l32zhang
267 Followers 384 Following Senior Research Scientist @GoogleDeepMind Prev: @OpenAI @nanoleaf @BellLabs PhD from @UofTRishabh Agarwal @agarwl_
6K Followers 549 Following Senior Research Scientist, @GoogleDeepMind, ex-🧠. Agents that make decisions. NeurIPS Best Paper (RLiable). Mila, IIT Bombay.Avi Singh @avisingh599
2K Followers 1K Following Making LLMs a little smarter @GoogleDeepMind. Previously worked on robots. Ask for my strava and goodreads :)Shunyu Yao @ShunyuYao12
7K Followers 857 Following Language agents (ReAct, Reflexion, Tree of Thoughts) for digital automation (WebShop, SWE-bench, SWE-agent)Tim Cook @tim_cook
14.9M Followers 70 Following Apple CEO Auburn 🏀 🏈 Duke 🏀 National Parks 🏞️ “Life's most persistent and urgent question is, 'What are you doing for others?'” - MLK. he/himBrandon Smith @BSmith_Esports
120K Followers 2K Following @EASportsFCPro Commentator🎤 | #EAFC Creator w/ 2M + followers | @thefull90pod 🎙️ | @turtlebeach & @elgato partner | 📩 Business: [email protected]Rodney Niya @Rodneyniya
1K Followers 91 Following Brand Strategist & Eternal Optimist | ⚡️Tesla⚡️Owner/Fanatic | 🚘FSDBeta🚘 Tester | ❤️Compassion❤️ drives real changeBoston Dynamics @BostonDynamics
315K Followers 0 FollowingSawyer Merritt @SawyerMerritt
664K Followers 315 Following Co-Founder of @TwinBirchUSA | sustainable lifestyle apparel. $TSLA investor. EV/tech news. My posts aren’t financial advice.Ahmad Al-Dahle @Ahmad_Al_Dahle
4K Followers 53 Following #Girldad of twins. Leading GenAI @ Meta (llama, imagine, meta ai and more)亚洲金融 Asia Fin.. @AsiaFinance
267K Followers 383 Following 亚洲金融:政经领域的有意思的事儿。 AsiaFinance: For the Rich and Powerful.Yisong Yue @yisongyue
19K Followers 3K Following Machine Learning @Caltech (@YueLabCaltech). AI for invention at @AsariAILabs. Autonomous Driving at https://t.co/riZHAmvcAr. Senior Program Chair @iclr_conf.Kaiyu Yang @KaiyuYang4
2K Followers 775 Following Postdoc @Caltech CMS. Previously: @PrincetonCS, @Tsinghua_Uni. https://t.co/KZiCELQI2DHaotian Liu @imhaotian
6K Followers 397 Following building intelligence @xAI, creator of #LLaVA, cs @UWMadison, prev @MSFTResearchKevin Patrick Murphy @sirbayes
42K Followers 334 Following Research Scientist at Google Brain / Deepmind. Interested in Bayesian Machine Learning.Zhiqing Sun @EdwardSun0909
2K Followers 1K Following CS PhD @LTIatCMU working on scalable alignment. BS @PKU1898Hongyu Ren @ren_hongyu
3K Followers 594 Following Research Scientist @openai. CS PhD @stanford. Previously @apple, @googleai and @nvidiaai. I train language models.Matt Shumer @mattshumer_
51K Followers 1K Following CEO @HyperWriteAI, @OthersideAI - I make AIs do the impossible.Eric Steinberger @EricSteinb
7K Followers 478 Following Writing code that writes code on a mission to build safe superintelligence | CEO/cofounder @magicailabsAI at Meta @AIatMeta
532K Followers 255 Following Together with the AI community, we are pushing the boundaries of what’s possible through open science to create a more connected world.Andrew Carr (e/🤸) @andrew_n_carr
15K Followers 3K Following science @getcartwheel AI writer @tldrnewsletter advisor @arcade_ai Past - Codegen @OpenAI, Brain @GoogleAI, world ranked Tetris playerThe students of @elonmusk's private school, Ad Astra, getting a lecture about first principles
Jensen Huang and NVIDIA's success is based on the AI revolution
This is Elon Musk's most inspiring speech that you'll find on X today.
Mistral open-sourced their tokenizer! Pumped to see that it uses Ruff. github.com/mistralai/mist…
NEWS: After 10 years, Boston Dynamics has announced it is retiring its hydraulic humanoid robot, HD Atlas. They released a farewell video (below). It's unclear what the company might do next.
I’ll be sharing more on Llama 3 very soon. It’s so cool to see what the community is already building with Llama 2 though. One of my favorites: @team_qanda & @upstageai used it to build a math-specific LLM to make personalized learning more accessible! ai.meta.com/blog/llama-2-m…
🧙♀️We not only opensource the models, but also share you how we reach that! 🚀So now let's verify step-by-step to review the whole training method of WizardLM-2 together: -------- 🧵 -------- 👉Motivation First: As the natural world's human-generated data becomes…
🔥Today we are announcing WizardLM-2, our next generation state-of-the-art LLM. New family includes three cutting-edge models: WizardLM-2 8x22B, 70B, and 7B - demonstrates highly competitive performance compared to leading proprietary LLMs. 📙Release Blog:…
Remember, OpenAI, Anthropic and DeepMind are training 10x bigger models than Opus
I will give a talk in Stanford NLP Group this Thursday (4/11) 11am PT. Welcome to join!
For this week’s NLP Seminar, we are thrilled to host @tydsh to talk about "Demystifying Attention Mechanism in Transformer and its application for Large Language Models"! When: 04/11 Thurs 11am PT Non-Stanford affiliates registration form (closed at 9am PT on the talk day):…
Flan-2 is published in JMLR jmlr.org/papers/v25/23-…. I think it's a nice piece of history. The work scaled instruction tuning with respect to model size and finetuning tasks, which both improved performance. Our MMLU was 75%, SOTA when the paper came out in Oct 2022. Our…
Once you have the forward/backward, the rest of it (data loader, Adam update, etc) are mostly trivial. The real fun starts now though: I am now porting this to CUDA layer by layer so that it can be made efficient, perhaps even coming within reasonable fraction of PyTorch, but…
A Mamba Primer (w/ Yair Schiff youtube.com/watch?v=dVH1dR… ) Mamba is a nice jumping off point to summarize foundational ideas in sequence modeling, parallel algorithms, continuous-time representations, and GPU aware algorithms. We try to put these together in the context of LMs.
On-Device 2B LLMs for actions, outperform GPT-4 🤯 The “Octopus v2: On-device language model for super agent” proposes a new method to create on-device agents. 📱🔄 Implementation 1️⃣ Define supported functions as special tokens, e.g. <func_1> and add them to the tokenizer 2️⃣…
For more details, I just published a writeup of my favorite recent developments in AI over the last few months. It includes all of the LLM proposals from last week (DBRX, Jamba, Qwen-MoE, etc.) as well!
Just dropped a 4 hour lecture on "Large Language Models": youtu.be/2yjzZfDQxy8 0:00 Basics of language models 2:30 Word2vec 16:27 Transfer Learning 19:23 BERT 1:00:39 T5 1:31:14 GPT1-3 1:53:05 ChatGPT 2:20:03 LLMs as Deep RL 2:53:00 Policy Gradient 3:32:50 Train your…
Data Mixing Laws: Optimizing Data Mixtures by Predicting Language Modeling Performance Effectively optimizes the training mixture of a 1B model trained for 100B tokens, reaching a perf comparable to the one trained for 48% more steps on the default mixture repo:…
Spoke to a Microsoft engineer on the GPT-6 training cluster project. He kvetched about the pain they're having provisioning infiniband-class links between GPUs in different regions. Me: "why not just colocate the cluster in one region?" Him: "Oh yeah we tried that first. We…
The gpt-4 tokenizer is open source github.com/openai/tiktoke… If you look at the code, an interesting finding is the presence of special tokens FIM_*. This is probably for Fill-in-the-middle arxiv.org/abs/2207.14255 pretraining.
At a time where 314B parameters models are trending, come join me at #NVIDIAGTC to see what you can do with 1 or 2B parameters :-) (and coming soon, what can you do with 3B?!?)
I distilled the 49 pages of the weak-to-strong generalization paper into a 7 minutes explainer video youtube.com/watch?v=OR-vcV…