Shom @ShomLinEd
language model | sequence modeling | education | HCI Web Joined September 2021-
Tweets607
-
Followers305
-
Following2K
-
Likes29K
claude code wrapping bash usage in python subprocess calls is interesting and worrying...
True
Hire the right Chinese.
I'd like to see Meta building a lean LLM team around Narang, Allen-Zhu, Mike Lewis, Zettlemoyer and Sukhbaatar and giving them all the budget and power.
Since its fifth generation, RWKV's main progress -- outer product states, data dependent decay and delta rules -- has come only after works like RetNet, Mamba and DeltaNet with a few adjustments. I respect his efforts of training models, but he could use some more credit.
Since its fifth generation, RWKV's main progress -- outer product states, data dependent decay and delta rules -- has come only after works like RetNet, Mamba and DeltaNet with a few adjustments. I respect his efforts of training models, but he could use some more credit.
i didn't play with o3 as much but judging from my experience with claude, its love of printing probably stems from having to print out results to be collected and judged in RL loop. Its abuse of .get("key") and try catch may be caused by error penalty.
i didn't play with o3 as much but judging from my experience with claude, its love of printing probably stems from having to print out results to be collected and judged in RL loop. Its abuse of .get("key") and try catch may be caused by error penalty.
We will be presenting "APE: Faster and Longer Context-Augmented Generation via Adaptive Parallel Encoding", a novel encoding method that enables: 🚀Pre-caching Contexts for Fast Inference 🐍Re-using Positions for Long Context Our poster session is located in Hall 3 and Hall 2B,…
We will be presenting "APE: Faster and Longer Context-Augmented Generation via Adaptive Parallel Encoding", a novel encoding method that enables: 🚀Pre-caching Contexts for Fast Inference 🐍Re-using Positions for Long Context Our poster session is located in Hall 3 and Hall 2B,… https://t.co/oqnOCeWV7V
New Article: "Against The Achilles' Heel: A Survey on Red Teaming for Generative Models" by Lin, Mu, Zhai, Wang, Wang, Wang, Gao, Zhang, Che, Baldwin, Han, and Li jair.org/index.php/jair…
Deepseek in Jan 2025 is going through the chatgpt moment in Dec 2022. Servers going down, user base surging, rl techniques making model rise in performance.
📝Please fill in your information to get a free pass before they’re gone-only 3 days left to register! ⬇️Check the comments for the link to our questionnaire. Let’s meet and talk about innovation, AI, and opportunities! #LibrAI #AI #GITEX #FreePass #GITEX2024 #ExpandNorthStar
HOC's Fast Discrete Program Search (DPS) HOC will soon (EOY?) launch an API for our DPS solution. The interface will be simple: - You give us a set of examples (input/output pairs) - We'll give you a (Python?) function that models it And that's it. It will be an universal…
HOC's Fast Discrete Program Search (DPS) HOC will soon (EOY?) launch an API for our DPS solution. The interface will be simple: - You give us a set of examples (input/output pairs) - We'll give you a (Python?) function that models it And that's it. It will be an universal…
Tired: transformer captures long term dependency Wired: fractal exhibits long term dependency Inspired: Memory processes and 2D Ising models characterize long term dependency
Hear me out, universal structured format with thought tag and result tag
Just updated taxonomy & covered more papers 😄 github.com/Libr-AI/OpenRe…
Thanks @llm_sec for covering our paper! This is a fast growing field and we want to make sense of the LLM security landscape. Big update in github repo incoming with new taxonomy and more papers covered😄 #AISafety #jailbreak
Thanks @llm_sec for covering our paper! This is a fast growing field and we want to make sense of the LLM security landscape. Big update in github repo incoming with new taxonomy and more papers covered😄 #AISafety #jailbreak

Saber Darabi @SADarabi
322 Followers 7K Following
Elizabeth @kendall21elizab
302 Followers 3K Following
DigitalRiver @rndqrhfstb58063
4 Followers 385 Following 当社は、会社の委託を受けてオンラインの在宅ワークスタッフを募集しています。 💵 每日報酬:6000円~5万円 、給与は日払いです 🔍勤務時間:30分から80分 問い合わせ担 当者LINE追加:【https://t.co/enRovJSZRD】
Mason Wang @masonwang025
755 Followers 387 Following cs @stanford. prev cto @tilderesearch & research @stanfordnlp.
Thibaut Boissin @ThibautBoissin
249 Followers 203 Following
Qian Liu @sivil_taram
4K Followers 743 Following Researcher @ TikTok 🇸🇬 📄 Sailor / StarCoder / OpenCoder 💼 Past: Research Scientist @SeaAIL; PhD @MSFTResearch 🧠 Contribution: @XlangNLP @BigCodeProject
Junxuan Wang @JunxuanWang0929
73 Followers 71 Following PhD student, Fudan University, Interpretability
Shangbin Feng @shangbinfeng
4K Followers 2K Following PhD student @uwcse @uwnlp. Model collaboration, social NLP, networks and structures. #水文学家
Men1scus @Men1scus
173 Followers 4K Following Junior@Nankai University | Major in CS | Research in CV, GenAI | Full Stack Developer | Beginner in Crypto | Runner, Cyclist, Gym-goer | Rap enthusiast
JingyuanLiu @JingyuanLiu123
1K Followers 398 Following https://t.co/D7zLeTZRMh is all you need | Opinions are my own
Awsatui @Awsatui48400
33 Followers 1K Following
Nathan Chen @nathancgy4
663 Followers 567 Following @tilderesearch trying to (pragmatically) understand my friend, ml & open-source, 16
Xiaosen Zheng @xszheng2020
601 Followers 2K Following Researcher @ TikTok 📄 RegMix 💼 Past: PhD @sgSMU | Intern @SeaAIL 🧠 Interests: Data-Centric AI | Code AI
Zeyuan Allen-Zhu, Sc.... @ZeyuanAllenZhu
20K Followers 452 Following physics of language models @ Meta (FAIR, not GenAI) 🎓:Tsinghua Physics — MIT CSAIL — Princeton/IAS 🏅:IOI x 2 — ACM-ICPC — USACO — Codejam — math MCM
Wangchunshu Zhou @wangchunshu
3K Followers 2K Following Building personal superintelligence @OPPO, previously @AIWaves_inc. Former CS PhD student at ETHZ. Former researcher at ByteDance, Intern at MSRA and PYI at AI2
hear hill @HearHills
106 Followers 2K Following
Zhixuan Lin @zhxlin
377 Followers 632 Following PhD student at @Mila_Quebec and @UMontreal. Working on (linear complexity) long-context sequence models and RL.
Zhanpeng Zhou @zhanpeng_zhou
274 Followers 382 Following Ph.D. candidate @sjtu1896 | Exploring the theoretical foundations of deep learning.
socalled @theSoCalled_
246 Followers 7K Following
Zhang Ruichong @ZhangRuichong
53 Followers 168 Following
Bowen Li @BowenLi2121
182 Followers 182 Following 🤔 NLP Researcher at Shanghai AI Lab. Large Language Models, Semantic Parsing
michielh.eth @michieldoteth
5K Followers 3K Following 25 | Building @4Mlabs | Sharing insights on Business & AI | Tweets are my opinions.
Yifan Zhang @yifan_zhang_
325 Followers 355 Following Ph.D. student at Princeton University, focusing on LLMs.
Snoarhabit @sonarforce
42 Followers 134 Following college student who majors in math | Anki user | star wars fan | studying English with LingQ
Xinyu Yang @Xinyu2ML
998 Followers 984 Following Ph.D. @CarnegieMellon. Working on data and hardware-driven principled algorithm & system co-design for scalable and generalizable foundation models. They/Them
Daniel Sosebee @dnsosebee
257 Followers 1K Following At @recursecenter 🐙, learning automated learning & creating automated creativity ➿, playing piano 🎹, governing @sneaky_town ♟️
Rasurs @RasursooMSX
55 Followers 886 Following
𝗛𝗔𝗥⚡︎�... @harsha_gv
26 Followers 2K Following Namaste ★✨ Cybersecurity | Cloud DevSecOps Engineer✨ Passionate about programming and security✨ Design Thinker✨ @vhsindia member✨ Love All, Serve All ♡✨
Calc Consulting @CalcCon
4K Followers 2K Following Calculation Consulting is a boutique consultancy that specializes in machine learning, AI, and data science
Honglin Mu @honglin_mu
6 Followers 123 Following
Neesceau @Neesceauzf3
59 Followers 3K Following
Ondřej Čertík @OndrejCertik
1K Followers 299 Following At @Microsoft, previously @gsitechnology, @LosAlamosNatLab. Original author of @SymPy, SymEngine, @LFortranorg, LPython, co-founder of @fortranlang org.
Lifan Yuan @lifan__yuan
2K Followers 137 Following PhD student @uiuc_nlp @GoogleDeepMind. Prev: @TsinghuaNLP
Edward Z. Yang @ezyang
14K Followers 1K Following I work on PyTorch at Meta. Chatty alt at @difficultyang.
hud @hud_evals
1K Followers 6 Following RL environments + evals for agents | @ycombinator | we're hiring!
Ofir Press @OfirPress
15K Followers 6K Following I build tough benchmarks for LMs and then I get the LMs to solve them. SWE-bench & SWE-agent. Postdoc @Princeton. PhD @nlpnoah @UW.
Yanzhe Zhang @StevenyzZhang
480 Followers 237 Following 张彦哲, Computer Science Ph.D. student @ICatGT @GeorgiaTech @SALT_NLP Previously Intern @AdobeResearch CS undergrad @ZJU_China
Ben @SolidlySheafy
274 Followers 341 Following Understanding intelligence @tilderesearch // prev math @Penn and @Cambridge_Uni
Mason Wang @masonwang025
755 Followers 387 Following cs @stanford. prev cto @tilderesearch & research @stanfordnlp.
Ai2 @allen_ai
73K Followers 409 Following Breakthrough AI to solve the world's biggest problems. › Join us: https://t.co/MjUpZpKPXJ › Newsletter: https://t.co/k9gGznstwj
Thibaut Boissin @ThibautBoissin
249 Followers 203 Following
Zeyi Sun @sunzeyi6
43 Followers 167 Following Research Intern in Shanghai AI Lab in CV PhD student in SJTU
泓君Jane @hongjun60
5K Followers 448 Following Founder of Valley 101(硅谷101)|Podcaster @thevalley101 @web3_101 https://t.co/dBbudITzrA https://t.co/RgIxFn11wx https://t.co/QQ49UEhkSp
Junxuan Wang @JunxuanWang0929
73 Followers 71 Following PhD student, Fudan University, Interpretability
Guangxuan Xiao @Guangxuan_Xiao
3K Followers 697 Following Ph.D. student at @MITEECS Prev: CS & Finance @Tsinghua_Uni
Ian Goodfellow @goodfellow_ian
346K Followers 1K Following DeepMind Research Scientist. Opinions my own. Inventor of GANs. Lead author of https://t.co/M6vl8pEQ4I Founding chairman of @pubhealthaction
Shangbin Feng @shangbinfeng
4K Followers 2K Following PhD student @uwcse @uwnlp. Model collaboration, social NLP, networks and structures. #水文学家
Xander Chin @XanderChin
1K Followers 406 Following eng @westernu @schulichleaders | building and learning for fun
Huazi @HeyHuazi
2K Followers 472 Following 👨🎨UI/UX 设计师|⛱️GAP 中|🦲无业难民|✨ 对一切保持好奇|🧩矢量Logo收集站→ https://t.co/LpAWVDEj2Y|🎙️播客《设计漫谈》|📰 写《设计漫步周刊》→https://t.co/7TbNPjMtxe
Igor Babuschkin @ibab
103K Followers 851 Following Maybe the real ASI was the friends we made along the way. Co-founder @xAI, Research & Engineering
阿西_出海 @axichuhai
22K Followers 193 Following 🚀关注AI、LLM、MCP、AI图像视频 (Interested in AI,LLM,MCP,Stable Diffusion) 💡推特自媒体副业专栏:https://t.co/wM2OB8OuYB | 推特运营咨询 | 商务合作详见↓↓
Shuchao Bi @shuchaobi
13K Followers 687 Following Research @Meta Superintelligence Labs, RL/post-training/agents; Previously Research @OpenAI on multimodal and RL; Opinions are my own.
Humanloop @humanloop
10K Followers 533 Following Humanloop is the LLM evals platform for enterprises. Trusted by Gusto, Vanta and Duolingo to ship reliable AI products.
Yi Wu @jxwuyi
1K Followers 103 Following AI/RL researcher, Assistant Prof. at @Tsinghua_Uni, leading the RL lab at @AntResearch_, PhD at @berkeley_ai, frequent flyer and milk tea lover.
John Schulman @johnschulman2
65K Followers 1K Following Recently started @thinkymachines. Interested in reinforcement learning, alignment, birds, jazz music
Samuel Timbó @io_sammt
8K Followers 350 Following Software Engineer - Researcher - Builder - Founder - Creator of Unit - https://t.co/nGnstCPiPD
Wenhao Chai @wenhaocha1
2K Followers 2K Following Ph.D. Student @PrincetonCS. Prev @Stanford @UW @pika_labs @MSFTResearch @UofIllinois @ZJU_China. I used to work on computer vision, but it's not all I do.
Shuangfei Zhai @zhaisf
2K Followers 98 Following Research Scientist & Manager, Machine Learning Research @ Apple
cat @_catwu
39K Followers 354 Following claude code pm @anthropicai prev: @indexventures, @dagster, @scale_ai
Jason Lee @jasondeanlee
18K Followers 4K Following Associate Professor at UC Berkeley. Former Research Scientist at Google DeepMind. ML/AI Researcher working on foundations of LLMs and deep learning.
Prince Canuma @Prince_Canuma
7K Followers 1K Following Apple MLX King 🤴🏽• ML Research Engineer👨🏾💻 • VLMs • LLMs • Speaker • Writer • Ex-@arcee_ai • @neptune_ai • https://t.co/iZnxoefJBU
Zach Mueller @TheZachMueller
12K Followers 591 Following Let's make billions of parameters go brr https://t.co/rUxXIfNpwh
Dimitris Papailiopoul... @DimitrisPapail
20K Followers 1K Following Researcher @MSFTResearch, AI Frontiers Lab; Prof @UWMadison (on leave); learning in context; thinking about reasoning; babas of Inez Lily.
Eigen AI @Eigen_AI_Labs
616 Followers 21 Following Built by researchers and engineers from MIT, we are pursuing Artificial Efficient Intelligence (AEI). Try GPT-OSS support: https://t.co/BQfsnXIGFo.
Jiayi Weng @Trinkle23897
3K Followers 141 Following MTS @openai, author of the entire post-training RL infra, core contributor of ChatGPT/GPT4/GPT4o etc. 30U30
James Chen @jchencxh
861 Followers 507 Following mostly representation learning for vision and generalisation @SCSatCMU hot takes on everything else