Andrew Siah @theandrewsiah
Working with kind, curious and smart people on cutting edge problems. And petting many dogs in the process. @Columbia andrewsiah.com New York Joined September 2010-
Tweets192
-
Followers136
-
Following659
-
Likes596
We organized IvyHacks.ai for the greatest students and gathered awesome researchers in industry @GoogleDeepMind @DbrxMosaicAI @AnthropicAI and more. Also started @newyorkailabs an open community for AI research and discourse! Follow us for more events!
We organized IvyHacks.ai for the greatest students and gathered awesome researchers in industry @GoogleDeepMind @DbrxMosaicAI @AnthropicAI and more. Also started @newyorkailabs an open community for AI research and discourse! Follow us for more events!
🚨New🌟blog✍️ on ⏩ maximizing🌙 FLOPS 🚀 Training large models requires maximizing flops/GPU, especially at scale. Excited to share a few of the cool tricks in thread👀. 1/N
DBRX shows you can have EFFICIENCY and PERFORMANCE: DBRX has 16 experts (most MOEs have 8) with 4 activated per token and uses only 36B params at a time. With training efficiency boosted by MegaBlocks arxiv.org/pdf/2211.15841…, it outperforms GPT-3.5 and other open-source models.
DBRX shows you can have EFFICIENCY and PERFORMANCE: DBRX has 16 experts (most MOEs have 8) with 4 activated per token and uses only 36B params at a time. With training efficiency boosted by MegaBlocks arxiv.org/pdf/2211.15841…, it outperforms GPT-3.5 and other open-source models.
yes I finally wrote about it. defo my longest article so far. as usual giving access to some who RT reference: @__paleologo @KrisAbdelmessih @AgustinLebron3
Have you ever done a dense grid search over neural network hyperparameters? Like a *really dense* grid search? It looks like this (!!). Blueish colors correspond to hyperparameters for which training converges, redish colors to hyperparameters for which training diverges.
@karpathy To anyone curious, my account focuses on creating diagrams to visualize the internal components of deep learning algorithms. I've recently made a guide for Mixtral, which shows how my diagrams and implementations are closely related! github.com/vtabbott/Neura…
Super simple code change to get value-based deep RL scale *much* better w/ big models across the board on Atari games, robotic manipulation w/ transformers, LLM + text games, & even Chess! Just use classification loss (i.e., cross entropy), not MSE!! arxiv.org/abs/2403.03950🧵⬇️
If you export your chat history from ChatGPT, you get the system prompt(s) for free, no jailbreaking or similar needed
We are excited to share Large World Model (LWM), a general-purpose 1M context multimodal autoregressive model. It is trained on a large dataset of diverse long videos and books using RingAttention, and can perform language, image, and video understanding and generation.
Kalamang Translation One of the most exciting examples in the report involves translation of Kalamang. Kalamang is a language spoken by fewer than 200 speakers in western New Guinea in the east of Indonesian Papua (endangeredlanguages.com/lang/1891). Kalamang has almost no online…
@drydenwtbrown @BasedBeffJezos not 501c3 but we want some more e/acc folks at our upcoming hackathon in NYC: ivyhacks.ai
Presenting on Gaussian Splatting and the Future of 3D
if you value intelligence above all other human qualities, you’re gonna have a bad time
oppenheimer but it's artificial intelligence (1/10)
Tutetirsh @tutetirsh1313
0 Followers 193 FollowingEmerald Goulas @goulas6115
72 Followers 5K FollowingNancy @nancy_strode2
209 Followers 3K FollowingNotee @Notee639150
0 Followers 236 Following Life itself is a journey, we are all worthy and should strive to travel to different lives.Ellie Krzeminski @EllieKrzem6552
90 Followers 5K FollowingSlurthurs @slurthurs76855
9 Followers 223 FollowingTosmea @Tosmea189321
0 Followers 198 FollowingAndreaKeppel @zmx1lxCZctVo1bh
0 Followers 159 Followinguhcth1ihr46 @ht65ono09ld2
2 Followers 447 Following The team offers short-term investments in cryptocurrencies. With a rigorous plan, you can earn between $500 and $5,000. Click to join TG: https://t.co/PjwW8Jq3CXGolda Ruhenkamp @GRuhenka
52 Followers 5K FollowingMcThesea @mc_thesea56847
160 Followers 2K FollowingQuiana Tubville @tubville34990
88 Followers 5K Followingqw8tbkj6ocbpe @3fqcwyhiifci5n
2 Followers 179 Following 【coinsrw . com 】User**me:Rom88 , P*****rd:R 66888 Bal:4,289,287,11 U.S.D.TBrittani Brinson @BrinsoBritta
49 Followers 5K FollowingTrinity Mowles @TrinMowles
50 Followers 5K FollowingRosie Moncion @MonciRos
51 Followers 5K FollowingDaria Edin @DariaEdin90187
58 Followers 5K FollowingDarcel Kesselman @DarKesselm
47 Followers 5K Followingmadhu @madhu___s
425 Followers 550 Following having fun taking ideas seriously🫧 || building a proto-university and the nyc tech & science scene https://t.co/xaB7RSdmRZTessa Mondria @tessamondria
2K Followers 4K Following Data Science (QMSS) master’s student @columbia & "la Caixa" Postgraduate Fellow @becariosFLC | Former Product manager @igeneris | Media, tech & public valueMert Bozkir @mertbozkirr
1K Followers 2K Following Relentless journey of GenAI and Growth! ⚡️ https://t.co/3DJtw5gsInBen (e/sqlite) @andersonbcdefg
3K Followers 3K Following 🤖 Computer scientist, next-word-prediction enjoyer 📊 Prev. research fellow @ Stanford RegLab 🛠️ bUiLdiNg sOmeThiNg nEw (https://t.co/mdYPZmjSzN - YC S23) 🏳️🌈Mina Heatwole @MiHeatwo
98 Followers 5K FollowingHanming Albert Yang @YangHanming
8 Followers 33 FollowingKellie Chamnanphony @chamnanpho46177
81 Followers 5K FollowingEmber Augeri @AugeriEmbe61463
84 Followers 5K FollowingTony Chen @tonychenxyz
106 Followers 688 Following Undergrad @CUSEAS I am interested in building interpretable, efficient, human-interaction-friendly AI models.Luca Antiga ⚡️ @lantiga
3K Followers 2K Following CTO @LightningAI // Co-founder @ Orobix · Tensorwerk // Manning authorTara Forcier @forc_ta
34 Followers 5K FollowingLaura Toyota @toyo_la
77 Followers 5K FollowingNaomi Lamme @NaomLamme
84 Followers 5K FollowingArwa Beavers @ar_beave
41 Followers 5K FollowingAmy Chen @iamamychen
2K Followers 2K Following Inspired by the rhythms of sidewalks. Based in NYC’s tech community, expanding opportunities w/ early-stage AI & enterprise software startups w/ Tola Capital.Beatrice Bassali @BassaBeatr
75 Followers 5K FollowingMagdalen Licause @MagdaleLicau
75 Followers 5K FollowingFilip Wojda @filipwojda
184 Followers 247 Following vlogging at https://t.co/kpyJ84gLf0 building https://t.co/ziAeAFIsEd ✨Evie-mae Hanaway @evie_hanaw
88 Followers 5K FollowingConsuela Sprafka @ConsuelaSp65674
59 Followers 5K FollowingAlishia Refazo @AlishRefaz
59 Followers 5K FollowingAI Deeply @AiDeeply
403 Followers 5K Following AI is reshaping the world. Who are the people and companies driving the change? Visit our website to search more than 5,000 profiles.Joshua Lim @joshua_j_lim
18K Followers 4K Following in crypto: co-founder, @arbelosxyz; head of derivatives, genesis trading; galaxy digital; circle. previous life: equity exotics GS; UBSmartin_casado @martin_casado
50K Followers 2K Following GP @ a16z ... questionable heuristics in a grossly underdetermined worldMengdi Wang @MengdiWang10
1K Followers 266 Following Princeton professor in AIML, optimization and data science. Program Chair @ICLR2023. Formerly @MIT @GoogleDeepmind @TsinghuaKyunghyun Cho @kchonyc
61K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).Hyperliquid @HyperliquidX
43K Followers 4 Following L1 with performant native components, including a perps DEX with 100+ assets and spot trading. Join the community: https://t.co/wfW5L1lYXYMike Schroepfer @schrep
104K Followers 278 Following Partner @Gigascale, Sr Fellow (Formerly CTO) @Meta, founder @AdditionalVent, . Investing in tech and science to fight climate change. AIJD Ross @justindross
29K Followers 734 Following I take the side of people who build. Co-founded @Opendoor & @join_royal. Now building in insurance, studying energy problems 🤠Jack Rae @drjwrae
9K Followers 356 Following Principal Scientist @ Google DeepMind Work on Gemini 💎♊ Compression is all you need LLMs (e.g. Gopher, Chinchilla, Gemini) 💼 Past: OpenAI, QuoraSebastian Borgeaud @borgeaud_s
997 Followers 260 Following Research Engineer at DeepMind with a focus on Large Language Models and large scale Deep LearningAlbert Gu @_albertgu
9K Followers 90 Following assistant prof @mldcmu. chief scientist @cartesia_ai. leading the ssm revolution.Wing Lian (caseus) @winglian
9K Followers 2K Following @axolotl_ai OSS maintainer. Axolotl AI founder. AI/ML tinkerer. Building tools for everyone.Zico Kolter @zicokolter
15K Followers 499 Following Associate professor at Carnegie Mellon, VP and Chief Scientist at Bosch Center for AI. Researching (deep) machine learning, robustness, implicit layers.Cody Blakeney @code_star
3K Followers 825 Following Head of Data Research @MosaicML / @databricks | Formerly Visiting Researcher @ Facebook | Ph.D | #TXSTFOOTBALL fan | https://t.co/4G6Jf3at5wDurk Kingma @dpkingma
35K Followers 349 Following Deep learning, mostly generative models. Prev. Google Brain/DeepMind, founding team @OpenAI. Inventor of the VAE, Adam optimizer, among other things. ML PhD.Deepak Pathak @pathak2206
16K Followers 316 Following I study topics in AI (machine learning, robotics & computer vision).John Langford @JohnCLangford
9K Followers 35 Following Solving Machine Learning at Microsoft in New York. https://t.co/ZpdQV4IsHY pandemic past president. https://t.co/MkluiHpWF7 makes RL real. https://t.co/wK8xQaQGwf for thinking out loud.ICLR 2024 @iclr_conf
41K Followers 40 Following International Conference on Learning Representations #ICLR2024. SPC is @yisongyue and GC is @_beenkim OpenReview:https://t.co/OD1sg0r7F8Csaba Szepesvari @CsabaSzepesvari
8K Followers 704 Following "If there is not folly in the world, then the world itself is folly. You must understand that mistakes are not always regrets." - Paul Tobin, Bandette🤠Jure Leskovec @jure
43K Followers 378 Following Professor of #computerscience @Stanford; Co-founder at https://t.co/hhm1j5wP0f #machinelearning #graphs.Christopher Manning @chrmanning
127K Followers 116 Following Director, @StanfordAILab. Assoc. Director, @StanfordHAI. Founder, @stanfordnlp. Prof. CS & Linguistics, @Stanford. IP @aixventureshq. 🇦🇺 Do #NLProc & #AI. 👋Jason Lee @jasondeanlee
10K Followers 3K Following Associate Professor at Princeton and Research Scientist at Google DeepMind. ML/AI Researcher working on foundations of LLMs and deep learningOpenAI Developers @OpenAIDevs
72K Followers 0 Following Official @OpenAI account for anyone building on our APIs. Join us in building the future of AI. We ❤️ developers!ICML Conference @icmlconf
70K Followers 17 Following Int'l Conf on ML • July 21-27, 2024 (Vienna, Austria) • #icml2024 • Contact: https://t.co/6saHKWV01y • https://t.co/6UpPvOXTojRL Theory Virtual Sem.. @RLtheory
5K Followers 0 Following Virtual seminar series featuring the latest advances in theoretical reinforcement learning. Seminars (approximately) every Tuesday at 6pm UTC.Eric @ericmitchellai
4K Followers 488 Following I like AI & music. Working on making LLMs easier & safer to use. Final year PhD student at Stanford advised by Chelsea Finn & Chris Manning.Nan Jiang @nanjiang_cs
7K Followers 72 Following machine learning researcher, with focus on reinforcement learning. asst prof @ uiuc cs. Course on RL theory (w/ videos): https://t.co/vqVKwY4RJEElad Hazan @HazanPrinceton
11K Followers 187 Following machine learning and optimization @PrincetonCS & Google DeepMind Princeton, dad^3James Bradbury @jekbradbury
11K Followers 8K Following Compute at @AnthropicAI! Previously JAX, TPUs, and LLMs at Google, MetaMind/@SFResearch, @Stanford Linguistics, @Caixin.xulian - MultiPurr Ca.. @KingJulianIAm
8K Followers 2K Following Purring - Core contributor to @hyperliquidx 🫧Sham Kakade @ShamKakade6
12K Followers 383 Following Harvard Professor. Full stack ML and AI. Co-director of the Kempner Institute for the Study of Artificial and Natural Intelligence.Zeyuan Allen-Zhu @ZeyuanAllenZhu
8K Followers 273 Following physics of language models @ Meta / FAIR IOI - USACO - MCM - ACM/ICPC - Codejam Tsinghua - MIT - Princeton/IAS - MSR - FAIRandrew (in CS teacher.. @__drewface
4K Followers 935 Following starting a university & bootcamp in NYC with my friends, for fun. join our software bootcamp this fall! let's do joyful work and build a good society ❤️madhu @madhu___s
425 Followers 550 Following having fun taking ideas seriously🫧 || building a proto-university and the nyc tech & science scene https://t.co/xaB7RSdmRZLuca Soldaini 🎀 @soldni
6K Followers 1K Following I like tokens! Lead for OLMo data team at @allen_ai (Dolma 🍇), OSS is fun, @QueerInAI organizer 🤖☕️🍕they/them (views mine, not my employer’s)Amy Chen @iamamychen
2K Followers 2K Following Inspired by the rhythms of sidewalks. Based in NYC’s tech community, expanding opportunities w/ early-stage AI & enterprise software startups w/ Tola Capital.Ben (e/sqlite) @andersonbcdefg
3K Followers 3K Following 🤖 Computer scientist, next-word-prediction enjoyer 📊 Prev. research fellow @ Stanford RegLab 🛠️ bUiLdiNg sOmeThiNg nEw (https://t.co/mdYPZmjSzN - YC S23) 🏳️🌈Lisa Wehden @lisawehden
14K Followers 1K Following Building @plymouthstreet to make US immigration fast and simple for top talent. Prev @bloombergbeta @join_ef @uniofoxfordPlymouth @plymouthstreet
2K Followers 13 Following Plymouth supports top international talent with work visas and green cards. Proudly built by immigrants for immigrants.Piotr Pomorski @PtrPomorski
7K Followers 169 Following Senior Machine Learning Engineer | AI & Quant | PhD/CFA/CEO of Kalman Filter | Not an investment advice, views are my own | DM open 📧stalequant @stalequant
1K Followers 207 Followingbilal2vec @bilaltwovec
2K Followers 781 Following ✨ research engineer • prev @googlebrain @cohere @dbrxmosaicai • se @uwaterlooJohn Yang @jyangballin
2K Followers 450 Following CS/NLP MS student @princeton_nlp Previously @Berkeley_EECSChip Huyen @chipro
92K Followers 442 Following Data processing on GPUs @VoltronData Designing ML Systems: https://t.co/G81hL2dWmr @designmlsys #AI x #GPUEvery day the AGI-to-FSD arc seems more and more plausible.
Before accepting a startup job offer with equity, make sure you ask the company for: - Strike price - Last preferred share price (price investors paid) - Total number of shares - Liquidation preferences This will allow you to calculate the % of the company you own & value it
What are your predictions for L3? (read the rest of the thread for interesting experiments in pushing GPT-4 up to 86%)
(2/n) GPT-4 performs the best out of five models with 8.7% zero-shot. SOTA foundation models have plenty of room for improvement! - No model (even with inference-time techniques) solves a single platinum problem - Far short of average USACO competitor (35.8%)
now I’m actually living the *exact* life that I dreamed about few years ago. I work in my dream firm, in my dream role, and live in my dream city. back then I would’ve given up my thumbs if that’s what it would take. my finances are great. before this I would eat a heap of…
Introducing the SDK for @LightningAI Studios - Build production ML pipelines with Studios 🚀🚀 Production AI is super simple with Studios: - Overfit one Studio to a task (finetune, data prep, whatever) - orchestrate all Studios via the SDK - Studios share THE SAME filesystem…
“In many ways, the work of a critic is easy. We risk very little, yet enjoy a position over those who offer up their work and their selves to our judgment. We thrive on negative criticism, which is fun to write and to read. But the bitter truth we critics must face, is that in…
Which is the worst, most-hyped product of 2024 so far?
My favorite part about @honicky's Paper Club session this week on the 1-bit LLMs paper - relating it to @jefrankle's Beyond Chinchilla laws and adjusting the equations for the memory/latency characteristics of 1-bit LLMs to derive an optimal param count/data size to aim for. no…
Q: How can 1-bit LLMs match 16-bit LLaMAs? 2-3x faster inference and 20-40x more energy efficient and slightly better evals!! Great work by RJ for the @latentspacepod paper club - a deep dive into the 1-bit LLMs paper! We had a LOT of fun going thru the details this week and…
Studios can host web apps! 🤯🤯 Example: Chat with your documents using Command R+ from @cohere. Hit "Open in Studio" to get your own chat app with Command R+ lightning.ai/lightning-ai/s…
@iamamychen @cornell_tech @theandrewsiah @Columbia_Biz andrew is the 🐐. Can I come pet the 🐶, too?
@iamamychen @cornell_tech @theandrewsiah @Columbia_Biz ps: thanks for all the intro’s you made for IvyHacks 🫡
@theandrewsiah NewYorkLabs conquering SF hackathons already
Most people don't know that Lightning Studios offer: - free persistent storage - free persistent environments - unlimited background execution - VSCode, PyCharm, (any IDE) integration Set up your Studio environment once and reuse it again any time 🤯🤯
I allocated a week to work on my tax return but finished it in two hours this Monday. Is this real life?
SWE-agent is our new system for autonomously solving issues in GitHub repos. It gets similar accuracy to Devin on SWE-bench, takes 93 seconds on avg + it's open source! We designed a new agent-computer interface to make it easy for GPT-4 to edit+run code github.com/princeton-nlp/…
We've just crossed 1k Github stars and 100 Cloud e2e pipelines deployed with R2R. Excited to keep on building for our early users. Would love to help out if you want to implement a RAG solution.
I’ve been trying to find people to play this game for months in NYC. SF sending strong signals for me to move here 🚀✨
Today I saw the launch of another "open-source" AI wearable that has not published anything just to charge you 5x the cost At @MistralAI x @cerebral_valley hackathon in @SHACK15sf we built FRIEND - an AI Wearable that: - Works 24+ hours on a single charge - Costs ~$20 -…
It is been <24h in SF and I’m already getting pulled into hosting a hackathon? The builder energy is unmatched. @AlexReibman @hackgoofer any venue recks / SF hackathon planning advice?
What’s a common misconception about machine learning that you wish more people understood?