Vithu Thangarasa @vithursant19

Machine Learning Research at @CerebrasSystems, previously at @Tesla and @UberAILabs, and grad student at @uoguelph_mlrg and @VectorInst. Thamilan ௐ. vithursant.com San Francisco, CA Joined August 2018

Tweets

277
Followers

405
Following

507
Likes

902

Philipp Schmid @_philschmid

a week ago

Phi-3 mini model released under MIT! 🚀 Last Week Llama 3, this week Phi-3 🤯 @Microsoft Phi-3 comes in 3 different sizes: mini (3.8B), small (7B) & medium (14B). Phi-3-mini was released today, claiming to match Llama 3 8B performance! 🚀 3.8B TL;DR: 2️⃣ Instruct Versions with 4k…

13 97 364 57K 133

Download Image

Susan Zhang @suchenzang

a year ago

Not to mention this insane point-cloud of a plot for their Figure 1 (aka main result) that draws 3-lines for each of the "Approaches" to claim that all 3 yield "mostly similar" results. Since when did a 10x difference become "mostly similar" in literature? [6/7]

3 3 61 10K 1

Download Image

Tamay Besiroglu @tamaybes

2 weeks ago

The Chinchilla scaling paper by Hoffmann et al. has been highly influential in the language modeling community. We tried to replicate a key part of their work and discovered discrepancies. Here's what we found. (1/9)

16 131 889 313K 663

Download Image

AK @_akhaliq

2 weeks ago

Introducing v0.5 of the AI Safety Benchmark from MLCommons This paper introduces v0.5 of the AI Safety Benchmark, which has been created by the MLCommons AI Safety Working Group. The AI Safety Benchmark has been designed to assess the safety risks of AI systems that use

4 39 111 21K 51

Download Image

𝐷𝑟. 𝐼𝑎𝑛 𝐶𝑢𝑡𝑟𝑒𝑠𝑠 @IanCutress

3 weeks ago

More silicon news: @intel lifted the lid on Gaudi 3 today at #IntelVision. OAM and PCIe versions. ➡️TSMC N5 ➡️PCIe 600W ➡️OAM 900W air, 900W+ liquid ➡️128 GB HBM2e ➡️64 Tensor Cores ➡️24x200 GbE ➡️PCIe 5.0 x16 ➡️Supports clusters up to 8192 ➡️2xFP8, 4xBF16 vs Gaudi 2 ➡️10 tiles

9 41 265 31K 44

Download Image

Vithu Thangarasa @vithursant19

a month ago

It’s only been a day 😅

SambaNova Systems @SambaNovaAI

a month ago

It’s only been a day 😅

25 98 387 1.2M 149

Download Image

1 0 6 449 1

Vitaliy Chiley @vitaliychiley

a month ago

Introducing DBRX: A New Standard for Open LLM 🔔 databricks.com/blog/introduci… 💻 DBRX is a 16x 12B MoE LLM trained on 📜 12T tokens 🧠DBRX sets a new standard for open LLMs, outperforming established models on various benchmarks. Is this thread mostly written by DBRX? Yes! 🧵

23 85 473 116K 178

Download Image

Jim Fan @DrJimFan

a month ago

Today is the beginning of our moonshot to solve embodied AGI in the physical world. I’m so excited to announce Project GR00T, our new initiative to create a general-purpose foundation model for humanoid robot learning. The GR00T model will enable a robot to understand multimodal…

198 1K 5K 841K 2K

Download Video

VentureBeat @VentureBeat

2 months ago

Cerebras and G42 said they have broken ground on Condor Galaxy 3, an AI supercomputer that can hit eight exaFLOPs of performance. venturebeat.com/ai/cerebras-br…

0 3 5 4K 0

NYSE 🏛 @NYSE

2 months ago

"Making science fiction a reality." @CerebrasSystems CEO Andrew Feldman makes a major announcement and discusses how the company aims to lead the charge in AI innovation on #TakingStock with @trinitychavez #TSTC

1 12 42 7K 4

Download Video

Cerebras @CerebrasSystems

2 months ago

📣ANNOUNCING THE FASTEST AI CHIP ON EARTH📣 Cerebras proudly announces CS-3: the fastest AI accelerator in the world. The CS-3 can train up to 24 trillion parameter models on a single device. The world has never seen AI at this scale. CS-3 specs: ⚙ 46,225 mm2 silicon | 4…

6 17 89 13K 16

Download Image

𝐷𝑟. 𝐼𝑎𝑛 𝐶𝑢𝑡𝑟𝑒𝑠𝑠 @IanCutress

2 months ago

How many cores? OVER 900000! @CerebrasSystems just announced its new generation Wafer Scale Engine 3, a chip as big as your head - four trillion transistors, double the performance of WSE-2, built with TSMC N5. 125 PetaFLOPs of FP16/BF16 compute. TASTY. youtu.be/f4Dly8I8lMY

10 22 145 9K 15

Download Image

Cerebras @CerebrasSystems

2 months ago

📣 ANNOUNCEMENT DAY AT CEREBRAS 📣 Today, we are thrilled to share some of the biggest announcements in our company’s history. 📢 Cerebras announces CS-3, the world’s fastest AI Chip with a whopping 4 trillion transistors 📢 Cerebras selects Qualcomm to deliver unprecedented…

16 59 306 115K 68

Download Video

Aran Komatsuzaki @arankomatsuzaki

2 months ago

Google presents: Stealing Part of a Production Language Model - Extracts the projection matrix of OpenAI’s ada and babbage LMs for <$20 - Confirms that their hidden dim is 1024 and 2048, respectively - Also recovers the exact hidden dim size of gpt-3.5-turbo…

16 151 972 237K 662

Download Image

Warcop @Warcop

2 months ago

Good stuff going on here. #MediSwift biomedical training benchmarks.

Cerebras @CerebrasSystems

2 months ago

Good stuff going on here. #MediSwift biomedical training benchmarks.

2 9 44 5K 20

Download Image

0 1 1 245 1

Cerebras @CerebrasSystems

2 months ago

(1/n) Introducing MediSwift, the first suite of biomedical language models that employ sparse pre-training techniques to significantly reduce computational costs, while outperforming existing models up to 7B parameters on benchmark tasks such as PubMedQA. Paper:…

2 9 44 5K 20

Download Image

Angry Tom @AngryTomtweets

3 months ago

It's only been 5 hours since Open AI announced Sora, and people are going crazy over it. Here are 10 wild examples you don't want to miss: 1. Snow dogs

422 2K 20K 9.4M 8K

Download Video

Cerebras @CerebrasSystems

3 months ago

🎉 Cerebras overtakes peers as #1 AI Semiconductor Startup 🎉 In the freshly-updated 2023 State of AI Report Compute Index from Air Street Capital and Zeta Alpha, Cerebras is highlighted for leading all AI semiconductor startups in research publication counts, open source…

1 14 52 5K 12

Download Image

Vithu Thangarasa @vithursant19

3 months ago

This is neat! Explore PyTorch models effortlessly with TorchTrail! It allows you to easily trace and visualize your model's execution, seamlessly extracting torch function and module graphs💯github.com/arakhmati/torc…

0 1 13 403 5

Awni Hannun @awnihannun

3 months ago

Collected some of the amazing projects people are building with MLX in one place: github.com/ml-explore/mlx… Looking at that list, it's hard believe MLX is just 2 months old.

8 70 381 53K 265

Download Image

Claire Croshaw @ClaiCrosh

34 Followers 5K Following

Pirckoos @pirckoos59172

0 Followers 140 Following

Hertha Wiedyk @WiedyHerth

65 Followers 5K Following

LaurelAbe @18hgx5hrd7o4Zn

0 Followers 161 Following

Azalea Bend @azalea89088

101 Followers 5K Following

DaphneJimmy @k4pvKZWADf77hy2

3 Followers 237 Following

Heather Carrejo @HeathCarrej

87 Followers 5K Following

Ariah Zevallos @ZevalloAr

15 Followers 3K Following 💦25 - Lets Cam👇🖤

Elsy Mcmichael @e_mcmicha

18 Followers 4K Following 19 / Lets Cam👇😘

joshua cohen @yecohn

22 Followers 411 Following DL engineer @AI21Labs

Dahlia Bolnick @boln_dahli

84 Followers 5K Following

Claire Kreissler @clair_kreissl

13 Followers 2K Following 😚Claire | Lets Have Fun👇

Zora Huso @huso_zo

51 Followers 5K Following

Nelle Findling @FindlNel

74 Followers 5K Following

Lovella Dottin @LovelDott

47 Followers 5K Following

Tressie Selbo @tress_sel

10 Followers 2K Following Tressie , Lets Have Fun👇

Mark Kovarski @mkovarski

2K Followers 5K Following Responsible AI, Cloud, SaaS, Product 🤖 🫶🌐💡 | https://t.co/2vuiFosXlm 📪

Galilea Stiegman @GaliStieg

68 Followers 5K Following

Muoi Balliett @MuoiB88696

79 Followers 5K Following

Ravid Shwartz Ziv @ziv_ravid

2K Followers 1K Following Faculty Fellow and Assistant Professor at @NYUDataScience, working with @ylecun

Lindy Bulow @LBulow6786

86 Followers 5K Following

Kylie Courtney @court_ky

27 Followers 4K Following Kylie ~ Lets Chat👇

Imaan Difilippo @DifilippoI52295

82 Followers 5K Following

Khadijah Rehder @RehdeKhadij

61 Followers 5K Following

Paige Fandrich @PaiFandr

40 Followers 5K Following

Lillie-mae Soffer @SofferMae35208

66 Followers 5K Following

Aditi Wilshusen @wilshu_adi

46 Followers 5K Following

Esmai Vanbeveren @e_vanbevere

44 Followers 5K Following

N @men_shin_kai

85 Followers 1K Following

Claudia Milliron @ClaudMilli

11 Followers 3K Following

Courtney Zalwsky @zalwsky77669

96 Followers 5K Following

Lenora Kaitz @LKaitz74989

35 Followers 5K Following

Aleeza Mose @AleezaMo

97 Followers 5K Following

Maleah Figuroa @figur_male

28 Followers 4K Following 21 ~ Lets Cam👇💕

Mr. o @WemingMa

13 Followers 176 Following Interesting Machine Learning， DeepLearning，Data Science

Alexandra Kirkbride @AlexandraK14133

41 Followers 5K Following

Yer Delmas @DelmasYer78402

77 Followers 5K Following

Freida Stapleton @FreidStaplet

29 Followers 5K Following

Milena Bensen @BenseMile

12 Followers 3K Following Milena Lets Have Fun👇

Oscar Lynema @OLynema26371

24 Followers 2K Following Oscar ~ Earn your own Crypt$ casino👇

Biswajit Mishra @biswajitism

37 Followers 204 Following AI and HPC researcher, Truth seeker

❤️‍🔥👑 MAG.. @freakoncrypto

195 Followers 4K Following Queen of real takes ❤️‍🔥 (investing $250-500k in mainstream-adoption-oriented web3 startups). Trading 100% crypto

Candra Bouley @BouleCandr

21 Followers 4K Following 19 - Join my free content👇🔞

Hassan Hayat 🔥 @TheSeaMouse

5K Followers 4K Following Building the AI assistant for all @ https://t.co/D4gDyw97gu

Toughsh @Toughsh372489

121 Followers 3K Following

Bob Komin @BobKomin

368 Followers 1K Following

Sandi Havel @sand_hav

35 Followers 3K Following 📈23 - Earn now with crypto presale👇🔑

k_zer0s @k_zer0s

747 Followers 2K Following VC, Quantum Computing, AGI, AI, SDXL, FPGA, Startup Consulting, Senior Venture Architect at Financial Institution.

Nish Sinnadurai @NishSinnadurai

6 Followers 94 Following

Postdoc @NCBI @NLM_NIH. Tsinghua MD. JMIR AE. Democratizing medical knowledge. AgentMD, MedRAG, TrialGPT, GeneGPT, MedCPT, PMC-Patients, PubMedQA. Views my own.

Qiao Jin, MD @DrQiaoJin

1K Followers 859 Following Postdoc @NCBI @NLM_NIH. Tsinghua MD. JMIR AE. Democratizing medical knowledge. AgentMD, MedRAG, TrialGPT, GeneGPT, MedCPT, PMC-Patients, PubMedQA. Views my own.

MLCommons @MLCommons

3K Followers 131 Following Better Artificial Intelligence for Everyone

Wand platform, enables business users and data analysts to solve real-world business problems easily and quickly –
Collaborative, measurable and scalable.

Wand AI @WandAI_

415 Followers 29 Following Wand platform, enables business users and data analysts to solve real-world business problems easily and quickly – Collaborative, measurable and scalable.

Ravid Shwartz Ziv @ziv_ravid

2K Followers 1K Following Faculty Fellow and Assistant Professor at @NYUDataScience, working with @ylecun

Aathushan Kugendran @aathushankgn

4 Followers 6 Following

Katikapalli Subramanyam Kalyan (shortly Kalyan KS), Research Scientist (NLP) working on Generative AI and LLMs at @AkmmusAI.

Kalyan KS @kalyan_kpl

749 Followers 511 Following Katikapalli Subramanyam Kalyan (shortly Kalyan KS), Research Scientist (NLP) working on Generative AI and LLMs at @AkmmusAI.

Andrew Gao @itsandrewgao

27K Followers 2K Following techno optimist! currently: @nomic_ai @stanford; prev @LangChainAI; Z Fellow 🇺🇸

Corey Lynch @coreylynch

10K Followers 1K Following AI at @figure_robot, previously research scientist at @GoogleDeepMind.

Founder & Angel investor. AI & Fintech junkie. Past: Chime, BMW Self driving, Bloomberg, health-tech founder. Tweets about AI, startups, learning, & football.

Vivek Ponnaiyan @viveksworld

760 Followers 635 Following Founder & Angel investor. AI & Fintech junkie. Past: Chime, BMW Self driving, Bloomberg, health-tech founder. Tweets about AI, startups, learning, & football.

🤹‍♂️ Tech&Innovation @Siemens
🌮 co-founded: https://t.co/q2oseFfdT2, https://t.co/nwos8GDR4f (exited)
🎙️ Occasionally speaker, PM instructor

Mehmet Perk @mmt

917 Followers 1K Following 🤹‍♂️ Tech&Innovation @Siemens 🌮 co-founded: https://t.co/q2oseFfdT2, https://t.co/nwos8GDR4f (exited) 🎙️ Occasionally speaker, PM instructor

shrihacker @shrihacker

4K Followers 252 Following jsk ❤️🙏

Chief Technical Officer - Hermes Semiconductor | Strategist in Semiconductor Technology domain | VLSI/SoC/ASIC/IC (Chip) Design Specialist

Singh, Satinder Paul @PaulSatinder

1K Followers 4K Following Chief Technical Officer - Hermes Semiconductor | Strategist in Semiconductor Technology domain | VLSI/SoC/ASIC/IC (Chip) Design Specialist

Ben Pouladian @benitoz

4K Followers 1K Following Father (x3!), EE, entrepreneur, investor, real estate developer, super angel, AI 🤖, biotech 😇 @ypo @TerasakiInst ❤️ asymmetry 📈🇮🇱🇺🇸💪🏽

Yannick Scholich (e/�.. @YannickScholich

550 Followers 2K Following Effectively accelerating. Working on not dying and on FUN. Always needs funding and compute. Math/Applied Physics/CS

Engin @ngoteko

33 Followers 67 Following MLE @CerebrasSystems - CS PhD @UCSC

Co-founder of ZM Communications. Boulder-based PR & marketing professional. mom of two incredible little humans. Love travel, tech, new perspectives.

Kim Ziesemer @kziese

736 Followers 1K Following Co-founder of ZM Communications. Boulder-based PR & marketing professional. mom of two incredible little humans. Love travel, tech, new perspectives.

Isak Westerlund 🦇�.. @westis96

759 Followers 4K Following Exploring Amortized Inference, Language and Speech.

Biswajit Mishra @biswajitism

37 Followers 204 Following AI and HPC researcher, Truth seeker

Brett Adcock @adcock_brett

172K Followers 14 Following Founder @Figure_robot (AI Robotics) & Archer Aviation (NYSE: ACHR)

Hassan Hayat 🔥 @TheSeaMouse

5K Followers 4K Following Building the AI assistant for all @ https://t.co/D4gDyw97gu

Cuong Nguyen @cuong_qnguyen

205 Followers 645 Following Director of AI/ML Engineering @GSK. Previously: AI/ML @Genentech.

Sahil Lihas @MrSahilLihas

84 Followers 367 Following Book is still blank MS Research Scholar, IIT Madras Deep Learning, Semantic Web

Warcop @Warcop

2K Followers 2K Following Problems Demolitionist/Cloud Innovation Architect #TechFieldDay #Innovation #NetDevOps #AVTweeps #AVoIP #SMPTE #IPMX

Stroke Neurologist @CAMCHealth | Trained @Duke_Neurology & @UMICHNeuroRes | Interested in the intersection of AI, neurology, and education 🤖🧠👨‍🏫

Braydon Dymm, MD @BraydonDymm

601 Followers 565 Following Stroke Neurologist @CAMCHealth | Trained @Duke_Neurology & @UMICHNeuroRes | Interested in the intersection of AI, neurology, and education 🤖🧠👨‍🏫

Nish Sinnadurai @NishSinnadurai

6 Followers 94 Following

k_zer0s @k_zer0s

747 Followers 2K Following VC, Quantum Computing, AGI, AI, SDXL, FPGA, Startup Consulting, Senior Venture Architect at Financial Institution.

Meysam @vcmeysam

1K Followers 182 Following Training LLMs @scale_ai | ex @google @mastercard @yumbrands | Founder @wikusventures | @Solana fan | Traveled 100+ countries 🌎

Saleh Soltan @SalehSoltan

235 Followers 446 Following Principal Applied Scientist @Amazon AGI |Ph.D. @Columbia 2017 Views of my own.

Jimmy @mrgemy95

378 Followers 1K Following Mahmoud G. Salem. ML Scientist @cerebrassystems. MSc @vectorinst , @uofg. ex @GoogleAI

Bob Komin @BobKomin

368 Followers 1K Following

Quentin Anthony @QuentinAnthon15

999 Followers 129 Following I make models more efficient. Google Scholar: https://t.co/kzVsAKPdrp

IR & NLP Research @ZetaVector. Interested in Neural Information Retrieval, Autonomous Agents, and AI-assisted Evaluation. Prev: MSc AI @UvA_Amsterdam

dinos @din0s_

805 Followers 438 Following IR & NLP Research @ZetaVector. Interested in Neural Information Retrieval, Autonomous Agents, and AI-assisted Evaluation. Prev: MSc AI @UvA_Amsterdam

Qiao Jin, MD @DrQiaoJin

1K Followers 859 Following Postdoc @NCBI @NLM_NIH. Tsinghua MD. JMIR AE. Democratizing medical knowledge. AgentMD, MedRAG, TrialGPT, GeneGPT, MedCPT, PMC-Patients, PubMedQA. Views my own.

Cartesia @cartesia_ai

1K Followers 8 Following Cartesia is training next-gen foundation models with subquadratic deep learning architectures. Sign up for early access at https://t.co/c5og0yF1Pz

Elad Hazan @HazanPrinceton

11K Followers 187 Following machine learning and optimization @PrincetonCS & Google DeepMind Princeton, dad^3

Nikela Papadopoulou @_nikela_

251 Followers 620 Following never thinking straight | always thinking parallel Low Carbon & Sustainable Computing Lecturer @GlasgowCS

We are a computer consulting / repair center. We fix (and build) laptops / desktops / servers. We offer onsite support for business, and website design/hosting.

Arcadian Computers @ArcadianComp

339 Followers 906 Following We are a computer consulting / repair center. We fix (and build) laptops / desktops / servers. We offer onsite support for business, and website design/hosting.

CrD. & VFX-Artist.
As a kid i always wanted to become a pirate⚓️, Now i lost one eye and have a bad leg, I sacrificed both to the Ancient Ones, Am i one now 🤔?

𝕏one — exo/acc �.. @xone_4

151 Followers 264 Following CrD. & VFX-Artist. As a kid i always wanted to become a pirate⚓️, Now i lost one eye and have a bad leg, I sacrificed both to the Ancient Ones, Am i one now 🤔?

Software engineer @zeiss_micro,
previosly @tngtech and PhD in physics @_amolf.
Tweets on sustainability, computational imaging, and AI.
👨‍💻 🔬 🦇🔊

Ruslan Röhrich @Ruslan_0990

856 Followers 910 Following Software engineer @zeiss_micro, previosly @tngtech and PhD in physics @_amolf. Tweets on sustainability, computational imaging, and AI. 👨‍💻 🔬 🦇🔊

1X @1x_tech

9K Followers 2 Following Androids built to benefit society and meet the world's labor demand.

Research @MetaAI+NYU. Pretrain+SFT: NLP from Scratch (2011). Multilayer attention+position embed+LLM: MemNets (2015). Recent (2024): Self-Rewarding LLMs & more!

Jason Weston @jaseweston

9K Followers 569 Following Research @MetaAI+NYU. Pretrain+SFT: NLP from Scratch (2011). Multilayer attention+position embed+LLM: MemNets (2015). Recent (2024): Self-Rewarding LLMs & more!

Research Scientist at @Apple. Previous: @Meta (FAIR), @Inria, @MSFTResearch, @VectorInst and @UofG . Egyptian 🇪🇬 Deprecated twitter account: @alaaelnouby

Alaa El-Nouby @alaa_nouby

522 Followers 302 Following Research Scientist at @Apple. Previous: @Meta (FAIR), @Inria, @MSFTResearch, @VectorInst and @UofG . Egyptian 🇪🇬 Deprecated twitter account: @alaaelnouby

Shreyas Saxena @saxenashreyas2

34 Followers 91 Following ML Research Scientist @ Apple

I like AI & music. Working on making LLMs easier & safer to use. Final year PhD student at Stanford advised by Chelsea Finn & Chris Manning.

Eric @ericmitchellai

4K Followers 488 Following I like AI & music. Working on making LLMs easier & safer to use. Final year PhD student at Stanford advised by Chelsea Finn & Chris Manning.

qnguyen3 @stablequan

3K Followers 1K Following Multimodal | Synthetic Data | Multimodal Lead at Ontocord AI

Ishani Thakur @ishanit5

222 Followers 2K Following " 'How's the water?' And the two young fish swim on for a bit, and then eventually one of them looks over at the other and goes 'What the hell is water?'" - DFW

(Machine Learning Engineer ⋃ Software Engineer) ∩ Medical doctor. Swimmer and dancer. https://t.co/ixUWl9iGFq @polyrand@hachyderm.io

Ricardo Ander-Egg @ricardoanderegg

428 Followers 2K Following (Machine Learning Engineer ⋃ Software Engineer) ∩ Medical doctor. Swimmer and dancer. https://t.co/ixUWl9iGFq @[email protected]

I build/support SaaS w/ 300M users. Observability, identity, privacy. Interests: AI, personal finance, society. Solved my RSI/back pain. español 日本語 e/💻

Software Dev @swdevservice

365 Followers 2K Following I build/support SaaS w/ 300M users. Observability, identity, privacy. Interests: AI, personal finance, society. Solved my RSI/back pain. español 日本語 e/💻

Maxime Labonne @maximelabonne

12K Followers 437 Following Author of Hands-On Graph Neural Networks https://t.co/Q8victWUmR • Machine Learning Scientist

Staff Data Scientist, Mathematician, Father of two. Deep Learning / NLP / Computer Vision / MLOps OCaml Curious Not sponsored by Spindrift

marcel - so back / ng.. @mrclbschff

829 Followers 1K Following Staff Data Scientist, Mathematician, Father of two. Deep Learning / NLP / Computer Vision / MLOps OCaml Curious Not sponsored by Spindrift

budjoskop @budjoskop

88 Followers 428 Following Swift | iOS dev

Jeff Dean (@🏡) @JeffDean

18 hours ago

An edgy post... 1 trillion edges, in fact!

Google AI @GoogleAI

21 hours ago

Graph clustering merges similar items into groups to better understand relationships in data. Today, read about our recent works, including key techniques that enabled us to scale a high-quality algorithm that can cluster trillion-edge graphs. Read more → goo.gle/3y1iXMs

20 285 2K 207K 802

Download Video

11 27 326 51K 82

Jeff Clune @jeffclune

2 days ago

Delighted to share that I've been promoted to Professor (aka “Full Professor”) A huge thanks to my wife, students, collaborators, colleagues, family, & friends for everything. It's been an exhilarating, wondrous, fascinating climb, and what a view! Now, which peak to climb next?

61 8 399 27K 8

Download Image

Ravid Shwartz Ziv @ziv_ravid

2 weeks ago

All the Chinchilla scaling laws are wrong?!??!😱😱😱

Tamay Besiroglu @tamaybes

2 weeks ago

16 131 889 313K 663

Download Image

1 0 9 4K 7

Philipp Schmid @_philschmid

a week ago

13 97 364 57K 133

Download Image

Susan Zhang @suchenzang

a year ago

3 3 61 10K 1

Download Image

Tamay Besiroglu @tamaybes

2 weeks ago

16 131 889 313K 663

Download Image

MLCommons @MLCommons

2 weeks ago

Thanks, @JRussonHPC for this excellent writeup about the @MLCommons AI Safety v0.5 benchmark we announced earlier this week! If you want to learn more and contribute to our efforts to make AI safer for everyone, join our #AISafety working group: mlcommons.org/working-groups…

HPCwire @HPCwire

2 weeks ago

MLCommons Launches New AI Safety Benchmark Initiative ow.ly/V6NQ50RhsMv

0 2 5 864 1

0 0 5 441 0

MLCommons @MLCommons

2 weeks ago

Be sure to check out this coverage from @Business_AI of the MLCommons AI Safety v0.5 proof-of-concept benchmark that we announced earlier in the week to make AI safer for everyone: aibusiness.com/responsible-ai…

0 0 3 301 0

MLCommons @MLCommons

2 weeks ago

Want to dig deeper into the details of the @MLCommons AI Safety v0.5 benchmark proof of concept that we announced this week? Learn more about the approach, platform, and tests created by our open consortium for this first step toward evaluating AI safety: arxiv.org/abs/2404.12241

1 3 7 1K 1

AK @_akhaliq

2 weeks ago

4 39 111 21K 51

Download Image

Awni Hannun @awnihannun

2 weeks ago

One of my favorite things about MLX is it helps put ML research back in the hands of a single bold hobbyist. Don’t need a supercomputer to invent - just a nice laptop, a vision, and some persistence, (and maybe pip install mlx 😉)

15 30 369 43K 52

Remi Cadene @RemiCadene

2 weeks ago

Showcasing the powerful Idefics2, latest Vision LLM from Hugging Face, on a robot! 🚀

Thomas Wolf @Thom_Wolf

2 weeks ago

Time for the open-source AI robots revolution 🚀 We’ve been playing with a low-cost DJI robot controlled by 3 local open-source AI models (Whisper, Idefics2, Parler-TTS - all Apache2) & orchestrated by Dora-cs In comments a 250 lines code gist to build on top of it => enjoy!!

24 117 591 134K 355

Download Video

0 7 32 11K 4

Cerebras @CerebrasSystems

3 weeks ago

🌟 Cerebras is thrilled to be selected on the 2024 Forbes AI 50! 🌟 Here are a few reasons why we made the cut: 🎉 Cerebras is the only AI chip startup on this year’s list. Learn more about our latest generation of hardware, the CS-3: cerebras.net/product-system/ 🎉 We enable top…

12 7 40 3K 2

Download Image

Elon Musk @elonmusk

3 weeks ago

Precisely

8K 38K 229K 51.2M 56K

Download Video

𝐷𝑟. 𝐼𝑎𝑛 𝐶𝑢𝑡𝑟𝑒𝑠𝑠 @IanCutress

3 weeks ago

9 41 265 31K 44

Download Image

Tri Dao @tri_dao

4 weeks ago

I highly recommend this tutorial on Mamba and related models. Full of insights on model design and hardware-aware implementation!

Sasha Rush @srush_nlp

4 weeks ago

A Mamba Primer (w/ Yair Schiff youtube.com/watch?v=dVH1dR… ) Mamba is a nice jumping off point to summarize foundational ideas in sequence modeling, parallel algorithms, continuous-time representations, and GPU aware algorithms. We try to put these together in the context of LMs.

4 102 663 74K 736

0 32 251 31K 188

Abhi Venigalla @abhi_venigalla

a month ago

If you have apple silicon and > 70GB of RAM, you can run DBRX on your laptop!! Kudos to @awnihannun :) huggingface.co/mlx-community/…

7 20 187 20K 61

KaV @KaV_2599

a month ago

@vithursant19 I can't keep up

0 0 1 30 0

Dylan Patel @dylan522p

a month ago

Databricks DBRX model is AMAZING, generally great but CRUSHES code. 132B parameters, 12T token, 16 experts, 4 per forward, 36B active. ~2.6e24 HumanEval5, 0-Shot DBRX - 70.1% GPT-4 - 67% Gemini 1.5 Pro - 71.9% Mixtral - 54.8% Grok - 63.2% LLAMA 2 - 32.2% databricks.com/blog/introduci…

Jonathan Frankle @jefrankle

a month ago

Meet DBRX, a new sota open llm from @databricks. It's a 132B MoE with 36B active params trained from scratch on 12T tokens. It sets a new bar on all the standard benchmarks, and - as an MoE - inference is blazingly fast. Simply put, it's the model your data has been waiting for.