Theophile Gervet @theo_gervet
Accelerating open-source AI @MistralAI. Past: @Meta AI, PhD @SCSatCMU theophilegervet.github.io Pittsburgh, PA Joined March 2011-
Tweets1K
-
Followers1K
-
Following476
-
Likes1K
Very cool to see a personal hero whose videos taught you ML make a video about your models. Thank you for always being a great teacher @AndrewYNg!
Very cool to see a personal hero whose videos taught you ML make a video about your models. Thank you for always being a great teacher @AndrewYNg!
Back with more Apache 2.0! We just released Mixtral-8x22B-v0.1 and Mixtral-8x22B-Instruct-v0.1: - Outperforms all open models with only 39B active parameters - Native function calling and 64K context Blog: mistral.ai/news/mixtral-8… HF base: huggingface.co/mistralai/Mixt… HF instruct:…
magnet:?xt=urn:btih:9238b09245d0d8cd915be09927769d5f7584c1c9&dn=mixtral-8x22b&tr=udp%3A%2F%2Fopen.demonii.com%3A1337%2Fannounce&tr=http%3A%2F%https://t.co/OdtBUsbeV5%3A1337%2Fannounce
Hosting our first hackathon in SF, come build together!
Exciting personal update: I defended my PhD last week on Deep Learning on Graphs. Huge thanks to my committee Profs Tom Mitchell, @jure, and my advisors @rsalakhu and Christos Faloutsos. I have joined @inflectionAI to extend the benefits of intelligence to a wider audience!
We’re announcing a new optimised model today! Mistral Large has top-tier reasoning capacities, is multi-lingual by design, has native function calling capacities and a 32k model. The pre-trained model has 81.2% accuracy on MMLU. Learn more on mistral.ai/news/mistral-l…. Mistral…
Excited to release our latest model Mistral Large, try it out at chat.mistral.ai!
Excited to release our latest model Mistral Large, try it out at chat.mistral.ai!
How to get started with @MistralAI? 🔸Prompting: docs.mistral.ai/guides/prompti… 🔸RAG: docs.mistral.ai/guides/basic-R… 🔸Embeddings: docs.mistral.ai/guides/embeddi…
ChatGPT system prompt is 1700 tokens?!?!? If you were wondering why ChatGPT is so bad versus 6 months ago, its because of the system prompt. Look at how garbage this is. Laziness is literally part of the prompt. Formatted in the paste bin below. pastebin.com/vnxJ7kQk
Very cool to see HF assistants as an open-platform GPTs alternative, powered by @MistralAI Mixtral 8x7B as the default model! For now an assistant is a model + a system prompt, curious to see this evolve
Very cool to see HF assistants as an open-platform GPTs alternative, powered by @MistralAI Mixtral 8x7B as the default model! For now an assistant is a model + a system prompt, curious to see this evolve https://t.co/R2dvk4XJNn
Proud to see @MistralAI models pulling their weight on the @lmsysorg leaderboard: - Mixtral 8x7B is the best open license model - Mistral-Medium is just behind GPT-4, ahead of Claude 2.0 and Gemini Pro
We just released Mixtral 8x7B paper on Arxiv: arxiv.org/abs/2401.04088
Best books of 2023: The Fabric of Reality and The Beginning of Infinity by @DavidDeutschOxf Few books, if any, have impacted my worldview as much as Deutsch's work. He melds physics, evolution, computation, and epistemology to place human minds at the center of the physical…
This seems like the right data-centric paradigm for dexterous grasping that generalizes to any object and any purpose: learn general affordances from passive Internet-scale data and robustness from Sim2Real interaction data. Keep up the great work @anag004!
This seems like the right data-centric paradigm for dexterous grasping that generalizes to any object and any purpose: learn general affordances from passive Internet-scale data and robustness from Sim2Real interaction data. Keep up the great work @anag004!
🧐Consider this: In the real world, do multimodal data always exhibit straightforward one-to-one relationships between modalities? Join me for a discussion on how LLMs manage multimodal data with intricate intermodal connections at Hall C2! 🔥
Thank you for your support towards pushing the boundary of open-source!
Thank you for your support towards pushing the boundary of open-source!
Mixtral 8x7B is here, 11 weeks only after Mistral 7B. Outperforms Llama 2 70B and GPT 3.5 on most benchmarks, at the inference cost of a 12B dense model, with 32k tokens context size.
Mariya Toneva @mtoneva1
3K Followers 548 Following Faculty at @mpi_sws_ working at the intersection of ML, NLP, and cognitive neuroscience. Yogurt snob.Devendra Chaplot @dchaplot
8K Followers 365 Following Building next-gen AI at @MistralAI. Past: Research Scientist at Facebook AI Research. Ph.D. @SCSatCMU, BTech @iitbombay CS.Jeremy Cohen @deepcohen
4K Followers 870 Following PhD student in machine learning at Carnegie Mellon. The goal of my research is to turn deep learning into a real engineering discipline.Paul Liang @pliang279
5K Followers 910 Following PhD student @mldcmu @SCSatCMU. Foundations of multimodal learning & applications in social AI, NLP, and healthcare with @lpmorency and @rsalakhu.FaB San Francisco Fas.. @BeautytechSF
1K Followers 2K Following We are a community. 15k+ Entrepreneurs&Investors. Beauty&Fashion lovers. Founded in #California, now 19 chapters #wearefab💄🤳👡 🌎🚀 #FaB #Fashion #BeautyTechYiding Jiang @yidingjiang
1K Followers 468 Following PhD student @mldcmu @SCSatCMU. Formerly intern @MetaAI, AI resident @GoogleAI. BS from @Berkeley_EECS. Trying to understand stuff.Amelia Laskoskie @AmeliaLask34039
63 Followers 5K FollowingLewis Walker ➲ @lewiswalkerai
5K Followers 5K Following Follow for Generative AI insights shared daily | Deloitte AI | Ex-Goldman Sachs | LinkedIn Top AI VoiceBen Fu @benfucious
274 Followers 500 Following Partner @NewViewCap. General Partner @NextWorldCap. Investor @Gong_io @Aircall @Honeycombio @CopperInc @DataStax @DatameerTuedish @TuedishwHa_aO7
1 Followers 70 FollowingTianna Jelinski @TianJelin
65 Followers 5K FollowingCollene Mattina @colle_matti
99 Followers 5K FollowingRohan Paul @rohanpaul_ai
13K Followers 1K Following ML Engineer (e/acc) 📌 https://t.co/x0IIWfnOt8 🚀 https://t.co/QEO4CKRl1b Open LLMs is Happiness 💡 Ex Deutsche & HSBC. DM for collaboration.Maddie Aiello @aiello_mad80860
88 Followers 5K FollowingAzoth @Azoth42
48 Followers 381 FollowingChantel Castner @ChantCastner
42 Followers 5K FollowingPoorvi @Poorvi_rh
58 Followers 228 Following CV/Robotics @ Stealth Startup | MS Computer Vision @CMU | CSE @IIT BombayJosephine Rochefort @JosephinRochefo
64 Followers 5K FollowingYash @Yash11386432
4 Followers 53 FollowingAubree Woolson @wools_aubr
38 Followers 5K FollowingMuhammad Abdullah @Abdullah_kwl
42 Followers 501 Following Life is better when you're laughing...... "your time is limited,So don't waste it living someone else's life❤Madeline Kasdon @m_kasdo
42 Followers 5K FollowingDavid @DavidSHolz
54K Followers 5K Following founder @midjourney, prev founder leap motion, nasa, max planckcamenduru @camenduru
15K Followers 4K Following ML & Computer Engineer, Game Designer. #OpenSource ❤ #UE ❤ #Jupyter ❤ #AI #ML #StableDiffusion #LLM #NeRF #GaussianSplatting #T2V https://t.co/8MMNbygz1PVineeth Veetil @vin_veetil
52 Followers 142 FollowingHardik Godara @HardikGodara
20 Followers 161 Following Innovator, Tech Enthusiast, Avid Reader, Roboticist, Adventure loverShLeoMo @ShLeoMo
41 Followers 230 FollowingArif Ahmad @arif_ahmad_py
278 Followers 7K Following All things AI, Computer Science and Circuits! Prev. @GoogleAILeyla Katsuda @kats_ley
29 Followers 5K FollowingEvan @Evan734
65 Followers 2K FollowingDify.AI @dify_ai
6K Followers 121 Following An open-source LLM app development platform. GitHub:https://t.co/7MWGu2QQ1o Discord:https://t.co/MEaPZpnE5MMunish Kumar @kumar_munish_
57 Followers 236 FollowingPawan Osman @pawanosmant
397 Followers 128 Following 💻 Backend Developer, IT consultant and Networking Specialist ⚙️Avi Barazani @avibarazani326
12 Followers 60 FollowingCryptonite @battle8500
311 Followers 1K Following Economist and applied statistician. Passionate about Python | BigData | MachineLearning | DeepLearning. Believes in #Bitcoin. Builds webapps with Streamlit.shaoliang shi @ShaoliangS42597
1 Followers 87 FollowingSarah Arminta Bentley @Sarah_A_Bentley
3K Followers 2K Following Trader. PKM. AI. Mom. @tana_inc AmbassadorMa Sheen @MaSheenUprising
8 Followers 998 Following “The programme will take me a little while to run.” Fook glanced impatiently at his watch.Epsilon Guanlin Lee @Epsilon_Lee
208 Followers 2K Following PhD, MLer, CLer (NLPer), ML Engineer at https://t.co/gX6Lem59Co, have belief in interpretability research of AI/ML/NNsAaditya ; @Aaditya26082004
531 Followers 7K Following CS'26 • Machine Learning • Open-Source • Web Dev. • Algorithms • Jai Shree Krishna 🦚🪈Florent Daudens @fdaudens
11K Followers 6K Following Press Lead @HuggingFace / Passionate about AI & news / Previously @radiocanadainfo @ledevoir & coNot_Important @djagunic_tk
8 Followers 61 FollowingDerek Cheung @derekcheungsa
2K Followers 377 Following Engineer, Instructor & Investor in Canada. 💼🤵 Publicly building AI apps to solve real world problems Follow for tweets on AI, Finance, Building Cool AI AppsSimi Dolha @nuk3r
87 Followers 465 Following Civil Protection Exercises Coordination | Disaster Management | Crisis Leadership | Safety & Security | Military | Education | UEFA B Football Coach | AuthorYann LeCun @ylecun
712K Followers 719 Following Professor at NYU. Chief AI Scientist at Meta. Researcher in AI, Machine Learning, Robotics, etc. ACM Turing Award Laureate.Mariya Toneva @mtoneva1
3K Followers 548 Following Faculty at @mpi_sws_ working at the intersection of ML, NLP, and cognitive neuroscience. Yogurt snob.Devendra Chaplot @dchaplot
8K Followers 365 Following Building next-gen AI at @MistralAI. Past: Research Scientist at Facebook AI Research. Ph.D. @SCSatCMU, BTech @iitbombay CS.Jeremy Cohen @deepcohen
4K Followers 870 Following PhD student in machine learning at Carnegie Mellon. The goal of my research is to turn deep learning into a real engineering discipline.Jia-Bin Huang @jbhuang0604
51K Followers 285 Following Associate Professor @umdcs; Part-time Research Scientist @Meta. I like pixels.Paul Liang @pliang279
5K Followers 910 Following PhD student @mldcmu @SCSatCMU. Foundations of multimodal learning & applications in social AI, NLP, and healthcare with @lpmorency and @rsalakhu.Richard Socher @RichardSocher
101K Followers 971 Following CEO @youSearchEngine Investing at @aixventuresHQ Before: Stanford Adj Prof in AI/NLP, Chief Scientist at Salesforce, MetaMindZachary Lipton @zacharylipton
59K Followers 2K Following Professor: CMU/@acmi_lab, CTO / CSO: @AbridgeHQ, Creator: @d2l_ai & https://t.co/QQt98VNLUp, Relapsing 🎷Emery Wells @emerywells
11K Followers 1K Following Founder of a unicorn, @Frame_io (acquired by Adobe). Video Pro. Apple Design Award winner.David @DavidSHolz
54K Followers 5K Following founder @midjourney, prev founder leap motion, nasa, max planckRajko Radovanović @rajko_rad
4K Followers 4K Following AI/infra @a16z (partner to amazing teams eg @MistralAI @udiomusic); Enjoy most things outdoors, care about democracy in 🇷🇸🇭🇷🇸🇮🇧🇦🇲🇪Joshua Meier @joshim5
2K Followers 1K FollowingAndrew White crow/acc @andrewwhite01
20K Followers 2K Following Head of Sci/cofounder @FutureHouseSF. Prof of chem eng @UofR (on sabbatical). Automating science with AI and robots, starting with bio.Jack Altman @jaltma
76K Followers 415 Following Investing at Alt Capital. Founder and chairman of Lattice.Alex Reibman 🖇️ @AlexReibman
23K Followers 801 Following Accelerating @agentopsai @foomvc Agents, ML, math, and data viz. Hack reporter🕶️brett goldstein @thatguybg
21K Followers 3K Following founder of something new/old | x M&A guy at Google | investor in 50+ via @launchhouse | writing https://t.co/iX1dpZ3MZY | follow for startup advice & subpar memesAlex Bowe @alexbowe
2K Followers 1K Following Exploring ideaspace | Ex AI @Cruise | Compressed Data Structures/Bioinformatics PhD | Enjoyer of thingsmargaret jennings @mjmj1oo
657 Followers 646 Following building tools 🎀 produit @mistralai we’re hiring: https://t.co/IAQ1hUfJwTLulu Cheng Meservey @lulumeservey
80K Followers 2K Following “Meservey isn’t your typical flack.” -The Information. Founder of ROSTRA. Ex-Activision and Substack. TrailRunner cofounder. Writing https://t.co/4xKo7wQTQoCeder Group @cedergroup
3K Followers 136 Following Computational and Experimental Design of Emerging materials Research (CEDER) Group at @BerkeleyMSE & @BerkeleyLabKristin Persson @KPatBerkeley
3K Followers 367 Following Professor at UC Berkeley, Director of the Molecular Foundry and the Materials Project, Senior Faculty Scientist at LBNL. Opinions all me.Robert Palgrave @Robert_Palgrave
7K Followers 1K Following Professor of Inorganic and Materials Chemistry at UCL. Director of UK National XPS Service @harwellxpsAbhishek Das @abhshkdz
6K Followers 202 Following Prev: Research Scientist at FAIR @Meta & @OpenCatalyst, PhD at @GeorgiaTech.Remi Cadene @RemiCadene
8K Followers 587 Following Robotics at Hugging Face Ex-Tesla Autopilot Optimus Postdoc Brown, PhD SorbonneCristian Bodnar @crisbodnar
5K Followers 2K Following Senior Researcher @MSFTResearch | Simulations, Geometric Deep Learning, AI4ScienceRomain Huet @romainhuet
21K Followers 7K Following Head of Developer Experience @OpenAI. Previously, Product Lead @Stripe, Platform @Twitter, Co-Founder & CTO of Jolicloud.The AI Salon @TheAISalonSF
2K Followers 4 Following The AI Salon hosts in-person conversations on the philosophical, sociological and cultural implications of artificial intelligence.Balaji @balajis
1.0M Followers 4K Following Immutable money, infinite frontier, eternal life. #BitcoinPaul Buchheit @paultoo
60K Followers 968 Following Alignment, Narrative Understanding. I’m just a butterfly, flapping my wings :)Andrew Côté @Andercot
64K Followers 1K Following engineering physicist, founder Hyperstition Inc, scout @a16z, runs @TheAISalonSF, deep-tech, physics, energy and sci-fi 🇨🇦Quanquan Gu @QuanquanGu
9K Followers 2K Following Professor @UCLA | Head of AIDD, ByteDance Research | Recent work: Self-play fine-tuning (SPIN) | Opinions are my ownJulien Chaumond @julien_c
47K Followers 1K Following Co-founder and CTO at @huggingface 🤗. ML/AI for everyone, building products to propel communities fwd. @Stanford + @PolytechniqueOmar Sanseviero @osanseviero
32K Followers 2K Following Chief Llama Officer @huggingface 🦙 Founder @AI_Learners. Xoogler (SWE @Google Assistant, 20% PM TF Graphics). 100% Hacker Llama🇵🇪🇲🇽Aaron Levie @levie
2.5M Followers 630 Following Lead Magician (and CEO) at Box (@box); Huge ABBA fan. I don't fully endorse anything I say below. Go ☁Pessimists Archive @PessimistsArc
91K Followers 69 Following Exploring technophobia and moral panic through the ages. A litany of shameful cynicism and spite. Curated by @louisanslowSaurabh Garg @saurabh_garg67
865 Followers 579 Following Building next-gen AI at @MistralAI | prev/ PhD @mldcmu; CS @iitbombay (undergrad); Collab @GoogleAI @awscloud @appleGuillaume Verdon @GillVerd
53K Followers 3K Following Founder & CEO @Extropic_AI • prev: Physics & AI R&D @ (Alphabet X / Google) • Founder @ TensorFlow Quantum • (PhD(ABD) + MMath) @ (IQC / UWaterloo / PI) • e/accBrave Software @brave
293K Followers 92 Following Join over 70M users with our private browser, search, Web3 access & more. It only takes 60 seconds to switch. For help @BraveSupport 🦁 #BeBrave #SwitchToBraveYangqing Jia @jiayq
12K Followers 263 Following Founder @leptonai. @UCBerkeley alumni. ex @google & @facebook. ex vp @AlibabaGroup. Open source work on caffe, @pytorch, @tensorflow, & @onnxai.Maxime Labonne @maximelabonne
12K Followers 436 Following Author of Hands-On Graph Neural Networks https://t.co/Q8victWUmR • Machine Learning ScientistAK @_akhaliq
310K Followers 3K Following AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo follow on Hugging Face: https://t.co/q2Qoey80GxHarrison Chase @hwchase17
53K Followers 410 Following @LangChainAI, previously @robusthq @kensho MLOps ∪ Generative AI ∪ sports analyticsAsh Jogalekar @curiouswavefn
26K Followers 272 Following Scientist, historian, reader, writer, collector of vintage books. Views my own.David Perell @david_perell
435K Followers 782 Following "The Writing Guy" | Jesus-Follower | I tweet about writing and creativity | My writing school: https://t.co/bzeQ7VVyS0 | My writing: https://t.co/SOE9HtxXdiChristian Keil @pronounced_kyle
21K Followers 1K Following VP @Astranis, building internet satellites ◦ host of @1stPrinciplesFM ◦ investor and believer in deep tech startupsNous Research @NousResearch
18K Followers 29 Following The AI Accelerator Company. https://t.co/vrD0aDJetoTianle Cai @tianle_cai
5K Followers 4K Following ML PhD @Princeton. Life-long learner, hacker, and builder. Tech consultant & angel investor. Prev @togethercompute @GoogleDeepMind @MSFTResearch @citsecurities.jason liu @jxnlco
19K Followers 1K Following sabbatical @southpkcommons, angel investor?? prev @stitchfix @metaEurope has less than 3% of the world’s deployed H100s
events.nationalacademies.org/42507_04-2024_… Giving a talk on evaluating large language models for mathematics through interactions (work co-lead with @katie_m_collins) on Thursday. In the same session is the one and only @ChrSzegedy!
It's a great honor to partner with @AndrewYNg @DeepLearningAI on this @MistralAI course. Super excited for the launch!
New short course with @MistralAI ! Mistral's open-source Mixtral 8x7B model uses a "mixture of experts" (MoE) architecture. Unlike a standard transformer, an MoE model has multiple expert feed-forward networks (8 in this case), with a gating network selecting two experts at…
New short course with @MistralAI ! Mistral's open-source Mixtral 8x7B model uses a "mixture of experts" (MoE) architecture. Unlike a standard transformer, an MoE model has multiple expert feed-forward networks (8 in this case), with a gating network selecting two experts at…
Yup that was definitely a good idea
Considering moving from Austin to SF to be able to meet more interesting people in person. People familiar with SF, good or bad idea at this point?
Considering moving from Austin to SF to be able to meet more interesting people in person. People familiar with SF, good or bad idea at this point?
We just released Mixtral-8x22B-v0.1 and Mixtral-8x22B-Instruct-v0.1: - Free to use under Apache 2.0 license - Outperforms all open models - Native function calling - Masters English, French, Italian, German and Spanish. - Seq_len = 64K mistral.ai/news/mixtral-8…
Mixtral 8x22B Instruct is out. It significantly outperforms existing open models, and only uses 39B active parameters (making it significantly faster than 70B models during inference). 1/n
After two good years at Microsoft Research AI4Science, I am very excited to announce that as of this month I have, together with Chad Edwards, co-founded a new startup in the field of molecular and materials discovery.
Some personal news: I’m excited to be joining Hello Robot, to lead their Embodied AI effort! For years now, I’ve been a passionate supporter of their vision of affordable, useful robots that can help people out with their day-to-day lives. I’ve previously worked at FAIR, part of…
magnet:?xt=urn:btih:9238b09245d0d8cd915be09927769d5f7584c1c9&dn=mixtral-8x22b&tr=udp%3A%2F%2Fopen.demonii.com%3A1337%2Fannounce&tr=http%3A%2F%https://t.co/OdtBUsbeV5%3A1337%2Fannounce
Very happy to partner with @awscloud to expose Mistral models on Amazon Bedrock, as we continue to bring our technology to every developer.
A colossal AI has arrived. Get large with @MistralAI. ☁️💥💻 Mistral Large is now on #AmazonBedrock. Make the most of your data with cutting-edge text generation, top-tier reasoning capabilities, & advanced language processing. #AWS #generativeAI 👉 go.aws/43GcD8V
So big news: today was my last day at Meta. It's bittersweet, since meta has some of the smartest and hardest working people in the world, and I learned a lot there. But I'm excited to work on something new that I'm passionate about - what won't be a surprise if you follow me.
It is sad that in order to be able to pursue deep hard tech over a long time horizon, (mostly) you need to have had a quicker financial success of some magnitude in your career once before, mostly with something that’s not really hard tech. Musk, Bezos, etc. It makes you assume…
Update: I left Meta yesterday. Working on @OpenCatalyst at FAIR has been one of my most cherished career journeys. The team's really strong, with ambitious step-change research in the pipeline. I'm rooting for them! As for what's next, excited to build something new! Stay tuned
Mistral AI is what OpenAI would be if it were actually open. And they just threw the largest OSS LLM hackathon to date. Over 2000 hackers applied to compete for $10k in prizes. Here’s what we saw at the @MistralAI x @cerebral_valley hackathon (🧵):
Folks still accelerating here around midnight on a Saturday @SHACK15sf for the @MistralAI hackathon. Love to see it. 👨💻👩💻🚀🤘
@GuillaumeLample presenting technical details of Mixtral 8x7B at @NVIDIAGTC ❤️