AI21 Labs @AI21Labs
AI21 Labs builds Foundation Models and AI Systems for the enterprise that accelerate the use of GenAI in production. 🥂Meet Jamba https://t.co/xUBjKZHKVH ai21.com Joined August 2019-
Tweets251
-
Followers6K
-
Following89
-
Likes279
.@origoshen, Co-Founder + Co-CEO of @AI21Labs, talks about growth opportunities for the company following a $208 million Series C funding round, and shares his perspective on the future of AI on #NYSEFloorTalk with @JudyKShaw
Building a RAG solution is easy. Building a great one is not. In our guest blog on @streamlit, our team explores the intricacies of how AI21's Contextual Answers Task-Specific Model & our RAG Engine generate context-based answers grounded in your proprietary organizational data.…
Paper of Jamba: A Hybrid Transformer-Mamba Language Model Jamba, a novel architecture which combines Attention and Mamba layers, with MoE modules, and an open implementation of it, reaching state-of-the-art performance and supporting long contexts. We showed how Jamba provides…
Paper of Jamba: A Hybrid Transformer-Mamba Language Model Jamba, a novel architecture which combines Attention and Mamba layers, with MoE modules, and an open implementation of it, reaching state-of-the-art performance and supporting long contexts. We showed how Jamba provides… https://t.co/7Q0DgpVSSA
Jamba A Hybrid Transformer-Mamba Language Model present Jamba, a new base large language model based on a novel hybrid Transformer-Mamba mixture-of-experts (MoE) architecture. Specifically, Jamba interleaves blocks of Transformer and Mamba layers, enjoying the benefits of
Last Week @AI21Labs released the production-scale Mamba implementation, and today, they released their paper. 🧐 Jamba introduces a new hybrid Transformer-Mamba mixture-of-experts architecture offering state-of-the-art performance but with significant improvements on long…
Yesterday AI21 Labs released Jamba, the first production-scale Mamba implementation as a hybrid SSM-Transformer MoE 🐍 And today, you can already finetune it with Hugging Face TRL. @Dorialexander shared a working Qlora script using 4-bit quantization on an A100 GPU (for now,…
🥹 Jamba is truly amazing! Everyone speaks about Long Context. But it's been mostly useful for ingesting in-context learning. But Jamba seems to be the first Model offering a great throughput even for higher context!
I played a little with Jamba: it looks like an amazing model. In terms of architecture, the MoE implementation is very close to Mixtral's. What's great about it is that it hasn't been fine-tuned. Curious to see how much improvement we can get through SFT. I made a little…
I played a little with Jamba: it looks like an amazing model. In terms of architecture, the MoE implementation is very close to Mixtral's. What's great about it is that it hasn't been fine-tuned. Curious to see how much improvement we can get through SFT. I made a little… https://t.co/g01vjsZxvx
Extremely cool new model release from @AI21Labs - Jamba - and it's not even a transformer! It's a hybrid model that combines Mamba (structured state space model), transformer layers and MoE technique, and it's a first production-grade Mamba based model! * It's a 52B MoE with 12B…
incredibly impressed by @AI21Labs' Jamba today. This is the first legitimate Mixtral-killer we've seen and it came out of "nowhere": buttondown.email/ainews/archive… They've helped me redefine my idea of a model "weight class" from "number of parameters" (increasingly outdated with MoEs…
Find our new model on @huggingface.
Find our new model on @huggingface.
AK @_akhaliq
310K Followers 3K Following AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo follow on Hugging Face: https://t.co/q2Qoey80GxJim Fan @DrJimFan
229K Followers 3K Following @NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.Percy Liang @percyliang
49K Followers 408 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | PianistUri Eliabayev @urieli17
13K Followers 1K Following AI Consultant and lecturer, Founder of the "Machine & Deep learning Israel" community (37K members), Contributor at @haaretzLior⚡ @AlphaSignalAI
84K Followers 897 Following Covering the latest in AI R&D • ML Engineer • Ex-Mila researcher • MIT Lecturer • Building AlphaSignal, a technical newsletter read by 180,000+ ML experts.Sharif Shameem @sharifshameem
53K Followers 3K Following founder @LexicaArt • in pursuit of good explanationsnear @nearcyan
45K Followers 882 Following https://t.co/IdaJwZJCXm partner @ https://t.co/9g1MIgjiqc dms openJack Clark @jackclarkSF
68K Followers 5K Following @AnthropicAI, ONEAI OECD, co-chair @indexingai, writer @ https://t.co/3vmtHYkaTu Past: @openai, @business @theregister. Neural nets, distributed systems, weird futuresDan Padnos @DanPadnos
228 Followers 435 Following Wrangling LLMs @AI21Labs. Only my good tweets are AI-generated.Miles Brundage @Miles_Brundage
43K Followers 10K Following Policy research at @openai. I mostly tweet about AI, animals, and sci-fi. He/him. Views my own.Jerry Liu @jerryjliu0
45K Followers 1K Following co-founder/CEO @llama_index Careers: https://t.co/EUnMNmbCtx Enterprise: https://t.co/Ht5jwxSrQBArbel Zinger @Arbel2025
6K Followers 938 Following Tech. Food. Parenting. Product Management. Co-Founder of #צרות_בהייטק.Eran Yacobovitch @eranyac
3K Followers 725 Following Marketing Advisor | Ex-CMO @AI21labs | Ex-VP Marketing at @Lightricks | Co-founder of @hitechproblemsOri Ram @ori__ram
765 Followers 386 Following Research Scientist @GoogleAI, working on #NLProc. Previously: PhD from @TelAvivUni, Research Scientist @AI21LabsYuval the 🥑 @YuvalinTheDeep
275 Followers 200 Following I simplify complicated things, and complicate simple things. My brother went to law school, but somehow I'm the only Advocate in the family (at @AI21Labs)Ramsri Goutham Golla @ramsri_goutham
11K Followers 3K Following Shares learnings from bootstrapping 2 AI SaaS Apps to $100k ARR with no employees: https://t.co/fU8yoiYVDc https://t.co/DTyILliHVm My NLP courses: https://t.co/MYUyOxGSkAYazeed Sabri @seeds_rt
399 Followers 425 Following Co-founder and CTO @artistreeio | @Techstars Austin '23 | Hater of @apple's Safari |Lisa N. Berg Montgome.. @lisan_berg
2 Followers 29 FollowingPiero Valencia @PieroValen75728
4 Followers 88 Followingsebastianolovadina @Sebasti26467860
2 Followers 22 FollowingShannon Ramsey @_shannonramsey_
65 Followers 168 Following Connector. Disruptor. AI strategist. Vice President of Strategic Accounts at #AIfirst companies Aurea Software and Jigsaw Interactive.Santhosh kumar theyya.. @tkumarsanthosh
332 Followers 4K Following Full time IT professional since 2001.Eric Sharp @ericgsharp
587 Followers 651 Following Learn In | BookClub | Degreed | Tech Founder | Startup Advisor | Entrepreneur | Builder of Great Teams | Culture Enthusiast | Lifelong Learnerxiaoboliang @xiaobolian66449
2 Followers 97 FollowingHalil Zabun @halilzabun_
58 Followers 52 Following ML engineer 👨💻| YouTuber 🎥| Interested in machine learning, neuroscience, linguistics, psychology, video games and their intersections.Harrison McQ @hmMcQ
2 Followers 36 FollowingAlok Shah @alokshah1504
11 Followers 25 Followingsiva kumar @sivakum38269090
2 Followers 83 FollowingHannah @Hannah1143097
2 Followers 60 FollowingSaasy LLAMA @saasy_llama
11 Followers 114 Following Quit my corporate restructuring job to ride this new AI waveJacob Somer @jacob_somer_
585 Followers 3K Following AI Enthusiast & Software Engineer 💻 Building intelligent systems that make a difference.Taiping Zeng @zengtaiping
90 Followers 774 Following Spatial Navigation, Computational Neuroscience, Robotics, Artificial Intelligence, University of TokyoZar 🏄🏽♂️ @zarvintus
1 Followers 215 Following In the realm of Zarvintus, where horizons bleed into infinity, wanderlust is not merely a desire but the very essence of existence.Ajinkya @Ajinkya_Tweets
302 Followers 3K Following software engineer | stealth mode | reader | learner Strong opinions, Strongly heldelvis enriquez diaz @elvisdiazc1
92 Followers 596 Following CEO at E&M Learning, Divulgador Científico, Docente,Consultor Proyectos Ti, Subgerente SistemasTom Hannigan @thannigan_
314 Followers 2K Following Iced Americano, just black, no room for milk or sugar | @AgencyBateman | UMass '18 @UMassJournalismPedro Nascimento @pedromnasc
363 Followers 2K Following 🇧🇷/🇬🇧 Founder @findlyai (YC S22) Ex - @Google / @TwitterYuquan Chen @YuquanChen_USTC
5 Followers 431 Following PhD student @ustc. Research interest: quantum computing & ML.Roger Ng, MD @rkchee
193 Followers 822 Following Cardiologist | Board Certified in Echocardiography, Cardiovascular Diseases, Internal Medicine, Nuclear Cardiology, Cardiac CT | AI/ML DeveloperEytan Assaf @eytan_assaf
2 Followers 42 FollowingNguyễn Trúc @LucieNguyen102
1 Followers 24 FollowingMax @Spartacus0523
482 Followers 4K Following An idiot focused on the future of finance. Retired/Disabled.Conor O'Hare @conorohare91
1K Followers 4K Following Distiller at Cooley Distillery WSET Spirits qualified All opinions are my ownscriptable @scriptable
212 Followers 1K Following A handbook for software engineers - by Mitch Allen - For hardware see https://t.co/jUimknNHzD - typos are my own - free the edit button!Güçlü Akpınar @gucluakpinar
984 Followers 4K FollowingOctavIan @octavicristea
27 Followers 492 FollowingRayan H. Assaad @RayanAssaad
146 Followers 1K Following Assistant Professor of Civil Engineering; New Jersey Institute of Technology (NJIT)Jonathan Sadeghi @JonathanSadeghi
239 Followers 1K Following Senior Research Engineer at @_FiveAI/@BoschGlobal, previously PhD at @LivUni. statistical machine learning, uncertainty, computer vision...Muhammad Abdullah @Abdullah_kwl
42 Followers 501 Following Life is better when you're laughing...... "your time is limited,So don't waste it living someone else's life❤Billy D @BillyDtwatter
224 Followers 701 Following All things Cleveland sports and The Ohio State Buckeyes! By the way, your government isn’t on your side, right or left. Meet in the middle, strength in numbers!RiboRibal @RiboRibal
7 Followers 68 FollowingSaliëns @Contact_Saliens
138 Followers 1K Following Sic deinde, quicumque alius transiliet moenia mea.Mohammed AIT OUFKIR @moufkir
17 Followers 87 FollowingJBGrowing 🇨🇦�.. @JessieTweeting
3K Followers 4K Following Dad, husband, brother and son. Army Veteran. Advocate for societal change & the environment. Comments my own.. He/Himsblomley @sblomley
1 Followers 4K FollowingAkhil Sathuluri @akhil_sathuluri
32 Followers 2K FollowingShreya Kapoor @SKapoor_18
333 Followers 1K Following PhD @CogCoVi |Formerly Data Scientist @MPI-CBS| https://t.co/HWJLt7Jhwk. Life Science Informatics @UniBonnVineeth Veetil @vin_veetil
50 Followers 142 FollowingFarad Alam @farad_alam
36 Followers 69 Following Django | LangChain Developer https://t.co/Ivgox8gjWZclem 🤗 @ClementDelangue
91K Followers 5K Following Co-founder & CEO @HuggingFace 🤗, the open and collaborative platform for AI buildersDan Padnos @DanPadnos
228 Followers 435 Following Wrangling LLMs @AI21Labs. Only my good tweets are AI-generated.Yuval the 🥑 @YuvalinTheDeep
275 Followers 200 Following I simplify complicated things, and complicate simple things. My brother went to law school, but somehow I'm the only Advocate in the family (at @AI21Labs)Andrej Karpathy @karpathy
979K Followers 905 Following 🧑🍳. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets 🧠🤖💥Wing Lian (caseus) @winglian
9K Followers 2K Following @axolotl_ai OSS maintainer. Axolotl AI founder. AI/ML tinkerer. Building tools for everyone.Tri Dao @tri_dao
19K Followers 365 Following Incoming Asst. Prof @PrincetonCS, Chief Scientist @togethercompute. Machine learning & systems.Artificial Analysis @ArtificialAnlys
6K Followers 359 Following Independent analysis of AI models and hosting providers - choose the best model and API provider for your use-caseAlvin-GenAI @AlvinWeb3
1K Followers 1K Following building multi-modal AI app, enterprise AI consultant | TMT veteran 20+ years | ex-Alibaba Group VP | Aspen Institute Global Leadership NetworkAndrew Bolis @AndrewBolis
24K Followers 153 Following AI & Marketing Consultant 🚀 $190M in Attributed Revenue 📢 Former CMO 📈 I help companies leverage AI tools, optimize their marketing & grow their revenue.Shai Oster @beijingscribe
11K Followers 2K Following Asia Bureau Chief for The Information. [email protected]. Ex-WSJ, ex-Bloomberg.Philipp Schmid @_philschmid
16K Followers 651 Following Tech Lead and LLMs at @huggingface 👨🏻💻 🤗 AWS ML Hero 🦸🏻 | Cloud & ML enthusiast | 📍Nuremberg | 🇩🇪 https://t.co/l1ppq3q3hkTeknium (e/λ) @Teknium1
29K Followers 3K Following Cofounder @NousResearch, prev @StabilityAI Github: https://t.co/LZwHTUFwPq HuggingFace: https://t.co/sN2FFU8PVE Support me on Github SponsorsMurray Shanahan @mpshanahan
16K Followers 314 Following Professor at Imperial College London and Principal Scientist at Google DeepMind. Tweeting in a personal capacity. To send me a message please use emailDogan Ural @doganuraldesign
20K Followers 387 Following ✦ Designer ✦ AI Explorer ✦ Prompt Curator ✦ Daily Crafting & Sharing ✦ Generative AI, Design, UI/UXGreg Kamradt @GregKamradt
24K Followers 720 Following Building AI + B2B products 🖥️ Content: https://t.co/kLERwNtzqi Feedback is great: https://t.co/A6mrmjCem5 Prev. @digits @salesforcelmsys.org @lmsysorg
37K Followers 172 Following Large Model Systems Organization. We created Vicuna and Chatbot Arena! Compare 30+ LLMs (GPT-4/Claude/Llamas) side-by-side at https://t.co/IDFeIDIOtmVin Vashishta @v_vashishta
29K Followers 1K Following Author, Best Seller “From Data To Profit.” I wrote the playbook for enterprise data & AI monetization. AI advisor, speaker, & course instructor.Alvaro Cintas @dr_cintas
47K Followers 128 Following Educating about AI, Cybersecurity and Technology | Professor & Computer Engineer, PhD | ✍️ @therundownaiMin Choi @minchoi
66K Followers 762 Following AI Educator. 𝕏 about AI, solutions and interesting things. Showing how to leverage AI in practical ways for you and your business.Chase Lean @chaseleantj
61K Followers 337 Following AI educator. I share practical ways to use AI tools every day.Yohei @yoheinakajima
71K Followers 8K Following VC by day, builder by night: @untappedvc, @babyagi_, @pixelbeastsnft. Build-in-public log: https://t.co/UdHHGbZba5Anthony Noto @anthonynoto
77K Followers 17K Following Proud Father of Fab 5, Bear & Bodie; Husband to Klo; CEO of SoFi; previously Twitter COO, NFL CFO; West Point Class of ‘91 DSBD‼️ Wharton MBAGary Marcus @GaryMarcus
145K Followers 7K Following “A beacon of clarity”. Spoke at US Senate AI Oversight committee. Founder/CEO Geometric Intelligence (acq. by Uber). Rebooting AI & Taming Silicon Valley.Ethan Mollick @emollick
211K Followers 552 Following Professor @Wharton studying AI, innovation & startups. Democratizing education using tech Book: https://t.co/CSmipbJ2jV Substack: https://t.co/UIBhxu4bgqMark Suster @msuster
332K Followers 2K Following 2x entrepreneur. Sold both companies (last to @salesforce). Now @UpfrontVC looking to invest in passionate entrepreneurs.Kirk Borne @KirkDBorne
447K Followers 6K Following Advisor to startups. Freelancer. Global Speaker. Founder @LeadershipData. Top influencer in #BigData #DataScience #AI #IoT #ML #B2B. PhD Astrophysics @CaltechRowan Cheung @rowancheung
497K Followers 377 Following Founder @therundownai. Sharing the latest developments in the world of artificial intelligence.Robert Scoble @Scobleizer
504K Followers 68K Following Follow me on my new podcast with AI startups, Unaligned. Tech industry color commentator since 1993. Author/Blogger. Former strategist @Microsoft.Marc Andreessen 🇺�.. @pmarca
1.4M Followers 24K Following Techno-optimist. E/acc. Technology brother. Move Fast and Make Things. p(Doom) = 0; p(“1984”) = not 0.Paul Graham @paulg
1.9M Followers 772 FollowingONE ZERO BANK @ONEZEROBANK
2K Followers 54 Following ONE ZERO- בנקאות פרטית שעובדת בשבילך. לפתיחת חשבון - https://t.co/TEZxQdYKeMAWS Partner Network @AWS_Partners
82K Followers 2K Following The AWS Partner Network (APN) is the global partner program for @awscloud. 🤝 Check out the APN Blog: 📖 https://t.co/phfBUdXfTVSwissCognitive, AI Ve.. @SwissCognitive
150K Followers 105K Following We are committed to unleashing the power of AI in the business world. With our AI research, advisory, and ventures, we bring a blend of expertise to the Table.Amazon Web Services @awscloud
2.2M Followers 465 Following The official account for Amazon Web Services (#AWS). ☁️ For help, please contact: @AWSSupportFirdaus Adib @epireve
734 Followers 2K Following AI Product Enthusiast. Data Science. Biohack. Into blogging.Yoav Levine @YoavLevine
306 Followers 83 FollowingAI Dungeon @aidungeon
10K Followers 67 Following Explore infinite adventures. Developed by @LatitudeGamesAI. Chat with devs and other players on our Discord! https://t.co/rtS8siV7cXEric Schmidt @ericschmidt
2.2M Followers 224 Following Former Executive Chairman & CEO and tweets from Schmidt FoundationDan Roth @DanRothNLP
2K Followers 54 Following VP/Distinguished Scientist, AWS AI Labs and the Eduardo D. Glandt Distinguished Professor, CIS, University of Pennsylvaniajack @jack
6.5M Followers 3 Following npub1sg6plzptd64u62a878hep2kev88swjh3tw00gjsfl8f237lmu63q0uf63mBill Gates @BillGates
64.6M Followers 586 Following Sharing things I'm learning through my foundation work and other interests.Fei-Fei Li @drfeifei
456K Followers 1K Following Prof (CS @Stanford), Co-Director @StanfordHAI, CoFounder/Chair @ai4allorg, Researcher #AI #computervision #ML AI+healthcareJeff Bezos @JeffBezos
6.4M Followers 355 Following Amazon. Blue Origin. Washington Post. Bezos Earth Fund. Bezos Academy.@_philschmid @AI21Labs Thanks for the fixed fix! the chart doesn't do justice to Mamba-based models though. For the same # of Active Params the cost is lower in Mamba-based models vis a vis Transformer-based.
The FMOps infrastructure stack visualized. 13 foundation model providers: - @GoogleAI - @cohere - @AnthropicAI - All open-source on @huggingface - @AI21Labs - @xai - @OpenAI - @inflectionAI - @01AI_Yi - @AIatMeta - @MistralAI - @Baidu_Inc - @StabilityAI turingpost.com/p/recap2fmops
With #RAG being all the rage right now, there are so many different solutions out there -- how do you choose your stack? Check out this blog post from @AI21Labs about their RAG Engine and Contextual Answers #TSM to see what their particular solution offers 👀
Building a RAG solution is easy. Building a great one is not. In our guest blog on @streamlit, our team explores the intricacies of how AI21's Contextual Answers Task-Specific Model & our RAG Engine generate context-based answers grounded in your proprietary organizational data.…
Releasing Jambert, my first official fine-tune of Jamba by @AI21Labs Still experimental, but on a specialized task where Mamba has the potential to shine: RAG synthesis of document (not so long for now, but this 256k context length window has potential…). huggingface.co/Pclanglais/Jam…
🐍 Jambatypus-v0.1 I fine-tuned a Jamba model on the Platypus dataset using @axolotl_ai. I'm also releasing LazyAxolotl - Jamba, a notebook to train Jamba, merge the adapter, and upload it to @huggingface. 🤗 Model: huggingface.co/mlabonne/Jamba… 💻 Colab: colab.research.google.com/drive/1alsgwZF…
Jamba from @AI21Labs scales to 256K context-window and is released as open-weights model. Will the hybrid Attention+Structured State Space models usher in a new era for open source LLMs? codingwithintelligence.com/p/jamba-are-hy… I explore this Q and more in this week's CoWI! Benchmarks 👇
I also played with Alpaca (2k test version) and created a Jambalpaca model: huggingface.co/mlabonne/Jamba… It doesn't follow instructions as well but it's quite good in general. This model/architecture has a lot of potential. cc @AI21Labs @altryne @Dorialexander
AI21 and Databricks show open source can radically slim down AI Two new large language models, Jamba and DBRX, dramatically reduce the compute and memory needed for predictions, while meeting or beating the performance of top models such as GPT-3.5 and Llama 2.…
Jamba has been introduced by @AI21Labs, a new approach in #genAI. Combining the Mamba model with transformers, Jamba aims to optimize gen AI. Learn more about this hybrid model in @TechCrunch: bit.ly/3TXSOXx
🏀 @AI21Labs' Jamba, is a 🆕 open LLM with a hybrid architecture combining Mamba & Transformer elements, offers significant improvements in memory footprint, throughput, and efficient handling of long contexts compared to pure Transformer or SSM models gradientflow.com/ai21labs-jamba/
AI21 Labs Breaks New Ground with ‘Jamba’: The Pioneering Hybrid SSM-Transformer Large Language Model Quick read: marktechpost.com/2024/03/28/ai2… Available on Hugging Face: huggingface.co/ai21labs/Jamba… #ArtificialIntelligence @AI21Labs
Minimal Fine-tuning example on a Colab (Pro) Notebook (A100 - 40GB) for @AI21Labs #Jamba colab.research.google.com/drive/1OMmaLGK…
🚀💥The power of open AI innovation!👀 Jamba SSM-Transformer MoE @AI21Labs ai21.com/jamba Qwen1.5-MoE-A2.7B @AlibabaGroup qwenlm.github.io/blog/qwen-moe/ DBRX 132B MoE @databricks databricks.com/blog/introduci… Samba-CoE v0.3 @SambaNovaAI new SOTA on AlpacaEval sambanova.ai/blog/accurate-…
For Jamba from @AI21Labs - It seems every design decision was taken to maximize the performance gained from a single A100
@AI21Labs @_philschmid Thanks to you mostly. This is a massive service for open science. (What gets me excited is that we submitted a similar concept of hybrid Mamba on 256k context in January but with the speed of EU GPU grant don’t even have the compute yet :D)
Jamba model launch takeaways - Potentially a new leader for ultra-long prompt use-cases (RAG) ‣ First open-source model of this size to combine MAMBA state-space model architecture, Mixture-Of-Experts (MOE) and the transformer ‣ 256k context window, more than 2X the size of…
What a week, huh? Captain, it's only Thursday - @databricks releases DBRX, a 132B MoE - @AI21Labs releases Jamba, a 52B Transformer/Mamba MoE - @AlibabaGroup releases Qwen-1.5-MoE, a 2.7B MoE All on hf.co! 🤗 Learn about MoE here: huggingface.co/blog/moe
Jamba, the world’s first production-grade Mamba based model is released by @AI21Labs 📌 New architecture with Joint Attention and Mamba 📌 Supports 256K context length 📌The only model in its size class that fits up to 140K context on a single GPU 📌3X throughput on long…
@natolambert @AI21Labs @Alibaba_Qwen What a great week for the open source world.
whoah ok, new architecture models are dropping on #thursdAI x.com/i/spaces/1owxw… Jamba is a combined Mamba + Joint Attention 256K context support SSM-Transformer MoE (that's a mouthful) from @AI21Labs 🔥 See Phillip's announcement for all details and come chat with us…
Jamba released! @AI21Labs just released the first production-scale Mamba implementation! Jamba is a hybrid SSM-Transformer MoE rivaling open transformer-based LLMs 🚀 TL;DR: 🧠 52B parameters with 12B active during generation 👨🏫 16 experts with 2 active in generation 🆕 New…