Devendra Chaplot @dchaplot, Twitter Profile

Devendra Chaplot @dchaplot

2 weeks ago

We just released Mixtral-8x22B-v0.1 and Mixtral-8x22B-Instruct-v0.1: - Free to use under Apache 2.0 license - Outperforms all open models - Native function calling - Masters English, French, Italian, German and Spanish. - Seq_len = 64K mistral.ai/news/mixtral-8…

27 188 1K 141K 287

Download Image

Devendra Chaplot @dchaplot

2 weeks ago

Mixtral-8x22B outperforms all open models across reasoning, knowledge, code and math capabilities while offering the best performance/cost ratio. Mixtral-8x22B-v0.1 on HF: huggingface.co/mistralai/Mixt… Mixtral-8x22B-Instruct-v0.1 on HF: huggingface.co/mistralai/Mixt…

3 5 58 5K 15

Download Image

Lech Mazur @LechMazur

2 weeks ago

@dchaplot It does well my NYT Connections benchmark x.com/lechmazur/stat…

Lech Mazur @LechMazur

2 weeks ago

@dchaplot It does well my NYT Connections benchmark x.com/lechmazur/stat…

0 0 2 995 1

Download Image

0 0 0 699 0

SpaceTechRocks @Okitwist

2 weeks ago

@dchaplot @stanfordnlp Where is grok-1 ?

1 0 0 799 0

Rohan Paul @rohanpaul_ai

2 weeks ago

@dchaplot Congratulations. Didn't expect to get the instruct version publicly so soon. can run 3.75 bpw on 72GB VRAM

1 0 2 1K 1

Eric Kavanagh on #DMRadio @eric_kavanagh

2 weeks ago

@dchaplot Very cool! Let me know if someone wants to come on-air to talk about it. I host two nationally syndicated radio shows about Data and the Information Economy, DM Radio and InsideAnalysis. We'd love to get you some free publicity for this amazing innovation! #LLM #AI #Innovation