Connor Shorten @CShorten30, Twitter Profile

Connor Shorten @CShorten30

4 weeks ago

It was incredible seeing @lateinteraction in action @MIT_CSAIL with @ecardenas300!! 🔥🏛️🔥 Omar gave an amazing talk spanning all things from ColBERT to Multi-Hop Baleen RAG and DSPy! 🧠 The slide image below shares a part of the talk that has heavily resonated with us since -- on when to use which LLM optimization strategy based on model size: • 100B+, Instruction Tuning (Command R+, GPT-4 / Claude Opus / Gemini Ultra) • 7-13B, Few-Shot Examples (Llama2, Mistral 7B) • <1B, Gradient Descent (T5-Large) There is definitely some overlap here and you definitely can gradient descent tune 7B sized LLMs fairly easily, but I think this is a really nice nugget for thinking about the DSPy compilers at a high-level and maybe which one to reach for first when starting to optimize your programs based on the LLM you will be using! Aside from the technical discussion 😂, it was really amazing seeing Omar at MIT! He has put together an unbelievable dissertation and the future is bright for DSPy, ColBERT and the emerging community! Go Omar! 🚀

10 18 129 18K 34

Download Image

Erika Cardenas @ecardenas300

4 weeks ago

@CShorten30 @lateinteraction @MIT_CSAIL It was amazing to witness this live. 🤌 Congrats on all your research, Omar!

1 0 6 480 0

1LittleCoder💻 @1littlecoder

4 weeks ago

@CShorten30 @lateinteraction @MIT_CSAIL @ecardenas300 DSPy Crew!

1 0 3 427 0

Ahmed @hs0ci3ty

4 weeks ago

@CShorten30 @lateinteraction @MIT_CSAIL @ecardenas300 he cooked something, he deserved it, congrats omar—

1 0 4 326 0

Shaurya Rohatgi @shauryr

4 weeks ago

@CShorten30 @lateinteraction @MIT_CSAIL @ecardenas300 Happy to see the DSPy gang together! :)

1 0 3 327 0

Jason @KamaraiCode

4 weeks ago

@CShorten30 @lateinteraction @MIT_CSAIL @ecardenas300 a high alpha heuristic by @lateinteraction 🧐 • 100B+, do Instruction Tuning (Command R+, GPT-4 / Claude Opus / Gemini Ultra) • 7-13B, do Few-Shot Examples (Llama2, Mistral 7B) • <1B, do Gradient Descent (T5-Large)

0 0 3 204 0

Jay Guthrie @StraughterG

4 weeks ago

@CShorten30 @lateinteraction @MIT_CSAIL @ecardenas300 Any video?

0 0 0 54 0