Assaf Ben Kish @abk_tau

Deep Learning | Large Language Models | Reinforcement Learning assafbk.github.io/website/ Joined August 2023

Tweets

76
Followers

114
Following

148
Likes

320

Jyo Pari @jyo_pari

4 weeks ago

For agents to improve over time, they can’t afford to forget what they’ve already mastered. We found that supervised fine-tuning forgets more than RL when training on a new task! Want to find out why? 👇

18 143 890 140K 772

Download Image

Assaf Ben Kish @abk_tau

2 months ago

Had a great time presenting OPRM at ASAP! We talked about recurrent memory overflows, Long Context vs. RAG, and possible scaling paradigms for recurrent LLMs. Check it out👇 Recording: youtu.be/O1_qqNAK7XE Slides: asap-seminar.github.io/assets/slides/…

Songlin Yang @SonglinYang4

2 months ago

0 2 19 6K 4

Download Image

0 3 15 3K 1

Jack O'Brien @thejackobrien

2 months ago

Today we're launching Subconscious: a new platform for building agents with long-horizon reasoning and tool use, backed by MIT research. One API call. Tool use. Context beyond existing limits. If you're building agents, let's talk.

90 20 236 11K 58

Download Video

Assaf Ben Kish @abk_tau

3 months ago

OPRM is accepted to #COLM2025! See you in Montreal 🇨🇦 Big thanks to our great collaborators from TAU, MIT, and IBM! #LLM @COLM_conf

Assaf Ben Kish @abk_tau

5 months ago

OPRM is accepted to #COLM2025! See you in Montreal 🇨🇦 Big thanks to our great collaborators from TAU, MIT, and IBM! #LLM @COLM_conf

3 26 90 19K 54

1 2 14 1K 0

Itamar Zimerman @ItamarZimerman

4 months ago

📄🚨 New! Tired of waiting minutes for LLMs to "think"? Test-time scaling (O3, DeepSeek-R1) lets LLMs reason before answering — but users are left clueless, with no progress or control. Not anymore! We expose the LLM’s internal 🕰️, and show how to monitor 📊 & overclock it⚡ 🧵👇

6 22 111 21K 41

Download Image

Han Guo @HanGuo97

4 months ago

We know Attention and its linear-time variants, such as linear attention and State Space Models. But what lies in between? Introducing Log-Linear Attention with: - Log-linear time training - Log-time inference (in both time and memory) - Hardware-efficient Triton kernels

16 202 1K 261K 850

Download Image

Yael Vinker🎗 @YVinker

4 months ago

Thanks @MIT_CSAIL for featuring our work!🖊️🎨 Huge thanks to the CSAIL news team for the fun article + video!! We'll be presenting SketchAgent at #CVPR2025 next week — come say hi if you're curious how LLMs can be used to collaboratively sketch!🖌️ 👉 bit.ly/43mTme1

MIT CSAIL @MIT_CSAIL

4 months ago

5 35 128 37K 59

Download Video

1 11 95 5K 15

MIT CSAIL @MIT_CSAIL

4 months ago

Sometimes the best way to express an idea is by sketching it out. A system from MIT CSAIL & Stanford captures this iterative process by teaching LLMs to create sequential sketches. It could work w/users to visually communicate concepts: bit.ly/4kfXFhk

5 35 128 37K 59

Download Video

𝚐𝔪𝟾𝚡𝚡𝟾 @gm8xx8

4 months ago

Overflow Prevention Enhances Long-Context Recurrent LLMs OPRM chunk-based inference: - Split the context into chunks - Process chunks in parallel (speculative prefill) - Select the best one (e.g., lowest entropy). - Decode only from that chunk Advantages: - No training required…