Nando de Freitas 🏳️‍🌈 @NandoDF

I research intelligence to understand what we are, and to harness it wisely. I lead a wonderfully creative AI team at @GoogleDeepMind who inspire me everyday. scholar.google.com/citations?user… London, England Joined April 2009

Tweets

10K
Followers

97K
Following

656
Likes

17K

Michael Black @Michael_J_Black

20 hours ago

The name has changed but it remains the world's first 3D human foundation model. ChatPose (formerly PoseGPT) has a new name at the request of the #CVPR2024 reviewers. Same great work from @meshcapade and @PerceivingSys. Final #CVPR2024 version on arXiv: arxiv.org/abs/2311.18836

4 47 212 29K 92

Download Video

UCL DARK @UCL_DARK

2 days ago

We're excited to announce that the Genie Team from @GoogleDeepMind will be our next invited speakers! Title: Genie: Generative Interactive Environments Speakers: @ashrewards, @jparkerholder, @YugeTen Sign up: eventbrite.co.uk/e/ucl-dark-spe… 📌 90 High Holborn 📅 Tue 30 Apr, 17:00

2 10 40 7K 7

Aran Komatsuzaki @arankomatsuzaki

2 days ago

SnapKV: LLM Knows What You are Looking for Before Generation - Automatically compresses KV caches - Consistent decoding speed with a 3.6x increase in generation speed and an 8.2x enhancement in memory efficiency repo: github.com/FasterDecoding… abs: arxiv.org/abs/2404.14469

6 55 305 34K 225

Download Image

Nando de Freitas 🏳️‍🌈 @NandoDF

2 days ago

Can someone create a leaderboard with metrics that also measure the features Oriol highlights here: 1. multimodal performance: understanding and generating video, audio, touch, actions, proprioception. 2. long-context: long understanding and generation. I agree Gemini 1.5 Pro…

Oriol Vinyals @OriolVinyalsML

2 days ago

7 51 270 91K 45

2 6 31 14K 12

Aran Komatsuzaki @arankomatsuzaki

2 days ago

Microsoft presents Multi-Head Mixture-of-Experts Achieves notable improvements over the baseline MoE by using multiple MoE heads repo: github.com/yushuiwx/MH-MoE abs: arxiv.org/abs/2404.15045

6 114 571 39K 382

Download Image

Alex Dimakis @AlexGDimakis

3 days ago

Phi-3 just released by Microsoft. Three small size models (3.8B, 7B and 14B) trained on highly filtered and synthetic data. They report impressive performance since the 3.8B model (trained on 3T tokens) has MMLU of 69% matching Llama3 8B, and the 7B Phi-3 model has 75% MMLU,…

3 19 96 16K 36

Download Image

Nando de Freitas 🏳️‍🌈 @NandoDF

3 days ago

Wonderful to see this! Thanks @edwardbeeching and team 🙏 @scott_e_reed @konradzolna @SashaVNovikov @maidotgimenez @gbarthmaron

Edward Beeching @edwardbeeching

3 days ago

Wonderful to see this! Thanks @edwardbeeching and team 🙏 @scott_e_reed @konradzolna @SashaVNovikov @maidotgimenez @gbarthmaron

4 37 173 27K 103

Download Video

0 1 17 5K 3

Nando de Freitas 🏳️‍🌈 @NandoDF

4 days ago

Fully agree that multimodal LLMs are the solution to robotics. This is why my team pioneered Gato (General AgenT One): arxiv.org/pdf/2205.06175… which we built over two years since 2020 to 2022. One of the most important parts of Gato was the data and its engineering pipeline.…

Bindu Reddy @bindureddy

5 days ago

109 265 1K 188K 466

Download Image

4 10 100 32K 52

Justin Hart @justin_hart

2 weeks ago

Last July I used @runwayml to do some animations to a classic 80s ballad (you know it!) over some MidJourney images of anthropomorphic flowers. (see my pinned tweet) Well, enter @HaiperGenAI - I thought I'd give them a fair shake with MJ v6 images of the same ilk. It is...cool

13 14 48 26K 17

Download Video

Nando de Freitas 🏳️‍🌈 @NandoDF

4 days ago

How to attain Multimodal World Models is a great open question in AI. The solutions will likely lead to more grounded models that interact better with people and make better physics predictions. Hopefully, they will enable scientific generalisation, but this too I feel is an open…

TuringPost @TheTuringPost

5 days ago

13 77 362 90K 335

Download Image

3 12 45 17K 30

Thomas Wolf @Thom_Wolf

5 days ago

Llama3 was trained on 15 trillion tokens of public data. But where can you find such datasets and recipes?? Here comes the first release of 🍷Fineweb. A high quality large scale filtered web dataset out-performing all current datasets of its scale. We trained 200+ ablation…

Guilherme Penedo @gui_penedo

5 days ago

37 326 1K 524K 723

Download Image

23 300 2K 287K 964

lmsys.org @lmsysorg

5 days ago

Congrats @GoogleDeepMind on shipping Gemini 1.5 Pro to public review! Upon capacity & latency testing, we have now brought Gemini 1.5 Pro up to the Arena🤖 Big improvement from Pro 1.0 to 1.5 across the board, and exceptionally strong long context understanding. Come test and…

Google DeepMind @GoogleDeepMind

2 weeks ago

16 237 957 239K 202

Download Gif

7 49 376 254K 61

AI at Meta @AIatMeta

a week ago

In addition to Llama 3, today we’re also publishing a new paper: Imagine Flash: Accelerating Emu Diffusion Models with Backward Distillation ➡️ go.fb.me/g4r584 This work from GenAI researchers is enabling new image generation features in Meta AI on @WhatsApp & web.

11 196 1K 90K 225

Download Image

AK @_akhaliq

a week ago

Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing Despite the impressive capabilities of Large Language Models (LLMs) on various tasks, they still struggle with scenarios that involves complex reasoning and planning. Recent work proposed advanced

4 87 357 72K 256

Download Image

Nando de Freitas 🏳️‍🌈 @NandoDF

6 days ago

One of the greatest minds of our times has died. This is a huge blow to the fields of philosophy, morality, consciousness and intelligence. I adored his teachings even though sometimes it took me years to get them. His ideas will live on. @danieldennett

4 21 168 15K 22

Download Image

lmsys.org @lmsysorg

7 days ago

Early 1K votes are in and Llama-3 is on FIRE!🔥The New king of OSS model? Vote now and make your voice heard! Leaderboard update coming very soon.

lmsys.org @lmsysorg

a week ago

Early 1K votes are in and Llama-3 is on FIRE!🔥The New king of OSS model? Vote now and make your voice heard! Leaderboard update coming very soon. https://t.co/L9h9QrCkjl

10 34 299 120K 34

Download Image

17 84 534 168K 71

Download Image

elvis @omarsar0

a week ago

Emerging AI Agent Architectures Researchers from IBM and Microsoft present this concise summary of emerging AI agent architectures. It focuses the discussion on capabilities like reasoning, planning, and tool calling which are all needed to build complex AI-powered agentic…

8 176 591 59K 687

Download Image

Roberto Nickson @rpnickson

a week ago

My full conversation with Mark Zuckerberg on the breaking Meta AI announcements, fighting in the UFC, metaverse, Ray-Ban Metas, and future technologies. But what was really touching was his thoughts on legacy and fatherhood. Timestamps: 00:00 Intro 00:37 Meta AI announcements…

105 235 1K 1.0M 1K

Download Video

Jay Alammar @JayAlammar

a week ago

AI Agents will take the abilities of LLMs to a whole new level. Here's how to build a simple agent that can use software tools like searching the web or writing and running python code (LLMs love to write @matplotlib code for you). youtube.com/watch?v=5drn2D…

cohere @cohere

a week ago

0 15 72 49K 40

4 37 210 33K 178

Download Image

Sebastian Borgeaud @borgeaud_s

a week ago

Great analysis, approach 3 is finally in agreement! The loss scale was too low in our paper, resulting in premature termination of L-BFGS, and leading to bad fits. After fixing this we can reproduce your findings! We're also open sourcing the data in the paper, stay tuned :)