Dan Deutsch @_danieldeutsch

Research Scientist at Google Translate working on text generation evaluation danieldeutsch.github.io San Francisco Joined September 2012

Tweets

90
Followers

609
Following

90
Likes

122

Markus Freitag @markuseful

2 months ago

Our Google Translate team is bringing a strong presence to #ACL2025 in Vienna this week! 🇦🇹 My group is excited to present several of our latest papers. 👇 Don't miss them!

1 5 52 3K 3

Two new datasets from Google Translate targeting high and low resource languages! WMT24++: 46 new en->xx languages to WMT24, bringing the total to 55 SMOL: 6M tokens for 115 very low-resource languages WMT24++: huggingface.co/datasets/googl… SMOL: huggingface.co/datasets/googl…

2 26 87 16K 52

iseeaswell꩜bʂky @iseeaswell

7 months ago

😼SMOL DATA ALERT! 😼Anouncing SMOL, a professionally-translated dataset for 115 very low-resource languages! Paper: arxiv.org/pdf/2502.12301 Huggingface: huggingface.co/datasets/googl…

3 11 34 4K 11

Download Image

Yusuf Kocyigit @mykocyigit

8 months ago

Thrilled to share our latest findings on data contamination, from my internship at @Google! We trained almost 90 Models on 1B and 8B scales with various contamination types using machine translation as our task and analyze the impact of contamination. arxiv.org/abs/2501.18771

3 20 85 11K 33

Jurik Juraska @JurikJuraska

10 months ago

🚀 We have just released bfloat16 variants of all 3 MetricX-24 models, offering nearly identical performance to their float32 counterparts, but with a 50% smaller memory footprint. ✨ We hope this makes the XL and XXL models more accessible! 🔗 GitHub: github.com/google-researc…

Jurik Juraska @JurikJuraska

10 months ago

1 6 18 2K 7

0 2 2 338 0

Jurik Juraska @JurikJuraska

10 months ago

🌐 Meet MetricX-24, our SOTA machine translation evaluation metric and a successor to the successful MetricX-23. 🚀 Now open-source in PyTorch/Transformers! 🎉 Ready to take this top performer in the WMT24 Metrics Shared Task for a spin? 🔗 Code: github.com/google-researc…

1 6 18 2K 7

Dan Deutsch @_danieldeutsch

10 months ago

Super simple and effective way of significantly increasing the performance of your evaluation metric!

Mara Finkelstein @marafinkels

10 months ago

Super simple and effective way of significantly increasing the performance of your evaluation metric!

4 11 50 12K 41

Download Image

0 0 8 881 2

Dan Deutsch @_danieldeutsch

10 months ago

The Google Translate Research Team is looking for interns this summer! Apply here if you will graduate from a PhD program in the 2025-2026 academic year, and send me an email to let me know that you applied google.com/about/careers/…