Our Google Translate team is bringing a strong presence to #ACL2025 in Vienna this week! 🇦🇹 My group is excited to present several of our latest papers. 👇 Don't miss them!
Two new datasets from Google Translate targeting high and low resource languages!
WMT24++: 46 new en->xx languages to WMT24, bringing the total to 55
SMOL: 6M tokens for 115 very low-resource languages
WMT24++: huggingface.co/datasets/googl…
SMOL: huggingface.co/datasets/googl…
Thrilled to share our latest findings on data contamination, from my internship at @Google! We trained almost 90 Models on 1B and 8B scales with various contamination types using machine translation as our task and analyze the impact of contamination.
arxiv.org/abs/2501.18771
🚀 We have just released bfloat16 variants of all 3 MetricX-24 models, offering nearly identical performance to their float32 counterparts, but with a 50% smaller memory footprint. ✨ We hope this makes the XL and XXL models more accessible!
🔗 GitHub: github.com/google-researc…
🚀 We have just released bfloat16 variants of all 3 MetricX-24 models, offering nearly identical performance to their float32 counterparts, but with a 50% smaller memory footprint. ✨ We hope this makes the XL and XXL models more accessible!
🔗 GitHub: github.com/google-researc…
🌐 Meet MetricX-24, our SOTA machine translation evaluation metric and a successor to the successful MetricX-23. 🚀 Now open-source in PyTorch/Transformers! 🎉 Ready to take this top performer in the WMT24 Metrics Shared Task for a spin?
🔗 Code: github.com/google-researc…
The Google Translate Research Team is looking for interns this summer! Apply here if you will graduate from a PhD program in the 2025-2026 academic year, and send me an email to let me know that you applied
google.com/about/careers/…
303 Followers 2K Following🤖 AI PhD @uniofoxford
✏️ NLP x CSS x graphs
🤓 ¿¡ Talk nerdy to me !?
🔍 Find me almost everywhere on the internet @elleismatic
16K Followers 1K FollowingSenior Research Scientist - @google, Adjunct Faculty - @iitmadras, @iitbombay, Ex: @NICT_Publicity
Use of my tweets without permission ➡️ legal action
7K Followers 6K FollowingCenter for Language and Speech Processing at @JohnsHopkins #NLProc #MachineLearning #AI https://t.co/6IXR5OSQtw
@[email protected]
881 Followers 294 FollowingPostdoc @Mila_Quebec @McGill_NLP 🇨🇦 PhD in #NLProc from @Edin_CDT_NLP 🏴 interpretability x memorisation x (non-)compositionality. she/her
264 Followers 725 FollowingML engineer at Apple. PhD in Neural Machine Translation. Background in ML, data science and SW engineering. Chinese lang enthusiast. Life-long learner. 学无止境
9K Followers 52 FollowingThe official account of the Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics.
11K Followers 1K FollowingI like tokens! I lead the OLMo data team at @allen_ai w/ @kylelostat. Open source is fun 🤖☕️🍕🏳️🌈 Opinions are sampled from my own stochastic parrot
11K Followers 1K FollowingCS professor @GeorgiaTech @gtcomputing @ICatGT @mlatgt. Natural language processing, machine learning, LLMs, social media research.
50K Followers 3K FollowingAI alignment + LLMs at Anthropic. On leave from NYU. Views not employers'. No relation to @s8mb. I think you should join @givingwhatwecan.
2K Followers 56 FollowingChief AI Scientist, Oracle, and the Eduardo D. Glandt Distinguished Professor, CIS, University of Pennsylvania. Former VP/Distinguished Scientist, AWS AI Labs.
321 Followers 2K FollowingAnyone living in an anyhow town. Cryptic crossword setter Gussalufz for #TheHinduCrossword and elsewhere. @[email protected] & @viresh.bsky.social
2K Followers 2K FollowingMachine translation research for big tech and big academia and director of the @aclanthology. Tweets here are mostly personal.
428 Followers 662 FollowingPhD candidate at @stonybrooku in @stonybrooknlp; Prev: @GoogleAI research, @ai2_aristo @allen_ai, @sfresearch; MS from @jhuclsp