Markus Freitag @markuseful
Head of Google Translate Research freitagmarkus.github.io Mountain View, CA Joined April 2012-
Tweets251
-
Followers1K
-
Following203
-
Likes958
When LLMs make mistakes, can we build a model to pinpoint error, indicate its severity and error type? Can we incorporate this fine-grained info to improve LLM? We introduce LLMRefine [NAACL 2024], a simulated annealing method to revise LLM output at inference. @GoogleAI @ucsbNLP
We're looking for a final-year PhD intern passionate about working on automatic metrics for machine translation and NLP in Mountain View. If interested, please send an email to me and @_danieldeutsch. The ideal candidate should have experience working on automatic evaluation and…
[New paper!] Can LLMs truly evaluate their own output? Can self-refine/self-reward improve LLMs? Our study reveals that LLMs exhibit biases towards their output. This self-bias gets amplified during self-refine/self-reward, leading to a negative impact on performance. @ucsbNLP
🗼TowerLLM 13B 🗼 We are releasing TowerLLM 13B a SOTA open-source LLM for translation related tasks! You can check the model on our @huggingface collection page: huggingface.co/collections/Un… Updated results can be found here: unbabel.com/nl/announcing-… Paper will be out soon...
🌐Exciting News in Machine Translation! 🚀MetricX-23, our SOTA evaluation metric, is now OPEN-SOURCE in PyTorch/Transformers! 🎉There are three model sizes available, all trained on 1m+ human judgments of MT quality! 🔗Code github.com/google-researc… 🔗Paper www2.statmt.org/wmt23/pdf/2023…
I'm thrilled to share that the WMT General MT Shared Task for 2024 is now officially open! This year, we've introduced several changes to further advance the field of machine translation (MT) research and align with LLMs shift. Here’s what’s new:
Marcin Junczys-Dowmun.. @marian_nmt
2K Followers 396 Following NLP. NMT. Main author of Marian NMT. Research Scientist at Microsoft Translator. Non-NLP silliness and stuff on @emjotdeGraham Neubig @gneubig
31K Followers 588 Following Associate professor at CMU, studying natural language processing and machine learning.Kyunghyun Cho @kchonyc
61K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).Sebastian Gehrmann @sebgehr
5K Followers 2K Following Head of NLP, CTO office, @Bloomberg. (he/him) Generating natural language, one word at a time. Also making sense of that language afterwards. views my ownAndre Martins @andre_t_martins
2K Followers 397 Following NLP/ML researcher in Lisbon ([email protected])Kelly Marchisio (St. .. @cheeesio
1K Followers 558 Following Multilingual NLP @cohere. Formerly: PhD @jhuclsp Alexa Fellow @amazon dev @Google MPhil @cambridgenlp EdM @hgse 🔑🔑¬🧀 (@kelvenmar20)Raj Dabre @prajdabre1
3K Followers 758 Following NLP/Machine Translation/NLG/Deep Learning. Researcher-@NICT_Publicity. Adjunct Faculty-@iitmadras. Visiting Professor-@iitbombay. Ex-@KyotoU_News. #nlprocDjamé.. @zehavoc
6K Followers 3K Following Associate professor in NLP, engaged citizen. Tweeting about work, life and stuffs that I care about. All my tweets can be used freely. Personal account.Antonis Anastasopoulo.. @anas_ant
3K Followers 2K Following Assist. Prof at George Mason CS #nlproc MT, ASR, and documentation of endangered languages.Leshem Choshen 🤖�.. @LChoshen
4K Followers 548 Following 🥇 Collaborative LLMs 🥈 Opinionatedly sharing #ML & #NLP 🥉 Propagating us underdogs we owe science an alternative hype @IBMResearch & @MIT_CSAILWilliam Wang @WilliamWangNLP
14K Followers 718 Following UCSB NLP Lab + ML Center. https://t.co/6TOnqbk6YT https://t.co/KJYhnav3Et Mellichamp Chair Prof. at UCSB CS. PhD @ CMU SCS. Areas: #NLProc, Machine Learning, AI.Huda Khayrallah @HudaKhay
1K Followers 910 Following Machine Translation/#NLProc/ML Researcher at Microsoft. Past: @UCBerkeley CS ugrad; @LiltHQ research intern; @jhuCLSP/@jhuCompSci PhDSara Hooker @sarahookr
39K Followers 7K Following I lead @CohereForAI. Formerly Research @Google Brain @GoogleDeepmind. ML Efficiency at scale, LLMs, @trustworthy_ml. Changing spaces where breakthroughs happen.Tom Kocmi @KocmiTom
603 Followers 178 Following Senior researcher at Microsoft Translator (he/him) | AI Evaluation (LLMs, MT, Multilingulity)Shruti Rijhwani @shrutirij
4K Followers 499 Following * Research Scientist @GoogleDeepMind * #NLProc research * PhD from @LTIatCMU * Amateur woodworker, scuba diver, foosball playerLeo Boytsov @srchvrs
7K Followers 2K Following Sr. Research Scientist @AWS Labs (ph-D @LTIatCMU) working on unnatural language processing, speaking πtorch & C++. Opinions sampled from MY OWN 100T param LM.Machel Reid @machelreid
2K Followers 1K Following Research Scientist @GoogleDeepMind Working on LLMs on the Gemini Team; did gemini 1.5 proFred Blain @fblain
865 Followers 932 Following • Assistant Professor in AI @TilburgU_DCA & @ISMT_TiU, @TilburgU • Co-founder hackerspace @haum72 • #PGP FC7C3BC0Nikita Moghe @nikita_moghe
944 Followers 1K Following PhD student at CDT in NLP, University of Edinburgh. Prev: IIT Madras | University of Mumbai. She/her. On the industry job marketBenjamin Marie @bnjmn_marie
472 Followers 168 Following Researcher in LLM / multimodal dialogue / machine translation.Sheight @Sheight124619
1 Followers 81 FollowingAbdulrahman Tabaza @embed_dim
4 Followers 799 Following enjoyer of various vector spaces, encoders and modalitiesHarsh Pareek @harshhpareek
713 Followers 3K Following ML @prodigaltech, ex-(@Meta|@UTAustin|@iitbombay), 1/sqrt(2) (e/acc+AINotKillEveryone)Ella-rose Exford @EllaExford31013
80 Followers 5K FollowingNusaybah Aguiniga @NusaybahAg83115
13 Followers 3K FollowingAva-grace Urse @AvaUrse15730
90 Followers 5K Followingretweet @dailyretwee
136 Followers 598 FollowingIssath @onissathkhan
41 Followers 424 Followingmanduka @manduka334465
8 Followers 29 FollowingBaohaoLiao @baohao_liao
145 Followers 262 Following PhD for NLP @UvA_Amsterdam. Previously study @RWTH @sjtu1896Shangbin Feng @shangbinfeng
1K Followers 1K Following PhD student @uwcse @uwnlp. Understanding and expanding the knowledge abilities of LMs, social NLP, networks and structures. he/him. #水文学家Joana Nazaroff @nazaro_joan
13 Followers 3K FollowingDaisy-mae Vandevort @MaeVandevo32936
10 Followers 3K FollowingXin Zhang | 张鑫 @xinzhangai
85 Followers 342 Following NLP | AI, CS PhD student at https://t.co/TULb8Mltx8, Battling with my **long-covid** 💪AI @psuhag8636
15 Followers 335 FollowingKun (Kevin) SUN @Sharp_K_Sun
219 Followers 2K Following Scientist Researcher @ Tübingen University and Professorial Research Fellow @ Fudan University, and interested in LLMs, NLP, and computational cognition .Parker Riley @prk_riley
2 Followers 1 FollowingRong Ching Chang @AnnCC12
689 Followers 5K Following Fascinated by ML, LLMs, GNN, Multimodal models in social media. Ph.D. student @ucdavisShabnam Behzad @Shabnam_Behzad
137 Followers 407 Following CS PhD Student @Georgetown. Interested in CL, NLP.Velma Fobbs @VelmFobbs
41 Followers 5K FollowingHyoJung Han @h__j___han
172 Followers 152 Following Ph.D. @umdcs @ClipUmd. ex Research Intern @AIatMeta FAIR. Multilingual and Multimodal NLP for seamless communication by tackling language/background barriersDean Carignan @DeanCarignan
922 Followers 1K Following Chief of Staff for @Microsoft's Chief Scientific Officer; exploring responsible practices in AI, Data Science, ML Ops. Ex: @MSFTReseach @Mckinsey, @WorldbankSimean Seng @Simean_568
4 Followers 68 FollowingShreyas Vaidya @shreyasvaidya23
160 Followers 1K Following Nothing beats the joy of solving interesting problems Third year UG majoring in CS @iitjodhpurMohammad hadi Nicknam @_Nicknam_
23 Followers 502 Following CS @ Aut , (Tehran Polytechnique)، R.A. @ SUT working on Computer Vision , This account is on the training process to find his optimum point ... , @__nicknam__Jin Xu @JinXu429447
0 Followers 32 FollowingAimerou @AmrouNdiaye1
407 Followers 596 Following Research Scientist • Nature Lover • Otaku • Tijani HopefulChao-Wei Huang @cwhuang_wh
61 Followers 428 Following PhD student at National Taiwan University. Former intern @AmazonScience and @MetaAI. NLP, Retrieval, and Dialogue Systems.Mamoru B Komachi @mamoruk
12K Followers 6K Following Professor (Graduate School of Social Data Science) at Hitotsubashi University. 鍵アカウントで相互フォローを承認いただけない方、投稿数が0件の方はブロックすることがありますので、ご了承ください。Hiroyuki Deguchi @de9uch1_
225 Followers 222 Following Machine Translation, (Approximate) k Nearest Neighbor Search, Decoding, Efficiency @ NAIST, NICT ← Ehime University / Gentoo / LISP / Bebop Jazz PianistMkrtich Mkrtchyan @mckr09
93 Followers 510 FollowingShulin Zhang @shulin_zh
74 Followers 378 Following Ph.D. Candidate at the University of Georgia | Computational LinguisticsMohammad Rifat Arefin @mo_rifat
63 Followers 238 Following CS PhD Student at @utarlington | Software Engineering | Program AnalysisShruti Singh ⇾ @shr.. @shruti_rsingh
225 Followers 2K Following Representation Learning for Scientific Literature | #NLProc | Fulbright fellow @yale | CS Ph.D. Student @iitgn | Past @daiictofficialAngana Borah @AnganaBorah2
46 Followers 188 Following Ph.D. @UMichCSE. Computational Social Science and NLP research. Master's (CS-ML) @GeorgiaTech. Prev @UTAustin, @DFKI.Alexandra DeLucia @Alexir563
547 Followers 742 Following Sony intern Fall 2023. Computer Science PhD Student at @jhuclsp in the @mdredze group. @rollinscollege alum.Amir Kargaran @amir_nlp
608 Followers 2K Following PhD student in Computer Science #NLProc at University of Munich @CisLmu / Woman, Life, Freedom 🕊CEPIC HKBU @CEPIC_corpus
5 Followers 94 Following(((ل()(ل() 'yoav))).. @yoavgo
46K Followers 2K FollowingMarcin Junczys-Dowmun.. @marian_nmt
2K Followers 396 Following NLP. NMT. Main author of Marian NMT. Research Scientist at Microsoft Translator. Non-NLP silliness and stuff on @emjotdeSasha Rush @srush_nlp
52K Followers 464 Following Professor, Programmer in NYC. Cornell Tech, Hugging Face 🤗 https://t.co/cZl0wTfqGzGraham Neubig @gneubig
31K Followers 588 Following Associate professor at CMU, studying natural language processing and machine learning.Kyunghyun Cho @kchonyc
61K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).Yi Tay @YiTayML
29K Followers 97 Following chief scientist / cofounder @RekaAILabs 🫠 past: research scientist @google brain 🤯 currently learning to be a dad 🍼Sebastian Gehrmann @sebgehr
5K Followers 2K Following Head of NLP, CTO office, @Bloomberg. (he/him) Generating natural language, one word at a time. Also making sense of that language afterwards. views my ownAndre Martins @andre_t_martins
2K Followers 397 Following NLP/ML researcher in Lisbon ([email protected])Kelly Marchisio (St. .. @cheeesio
1K Followers 558 Following Multilingual NLP @cohere. Formerly: PhD @jhuclsp Alexa Fellow @amazon dev @Google MPhil @cambridgenlp EdM @hgse 🔑🔑¬🧀 (@kelvenmar20)Sebastian Ruder @seb_ruder
80K Followers 1K Following Multilingual LLMs @cohere • Prev: @GoogleDeepMind • Newsletter: https://t.co/7JGh2qpG98William Wang @WilliamWangNLP
14K Followers 718 Following UCSB NLP Lab + ML Center. https://t.co/6TOnqbk6YT https://t.co/KJYhnav3Et Mellichamp Chair Prof. at UCSB CS. PhD @ CMU SCS. Areas: #NLProc, Machine Learning, AI.Sara Hooker @sarahookr
39K Followers 7K Following I lead @CohereForAI. Formerly Research @Google Brain @GoogleDeepmind. ML Efficiency at scale, LLMs, @trustworthy_ml. Changing spaces where breakthroughs happen.Google DeepMind @GoogleDeepMind
944K Followers 275 Following We’re a team of scientists, engineers, ethicists and more, committed to solving intelligence, to advance science and benefit humanity.EMNLP 2024 @emnlpmeeting
12K Followers 41 Following EMNLP 2024 - The 2024 Conference on Empirical Methods in Natural Language Processing, November 12 –16, 2024 Hashtag: #EMNLP2024Dipanjan Das @dipanjand
4K Followers 308 Following Senior Director of Research at @GoogleDeepmind. Working on improving the factuality of LLM generated content.Tom Kocmi @KocmiTom
603 Followers 178 Following Senior researcher at Microsoft Translator (he/him) | AI Evaluation (LLMs, MT, Multilingulity)Alexis Conneau @alex_conneau
24K Followers 113 Following Audio AGI Research Lead @OpenAI - GPT-Next - Past: XLM, Unsupervised ASR, Unsupervised MT, Wav2vec 2.0/XLSR, MUSE, Unsupervised cross-lingual transferNeurIPS Conference @NeurIPSConf
112K Followers 35 Following New Orleans, Dec 10-16, 23. https://t.co/ga8aOw615g Tweets to this account are not monitored. Please send feedback to [email protected].Fred Blain @fblain
865 Followers 932 Following • Assistant Professor in AI @TilburgU_DCA & @ISMT_TiU, @TilburgU • Co-founder hackerspace @haum72 • #PGP FC7C3BC0Parker Riley @prk_riley
2 Followers 1 FollowingXavier Garcia @xgarcia238
68 Followers 36 FollowingChryssa Zerva @chryssaZrv
378 Followers 325 Following Assistant Professor @informatica_IST, @ist_tecnico. Interested in understanding uncertainty in data, models, life. NLP, ML and climbing fan.Andriy Burkov @burkov
19K Followers 141 Following Author of 📖 The Hundred-Page Machine Learning Book and the 📖 Machine Learning Engineering bookAdam Sadovsky @asadovsky
556 Followers 294 Following Distinguished Software Engineer / Senior Director, GeminiConference on Languag.. @COLM_conf
2K Followers 6 Following https://t.co/GhGCMEoa4A Abstract submission: March 22, 2024Jonathan Pilault @J_Pilault
167 Followers 483 Following former intern @GoogleDeepMind PhD @Mila_QuebecTu Vu @tuvllms
3K Followers 894 Following Research Scientist @GoogleDeepMind & Assistant Professor @VT_CS. PhD from @UMass_NLP. #NLProcDouglas Eck @douglas_eck
12K Followers 905 Following Google DeepMind lead and recovering faculty member. Sometimes a musician. https://t.co/ucS5UXich2.Hyung Won Chung @hwchung27
18K Followers 231 Following Research Scientist @OpenAI. Past: @Google Brain / PhD @MITVikas Raunak @vyraun
506 Followers 5K Following Senior Research Scientist at Microsoft Azure AI. Working on Reliability Problems in AI (LLMs, MT, Speech). Carnegie Mellon Graduate. IIT Indore Gold Medalist.Aakanksha Chowdhery @achowdhery
7K Followers 3K Following LLMs @ Google DeepMind :: PaLM, Gemini // Previously @MSFTResearch, @Stanford, @Princeton // views my own and subject to changeHugo Larochelle @hugo_larochelle
113K Followers 626 Following Google DeepMind researcher, machine learning professor, ex-Twitter Cortex, father of 4, wine/music/comedy enthusiastNando de Freitas 🏳.. @NandoDF
97K Followers 659 Following I research intelligence to understand it and to harness it wisely. Part of AlphaGo tuning, AlphaCode, learning to learn, Lyria, Imagen2, Gato, rGemmaMia Chen @MiaXuChen
49 Followers 33 FollowingYanping Huang @bignamehyp
310 Followers 100 FollowingDustin Tran @dustinvtran
40K Followers 649 Following Research Scientist at Google DeepMind. I lead evaluation at Gemini / Bard. AI, Bayesian statistics, deep learning.Ankush Garg @agbgarg
26 Followers 125 FollowingCharles Sutton @RandomlyWalking
17K Followers 1K Following Research scientist @GoogleAI / Previously academic @InfAtEd / Deep learning to help people write code. / @[email protected] / ❤️s:🐱🐶☕️🍕Max Welling @wellingmax
32K Followers 432 FollowingDaniel Cer @daniel_m_cer
385 Followers 725 Following Research Scientist at @GoogleAI, @googIeresearch.Jeff Pitman @_jrp_
129 Followers 158 Following Eng Director, Google Bard. Disclaimer: my opinion, not Google's.Ethan Dyer @ethansdyer
725 Followers 121 FollowingDaphne Ippolito @daphneipp
1K Followers 72 Following I am a senior research scientist at Google. I research topics in natural language generation.Kelvin Guu @kelvin_guu
3K Followers 333 Following Senior staff research scientist @ Google DeepMind leading cross-functional teams of 40+ (research/eng/PM/UI/UX), turning our SOTA research into new AI products.Jason Eisner @adveisner
8K Followers 547 Following Professor of CS at Johns Hopkins University, Director of Research at Microsoft Semantic Machines, ACL Fellow. My tweets speak only for me.hinrich schuetze @HinrichSchuetze
456 Followers 110 FollowingYoung @yjkim362
346 Followers 263 Following Principal Researcher, Large language models, NLP, @Microsoft GenAIThang Luong @lmthang
20K Followers 100 Following Senior staff scientist @GoogleDeepMind. PhD @StanfordNLP. PI #AlphaGeometry. Co-lead #Bard Multimodality, now #Gemini. Co-founder #MeenaBot (later LaMDA).Miles Brundage @Miles_Brundage
43K Followers 10K Following Policy research at @openai. I mostly tweet about AI, animals, and sci-fi. He/him. Views my own.Prof. Andy Way @tarfandy
2K Followers 368 Following Deputy Director @adaptcentre, Full Professor @dcu. 3x @uni_of_essex alumnus (BA,MSc,PhD). Recently naturalised ☘️🇮🇪 All views my own etc.David Chiang @davidweichiang
2K Followers 554 Following Associate Professor of Computer Science and Engineering at University of Notre Dame. Natural language processing, formal grammars, machine learningMarcin Junczys-Dowmun.. @emjotde
502 Followers 141 Following Research scientist, playing around with open source, turning diet pops into neural networks. Private account, serious stuff on @marian_nmt.Jörg Tiedemann @TiedemannJoerg
750 Followers 72 FollowingUlrich Germann @UlrichGermann
35 Followers 84 Followingnothing gets my heart rate up like waiting for eval results on new models to come in
Llama 3 has arrived! Taaa-daaam! ai.meta.com/blog/meta-llam…
@markuseful This is really great! We were just thinking about "how many (and what) items should be annotated". 🙂
Observation: Relying too heavily on prompt engineering can stifle the creativity and exploration spirit of PhD students. It's crucial to remember that breakthroughs and fundamental innovations come from diving deep, questioning, and reimagining the boundaries of what's possible.
IMO, the word "super-human" is not a well-defined term. An average human tend to be pretty bad at most of things. If you ask a random human to do GSM, I doubt if they can even get 20%. So most LLMs now are super-average-human on math tasks.
We are hiring a machine learning engineer role to drive making our research + weight releases as accessible as possible to the wider community. 🔥 If you care about model efficiency, tooling, usability, translating research into impact -- get in touch! jobs.lever.co/cohere/3dbae8b…
🥳🍾 It's official - I'm a tenured associate professor! This job is incredible luck and privilege, and @ITUkbh is an amazing place to work at!!! More PhD student and postdoc positions will be announced soon.
Happy to announce our first HPLT model release!
First datasets, then models! Initial HPLT models (LLMs and MT) are out: hplt-project.org/models, some still running 🏃 We explain what we are doing in the deliverables section: hplt-project.org/deliverables Meanwhile, we keep cooking IA peta-data-bytes 🥘, enriching, dashboarding 📊
Today we release the Tower paper! 🗼 Tower is an open-weight suite of multilingual models — built on top of LLaMA-2 — for translation-related tasks. It supports 10 different languages. Paper: arxiv.org/pdf/2402.17733… Models and data: huggingface.co/collections/Un… 🧵Thread below.
How does scaling affect LLM finetuning? We explored the impact of pretrain/finetuning data size, model size, and finetuning methods on LLM performance in downstream tasks like MT and summarization. Paper: arxiv.org/abs/2402.17193 w/ Zhongtao Liu,@ColinCherry, and @orf_bnw
Super proud of the work we have been doing in Tower and this is just the begin! There is more to come…
Today we release the Tower paper! 🗼 Tower is an open-weight suite of multilingual models — built on top of LLaMA-2 — for translation-related tasks. It supports 10 different languages. Paper: arxiv.org/pdf/2402.17733… Models and data: huggingface.co/collections/Un… 🧵Thread below.
[New paper!] Can LLMs truly evaluate their own output? Can self-refine/self-reward improve LLMs? Our study reveals that LLMs exhibit biases towards their output. This self-bias gets amplified during self-refine/self-reward, leading to a negative impact on performance. @ucsbNLP
🗼TowerLLM 13B 🗼 We are releasing TowerLLM 13B a SOTA open-source LLM for translation related tasks! You can check the model on our @huggingface collection page: huggingface.co/collections/Un… Updated results can be found here: unbabel.com/nl/announcing-… Paper will be out soon...
🌐Exciting News in Machine Translation! 🚀MetricX-23, our SOTA evaluation metric, is now OPEN-SOURCE in PyTorch/Transformers! 🎉There are three model sizes available, all trained on 1m+ human judgments of MT quality! 🔗Code github.com/google-researc… 🔗Paper www2.statmt.org/wmt23/pdf/2023…
I'm thrilled to share that the WMT General MT Shared Task for 2024 is now officially open! This year, we've introduced several changes to further advance the field of machine translation (MT) research and align with LLMs shift. Here’s what’s new:
@SarahTheHaider lectured/instructed less, encouraged me to try, fail, and learn more
📢SPAN-ACES is here! We annotated the error spans in the incorrect-translation for every example in the ACES dataset. 🤗huggingface.co/datasets/nikit… with Arnisa Fazla, @chantalamrhein, @KocmiTom, Mark Steedman, @alexandrabirch1, @RicoSennrich, @LianeGuillou