Paco Guzmán @guzmanhe
Researcher in Language Technologies guzmanhe.github.io San Francisco, CA Joined March 2009-
Tweets382
-
Followers292
-
Following144
-
Likes111
Announcing the alpha release of torchtune! torchtune is a PyTorch-native library for fine-tuning LLMs. It combines hackable memory-efficient fine-tuning recipes with integrations into your favorite tools. Get started fine-tuning today! Details: hubs.la/Q02t214F0
Are you a PhD student interested in memorisation, generalisation and the role of data in the era of LLMs? Come do an internship with me at @AIatMeta! metacareers.com/jobs/704140718… (Send me a ping if you apply)
SeamlessStreaming is an AI translation model that can deliver state-of-the-art results on streaming translation with <2 seconds of latency. One core piece of our latest Seamless Communication research work by teams at FAIR. More on this project ➡️ bit.ly/4165c9z
My team launched a suite of models that enable near real-time and expressive AI translations last week. This is a little personal reflection/thank-you note to everyone who contributed to it. (1/n)
My team launched a suite of models that enable near real-time and expressive AI translations last week. This is a little personal reflection/thank-you note to everyone who contributed to it. (1/n)
Last week we released “Seamless”, a new streaming speech-to-speech translation (S2ST) model capable of maintaining expressivity. @AIatMeta This 🧵 deep-dives how we achieved an expressivity-preserving S2ST system. ai.meta.com/blog/seamless-…
Excited to share our team’s work on SeamlessStreaming! Sharing more details in this thread
Excited to share our team’s work on SeamlessStreaming! Sharing more details in this thread
The recent Seamless family of models by @AIatMeta translate speech in near-real time streaming mode and preserve the expressivity of speech. How did we quantify the quality of this expressivity preservation? Here is a thread on it: x.com/cointegrated/s…
The recent Seamless family of models by @AIatMeta translate speech in near-real time streaming mode and preserve the expressivity of speech. How did we quantify the quality of this expressivity preservation? Here is a thread on it: x.com/cointegrated/s…
And boom, just like that, the new version of SeamlessM4T has already been integrated into 🤗 transformers! @huggingface @metaai Use it right now with minimal installation and just a few lines of code! Links below ⬇️
Meta just released a new collection their open access "Seamless" translation models 🔊 They do speech-to-text, text-to-speech, speech-to-speech, text-to-text 💬🔄📝 The Expressive model keeps speech rate, pauses and style 🗣️ 📁 Models and demos: huggingface.co/collections/fa…
Seamless: speech to text, text to speech, text to text, and speech to speech, transcription and translation in 100 languages. From FAIR.
Seamless: speech to text, text to speech, text to text, and speech to speech, transcription and translation in 100 languages. From FAIR.
1/ We just made speech translation a whole lot better! Introducing Seamless, an AI model that translates speech in real-time while also maintaining similar vocal style. Github: github.com/facebookresear… Site/Paper: ai.meta.com/research/seaml… HF: huggingface.co/collections/fa…
1/ We just made speech translation a whole lot better! Introducing Seamless, an AI model that translates speech in real-time while also maintaining similar vocal style. Github: github.com/facebookresear… Site/Paper: ai.meta.com/research/seaml… HF: huggingface.co/collections/fa…
Very excited to announce Seamless Communication, one more step towards breaking language barriers! Paper: ai.meta.com/research/publi… Code/Models/Metadata/Data: github.com/facebookresear… github.com/facebookresear… and github.com/facebookresear… Hugging Face: huggingface.co/collections/fa… Page:…
Very excited to announce Seamless Communication, one more step towards breaking language barriers! Paper: ai.meta.com/research/publi… Code/Models/Metadata/Data: github.com/facebookresear… github.com/facebookresear… and github.com/facebookresear… Hugging Face: huggingface.co/collections/fa… Page:…
Today we're sharing the next milestone in our Seamless Communication research — a new family of AI translation models that preserve expression and deliver near-real time streaming translations. More on this new work ➡️ bit.ly/3uBZAYG More on the individual models 🧵
We're proud to share that our work on SeamlessM4T — the first all-in-one, multilingual multimodal translation model — was recognized today as part of @TIME's Best Inventions of 2023 list! More details and the full #TIMEBestInventions list ➡️ bit.ly/3Qd7Cyn
🚨 New preprint 🚨 Can we use LLMs to generate more than one translation when there's gender ambiguity? We try it in "Gender-specific MT with LLMs", our new paper with Pierre Andrews, Pontus Stenetorp, @artetxem and @costajussamarta. arxiv.org/abs/2309.03175 🧵(1/n)
@yoavgo The wrong turn was made when we normalize personal attacks in our community. You can raise concerns w/o the need to directly point at ppl. These positions are elected, by members of the community and the elected ppl have volunteered their time to serve the community.
Removing the anonymity period is only going to benefit the big names and the big labs. For the researchers underrepresented regions that would mean: * Submit papers without arXiving (to avoid reviewers bias) * Competing with more papers that have their famous authors revealed.
Announcing Belebele, a first-of-its-kind multilingual reading comprehension dataset. This dataset is parallel for 122 language variants, enabling direct comparison of how well models understand different languages. Dataset ➡️ bit.ly/47UTSAh
We are hiring a Research Scientist to build the future of speech and language technologies! If you recently completed PhD or are about to graduate and are interested, please email [email protected] or apply directly at metacareers.com/jobs/124944106…
Marcin Junczys-Dowmun.. @marian_nmt
2K Followers 397 Following NLP. NMT. Main author of Marian NMT. Research Scientist at Microsoft Translator. Non-NLP silliness and stuff on @emjotdeAntonis Anastasopoulo.. @anas_ant
3K Followers 2K Following Assist. Prof at George Mason CS #nlproc MT, ASR, and documentation of endangered languages.Shruti Rijhwani @shrutirij
4K Followers 499 Following * Research Scientist @GoogleDeepMind * #NLProc research * PhD from @LTIatCMU * Amateur woodworker, scuba diver, foosball playerNathan Schneider @complingy
4K Followers 1K Following Computational Linguist and Professional Nerd at Georgetown University he/him pronouns, ALL the prepositions @[email protected] @complingy.bsky.socialLuciana Benotti @LucianaBenotti
3K Followers 1K Following Investigadora @unc_cordoba sobre #NLProc. Sueño con un mundo en el que las computadoras ayuden a todas las personas a vivir mejor---no sólo a unas pocas.Kareem Darwish @kareem2darwish
2K Followers 343 Following Principal scientist working on natural language processing and social computing; YouTuber; AuthorMaha Elbayad @melbayad
617 Followers 629 Following Research Scientist @AIatMeta. @centralesupelec, @ENS_ParisSaclay and @UGrenobleAlpes alum. 💬 My opinions are my own | she/her@𝘾𝙝𝙖𝙏𝙤.. @ChaToX
4K Followers 2K Following ICREA Professor @UPFBarcelona. Lead of @WSSC_UPF. Algorithmic fairness, social computing, @BigCrisisData (Nonbinary: they/them) 🏳️🌈Preslav Nakov @preslav_nakov
2K Followers 2K Following Professor, MBZUAI LLMs, Jais-chat, "fake news", disinformation, propaganda, media bias Past: UC Berkeley, NUS, BAS, SUManuel Mager (Turatem.. @pywirrarika
887 Followers 1K Following Applied Scientist | Amazon AWS Posts are my own opinion.Walid Magdy 🇵🇸 @Walid_Magdy
4K Followers 294 Following Reader (associate prof) @InfAtEd Fellow @turinginst Director of @SMASH_Edin. Interests: Computational social science & NLPBenjamin Muller @ben_mlr
816 Followers 2K Following Research in AI. Focusing on scaling models to the largest number of languages. Postdoc at FAIR @metaai.Houda BOUAMOR @hbouamor
684 Followers 398 Following Associate Professor @CMU-Q, Associate Area Head of Information Systems, NLP and Machine Learning ExpertJeff Wang 👨🚀 @jffwng
3K Followers 734 Following Product Lead @AIatMeta (FAIR). I like language models. I also like non-language models. Previously at Twitter and startupsAlice @JennyBrown50421
4 Followers 554 Followingretweet @dailyretwee
137 Followers 598 FollowingMehar Bhatia @bhatia_mehar
985 Followers 2K Following NLP || Grad CS Student at @UBC Vancouver 👩🎓|| @UBC_NLP @VectorInst || Studying culture, reasoning, alignment, fairness and biasesAbhinav Arora @abhinavarora28
21 Followers 227 FollowingDavid Dalé @cointegrated
117 Followers 45 Following Natural language processing specialist. Working as research engineer at @AIatMeta. Improving machine translation. Supporting Ukraine. Doing good and bad stuff.Skyler Wang @skylrwang
512 Followers 411 Following Sociologist @AIatMeta // Incoming Assistant Professor @mcgillu // I study technology and AI (though only on Twitter when I have to).Guilherme H. Bueno @ghbueno
121 Followers 2K FollowingHeriberto Avelino @HeribertoAveli2
80 Followers 822 FollowingFedor Vitiugin @vitiugin
196 Followers 268 Following Researcher at @CSAalto ex-@DTIC_UPF @UPFBarcelona Interested in #TransferLearning #SocialMedia #InformationRetreival #NLPNathaniel R. Robinson @robinson_n8
181 Followers 284 Following PhD @jhuclsp @JHUCompSci, formerly @LTIatCMU @NLProc for *many* languages 🌍 english | عربي | kreyòl | français | español 🌎Eduardo Sánchez @eduardosg_ai
131 Followers 402 Following Research Scientist at @Meta. PhD Student at @ucl_nlp. Formerly MSc AI at @UM_DACS & BSc CS at @MatCom_UH. I work on Low-Resource MT & LLMs. #NLProcSway @SwayThem
2K Followers 1K Following A leading social-first digital agency & Emmy-winning production studio with capabilities across owned, paid & earned media. | Mexico 🇲🇽 & USA 🇺🇸 | #SwayThemLoic Barrault @LoicBarrault
278 Followers 316 FollowingSwapnil Sumit @IamSumitPaneru
681 Followers 706 Following Freelancer, Digital Entrepreneur, Digital EvangelistOmar Espejel |📍Lon.. @espejelomar
3K Followers 2K Following Crypto Dev Advocacy at @StarknetFndn 🐺 previously at @StarkWareLtd | Host Hacia Afuera Podcast | ex-Machine Learning 🤗 @huggingfaceChao-Wei Huang @cwhuang_wh
61 Followers 428 Following PhD student at National Taiwan University. Former intern @AmazonScience and @MetaAI. NLP, Retrieval, and Dialogue Systems.Kaushik Ram Sadagopan @kauterry
99 Followers 561 Following AI research at FAIR @AIatMeta | Past: @Stanford @iitmadrasLuis @RisingforceLuis
43 Followers 566 FollowingHirofumi Inaguma @HirofumiInaguma
1K Followers 981 Following Research scientist at Fundamental AI Research (FAIR) @MetaAI / speech processing / Ph.D. from Kyoto University in 2021Godspower Eseurhobo @remote_geo
443 Followers 464 Following Founder & Remote Leader @Afrisplash | AI Program Tech Product Manager @CLEARGlobalOrg | Product Advisor @ReelinHQ | AFM @FIDE_chess | [email protected] 📩Prince Osei Aboagye @kp_aboagye
26 Followers 198 Following Staff Research Scientist @Visa Research || Formerly: Ph.D. Student @UUtah || Research Interest: Natural Language Processing, Ethical and Responsible AI.Sravya Popuri @sravyapopuri388
105 Followers 352 Following Research Engineer @ MetaAI - Fundamental AI Research (FAIR). Working on speech to speech translation.Lucie-Aimée Kaffee @frimelle
1K Followers 2K Following Computer Scientist, PhD. Applied Policy Researcher @huggingface 🤗 ML & Society; Wikipedia & languages are my ♡camenduru @camenduru
15K Followers 4K Following ML & Computer Engineer, Game Designer. #OpenSource ❤ #UE ❤ #Jupyter ❤ #AI #ML #StableDiffusion #LLM #NeRF #GaussianSplatting #T2V https://t.co/8MMNbygz1PVikas Raunak @vyraun
507 Followers 5K Following Senior Research Scientist at Microsoft Azure AI. Working on Reliability Problems in AI (LLMs, MT, Speech). Carnegie Mellon Graduate. IIT Indore Gold Medalist.Alok Parlikar @happyalu
288 Followers 223 Following @[email protected] | Works for @cobaltspeech ❤️: NixOS, Thinkpad, Golang, Zig, Idli, MosambiArmen Aghajanyan @ArmenAgha
6K Followers 263 Following Research Scientist @ Meta AI (FAIR) https://t.co/8XF2vtiIVy Opinions are my own.Bismarck B. Odoom @BismarckBamfo
361 Followers 3K Following PhD student @jhuclsp | #NLProc | #SpeechProcGlobal Tech Summit @GlobalTechMeet
146 Followers 2K Following Welcome to the Global Tech Summit! Join us for an exciting exploration of the latest advancements in technology, featuring top industry leaders and innovators.Belen Alastruey @b_alastruey
506 Followers 317 Following PhD student @AIatMeta & @PSL_univ. Previously: @amazon Alexa, @apple MT, @mtupc1Javier Ferrando @javifer_96
277 Followers 480 Following PhD Student @la_UPC. Interpretability in NLPEverlyn Asiko @everlyn_asiko
764 Followers 513 Following PhD Fellow(ADTP-DS) @QL_Africa | Machine Translation researcher | @AIMSacza Graduate | Data Science Technical Mentor @moringaschool | KamiLimu Cohort 4.0 menteeMarco Trombetti @marcotrombetti
698 Followers 246 Following Incurable optimist always in love with big ideas. Entrepreneur and investor. @translation @picampusromeTu Anh Dinh @TuAnhDinh23
18 Followers 204 Following PhD student @ AI4LT lab, Karlsruher Institut für TechnologieMarcin Junczys-Dowmun.. @marian_nmt
2K Followers 397 Following NLP. NMT. Main author of Marian NMT. Research Scientist at Microsoft Translator. Non-NLP silliness and stuff on @emjotdeYann LeCun @ylecun
712K Followers 719 Following Professor at NYU. Chief AI Scientist at Meta. Researcher in AI, Machine Learning, Robotics, etc. ACM Turing Award Laureate.Antonis Anastasopoulo.. @anas_ant
3K Followers 2K Following Assist. Prof at George Mason CS #nlproc MT, ASR, and documentation of endangered languages.Shruti Rijhwani @shrutirij
4K Followers 499 Following * Research Scientist @GoogleDeepMind * #NLProc research * PhD from @LTIatCMU * Amateur woodworker, scuba diver, foosball playerNathan Schneider @complingy
4K Followers 1K Following Computational Linguist and Professional Nerd at Georgetown University he/him pronouns, ALL the prepositions @[email protected] @complingy.bsky.socialLuciana Benotti @LucianaBenotti
3K Followers 1K Following Investigadora @unc_cordoba sobre #NLProc. Sueño con un mundo en el que las computadoras ayuden a todas las personas a vivir mejor---no sólo a unas pocas.Kareem Darwish @kareem2darwish
2K Followers 343 Following Principal scientist working on natural language processing and social computing; YouTuber; AuthorMaha Elbayad @melbayad
617 Followers 629 Following Research Scientist @AIatMeta. @centralesupelec, @ENS_ParisSaclay and @UGrenobleAlpes alum. 💬 My opinions are my own | she/her@𝘾𝙝𝙖𝙏𝙤.. @ChaToX
4K Followers 2K Following ICREA Professor @UPFBarcelona. Lead of @WSSC_UPF. Algorithmic fairness, social computing, @BigCrisisData (Nonbinary: they/them) 🏳️🌈Omar Sanseviero @osanseviero
32K Followers 2K Following Chief Llama Officer @huggingface 🦙 Founder @AI_Learners. Xoogler (SWE @Google Assistant, 20% PM TF Graphics). 100% Hacker Llama🇵🇪🇲🇽Preslav Nakov @preslav_nakov
2K Followers 2K Following Professor, MBZUAI LLMs, Jais-chat, "fake news", disinformation, propaganda, media bias Past: UC Berkeley, NUS, BAS, SUWalid Magdy 🇵🇸 @Walid_Magdy
4K Followers 294 Following Reader (associate prof) @InfAtEd Fellow @turinginst Director of @SMASH_Edin. Interests: Computational social science & NLPHouda BOUAMOR @hbouamor
684 Followers 398 Following Associate Professor @CMU-Q, Associate Area Head of Information Systems, NLP and Machine Learning ExpertJeff Wang 👨🚀 @jffwng
3K Followers 734 Following Product Lead @AIatMeta (FAIR). I like language models. I also like non-language models. Previously at Twitter and startupsThomas Scialom @ThomasScialom
6K Followers 232 Following AGI Researcher @MetaAI -- Lead Llama 2 and Postraining Llama 3. Also CodeLlama, Galactica, Toolformer, Bloom, Nougat, GAIA, ..Loic Barrault @LoicBarrault
278 Followers 316 FollowingAnkur Bapna @ankurbpn
723 Followers 564 Following Audio in Gemini. Low resource multilingual nlp and speech. At Google Deepmind.thamar | @thamar_solorio
2K Followers 675 Following NLP Prof @MBZUAI, & @UH, Director @RiTUAL_Lab. Friend, mother, partner, loves sunny days and live music. EiC @reviewAcl and ARR board. Views are my own.Eduardo Sánchez @eduardosg_ai
131 Followers 402 Following Research Scientist at @Meta. PhD Student at @ucl_nlp. Formerly MSc AI at @UM_DACS & BSc CS at @MatCom_UH. I work on Low-Resource MT & LLMs. #NLProcAK @_akhaliq
310K Followers 3K Following AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo follow on Hugging Face: https://t.co/q2Qoey80GxSravya Popuri @sravyapopuri388
105 Followers 352 Following Research Engineer @ MetaAI - Fundamental AI Research (FAIR). Working on speech to speech translation.Vaibhav (VB) Srivasta.. @reach_vb
11K Followers 169 Following GPU poor @Huggingface | F1 fan | Here for @at_sofdog’s wisdom | *opinions my ownThomas Wolf @Thom_Wolf
68K Followers 4K Following Co-founder and CSO @HuggingFace - open-source and open-scienceJHU CLSP @jhuclsp
5K Followers 664 Following Center for Language and Speech Processing at @JohnsHopkins #NLProc #MachineLearning #AI https://t.co/6IXR5OSiDY @[email protected]Armen Aghajanyan @ArmenAgha
6K Followers 263 Following Research Scientist @ Meta AI (FAIR) https://t.co/8XF2vtiIVy Opinions are my own.Vishrav Chaudhary @vishrav
511 Followers 561 Following Researcher @Microsoft Turing. Working on Large Scale LMs and Machine Translation. Ex- @MetaAI @LTIatCMU alum.Ahmed Mourad @ahsmourad
222 Followers 389 Following Computer Scientist | searchin 4 my RI | triathlete in the makingGustavo Aguilar @tavoaguilar91
197 Followers 403 Following Applied Scientist, Alexa AI | PhD '20 @CSatUH | Chess player ♟️| Salvadoran 🇸🇻 Opinions are my own.Alexis Conneau @alex_conneau
24K Followers 113 Following Audio AGI Research Lead @OpenAI - GPT-Next - Past: XLM, Unsupervised ASR, Unsupervised MT, Wav2vec 2.0/XLSR, MUSE, Unsupervised cross-lingual transferJavier Ferrando @javifer_96
277 Followers 480 Following PhD Student @la_UPC. Interpretability in NLPWiNLP @WiNLPWorkshop
4K Followers 443 Following Widening NLP (WiNLP) aims to elevate underrepresented voices in #NLProc. We care about #diversity and #inclusion. We will be #acl2023 and #emnlp2023Mark Riedl @mark_riedl
32K Followers 1K Following AI for storytelling, games, explainability, safety, ethics. Professor @GeorgiaTech. Associate Director @MLatGT. Time travel expert. Geek. Dad. he/himIngmar Weber @ingmarweber
5K Followers 5K Following @AvHStiftung Professor in AI at @SIC_Saar. Sophie's dad. #SocietalComputing. He/his. [email protected] Opinions are personal and not of my employer.QCRI ALT Group @qcrialt
379 Followers 94 Following Arabic Language Technologies Research Group at Qatar Computing Research Institute, HBKUSergey Edunov @edunov
953 Followers 103 Following Director of Engineering @ GenAI, Meta. I work on LlamasXimena Gutiérrez @XimGutierrez
615 Followers 948 Following Computational linguist. Currently associate professor @ceiich_unam @UNAM_MX. Before I was a postdoc researcher @uzh_spur.Kenton Murray @kentonmurray
876 Followers 797 Following Natural Language Processing Research Scientist JHU. | PhD from Notre Dame. | Formerly at QCRI, CMU, and Princeton.Adi Renduchintala @rendu_a
414 Followers 686 Following Applied Research Scientist @NVIDIA, former: Research Scientist @MetaAI, PhD @jhuclsp also lurking on Mastodon [email protected]Boz @boztank
110K Followers 1K Following CTO @Meta. Leading Reality Labs and working on AR, VR, AI, and more. Built v1 of FB News Feed, Messenger, Groups, Mobile Ads. TweetDelete 6moVes Stoyanov @vesko_st
2K Followers 550 Following Head of AI at @magicaltome. Ex-Language Researcher at @FacebookAI. Large LMs and multilingual NLP. @JHUCLSP and @Cornell alumn. https://t.co/WTSCasqDI6Shiyue Zhang @byryuer
2K Followers 1K Following Research Engineer @TechAtBloomberg | ex PhD student at UNC-Chapel Hill (@unccs @uncnlp) | Bloomberg PhD Fellow | Past Intern at @MetaAI @MSFTResearch | #NLProcNaman Goyal @NamanGoyal21
1K Followers 562 Following Research engineer, LLM scaling at GenAI Meta | Worked on: llama2, llama, OPT, blenderbot, XLMR, Bart, RobertaArya McCarthy @aryamccarthy
1K Followers 824 Following massively multilingual #NLProc • translation • @amazon fellow • jelinek fellow • PhD @jhuclsp • was @googleai, @duolingo, and @facebookaiMyle Ott @myleott
2K Followers 496 Following Founding engineer @character_ai. Previously at Meta AI (FAIR)Andre Niyongabo Rubun.. @andre_niyongabo
443 Followers 648 Following CS PhD student @Vertaix_ @Princeton | Interned @MetaAI, @Huawei & @WhaleCloud2 | Studied @la_UPC & @UESTC1956 | Previously @MasakhaneNLPMaja Popović ( @amel.. @amelija16mp
804 Followers 234 Following ADAPT Centre, DCU (Natural Language Processing)Kenneth Heafield @zngu
997 Followers 134 Following Making language models fast since 2011. Research Scientist at Meta.How come long context adaptions of Llama 3 that are being released only report performance on long context benchmarks? Do we assume that context extension happens for free without impacting model performance? Show us your MMLU, GSM8K, ARC-C and DROP!
Can't overstate how much effort the team has put into making Llama 3 happen, it was a wild ride, but totally worth it!
Feeling incredibly grateful for the entire team's dedication and hard work on the release of #Llama V3. It was a journey of long hours and immense effort, but we did it! Excited to finally put this in the hands of our amazing open source community.
People seem to over-index on the 15T number after Llama 3. While the number matters, what is even more important is the quality and diversity of those tokens. If there was a good way to measure those, that would have been an impressive result to report.
Llama3 was trained on 15 trillion tokens of public data. But where can you find such datasets and recipes?? Here comes the first release of 🍷Fineweb. A high quality large scale filtered web dataset out-performing all current datasets of its scale. We trained 200+ ablation…
@ollama Omg you guys are so fast! Thank you for making it run on my laptop 😍
The real king is still training 💪😝 But go go go 70B and 8B!
Early 1K votes are in and Llama-3 is on FIRE!🔥The New king of OSS model? Vote now and make your voice heard! Leaderboard update coming very soon.
I'm super proud of the work we've been doing in Tower.
Today we release the Tower paper! 🗼 Tower is an open-weight suite of multilingual models — built on top of LLaMA-2 — for translation-related tasks. It supports 10 different languages. Paper: arxiv.org/pdf/2402.17733… Models and data: huggingface.co/collections/Un… 🧵Thread below.
Our team (very kind folks doing great research) @AIatMeta is hiring PhD research interns on audio, speech and Multimodal generative models ! apply 👉metacareers.com/jobs/128854308…
Reflecting on Claudine Gay, I'm reminded that a fundamental of racism--that we should all be aware of--is the disparate application of rules: People from one race* are disproportionately punished for "breaking a rule" that ppl from another are virtually never punished for.🧵
@RTNJO3 Try the demo for yourself on @huggingface ⬇️ huggingface.co/spaces/faceboo…
Real-time , low-latency, speech-to-speech translation that preserves the voice and expression of the speaker.
SeamlessStreaming is an AI translation model that can deliver state-of-the-art results on streaming translation with <2 seconds of latency. One core piece of our latest Seamless Communication research work by teams at FAIR. More on this project ➡️ bit.ly/4165c9z
📣 Launching Audiobox Today! Demo: audiobox.metademolab.com Paper: fburl.com/bf23asfn Grant: fburl.com/zgsiz4bu Audiobox is a unified model for sound and speech generation boasting its controllability, performance, and efficiency.
Starting today you can try our new foundation research model for audio generation. The demo includes Zero shot TTS, Text to sound effects, Infilling and more! Try Audiobox ➡️ bit.ly/3GE2bEk
When you meet the next generation of mujeres aguerridas in NLP, like @GisVallejo and @jocelyndunstane my 💜 fills with joy! @emnlpmeeting #NLProc #dioslashaceyellassejuntan
Last week we released “Seamless”, a new streaming speech-to-speech translation (S2ST) model capable of maintaining expressivity. @AIatMeta This 🧵 deep-dives how we achieved an expressivity-preserving S2ST system. ai.meta.com/blog/seamless-…
Became a noogler just for this (and for what comes next 😉). Congrats to the team for all the progress!!
I’m very excited to share our work on Gemini today! Gemini is a family of multimodal models that demonstrate really strong capabilities across the image, audio, video, and text domains. Our most-capable model, Gemini Ultra, advances the state of the art in 30 of 32 benchmarks,…
The recent Seamless family of models by @AIatMeta translate speech in near-real time streaming mode and preserve the expressivity of speech. How did we quantify the quality of this expressivity preservation? Here is a thread on it: x.com/cointegrated/s…
@AIatMeta A 🧵 on how we evaluated expressivity of speech translation by the SeamlessExpressive model. Expressivity includes vocal style (e.g. pitch and timbre), overall expressive intent (including emotions, general intonational pattern), and rhythm (e.g. tempo and pauses).
To build our models in a responsible manner, we worked on assessing and strengthening the safety of our models to quantify and mitigate potential harms. We worked on a new multimodal and massively detector (MuTox) and mitigation (MinTox) for added toxicity shorturl.at/vEQ27
We present our robust watermarking module, which digitally labels our outputs to prevent potential misuse of our systems.
@victorckumar @NeilLevy10 Fffffffs. Don't tweet about a paper if your role is to be an anonymous reviewer.
@victorckumar @NeilLevy10 Your role is to help their research become stronger, not to publicly ridicule their work.
Amazing job by @AIatMeta launching a new speech-to-speech model that preserves unique vocal styles and expressions for speech translation. Current models are focused on translating the content and then using a monotone and/or robotic text-to-speech without proper personality.…