WAVLab | @CarnegieMellon @WavLab
Shinji Watanabe's Audio and Voice Lab | WAVLab @LTIatCMU @SCSatCMU | Speech Recognition, Speech Enhancement, Spoken Language Understanding, and more. shinjiwlab.github.io Joined August 2021-
Tweets300
-
Followers2K
-
Following144
-
Likes323
Sharing our initial leaderboard and open data release for š¶Music Arenaāļø! Music is subjective and multi-dimensional. A key goal of Music Arena is to provide insights beyond binary preferences! š§µ
ARECHO has been accepted by #neurips25 as spotlight! Many thanks to all the co-authors for their great effort and support!
ARECHO has been accepted by #neurips25 as spotlight! Many thanks to all the co-authors for their great effort and support!
espnet v.202509 released š github.com/espnet/espnet/⦠Includes many updates + fixes for NumPy 2.0 & Python 3.12 (thanks Nelson!). This is the last major update before we shift to the next-gen framework, ESPnet3 Interested in collaborating? Let us know!
Thanks, @HungyiLee2, for visiting CMU! Great discussions, inspiring research exchanges, and exciting seeds for collaboration ahead.
Iām happy to share that our paper, "Discrete Audio Tokens: More Than a Survey!", has been accepted at TMLR. š š Read: arxiv.org/pdf/2506.10274 š Explore our tokenizer database & submit yours: poonehmousavi.github.io/dates-website/ā¦
Iām happy to share that our paper, "Discrete Audio Tokens: More Than a Survey!", has been accepted at TMLR. š š Read: arxiv.org/pdf/2506.10274 š Explore our tokenizer database & submit yours: poonehmousavi.github.io/dates-website/ā¦
For my CMU course, we built the OWSM v4 demoš Check it out here: huggingface.co/spaces/espnet/ā¦
For my CMU course, we built the OWSM v4 demoš Check it out here: huggingface.co/spaces/espnet/ā¦
OWSM-V4 is now available as a demo! It includes both the OWSM-V4 Medium model and OWSM-V4 CTC model, each with about 1B parameters. š Try it out here: huggingface.co/spaces/espnet/ā¦
Our work on OWSM v4 received the Best Student Paper Award at #Interspeech2025! šš Huge congratulations to the team! šš Iām especially happy to see our open science efforts for speech foundation models recognized by the community. š š isca-archive.org/interspeech_20ā¦
Excited to be presenting 3 papers at #Interspeech2025 in Rotterdam this week! 1. Chain-of-Thought Reasoning for E2E Spoken Dialogue Systems Adds reasoning to real-time dialogue models, with open-source toolkit. Oral ā Thu, 10:10 | Dock 10B arxiv.org/abs/2506.00722
Meet Masao next week in #Interspeech2025 We use sound event, language and speaker context to prune large speech models for ASR and Speech Translation.
Meet Masao next week in #Interspeech2025 We use sound event, language and speaker context to prune large speech models for ASR and Speech Translation.
I will be presenting 3 papers from @WavLab at #Interspeech2025 š³š± One is OWSMv4 (led by @pengyf21), nominated for best student paper isca-archive.org/interspeech_20⦠It focuses a lot on data cleaning, particularly for non-English languages It will be an oral on Tues 15:10 at dock 10B.
Can we make discrete speech units lightweightšŖ¶ and streamableš? Excited to share our new #Interspeech2025 paper: On-device Streaming Discrete Speech Units arxiv.org/abs/2506.01845 (1/n)
While expanding more evaluation metrics in VERSA (github.com/wavlab-speech/ā¦), weāve been thinking bigger -> a unified, fast, and effective solution for evaluating them all at once. Meet UniVERSA at Tue, 14:10-14:30 š A12O3 ā Speech Assessment (Presented by @shinjiw_at_cmu )
Heading to #Interspeech2025! Iāll be involved in a tutorial, regular and special sessions (22 papers) & MLC-SLM workshop ā and excited to chat about our new project: ESPnet3, CHiME-9, Urgent3, LARC, and YODAS++. If youāre interested, come say hi ā letās collaborate! š
š¢ Excited to announce our 2-day workshop on "Foundations of Speech and Audio Foundation Models" at TTI Chicago, happening September 4ā5! š Info & registration: sites.google.com/view/speech-ai⦠š Poster submissions welcome! Join us for talks, discussions, and community building!
Excited to share our beta release of Music Arena, a live evaluation platform for state-of-the-art AI music generation models! š§ Listen to the latest models and š³ļø vote for your favorite āļø music-arena.org āļø github.com/gclef-cmu/musi⦠š arxiv.org/abs/2507.20900
Meows, music, murmurs and more! We train a general purpose audio encoder and open source the code, checkpoints and evaluation toolkit.
Meows, music, murmurs and more! We train a general purpose audio encoder and open source the code, checkpoints and evaluation toolkit.
3/3 of my first time Interspeech submissions got accepted āø( ' įµ ' )āø ļ¾ļ½Æļ¾ļ½°! 1 as first author, 1 shared first authorship with a colleague, and 1 as co-author. See you in Rotterdam!

Shinji Watanabe @shinjiw_at_cmu
4K Followers 363 Following I'm working at CMU (2021-). I was working at NTT (2001-2011), MERL (2012-2017), and JHU (2017-2020). Speech and Audio Processing is my main research topic.
Desh Raj @rdesh26
3K Followers 2K Following Research Scientist @Meta GenAI | Previously: @jhuclsp, @IITGuwahati
arXiv Sound @ArxivSound
6K Followers 32 Following Sound-related articles (https://t.co/dxVYgWJGOw and https://t.co/b90N0Zzvjs) on https://t.co/HHqPequzVU
Jonathan Le Roux @JonathanLeRoux
2K Followers 310 Following Speech and audio research scientist at MERL. Opinions never really my own. š¦https://t.co/6pSuhzw3fb
ć¾ć£ćć¼ @ymas0315
2K Followers 2K Following
Siddharth Dalmia @siddalmia05
2K Followers 448 Following Audio LLMs @ Waveforms AI | #SpeechProc and #NLProc | Previously Research Scientist @GoogleDeepmind | PhD @LTIatCMU @SCSatCMU
Mirco Ravanelli @mirco_ravanelli
4K Followers 2K Following Deep learning for Conversational AI. Creator of SpeechBrain.
Robin Scheibler @fakufakurevenge
855 Followers 925 Following Grower of cucumbers š„, tomatoes š , and chilli peppers š¶ļø. I ⤠audio, microphone arrays, IoT, Python, and data.
Delip Rao e/Ļ @deliprao
61K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: éå , improv, running š
laurent besacier @laurent_besacie
1K Followers 777 Following Principal Scientist at Naver Labs Europe & Professor at Univ. Grenoble Alpes - now on bluesky @lbesacie.bsky.social
Graham Neubig @gneubig
40K Followers 710 Following Associate professor @LTIatCMU. Co-founder/chief scientist @allhands_ai. I mostly work on modeling language.
Hirofumi Inaguma @HirofumiInaguma
1K Followers 1K Following Multimodal, Speech at Fundamental AI Research (FAIR) @MetaAI
Wen-Chin Huang @unilightwf
1K Followers 629 Following åå¤å±å¤§å¦ę å ±å¦ē ē©¶ē§å©ę. Assistant professor, Nagoya University. Voice conversion & synthesis. Trilingual, street dancer, golfer. Tweets are my own opinions.
imotts @imotts
985 Followers 700 Following ē°å¢é³åę/åęćć§ćććć±ćÆć«ć« åē°āē°č¾ŗåāRāSOKENDAIāéäæ”ä¼ē¤¾N
yamakatz @kyama0321
1K Followers 1K Following š»š¼šØš»āšš§š»āš»š§š¦»šš¢ å°éćÆč“č¦ćč£č“ęč”ćŖć©é³éæå¦å Øč¬ćē¾åØćÆäŗŗéć®ęč¦ćč£å©ć»ę”å¼µććę°ēć»ęč”ć»č£ ē½®ć»ē°å¢ć®ęŖę„ć«čå³ććć
Christian Steinmetz @csteinmetz1
5K Followers 2K Following AI for Audio & Music ⢠Research Scientist @sunomusic ⢠PhD Student @c4dm MSc @mtg_upf ⢠Previously Intern @Adobe @Meta @Dolby
mat @ballforest
6K Followers 3K Following PokĆ©mon GO / Outlier detection / Anomaly detection / Robust statistics / Functional Analysis / Statistical mechanics / Kernel methods / Dynamical system
šæļøšš»šļæ½... @SythonUK
1K Followers 1K Following ć¤ć³ćæć¼ćććäøć§ć®č°č«ćÆ å»ŗčØēć§ćÆćŖććć ćććŖć主義ćććć
tandat @Titanium040820
2 Followers 79 Following
Hamza @hamzadotnet
63 Followers 109 Following Founder and Creator. Built https://t.co/3eJS3FcYms (Failed) -- Building https://t.co/UfV592RZFM (Locked in)
Jene Martinell @JeneMartin58896
0 Followers 42 Following
Sai Darshini Nannapur... @SaiDarshiniN
20 Followers 182 Following #BuddingEntrepreneur #Beauracrat_NJPChemicals #SoapsIndustry | Padmanagar,Knr | 100%ReliableProducts | DetergentCake | Dishwasher | TextileFactory| Writerāļø
Afroza Rahman @itsafrozaj
1K Followers 5K Following š©āš»Coder š©āšComputer Engineer š¬Researcher - @AI @ML @ComputerVision @Healthcare @UELšBook Loverš±Tech Geek R1- #100daysofcoding š§#womanintech š“
Maria Teleki @Maria_Teleki_
12 Followers 217 Following Howdy š¤ | PhD in CS @ Texas A&M šļø Speech, AI & all things conversation š¶ Apolloās human | š¶ Rowing to 1M meters
Huang-Cheng Chou @huangchengchou
7 Followers 117 Following Postdoc @USC_SAIL @uscviterbi @USC Visiting Scholar @ntu_spml Edu.: PhD @BiicLab @NTHU_TAIWAN Alum.: @AmazonAlexa @ITRI_Taiwan @joolapickle @UT_Dallas; Realtek
Abhijay Sharma @AbhijayBro
0 Followers 38 Following
Ajith Kuriakose @Ajitkur
21 Followers 1K Following
ēå¢ę @wngzngwn1
0 Followers 91 Following
HarrietHewlett @2yi8S3R31YM5u
28 Followers 1K Following
Soroosh Mariooryad @sorooshooryad
153 Followers 375 Following Staff Research Scientist @GoogleDeepMind Gemini Audio core team āš Generative speech and language research at GDM foundational research unit
Weyori Joshua Akowuje @WeyoriJosh
4 Followers 173 Following
Atif Saleem @malikatifsaleem
443 Followers 8K Following A dreamer and an avid learner. Art and brains fascinate me but hearts put me in awe. My views are my own and donāt represent my employer in any way.
malouke @malouke2
325 Followers 2K Following Artificial intelligence (AI) ,machine learning (ML), and deep learning (DL), computer science,Computer Vision Natural Language Processing in finance
a @fghhhvghjjgfhjj
0 Followers 532 Following
Hassan Saeed @HassanS96922773
147 Followers 3K Following
Aurian @sound_au
14 Followers 96 Following PhD student at Telecom Paris in ADASP Group. Currently working on learning music representations.
Stefano Perna, Ph.D. @st3p_dot_io
75 Followers 311 Following AI Research Scientist @Translation and PhD student in Multimodal AI | Speech and Language Processing
Lokesh RLN @lokeshrln09
3 Followers 182 Following Trying to Work in | NLP | LLMs | Computer Vision | ML Research.
åØcharlie @charlie35868108
77 Followers 2K Following č„äŗŗäŗŗęäŗåÆåäøēåæ ååå®å Øļ¼č„åŖäŗéæčæäøäŗęå¤ēęē¼åäøēåæ ååē¾å„½
åéććć²ć @WBegnifico
15 Followers 214 Following
Koen Dewulf @Koeninorbit
129 Followers 3K Following Observatory crest ::ā.. Ik grasduin hier alleen maar .. en laat de duinen schoon achter ::ā.. Directeur Myria ::ā.. @MyriaBe
M Ng @tt0x6af
1 Followers 457 Following
Mubarak Hussain @MHQureshi
3K Followers 4K Following #Salesforce Trailblazer #Agentblazer | CRM+AI+DATA | Salesforce Certified Professional | MuleSoft Certified | Oracle Certified | Quotes | #CertifiedPro | š
Unnati Patel @unnati9_patel
200 Followers 3K Following
Abraham Mathews @_AbrahamMathews
45 Followers 293 Following Christian. Learning about AI. On Hiding.
ćć @aya172957
1K Followers 2K Following é½ē«å¤§ CS B4 唩ē°ē , ē³ę± (@ishiike_)ć®éēŗč , @triC_PR, @arcircle, NLP, Speech processing, DeepFakeę¤ē„ ć«čå³ćććć¾ććsubā@ayu271828
Amr Kayid @amr_kayid
102 Followers 4K Following šŖ¼šŖprev: @runwayml @Cohere š³ Research FORai / @CohereForAI š§āāļø @ManifoldRG @OpenMinedOrg šµ @TU_Muenchen š¤š©šŖš§
xyz @xyzYERC
0 Followers 19 Following
NaderEssam @_naterxo
461 Followers 60 Following
Eli @elipughresearch
124 Followers 423 Following https://t.co/gGEzeggqzD š¼ prev Msft speech, Stanford Math, CS. Created this as a way to keep up with new tech stuff. Personal is @elipughtri
Nelson Yalta @NelsonYalta
4 Followers 47 Following Electronic Eng. Researcher. AI. Speech Processing. Sound Processing. Deep Learning. E2E
Somkel @NSomkelechukwu
19 Followers 515 Following A programmer who loves challenging tasks š Full Stack Web Developer
TienDat @TienDat011000
4 Followers 270 Following
nesroul @nesroul
1 Followers 141 Following
hang chen @igorchen1997
1 Followers 45 Following
Darong @Darong82860184
13 Followers 350 Following
Nabeel Dev š @NabeelAmin70rb
355 Followers 4K Following Associate Researcher at Riphah International University Sub Campus | Graduated in Computer Science | Learn to Code | AI Engineer @optemastech| Freelancer
Hrishikesh H Pillai @hrishilabs
0 Followers 27 Following
Valentina Tardelli @ValentinaT32922
87 Followers 6K Following
professor 18 @professora18
715 Followers 7K Following Human intelligence. Brain Ninja. AI analysis and modeling. Futurist. News from the future. Fitness maniac. Music lover. A man of 808 months.
Shinji Watanabe @shinjiw_at_cmu
4K Followers 363 Following I'm working at CMU (2021-). I was working at NTT (2001-2011), MERL (2012-2017), and JHU (2017-2020). Speech and Audio Processing is my main research topic.
AK @_akhaliq
428K Followers 3K Following AI research paper tweets, ML @Gradio (acq. by @HuggingFace š¤) dm for promo ,submit papers here: https://t.co/UzmYN5YmrQ
Desh Raj @rdesh26
3K Followers 2K Following Research Scientist @Meta GenAI | Previously: @jhuclsp, @IITGuwahati
arXiv Sound @ArxivSound
6K Followers 32 Following Sound-related articles (https://t.co/dxVYgWJGOw and https://t.co/b90N0Zzvjs) on https://t.co/HHqPequzVU
Jonathan Le Roux @JonathanLeRoux
2K Followers 310 Following Speech and audio research scientist at MERL. Opinions never really my own. š¦https://t.co/6pSuhzw3fb
AI at Meta @AIatMeta
716K Followers 288 Following Together with the AI community, we are pushing the boundaries of whatās possible through open science to create a more connected world.
Heiga Zen (å Ø ē³ę²³... @heiga_zen
9K Followers 192 Following Principal Scientist (Director) @GoogleDeepMind / GDM Tokyo site leadļ¼ę³¢ē¬å°āäøåæäøāé“鹿é«å°āå巄大 (IBM TJ Watson intern)āę±č꬧å·ē āGoogle (Speechš¬š§āBrainšÆšµ) āGoogleDeepMind
Shinnosuke Takamichi ... @forthshinji
5K Followers 385 Following Speech researcher / é³å£°ē ē©¶č . https://t.co/f8hJL8R1Lm
ć¾ć£ćć¼ @ymas0315
2K Followers 2K Following
PyTorch @PyTorch
453K Followers 77 Following Tensors and neural networks in Python with strong hardware acceleration. PyTorch is an open source project at the Linux Foundation. #PyTorchFoundation
Wei-Ning Hsu @mhnt1580
2K Followers 133 Following Research Scientist @ Meta FAIR / audio generation, self-supervised learning, speech processing
Siddharth Dalmia @siddalmia05
2K Followers 448 Following Audio LLMs @ Waveforms AI | #SpeechProc and #NLProc | Previously Research Scientist @GoogleDeepmind | PhD @LTIatCMU @SCSatCMU
Alexis Conneau @alex_conneau
35K Followers 190 Following Co-founder and CEO https://t.co/efv72CKpAG (@WaveFormsAI) - Ex @OpenAI GPT-4o/AVM Audio Research Lead - #Her #TARS - Ex @AIatMeta, @Polytechnique (X11)
Graham Neubig @gneubig
40K Followers 710 Following Associate professor @LTIatCMU. Co-founder/chief scientist @allhands_ai. I mostly work on modeling language.
Hirofumi Inaguma @HirofumiInaguma
1K Followers 1K Following Multimodal, Speech at Fundamental AI Research (FAIR) @MetaAI
Wen-Chin Huang @unilightwf
1K Followers 629 Following åå¤å±å¤§å¦ę å ±å¦ē ē©¶ē§å©ę. Assistant professor, Nagoya University. Voice conversion & synthesis. Trilingual, street dancer, golfer. Tweets are my own opinions.
imotts @imotts
985 Followers 700 Following ē°å¢é³åę/åęćć§ćććć±ćÆć«ć« åē°āē°č¾ŗåāRāSOKENDAIāéäæ”ä¼ē¤¾N
šæļøšš»šļæ½... @SythonUK
1K Followers 1K Following ć¤ć³ćæć¼ćććäøć§ć®č°č«ćÆ å»ŗčØēć§ćÆćŖććć ćććŖć主義ćććć
Yongyi Zang @yongyi_zang
236 Followers 742 Following Director of Research @ Smule + Independent Researcher; Doing more applied stuff with Smule, and more ambitious stuff independently. Opinions are my own.
Shikhar @ShikharSSU
303 Followers 1K Following Turning noise intoā¦slightly better noise. https://t.co/9gtrEjheT0
Muramasa @Muramasa_2
770 Followers 1K Following Research scientist / Speech synthesis / 社ä¼äŗŗå士@å大
Berkeley Biological &... @BerkeleySCLab
1K Followers 495 Following Lab @UCBerkeley for biological and artificial language. PI @begusgasper
Lior Alexander @LiorOnAI
106K Followers 2K Following Covering the latest in AI development ⢠ML Eng since 2017 ⢠Building @AlphaSignalAI into the #1 source of news for AI devs ā At 250k readers.
Felix @felix_red_panda
5K Followers 2K Following speech synthesis and LLM nerd, DMs open, working on LLM stuff
Chris Donahue @chrisdonahuey
5K Followers 1K Following GenAI for *human* creativity in music + more. Assistant prof at CMU CSD, š¼ G-CLef lab. Part time Google DeepMind, Magenta (views my own)
Rafael Valle @RafaelValleArt
1K Followers 186 Following Research Manager and Scientist at NVIDIA. UC Berkeley alumn. Love, music, set and setting!
Jungo Kasai ē¬ äŗę·³... @jungokasai
2K Followers 504 Following Co-founder & CTO @kotoba_tech | Research Assistant Prof. @TTIC_Connect | PhD from @nlpnoah at @UW | IBM PhD Fellow | å«ę£ē¾©č²č±č²”å£ē | @Yale Undergraduate
Ankit Shah @ankits0052
2K Followers 8K Following LLM Arch Assoc Director @Accenture Ph.D. @LTIatCMU. Past @GoogleAI Sharing insights about AI research, LLMs, multimodal AI, coding & tech. š Views are my own
Antonis Anastasopoulo... @anas_ant
3K Followers 2K Following Assist. Prof at George Mason CS #nlproc MT, ASR, and documentation of endangered languages.
Tejes Srivastava @tejessri
17 Followers 148 Following Masterās Student @UChicagoCS advised by @ChenhaoTan, interested in AI Audio/Speech and Text
Ganesh Kini @gkayakg
105 Followers 1K Following PhD candidate at UCSB | Interested in machine learning, deep learning, signal processing | Masters from IISc
Loren Lugosch @lorenlugosch
2K Followers 996 Following Machine learning @ ; audio & language; Freigeisterei und Vielgeisterei; "at once a man of business and a man of rhyme"
Dong Zhang @dongzha35524835
569 Followers 607 Following MS Student at FudanNLP Lab @FudanUniv | Developing SpeechGPT-Series
Rui Liu @RuiLiu60711141
369 Followers 329 Following Professor at Inner Mongolia University. working on speech synthesis, deep learning, natural language processing.
Yu-An (Andy) Chung @iamyuanchung
173 Followers 317 Following Studying representation learning, self-supervised learning, generative modeling methods for speech and audio
Yoshiaki Bando @yoshipon0520
2K Followers 858 Following éčøé³ē°å¢ēč§£ | 深層ćć¤ćŗå¦ēæ | ćć¢ć
lester violeta @lesterphv
181 Followers 651 Following teaching computers to sing and speak | phd-ing @nagoyauniv
Li Sheng /listen/ ass... @cs_lisheng
651 Followers 6K Following ā2025 new faculty of Science Tokyo āSpeech tech+multilingual+multimodal+security āWelcome collaboration, discussion CV: https://t.co/naL0tJB3sI
Fabian Ritter-Gutierr... @Fabian_acoustic
186 Followers 653 Following Chilean doing PhD on Speech in Singapore. I rarely use this social media. Active on: https://t.co/JQxD1cDaZs
JIACHEN LIAN @LianJiachen
179 Followers 123 Following EECS PhD at UCB | Berkeley Artificial Intelligence Research(BAIR) | Snooker Lover
Andrew Rouditchenko ļæ½... @arouditchenko
452 Followers 550 Following PhD student at MIT working on multi-modal and multilingual speech. I was an intern at @AIatMeta and @Apple MLR.
Ian (Yi-Jen) Shih @yijenshih
132 Followers 235 Following CS Ph.D. @UTAustin @UTCompSci , Previously a NTUEE Undergraduate Interested in Music Information Retrieval, Speech processing and Deep learning.
Phillip Rust @rust_phillip
381 Followers 594 Following Research Scientist @AIatMeta (FAIR) ⢠PhD @coastalcph
Shih-Lun (Sean) Wu @slseanwu
261 Followers 92 Following music/audio/speech proc, generative models PhD student (now), EECS MIT MSc '24, SCS CMU BSc '21, CS Nat'l Taiwan U casual classical pianistš¹ & violistš»
Martijn Bartelds @BarteldsMartijn
533 Followers 376 Following Postdoctoral Scholar @stanfordnlp | Formerly @univgroningen, @tudelft and @Penn
Yuki Saito @ysaito_human
695 Followers 531 Following Lecturer (Sr. Assistant Professor) @ UTokyo-SaruLab, Japan, JST ACT-X (ꬔäøä»£AIć»ę°ēę å ± 2023 ~ 2026), ē¹å®ćć§ćć¼@ē£ē·ē (JST BOOST č„ęē ē©¶č ęÆę“, 2025 ~ 2030)
Huck Yang @huckiyang
846 Followers 757 Following Sr. Research Scientist @NVIDIAAI Generative Communications | Ph.D. MSc @GeorgiaTech | Past: @GoogleAI @AmazonScience | š£ļø omni
Kwanghee Choi @juice500ml
205 Followers 157 Following Master's student @LTIatCMU, working on speech AI at @shinjiw_at_cmu's @WavLab
Sravya Popuri @sravyapopuri388
159 Followers 384 Following Tech Lead Manager for mid-training, long context and synthetic data for Llama models at Meta Gen AI
Yung-Sung Chuang @YungSungChuang
1K Followers 681 Following PhD student @MIT_CSAIL | Intern @MetaAI @Microsoft @MITIBMLab | BS @NTU_SPML in #Taiwan
dongchao @dongchaodudu
79 Followers 83 Following A PhD student in The Chinese University of Hong Kong, focusing on large audio foundation models.
Heng-Jui Chang @hjchang87
170 Followers 165 Following š PhD Candidate @MIT_CSAIL š§Ŗ Research Scientist Intern @AIatMeta
å®å½ å“ @yihanwu93398629
1 Followers 3 Following
DailyAudioPapers @mlsp4audio
785 Followers 637 Following Daily tweets on selected arXiv papers on audio (eessā¤AS/csā¤SD) | Brief reviews of interesting papers | Machine learning | Signal processing
Liu Songxiang @shaunliu231
49 Followers 110 Following Focuses on general spoken language processing, speech and singing generation. Ph.D. from HCCL @CUHKofficial
Kai-Wei Chang @KaiWeiChang5
91 Followers 136 Following 3rd Ph.D. student at National Taiwan University. Currently working on prompting speech LLMs. SpeechPrompt / SpeechGen. Ex-intern at @Meta @RealityLabs.
Puyuan Peng @PuyuanPeng
2K Followers 511 Following Research Scientist @Meta Superintelligence Lab. Speech & Audio. Previously @utaustin @uchicago @bnu_1902