-
Tweets118
-
Followers164
-
Following499
-
Likes433
📢 Join our Conversational AI Reading Group! 📅 Thurs, Sep 25th | 11 AM - 12 PM EST 🎙 Speaker: Themos Stafylakis @themosst 📖 Topic: "Advances in Speaker Recognition: Pruning, Deepfake Detection, and Learning without Temporal Labels" 🔗 Details: (poonehmousavi.github.io/rg)
If you missed my session presenting our recent work “Discrete Audio Tokens: More Than a Survey!”, you can now find the recording on our YouTube channel and the slides on our website: ▶️ YouTube: youtu.be/iGNotmn5J5A?si… 🌐 Website: poonehmousavi.github.io/rg.html#fall20…
If you missed my session presenting our recent work “Discrete Audio Tokens: More Than a Survey!”, you can now find the recording on our YouTube channel and the slides on our website: ▶️ YouTube: youtu.be/iGNotmn5J5A?si… 🌐 Website: poonehmousavi.github.io/rg.html#fall20…
I’ll be presenting our survey paper “Discrete Audio Tokens: More Than a Survey!” at the first Fall 2025 session of the Conversational AI Reading Group. Looking forward to seeing you there and discussing ideas!
I’ll be presenting our survey paper “Discrete Audio Tokens: More Than a Survey!” at the first Fall 2025 session of the Conversational AI Reading Group. Looking forward to seeing you there and discussing ideas!
We’re back with a new series of Conversational AI Talks. Everyone’s invited! Feel free to share with your network. 🗓 Every Thursday, 11:00 AM – 12:00 PM EDT 🚀 Kicking off on September 18th with an exciting lineup of speakers. 🔗 More details: poonehmousavi.github.io/rg
I’m happy to share that our paper, "Discrete Audio Tokens: More Than a Survey!", has been accepted at TMLR. 🎉 📄 Read: arxiv.org/pdf/2506.10274 🔎 Explore our tokenizer database & submit yours: poonehmousavi.github.io/dates-website/…
I’m happy to share that our paper, "Discrete Audio Tokens: More Than a Survey!", has been accepted at TMLR. 🎉 📄 Read: arxiv.org/pdf/2506.10274 🔎 Explore our tokenizer database & submit yours: poonehmousavi.github.io/dates-website/…
📢 Presenting our paper “LiSTEN: Learning Soft Token Embeddings for Neural Audio LLMs” — an interpretable fine-tuning method for spoken language understanding. 🗓 Wed, Aug 20 | 08:30–10:30 📍 A11-P2B-03 Hope to see you there! 📄 arxiv.org/pdf/2505.18517 @ISCAInterspeech
Our pick of the week by @beomseok_lee_: "ALAS: Measuring Latent Speech-Text Alignment For Spoken Language Understanding In Multimodal LLMs" by Pooneh Mousavi, @yingzhi_wang, @mirco_ravanelli, and @CemSubakan (2025) arxiv.org/abs/2505.19937 #SLU #speech #multimodal #LLM
Our pick of the week by @beomseok_lee_: "ALAS: Measuring Latent Speech-Text Alignment For Spoken Language Understanding In Multimodal LLMs" by Pooneh Mousavi, @yingzhi_wang, @mirco_ravanelli, and @CemSubakan (2025) arxiv.org/abs/2505.19937 #SLU #speech #multimodal #LLM
📢 Join our Conversational AI Reading Group! 📅 Thursday, June 19th | 11 AM - 12 PM EST 🎙 Speaker: Yuki Mitsufuji (@mittu1204) - SonyAI 📖 Topic: "AI for Creators: Pushing Creative Abilities to the Next Level" 🔗 Details: (poonehmousavi.github.io/rg)
``Discrete Audio Tokens: More Than a Survey!,'' Pooneh Mousavi, Gallil Maimon, Adel Moumen, Darius Petermann, Jiatong Shi, Haibin Wu, Haici Yang, Anastasia Kuznetsova, Artem Ploujnikov, Ricard Marxer, Bhuvana Ramabhadran, Benjamin Elizalde, Loren Lugosch… ift.tt/GA4ZC6u
🎵💬 If you are interested in Audio Tokenisers, you should check out our new work! We empirically analysed existing tokenisers from every way - reconstruction, downstream, LMs and more. Grab yourself a ☕/🍺 and sit down for a read!
🌟🌟 Great collaboration, with a diverse all-star team led by @MousaviPooneh - check it out👇 📄Paper - arxiv.org/abs/2506.10274 🌐Website (+updating tokeniser DB!) - poonehmousavi.github.io/dates-website/
🚀 We're excited to announce our latest work: "Discrete Audio Tokens: More Than a Survey!" It presents a comprehensive survey and benchmark of audio tokenizers across speech, music, and general audio. preprint: arxiv.org/pdf/2506.10274 website: poonehmousavi.github.io/dates-website/
📢 Join our Conversational AI Reading Group! 📅 Thursday, June 12th | 11 AM - 12 PM EST 🎙 Speaker: Andros Tjandra 📖 Topic: "Meta Audiobox Aesthetics: Unified Automatic Quality Assessment for Speech, Music, and Sound" 🔗 Details: (poonehmousavi.github.io/rg)
📢 Join our Conversational AI Reading Group! 📅 Thursday, May 29th | 11 AM - 12 PM EST 🎙 Speaker: Yossi Adi @adiyossLC 📖 Topic: "On The Landscape of Spoken Language Models" 🔗 Details: (poonehmousavi.github.io/rg)
Learn about speaker diarization, the science behind it, and the future of diarization at @pyannoteAI research labs youtu.be/ECqxZgVevuI?fe…
... in which I'll talk about my decade-old love for speaker diarization and the loss functions used to train underlying neural networks
... in which I'll talk about my decade-old love for speaker diarization and the loss functions used to train underlying neural networks https://t.co/f5WHG4UMVO
🗣️🧠 Speech Language Models require lots of compute to train, right? In our new paper, we test is it possible to train an SLM on 1xA5000 gpu in 24 hours? The results may surprise you (they even surprised us)! Tips, open source resources, full paper 👇🏻
@convAI2024 Thank you for having me, and thank you all the listeners! I had a great time 🙌 If you missed it, here's the recording and the slides! Recording: youtube.com/watch?v=REH034… Slides: poonehmousavi.github.io/assets/slides/…
@convAI2024 Thank you for having me, and thank you all the listeners! I had a great time 🙌 If you missed it, here's the recording and the slides! Recording: youtube.com/watch?v=REH034… Slides: poonehmousavi.github.io/assets/slides/…
📢 Join our Conversational AI Reading Group! 📅 Thursday, May 15th | 11 AM - 12 PM EST 🎙 Speaker: Wen-Chin Huang (@unilightwf) 📖 Topic: "Automatic Quality Assessment for Speech and Beyond" 🔗 Details: (poonehmousavi.github.io/rg) , (youtube.com/@CONVAI_RG)
📢 Join our Conversational AI Reading Group! 📅 Thursday, May 8th | 11 AM - 12 PM EST 🎙 Speaker: Leda Sari 📖 Topic: "The Voicebox Model and Its Applications" 🔗 Details: (poonehmousavi.github.io/rg)

Catherine @shiromeshi007
237 Followers 1K Following AI Computing Engine: Its GPUs are widely used for deep learning and artificial intelligence training and reasoning, becoming the "gold standard" for AI hardware
Gisella @JacyntheAr48749
76 Followers 3K Following
Joan Serrà @serrjoa
2K Followers 564 Following Does research on machine learning at Sony AI, Barcelona. Works on audio analysis, synthesis, and retrieval. Likes tennis, music, and wine.
Frank Zalkow @swpffm
102 Followers 132 Following
Li Sheng /listen/ ass... @cs_lisheng
653 Followers 6K Following ◆2025 new faculty of Science Tokyo ◆Speech tech+multilingual+multimodal+security ◆Welcome collaboration, discussion CV: https://t.co/naL0tJB3sI
まっすー @ymas0315
2K Followers 2K Following
adamf @AdamForsted
58 Followers 609 Following
sanchit kabra @sanchitkabra4
155 Followers 545 Following CS grad @virginia_tech CS @bitspilaniindia ML/AI Why am i
傅丰元 Bob Fu @fm100
6K Followers 3K Following i make content&context, build in community. real-time ai/voice/video @rtedevcommunity & @AgoraIo丨灵感买家俱乐部丨离线丨利器 🐦帮助彼此完成各自项目: https://t.co/OHxgXPINRJ
DG. @dataghees
1K Followers 6K Following scaling speech native LLMs @rimelabs the future is willed into existence. bioML, discovering new science, housing, industrial policy, local politics.
Parshin Shojaee @ParshinShojaee
3K Followers 1K Following PhD student @VT_CS | AI for Science, Math, Code, Reasoning | Intern @Apple | prev @Adobe
Sathvik Udupa @SathvikUdupa
65 Followers 564 Following Graduate Student, BUT Speech@FIT. Previously, SPIRE Lab, IISc.
Mori Kiyotada @KiyotadaMr
132 Followers 190 Following 2-year Master’s student specializing in speech recognition and perception. 日本で日本人として生きていく。
Stefano Perna, Ph.D. @st3p_dot_io
76 Followers 311 Following AI Research Scientist @Translation and PhD student in Multimodal AI | Speech and Language Processing
Maryam Afshari @AfshariMaryam95
11 Followers 493 Following
chen zarfati @chenzarf
35 Followers 200 Following
Yigitcan Ozer @yiit_ozer_
290 Followers 634 Following postdoc @yamagishilab, NII | prev. research intern @SonyAI_global, Ph.D. at AudioLabs Erlangen, researcher @FraunhoferIIS
yingzhi wang @yingzhi_wang
38 Followers 93 Following Research on Speech & Audio, collaborator @SpeechBrain1
Enno Hermann @enno_hermann
195 Followers 527 Following Postdoc at @Idiap_ch - Speech. Coqui TTS fork maintainer.
Nonlinear Camel @nonlinear_camel
3 Followers 54 Following
Nima Nooshiri @nimanzik
474 Followers 995 Following Data Scientist at BDiM GmbH | PhD in Seismology | Digital Signal Processing | Applied Deep Learning | AI Dev | Prev.: @GFZ_Potsdam and @DIAS_Dublin
Yossi Adi @adiyossLC
894 Followers 377 Following Assistant Professor @ The Hebrew University of Jerusalem, CSE; Research Scientist @ Meta AI (FAIR); Drummer @ Lucille Crew 🤖🥁🎤🎧🌊
Julien Hauret @jhauret33
20 Followers 132 Following Ph.D. Student - Deep Learning & Speech Processing @LeCnam
ryu @ryu0000000001
316 Followers 185 Following Nothing is boring. No knowledge is irrelevant: only not relevant *yet*. - Jonathan Gorard
Ace Jiachen Luo @jiachenluo96
139 Followers 4K Following keep it simple and humble 😀 # multimodal foundation model, healthcare, human, society, ecology @CHUK @QMUL @Cambridge @UCAS
Avihu Dekel @AvihuDkl
286 Followers 570 Following Deep Learning Researcher at IBM. Sharing works I find interesting. Might also write about: Food, Cello, Cute animals, Israel and...
Ivan @_fentropy
400 Followers 1K Following Interested in Speech Recognition/Computer Vision/NLP/Bayesian ML. Wrote a bit in these languages: Python/R/C++. Lots of shitposting. RU (mainly)/EN
Loren Lugosch @lorenlugosch
2K Followers 997 Following Machine learning @ ; audio & language; Freigeisterei und Vielgeisterei; "at once a man of business and a man of rhyme"
kodhandarama(shreeram... @cricketrasika
127 Followers 1K Following PhD in speech synthesis, carnatic rasika, on a quest to visit all national parks
Anya @anyapiunova
36 Followers 559 Following
あまねゆみこ @amaneyumik6343
65 Followers 2K Following
VenusGrote @b53JS4LC2f16928
80 Followers 2K Following
armin zd @armin__zd
0 Followers 241 Following
Frank Zalkow @swpffm
102 Followers 132 Following
まっすー @ymas0315
2K Followers 2K Following
sanchit kabra @sanchitkabra4
155 Followers 545 Following CS grad @virginia_tech CS @bitspilaniindia ML/AI Why am i
傅丰元 Bob Fu @fm100
6K Followers 3K Following i make content&context, build in community. real-time ai/voice/video @rtedevcommunity & @AgoraIo丨灵感买家俱乐部丨离线丨利器 🐦帮助彼此完成各自项目: https://t.co/OHxgXPINRJ
MT Group at FBK @fbk_mt
1K Followers 441 Following #MachineTranslation Research Unit @FBK_research. #nlproc #deeplearning #ai
Parshin Shojaee @ParshinShojaee
3K Followers 1K Following PhD student @VT_CS | AI for Science, Math, Code, Reasoning | Intern @Apple | prev @Adobe
DG. @dataghees
1K Followers 6K Following scaling speech native LLMs @rimelabs the future is willed into existence. bioML, discovering new science, housing, industrial policy, local politics.
Sathvik Udupa @SathvikUdupa
65 Followers 564 Following Graduate Student, BUT Speech@FIT. Previously, SPIRE Lab, IISc.
Mori Kiyotada @KiyotadaMr
132 Followers 190 Following 2-year Master’s student specializing in speech recognition and perception. 日本で日本人として生きていく。
Stefano Perna, Ph.D. @st3p_dot_io
76 Followers 311 Following AI Research Scientist @Translation and PhD student in Multimodal AI | Speech and Language Processing
Beomseok LEE @beomseok_lee_
47 Followers 143 Following PhD student @uniTrento. Affiliated in @naverlabseurope and @fbk_mt. Ex research engineer @samsungresearch
Matteo Negri @negri_teo
425 Followers 508 Following Researcher at Fondazione Bruno Kessler, mainly on #machinetranslation and #NLProc.
Luisa Bentivogli @luisabentivogli
327 Followers 196 Following Head of the @fbk_mt research unit at @fbk_research Interested in #machinetranslation #nlproc #FairnessML • She/her • Views are my own
Hervé "pyannote" Bre... @hbredin
2K Followers 704 Following Hervé Bredin /👨🏻💻 Creator of 🎹 pyannote / ⚒️ Co-founder and CSO @pyannoteAI /👨🏼🔬 Researcher @CNRS (on leave)
Ivan @_fentropy
400 Followers 1K Following Interested in Speech Recognition/Computer Vision/NLP/Bayesian ML. Wrote a bit in these languages: Python/R/C++. Lots of shitposting. RU (mainly)/EN
الجزيرة - عا... @AJABreaking
3.0M Followers 1 Following تغطية الجزيرة للأخبار العاجلة على مدار الساعة، للاطلاع على التقارير والتغطيات للأحداث على الساحتين العربية والدولية، تابعوا حسابنا @AJArabic
قناة الجزير... @AJArabic
24.0M Followers 26 Following الجزيرة.. الرأي والرأي الآخر.. تابع أخبارنا العاجلة على @AJABreaking
BBC Dari @bbcafghanistan
762K Followers 5 Following حساب رسمی بیبیسیدری. شماره واتساپ ما: 00448000121010 خبرها و داستانهای شخصی، اجتماعی، اقتصادی و هنری از افغانستان و جهان.
Maryam Eslami @Maryam_Eslami
3K Followers 3K Following Research Scientist @UofIllinois & @FBK_research Materials Science & Electrochemistry (Cover photo by @mahedmousavi)
あまねゆみこ @amaneyumik6343
65 Followers 2K Following
kodhandarama(shreeram... @cricketrasika
127 Followers 1K Following PhD in speech synthesis, carnatic rasika, on a quest to visit all national parks
Anya @anyapiunova
36 Followers 559 Following
Avihu Dekel @AvihuDkl
286 Followers 570 Following Deep Learning Researcher at IBM. Sharing works I find interesting. Might also write about: Food, Cello, Cute animals, Israel and...
العربیه فار... @AlArabiya_Fa
462K Followers 47 Following العربيه فارسى به عنوان بخشی از شبكه العربيه در سال 2008 راهاندازی شد.
Barak Ravid @BarakRavid
398K Followers 822 Following Global Affairs Correspondent for Axios. CNN analyst. Washington correspondent for Israel's channel 12. Author of Trump's Peace. link in Bio
Denny Zhou @denny_zhou
22K Followers 540 Following Founded the Reasoning Team in Google Brain (now in the Gemini Core team of Google DeepMind). Build LLMs to reason. Opinions my own.
Joan Serrà @serrjoa
2K Followers 564 Following Does research on machine learning at Sony AI, Barcelona. Works on audio analysis, synthesis, and retrieval. Likes tennis, music, and wine.
Alexis Conneau @alex_conneau
35K Followers 190 Following Co-founder and CEO https://t.co/efv72CKpAG (@WaveFormsAI) - Ex @OpenAI GPT-4o/AVM Audio Research Lead - #Her #TARS - Ex @AIatMeta, @Polytechnique (X11)
Minje Kim @minje_research
418 Followers 246 Following Associate Professor at CS@UIUC; Visitic Academic at Amazon Lab126; Want to share my thoughts on audio & AI research, graduate studies, and life.
Piotr Żelasko @PiotrZelasko
1K Followers 698 Following AI + Speech @ Nvidia. PhD @ AGH-UST, ex-JHU. My interests: speech processing technologies; ML/AI software engineering. Building OSS for Speech AI.
yingzhi wang @yingzhi_wang
38 Followers 93 Following Research on Speech & Audio, collaborator @SpeechBrain1
Yusuf Aytar @yusufaytar
1K Followers 149 Following Research Scientist @ DeepMind. Making machines smarter. Views are my own.
حامد عاقل @aghel_ir
14K Followers 5K Following بنده هیچ خدا | وطن جان من است |#آقای_امام_حسین |هرگونه توهین بلاک
Xiaohua Zhai @XiaohuaZhai
11K Followers 311 Following Researcher at Meta (previously at OpenAI Zürich, Google DeepMind)
Convai_rg @convAI2024
252 Followers 1 Following
Jing Liu @JLiu_Compuling
367 Followers 1K Following 2nd year PhD student @CoML_ENS | Msc @LeuvenAi| ResMA @CLSRadboud| reverse engineer language acquisition using NN
Martijn Bartelds @BarteldsMartijn
533 Followers 375 Following Postdoctoral Scholar @stanfordnlp | Formerly @univgroningen, @tudelft and @Penn