Hugo @Mldhug

PhD student in multimodal learning for audio understanding at @telecomparis, ex-MVA (ENS Paris Saclay) Joined August 2019

Tweets

12
Followers

40
Following

337
Likes

51

Salah Zaiem @salah_zaiem

7 months ago

Given a number of ASR models of different sizes, how can I allocate an audio sample to the smallest one that will be good enough ? @Mldhug worked on this question during his internship, and ended up with interesting conclusions you will find in our paper !

Hugo @Mldhug

7 months ago

1 4 12 1K 2

0 3 9 1K 2

Maksym Andriushchenko 🇺🇦 @maksym_andr

8 months ago

It's really surprising how far one can go with *linear* predictors in the autoregressive setting. Interesting theory and experiments on TinyStories: a linear model (with 162M params :-) ) can generate totally coherent text with few grammatical mistakes. arxiv.org/abs/2309.06979

4 46 296 78K 141

Download Image

Puyuan Peng @PuyuanPeng

10 months ago

Why is Whisper so robust to background noise? Not because Whisper suppresses them, but because Whisper 𝐮𝐧𝐝𝐞𝐫𝐬𝐭𝐚𝐧𝐝𝐬 them! Check out the amazing work by Yuan Gong @YGongND. They reveal this emergent capability of Whisper, and get SOTA *simultaneous* ASR + audio tagging

4 19 129 12K 53

Download Image

Jack (in SF) Langerman @jacklangerman

10 months ago

Towards Language Models That Can See: Computer Vision Through the LENS of Natural Language Looks promising; I'll have to try and see if it stands upto some poking ;-) Love that they get around the need for multimodal training. ar5iv.org/abs/2306.16410 github.com/ContextualAI/l…

0 1 8 1K 3

Download Image

AK @_akhaliq

10 months ago

DreamDiffusion: Generating High-Quality Images from Brain EEG Signals paper page: huggingface.co/papers/2306.16… paper introduces DreamDiffusion, a novel method for generating high-quality images directly from brain electroencephalogram (EEG) signals, without the need to translate…

12 148 608 137K 210

Download Image

Wei-Ning Hsu (Attending ICASSP) @mhnt1580

11 months ago

Super excited to finally launch Voicebox🤩, the most versatile speech generative model ever💬👄 Demo page: voicebox.metademolab.com

AI at Meta @AIatMeta

11 months ago

Super excited to finally launch Voicebox🤩, the most versatile speech generative model ever💬👄 Demo page: voicebox.metademolab.com

52 448 2K 441K 524

7 29 196 416K 39

Lucas Beyer (bl16) @giffmana

11 months ago

Who killed non-contrastive image-text pretraining? @AlecRad and @_jongwook_kim with the below Fig2 in CLIP. Who collected the 7 Dragonballs and asked Shenron to resurrect it? Yours truly, in this new paper of ours. Generative captioning is not only competitive, it seems better!

17 93 582 211K 302

Download Image

Orr Zohar @orr_zohar

11 months ago

Thanks for tweeting, @ak! We’re super excited about the future of text-only vision model selection! 🙏 @MarsScHuang @kcjacksonwang @syeung10

AK @_akhaliq

11 months ago

Thanks for tweeting, @ak! We’re super excited about the future of text-only vision model selection! 🙏 @MarsScHuang @kcjacksonwang @syeung10

1 15 75 35K 22

Download Image

1 6 15 19K 7

AK @_akhaliq

11 months ago

Macaw-LLM: Multi-Modal Language Modeling with Image, Audio, Video, and Text Integration paper page: huggingface.co/papers/2306.09… Although instruction-tuned large language models (LLMs) have exhibited remarkable capabilities across various NLP tasks, their effectiveness on other data…

3 37 141 54K 51

Download Image

Sagar Vaze @Sagar_Vaze

11 months ago

We'll present GeneCIS at #CVPR2023 (Highlight) TL;DR: While most image representations are *fixed*, we present a general way to train and evaluate models which can adapt to different *conditions* on the fly. Code: github.com/facebookresear… Project page: sgvaze.github.io/genecis/ 🧵

AK @_akhaliq

11 months ago

1 15 108 53K 34

Download Image

1 15 69 33K 28

Markita Capurro @CapurrMarkit

65 Followers 5K Following

Suzanna Victorine @SuzannaVic50137

102 Followers 5K Following

Leana Brueckner @BruecknerL46438

36 Followers 5K Following

Myrtie Tench @MyrtieT36381

72 Followers 5K Following

Claire Longsworth @ClaireLong38076

24 Followers 4K Following 😙19 - Join my free content👇💛

Lin Fahrenbruck @fahrenbruc18863

52 Followers 5K Following

Aurian @sound_au

4 Followers 37 Following PhD student at Telecom Paris in ADASP Group. Currently working on learning music representations.

Jorgi Kukla @JorgiKuk

83 Followers 5K Following

Kelli Bradford @KelliB58262

126 Followers 3K Following

Maysa Senesenes @mays_senesen

27 Followers 5K Following

Sandie Tewell @SandieTewe23786

11 Followers 2K Following Sandie / Lets CamChat👇

Morgan Guzon @guzo_mor

37 Followers 5K Following

Sr. Research Scientist @NVIDIAAI Generative Error Correction | Ph.D. @GeorgiaTech | Past: @GoogleAI @AmazonScience | 🗣️ education

Huck Yang @huckiyang

569 Followers 528 Following Sr. Research Scientist @NVIDIAAI Generative Error Correction | Ph.D. @GeorgiaTech | Past: @GoogleAI @AmazonScience | 🗣️ education

PhD in Computer Science @Univ_Lorraine. Machine learning and signal processing enthusiast. Researcher at @telecomparis.

Michel Olvera @michelolzam

163 Followers 1K Following PhD in Computer Science @Univ_Lorraine. Machine learning and signal processing enthusiast. Researcher at @telecomparis.

PhD Candidate @nyuMARL. Former lecturer & M.A.'19 @Stanford @CCRMA I do music/AI research, audio engineering, and vocals. she/her

Elena Georgieva 🎶 @elenatheodora1

588 Followers 619 Following PhD Candidate @nyuMARL. Former lecturer & M.A.'19 @Stanford @CCRMA I do music/AI research, audio engineering, and vocals. she/her

Jing Liu @JLiu_Compuling

295 Followers 1K Following 1st year PhD student @CoML_ENS | Msc @LeuvenAi| ResMA @CLSRadboud| interested in memory-augmented language models

Becca Baker @BeccaBaker73854

144 Followers 3K Following

Matthieu Rouif @matthieurouif

9K Followers 3K Following Co-founder & CEO @photoroom_app (YC S20) - Photo editor to sell it like a pro. ex GoPro, Stanford, Polytechnique

Michal Valko @misovalko

5K Followers 2K Following Llama @AIatMeta Paris & Inria & MVA - Ex: Gemini and BYOL @GoogleDeepMind

PhD from @CoML_ENS in speech, ml and cognition. Ex research intern @MetaAI. @CoML_ENS. unsupervised (multilingual) speech representations

Maureen de Seyssel @Maureendss

513 Followers 638 Following PhD from @CoML_ENS in speech, ml and cognition. Ex research intern @MetaAI. @CoML_ENS. unsupervised (multilingual) speech representations

Thibaut Loiseau @thibaut_loiseau

2 Followers 47 Following

Yasser Dahou @dahou_yasser

171 Followers 614 Following PhD Candidate at Dublin City University.

Yassine Elkheir @YassineElkheir

21 Followers 416 Following

Axel @Axelito9803

2 Followers 21 Following

Research Scientist at @mbzuai on LLM for biology. Formerly PhD at @ENS_ULM @Inria and gap year at @AIatMeta (FAIR)
https://t.co/cvmsk7mbUs

Robin Alg @AlgayresR

96 Followers 164 Following Research Scientist at @mbzuai on LLM for biology. Formerly PhD at @ENS_ULM @Inria and gap year at @AIatMeta (FAIR) https://t.co/cvmsk7mbUs

Entrepreneur & Engineer. Building on-device speech recognition solutions @ https://t.co/mrhEaR4f9O and organically growing the business.

Ognjen Todic @ognjen_todic

495 Followers 2K Following Entrepreneur & Engineer. Building on-device speech recognition solutions @ https://t.co/mrhEaR4f9O and organically growing the business.

Salah Zaiem @salah_zaiem

548 Followers 2K Following Research Scientist @GoogleDeepMind. Speech and Audio processing. Alumni @Polytechnique and @telecomparis

Raphaël Smagacz @RaphSma

179 Followers 221 Following 💼 Responsable communication France 3 Nouvelle-Aquitaine zone @F3PoitouChtes et #NoA // @comf3aqui

Maxime Poli @mmaximepoli

27 Followers 118 Following PhD student @CoML_ENS, @ENS_Ulm

Malard Loic @LoicMalard70333

8 Followers 11 Following

Pourcel Julien @PourcelJulien

20 Followers 388 Following PhD student at @inria (@flowersinria team) | @ENS_ParisSaclay (MVA)

Amric Trudel @AmricTrudel

46 Followers 107 Following Consultant Data Scientist @ OCTO Technology // Alumni @ 42 Paris // Ancien président de 42-AI // Alumni @ McGill University

ANDREWS Roland @ANDREWSRoland1

32 Followers 260 Following phd Sierra + Willow . X2019-MVA deep learning, optimisation, diffusion

Shashs @Shashs4

8 Followers 155 Following

Lena Dupont @LenaDupont16

1 Followers 2 Following

Daniel Jack @Stritzer91

175 Followers 291 Following Netanyahou est un nazi, et leurs soutiens le sont tout autant. Mort au gouvernement israélien

Laura @Laura_Schoo

6 Followers 23 Following

Xavier @xavier_brt

323 Followers 490 Following

Mr-Thibzer @LemassonThibau1

12 Followers 51 Following ⚡️🏀⚡️

youm @youmgui

40 Followers 141 Following

Intern at @stabilityai, ex @sonyai_global @yamahamusic_jp @MSFTResearch intern; final year AI Music PhD student @c4dm and @apple. I am looking for full-time job

Yixiao Zhang @Yixiao_Zhang_

640 Followers 284 Following Intern at @stabilityai, ex @sonyai_global @yamahamusic_jp @MSFTResearch intern; final year AI Music PhD student @c4dm and @apple. I am looking for full-time job

Sr. Research Scientist @AWS Labs (ph-D @LTIatCMU) working on unnatural language processing, speaking πtorch & C++. Opinions sampled from MY OWN 100T param LM.

Leo Boytsov @srchvrs

7K Followers 2K Following Sr. Research Scientist @AWS Labs (ph-D @LTIatCMU) working on unnatural language processing, speaking πtorch & C++. Opinions sampled from MY OWN 100T param LM.

Multimodal and Embodied AI Researcher / Professor @UCSC. Director of https://t.co/Y4swOBag21. AI for Humanity in the long run. he/him

Xin Eric Wang @xwang_lk

7K Followers 1K Following Multimodal and Embodied AI Researcher / Professor @UCSC. Director of https://t.co/Y4swOBag21. AI for Humanity in the long run. he/him

Aurian @sound_au

4 Followers 37 Following PhD student at Telecom Paris in ADASP Group. Currently working on learning music representations.

Reader @QMEECS @c4dm @intelsensing @DERI_QMUL and Turing Fellow @turinginst - research on machine listening / audio analysis.

Emmanouil Benetos @em.. @emmanouilb

1K Followers 635 Following Reader @QMEECS @c4dm @intelsensing @DERI_QMUL and Turing Fellow @turinginst - research on machine listening / audio analysis.

Annamaria Mesaros @AnnamariaMsros

193 Followers 64 Following Associate Professor, Tampere University, Finland

Professor for CS at @UniBonn and affiliated Professor at MIT-IBM Watson AI lab @MITIBMLab - Multimodal learning and video understanding

Hilde Kuehne @HildeKuehne

3K Followers 763 Following Professor for CS at @UniBonn and affiliated Professor at MIT-IBM Watson AI lab @MITIBMLab - Multimodal learning and video understanding

Suno @suno_ai_

33K Followers 0 Following Make any song you can imagine

Katie Kang @katie_kang_

1K Followers 418 Following PhD student @berkeley_ai

Alexander H. Liu @alex_h_liu

138 Followers 109 Following Ph.D. Student @MIT_CSAIL

Elio Quinton @elio_elioo

780 Followers 536 Following AI - Audio - Music. VP of AI @UMG. Former @c4dm @qmul

UPMC Professor of Computer Science at Carnegie Mellon University,
ex-Director of AI research at @Apple, co-founder Perceptual Machines (acquired by Apple)

Russ Salakhutdinov @rsalakhu

100K Followers 140 Following UPMC Professor of Computer Science at Carnegie Mellon University, ex-Director of AI research at @Apple, co-founder Perceptual Machines (acquired by Apple)

Incoming Assistant Professor at University of Michigan | PhD Candidate at UC San Diego | Generative AI for Music & Audio

Hao-Wen (Herman) Dong.. @hermanhwdong

768 Followers 258 Following Incoming Assistant Professor at University of Michigan | PhD Candidate at UC San Diego | Generative AI for Music & Audio

#AI meetup for bluechips, startups & academics in Paris with a strong product focus. Run by @raffikamber @AlexCFlamant @nathanbenaich

ParisAI @parisai

1K Followers 109 Following #AI meetup for bluechips, startups & academics in Paris with a strong product focus. Run by @raffikamber @AlexCFlamant @nathanbenaich

Gaurav @mitts1910

1K Followers 234 Following Senior Researcher @ Microsoft working on Responsible AI and Multi-modal Content Understanding

prof UdeM/Mila; Canada Excellence Research Chair; AAI Lab head https://t.co/UzlrC7ZrGF; INCITE project PI https://t.co/0rV7szd7rH; CSO https://t.co/XDhj6MEtUj

Irina Rish @irinarish

9K Followers 994 Following prof UdeM/Mila; Canada Excellence Research Chair; AAI Lab head https://t.co/UzlrC7ZrGF; INCITE project PI https://t.co/0rV7szd7rH; CSO https://t.co/XDhj6MEtUj

Prateek Jain @jainprateek_

4K Followers 624 Following Learning machine learning at Google Research

Mathematician. Professeur titulaire de la chaire Combinatoire au Collège de France. Also fellow of Trinity College Cambridge.

Timothy Gowers @wtgow.. @wtgowers

45K Followers 188 Following Mathematician. Professeur titulaire de la chaire Combinatoire au Collège de France. Also fellow of Trinity College Cambridge.

Huck Yang @huckiyang

569 Followers 528 Following Sr. Research Scientist @NVIDIAAI Generative Error Correction | Ph.D. @GeorgiaTech | Past: @GoogleAI @AmazonScience | 🗣️ education

Wei Ping @_weiping

783 Followers 220 Following Principal Research Scientist @NVIDIA. Working on large language models and generative models. Views are my own.

Rafael Valle @RafaelValleArt

941 Followers 179 Following Research Manager and Scientist at NVIDIA. UC Berkeley alumn. Love, music and motorcycles!

zhifeng kong @ZhifengKong

59 Followers 28 Following Research Scientist at NVIDIA

vik @vikhyatk

7K Followers 521 Following teaching computers how to see // prev: @awscloud

Full professor at Telecom Paris, Institut Polytechnique de Paris (audio signal processing, deep learning, music information retrieval)

Peeters Geoffroy @GeoffroyPeeters

909 Followers 209 Following Full professor at Telecom Paris, Institut Polytechnique de Paris (audio signal processing, deep learning, music information retrieval)

Stefan Lattner @deeplearnmusic

1K Followers 186 Following MusicAI @ Sony CSL Paris

Sanjeel Parekh @SanjeelParekh

17 Followers 37 Following Research Scientist at Meta Reality Labs Research

Researcher @SonyCSLMusic. Former PhD @TelecomParis_ | @MIDASconsoles | Intern @JukedeckRandD, @SoundCloud. Exploring deep generative models for sound and music

Javier Nistal @latentspaces

825 Followers 507 Following Researcher @SonyCSLMusic. Former PhD @TelecomParis_ | @MIDASconsoles | Intern @JukedeckRandD, @SoundCloud. Exploring deep generative models for sound and music

Eduardo Fonseca @edfonseca_

1K Followers 558 Following Research Scientist @GoogleAI. Sound Understanding. Previously @mtg_upf. He/him.

Amirhossein Habibian @amir_habibian

911 Followers 290 Following Research Scientist (Director) @Qualcomm AI Research

Alon Ziv @lonziks

313 Followers 52 Following PhD Student @ The Hebrew University of Jerusalem; ex Research Scientist Intern @ Meta AI (FAIR); ex DL Researcher @ Mobileye.

Negar Rostamzadeh @negar_rz

10K Followers 809 Following Sr RS @Google Research

Seungwhan Shane Moon @shane_moon

584 Followers 197 Following | Research Scientist @ Facebook | | PhD @ LTI SCS, CMU |

Associate professor in NLP, engaged citizen. Tweeting about work, life and stuffs that I care about. All my tweets can be used freely. Personal account.

Djamé.. @zehavoc

6K Followers 3K Following Associate professor in NLP, engaged citizen. Tweeting about work, life and stuffs that I care about. All my tweets can be used freely. Personal account.

Challenge on Detection and Classification of Acoustic Scenes and Events
https://t.co/j9544Zu0yJ
https://t.co/cQqlshpkU7…

DCASE Challenge @DCASE_Challenge

641 Followers 29 Following Challenge on Detection and Classification of Acoustic Scenes and Events https://t.co/j9544Zu0yJ https://t.co/cQqlshpkU7…

Laurent Mazare @lmazare

1K Followers 75 Following Co-founder and CTO @kyutai_labs

Mathieu Fontaine @MathieuFontai19

160 Followers 226 Following Assistant Professor at Telecom Paris. Working on machine listening.

Felix Kreuk @FelixKreuk

2K Followers 759 Following AI Research @ FAIR, Meta AI. CS PhD from BIU.

Soham Deshmukh @sohamdesh_

130 Followers 228 Following speech and audio @Microsoft @CarnegieMellon

Vivek Kumar @vivek_kumar

2K Followers 614 Following Senior Manager, Sound Understanding at @googleai. Ex @Dolby & @Broadcom. Talks and Investments 👉🏽 https://t.co/Iqmk4l7YMF

Benjamin Elizalde @benjaminelizal

79 Followers 60 Following Researcher; Traveler

Pioneer of Deep Learning and known for vanishing gradient and the LSTM. I mostly tweet about random ArXiv papers which sparked my interest.

Sepp Hochreiter @HochreiterSepp

10K Followers 395 Following Pioneer of Deep Learning and known for vanishing gradient and the LSTM. I mostly tweet about random ArXiv papers which sparked my interest.

Research Scientist at @Apple. Previous: @Meta (FAIR), @Inria, @MSFTResearch, @VectorInst and @UofG . Egyptian 🇪🇬 Deprecated twitter account: @alaaelnouby

Alaa El-Nouby @alaa_nouby

521 Followers 302 Following Research Scientist at @Apple. Previous: @Meta (FAIR), @Inria, @MSFTResearch, @VectorInst and @UofG . Egyptian 🇪🇬 Deprecated twitter account: @alaaelnouby

Victor Sanh @SanhEstPasMoi

9K Followers 2K Following Dog sitter by day, Scientist at @huggingface 🤗 by night

I research intelligence to understand it and to harness it wisely. Part of AlphaGo tuning, AlphaCode, learning to learn, Lyria, Imagen2, Gato, rGemma

Nando de Freitas 🏳.. @NandoDF

97K Followers 659 Following I research intelligence to understand it and to harness it wisely. Part of AlphaGo tuning, AlphaCode, learning to learn, Lyria, Imagen2, Gato, rGemma

Jade Copet @jadecopet

598 Followers 343 Following AI Research @ FAIR, Meta AI

Polina Kirichenko @polkirichenko

3K Followers 1K Following PhD student at New York University, Visiting Researcher at @MetaAI FAIR Labs 🇺🇦

Hamid Eghbalzadeh @heghbalz

2K Followers 5K Following AI Research @AIatMeta, Opinions @ MyOwn.

Elena Georgieva 🎶 @elenatheodora1

588 Followers 619 Following PhD Candidate @nyuMARL. Former lecturer & M.A.'19 @Stanford @CCRMA I do music/AI research, audio engineering, and vocals. she/her

Natalie Schluter @natschluter

5K Followers 486 Following #NoJusticeNoPeace-- Machine Learning Researcher at Apple MLR-- All tweets/opinions my own

Magdalena Fuentes @mfu3ntes

791 Followers 265 Following Assistant Professor of Music Technology and Integrated Design & Media at @nyuMARL and @IdmNyu

Lucas Beyer (bl16) @giffmana

2 days ago

If you ever actually looked at these benchmarks, the model predictions, and what the claimed "human performance" means, you would know. Hint: it's unrelated to intelligence. Looks like many people, especially more prominent ones, are commenting and opining blindly.

Pedro Domingos @pmddomingos

4 days ago

Interesting how in all these domains AI is asymptoting at roughly human performance - where's the AI zooming past us to superintelligence that Kurzweil etc. predicted/feared?

265 148 1K 1.1M 389

Download Image

7 2 84 23K 21

Rafael Valle @RafaelValleArt

2 weeks ago

Audio Dialoges is finally out! It describes how we leveraged pre-trained LMs and joint audio and language embeddings to produce a dataset that gives Audio LLMs the ability to have multi-turn dialogues with users. arxiv.org/abs/2404.07616

0 6 51 3K 27

Salah Zaiem @salah_zaiem

3 weeks ago

Hello Twitter! A few weeks ago, I defended my PhD thesis (Title below). I want to thank everybody that joined, or helped along the way and especially my supervisors, jury members and colleagues. I joined since the Google Deepmind team here in Paris. Good things ahead (I hope 🤞!)

15 3 92 4K 3

Download Image

Salah Zaiem @salah_zaiem

a month ago

Hello Twitter, I am happy to announce that I will be defending my PhD thesis next week! 👨‍🎓🔔 🕞When? Monday, March 25th, 15h30. 📗Title: Informed self-supervised speech representation learning. 🏫Where? @telecomparis Feel free to DM me for a Zoom link or for details to join!

7 3 64 3K 0

Armen Aghajanyan @ArmenAgha

a month ago

There is a commonly held belief that Transformers have no inductive bias and that this bias is learned throughout the training process. This is not true. Transformers have very strong inductive biases.

4 26 281 173K 219

François Chollet @fchollet

2 months ago

People saying "Hollywood is over" remind me of those who were saying "Google is over" one year ago. Same intellectual caliber.

45 32 497 65K 36

(((ل()(ل() 'yoav))))👾 @yoavgo

2 months ago

many say now "to do video gen well, the system must learn a world model and understand the physics" but to me *the* big lesson from LLMs is how much impressive performance can be faked *without* underlying model and semantic understanding, just by mimicking observed patterns.

40 36 520 48K 67

Jia-Bin Huang @jbhuang0604

2 months ago

R2: While the results are impressive, this is a simple combination of diffusion transformer (ICCV 2023) and latent diffusion model (CVPR 2022). Limited novelty. Weak reject.

OpenAI @OpenAI

2 months ago

Introducing Sora, our text-to-video model. Sora can create videos of up to 60 seconds featuring highly detailed scenes, complex camera motion, and multiple characters with vibrant emotions. openai.com/sora Prompt: “Beautiful, snowy…

10K 33K 141K 95.8M 40K

Download Video

56 157 2K 362K 297

Grady Booch @Grady_Booch

3 months ago

If you need $7 trillion to build the chips and the energy demand equivalent to the consumption of the United Kingdom, then - with a high-level of confidence - I assure you that you have the wrong architecture.

134 365 3K 645K 209

François Fleuret @francoisfleuret

3 months ago

I am getting a bit confused by the benchmarks / leaderboard for LLMs. Every 5 min there is a tweet about a company I never heard about whose 1b model crushes whatever 30b model was the king 5 min before. We need a TL;DR of the LLM landscape.

30 17 315 59K 58

AK @_akhaliq

3 months ago

Nvidia presents Audio Flamingo A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities paper page: huggingface.co/papers/2402.01… Augmenting large language models (LLMs) to understand audio -- including non-speech sounds and non-verbal speech -- is critically…

2 96 497 45K 209

Download Image

Alon Ziv @lonziks

4 months ago

Happy to share MAGNeT 🧲! pages.cs.huji.ac.il/adiyoss-lab/MA… A single non-autoregressive model, for text-to-music and text-to-sound generation, with quality on par with SOTA models, while being 7x faster. We open-sourced our code (including training) on audiocraft! + a Gradio demo. [0/6]

22 163 624 267K 453

Download Video

Maureen de Seyssel @Maureendss

5 months ago

Proud to announce that I successfully defended my PhD yesterday at @ecolenormalesup! 👩‍🎓 🥳 I'm so grateful to have been accompanied throughout by the two best directors Emmanuel Dupoux (@CoML_ENS) and Guillaume Wisnewski, who even agreed to pose for a quick post-defence picture📸

8 0 72 5K 2

Download Image

Titouan Parcollet @ParcolletT

5 months ago

By the way, to the Dr. 3/10 reviewer from any typical ML conference: speech is not a "narrow" field. ML conferences explicitly call for papers dealing with Audio.

1 2 24 3K 1

Andrew Ng @AndrewYNg

5 months ago

@geoffreyhinton I'd like to respectfully point out that the logic in this argument is based on a flawed model for how scientists think. Scientists don't just take a weighted average of others' opinions to form their own. A good scientist takes as input lots of data, including others' opinions,…

236 655 8K 1.1M 800

Demis Hassabis @demishassabis

6 months ago

Thrilled to share #Lyria, the world's most sophisticated AI music generation system. From just a text prompt Lyria produces compelling music & vocals. Also: building new Music AI tools for artists to amplify creativity in partnership w/YT & music industry deepmind.google/discover/blog/…

96 539 3K 1.2M 1K

Download Gif

Yannic Kilcher 🇸🇨 @ykilcher

6 months ago

I have a secret for you... #manufacturedoutrage

Nikki Teran @DrNikkiTeran

6 months ago

Will releasing the weights of large language models grant widespread access to pandemic agents? Turns out, yes, probably. 1/5

61 109 463 631K 354

Download Image

18 28 538 81K 27

Download Image

Friedemann Zenke @hisspikeness

6 months ago

1/6 We updated “Implicit variance regularization in non-contrastive SSL,” by @manu_halvagal and @AxelLaborieux with new results for NeurIPS camera-ready: arxiv.org/abs/2212.04858 neurips.cc/virtual/2023/p…

2 38 152 29K 71

Download Image

Robin Alg @AlgayresR

7 months ago

We have a second accepted paper at #EMNLP2023 (with @adiyossLC) on Generative Spoken Language Modelling (textless NLP). We managed to generate clean speech sentences using continuous units instead of the discrete HuBERT. @CoML_ENS

1 2 15 1K 2

Salah Zaiem @salah_zaiem

7 months ago

Hugo @Mldhug

7 months ago

The preprint of our work (with @salah_zaiem and @AlgayresR) on sample dependent ASR model selection is available on arXiv! In this paper we propose to train a decision module, that allows, given an audio sample, to use the smallest sufficient model leading to a good transcription