Anoop Kunchukuttan @anoopk
I am a researcher in Machine Translation group at Microsoft India and co-lead and co-founder at AI4Bharat, a research center at IIT Madras for Indian NLP. anoopk.in Hyderabad, India Joined September 2008-
Tweets841
-
Followers808
-
Following426
-
Likes592
We are glad to present this shared task, following up on work at @ai4bharat to build resources for Indic MT. Looking forward to a lot of participation to address challenges with the most diverse language group (22 languages, 12 scripts, 4 families, 1.8B speakers).
We are glad to present this shared task, following up on work at @ai4bharat to build resources for Indic MT. Looking forward to a lot of participation to address challenges with the most diverse language group (22 languages, 12 scripts, 4 families, 1.8B speakers).
AI4Bharat discord will go live soon! Time to involve the community at scale :)
How can we extend the capabilities of English LLMs to other languages? Sharing a survey that I did recently of the literature in this area: anoopkunchukuttan.gitlab.io/publications/p…
We are happy to share IndicLLMSuite, an open-source collection of resources and tools to build LLMs at scale for Indian languages, spanning 22 languages. Setting the stage for Indian LLMs and delve deeper into investigating multilingual and multicultural LLMs #AI4Bharat #indicnlp
We are happy to share IndicLLMSuite, an open-source collection of resources and tools to build LLMs at scale for Indian languages, spanning 22 languages. Setting the stage for Indian LLMs and delve deeper into investigating multilingual and multicultural LLMs #AI4Bharat #indicnlp
🎉 🎉 🎉 Presenting our blog on IndicVoices! IndicVoices is an ongoing journey spanning 16,237 speakers, 145 Indian districts and 22 Indic languages! Blog: ai4bharat.iitm.ac.in/blog/indicvoic… Paper: arxiv.org/abs/2403.01926 Dataset: ai4bharat.iitm.ac.in/indicvoices/ Kindly help spread the word!
AI4Bharat releases the IndicVoices dataset: the most diverse speech dataset for Indian languages spanning 22 languages, 16 speakers and 145 districts. You can read more here: linkedin.com/feed/update/ur… Quick links: Paper: lnkd.in/djXFmw7C Blog: lnkd.in/d6hKh8Bm
We thank Mr. Sunil Wadhwani for the generous support to his alma mater. We look forward to working closely with him to take the WSAI to great heights!! @rbc_dsai_iitm @cerai_iitm @IBSE_IITM
We thank Mr. Sunil Wadhwani for the generous support to his alma mater. We look forward to working closely with him to take the WSAI to great heights!! @rbc_dsai_iitm @cerai_iitm @IBSE_IITM
Happy to share that our paper has been accepted to EACL 2024. arxiv.org/abs/2305.05214 Work done during @KaushalMaurya94's internship at Microsoft. Rahul Kejriwal @msades
Presenting our early arxiv pre-print on RomanSetu: our exploration on using romanization to extend capabilities of English-heavy LLMs to languages using other scripts. It is an interesting hypothesis we are exploring, keep watching...
Presenting our early arxiv pre-print on RomanSetu: our exploration on using romanization to extend capabilities of English-heavy LLMs to languages using other scripts. It is an interesting hypothesis we are exploring, keep watching...
Happy Republic Day! Microsoft Translator now supports 2 more Indian languages: Manipuri and Chattisgarhi. Happy to be part of this work! This continues our research to bring translation support to diverse languages. news.microsoft.com/en-in/microsof… @MicrosoftIndia @mstranslator
Beware, these website have nothing to do with AI4Bharat!
Beware, these website have nothing to do with AI4Bharat!
It's kind of absurd to insinuate that open source AI is more dangerous than any other kind of AI. The more you can study something, the more you can understand it and think of counter-measures. There's a reason most secure servers on the internet run on Linux, an open source OS.
Kerala state IT textbooks teach GIMP and I've seen students making some cool artworks with it using a mouse. Here's this year's state IT Fair digital painting entries: schoolwiki.in/Ssitm:Homepage…
Kerala state IT textbooks teach GIMP and I've seen students making some cool artworks with it using a mouse. Here's this year's state IT Fair digital painting entries: schoolwiki.in/Ssitm:Homepage…
in the end, the closest AI analogy to the "source" in "open-source" is still the training data and a bit of recipe. not the few hundred lines of pytorch code (generic), nor the trained weights (a compiled executable). pushes the threshold quite high in today's AI world though
COLM is focused on language modeling broadly, where neurips/icml/iclr are focused on ML (broadly), and ACL* venues are focused on NLP (broadly). They overlap, but the focus is different, both in direction and scope. Anyway, submit your best work to #colm!
COLM is focused on language modeling broadly, where neurips/icml/iclr are focused on ML (broadly), and ACL* venues are focused on NLP (broadly). They overlap, but the focus is different, both in direction and scope. Anyway, submit your best work to #colm!
We are pleased to announce that we will begin recruiting AI residents (and associates) for 2024-25. The AI resident program is an year long pre-doctoral program which allows you to work intensively on NLP, Speech and Vision projects. Apply below: forms.gle/WvZhDm8sM5Go1m…
Raj Dabre @prajdabre1
3K Followers 758 Following NLP/Machine Translation/NLG/Deep Learning. Researcher-@NICT_Publicity. Adjunct Faculty-@iitmadras. Visiting Professor-@iitbombay. Ex-@KyotoU_News. #nlprocSumanth @sumanthd17
2K Followers 1K Following PhD’ing @iitmadras @AI4Bharat, Google PhD Fellow, Past life - @GoogleAI @Mila_Quebec @IIITSCShaily @shaily99
5K Followers 2K Following PhD @LTIatCMU Prev: @GoogleAI @MSFTResearch. Working on ethics and evaluation in #NLProc. Usually ranting, often about research & DEI. 📚 @readsndrantsDanish Pruthi @danish037
7K Followers 628 Following Faculty at Indian Institute of Science, Bangalore. PhD from @LTIatCMU.Vivek Gupta @keviv9
2K Followers 5K Following PostDoc @cogcomp UPenn | Ph.D. CS @UUtah | @iitkanpur. @Bloomberg & @MSFTResearch Fellow | ex-@MetaAI, @IBM, @Verisk, @samsungresearch, @Synopsys #nlp #mlAditya Joshi @aadi_joshi
1K Followers 1K Following (He/Him) 🦜 🧑🏽🔬 🏳️🌈 Lecturer (Asst Prof) in #NLProc at @UNSWCOMPUTING . Made in 🇮🇳, 🇦🇺 is home. PhD from @iitbombay and @monashuni .Gaurav Aggarwal @fooobar
6K Followers 1K Following Building Ananas Labs, Anchor Volunteer iSPIRT. Occasionally teach AI/ML @ ISB & Jio Institute Prev: Google Research, Ola Cabs, Snapdeal, Fashiate, Yahoo LabsMonojit Choudhury @monojitchou
3K Followers 556 Following Professor at @mbzuai, #AI #Ethics #NLProc #LinguisticsOlympiad #artlover #foodlover #traveller #philosopher #puzzlist, ex-Microsoft ResearchSara Hooker @sarahookr
39K Followers 7K Following I lead @CohereForAI. Formerly Research @Google Brain @GoogleDeepmind. ML Efficiency at scale, LLMs, @trustworthy_ml. Changing spaces where breakthroughs happen.AI4Bharat @ai4bharat
3K Followers 135 Following The focus of AI4Bhārat, an initiative of IIT-Madras, is on building open-source language AI for Indian languages, including datasets, models, and applications.Harshita Diddee @ihsrahedid
642 Followers 698 Following LTI PhD @SCSatCMU | Prev: RF at @MSFTResearch | Interested in Data Quality EstimationRamsri Goutham Golla @ramsri_goutham
11K Followers 3K Following Shares learnings from bootstrapping 2 AI SaaS Apps to $100k ARR with no employees: https://t.co/fU8yoiYVDc https://t.co/DTyILliHVm My NLP courses: https://t.co/MYUyOxGSkAVilém Zouhar @zouharvi
2K Followers 2K Following PhD student @ ETH Zürich | all aspects of #NLProc but mostly HCI, evaluation and MT | go #veganKoustava Goswami @koustavagoswami
338 Followers 537 Following Research Scientist @Adobe_Research | PhD @insight_centreRajaswa Patil @RajaswaPatil
2K Followers 3K Following Applied AI @getpostman || Prev. @Microsoft @ProseMsft @TCSResearch || Alumni @bitspilaniindia || Opinions are my ownAlham Fikri Aji @AlhamFikri
3K Followers 334 Following NLP/AI scientist. Faculty at @MBZUAI Previously @EdinburghNLP PhD, @Amazon Alexa, @Google research, @Apple SiriNikita Moghe @nikita_moghe
945 Followers 1K Following PhD student at CDT in NLP, University of Edinburgh. Prev: IIT Madras | University of Mumbai. She/her. On the industry job marketRutwik Patil @Rutwik777
22 Followers 143 Following AI Researcher at IIT Madras. Artificial Engineer with a love for Art and Nature. University of BirminghamNiraj @heytere_123
0 Followers 230 FollowingPrateek Yadav @prateeky2806
2K Followers 2K Following Ph.D. at @unccs Continual Model Adaptation and Composition Previously @MSFTResearch, @AmazonScience, @iitmadras. UG @iiscbangalore. Opinions are my own.wadkar @wadkar
321 Followers 384 Following Linux, HPC, K8s, Software Engg, Python, LLMs, Semantics, Code Generation/Comprehension, Quantum Mechanics; I live in the terminal; IITB🇮🇳CMU🇺🇸NICT🇯🇵Vanya BK @VanyaBk
21 Followers 754 FollowingRobonomous @realpolity101
2K Followers 2K Following controls,robotics & UAVs//Algos,CV//usual sh!tposter XD //Vaishak Narayanan @vai_shk
1 Followers 51 FollowingKoshur @KoshyKoshur
16 Followers 5K Followingojasvi bhatia @ojasvibhatia13
177 Followers 307 FollowingFahri Alfiansyah @fahrialfiansy4h
0 Followers 117 FollowingSunandini Sanyal @SanyalSunandini
21 Followers 153 FollowingKavya Manohar (കാ.. @kavya_manohar
1K Followers 837 Following PhD in Speech and Language Processing, #Malayalam Language Technologist, Teacher, Student, Free Knowledge Enthusiast, Opentype Font Engineer, FeministGurpreet Kaur @GurupreetJethra
83 Followers 874 Following Gen AI Data Scientist👩🏻🎓| Former AA: IIM-A | Former Analyst: Central Govt. Of India | AI Researcher| Machine Learning| LLM| NLP| DL| CV| Data Science| MLOpsRishu Kumar @rishdotuk
548 Followers 538 Following A student of language @EM_LCT (@ufal_cuni & @LstSaar). Machine Translation and Summarisation #NLProcSuvo 🇮🇳 @suvasish114
80 Followers 852 Following 🎓 MSc CS grad student 🔍 Tweet about #tech #geopolitics 🗿 Interested in #DeepLearning and #LLMPranav Gupta @operation_peter
0 Followers 27 FollowingKonstantin Dobler @konstantdobler
74 Followers 113 Following PhD student @ELLISforEurope @hpi_de in NLP, prev @sapKurt Keutzer @KurtKeutzer
179 Followers 59 FollowingSarahMoore @GqSG6VoNM3fnV92
6 Followers 497 FollowingDevendra Singh @dev_n18
118 Followers 3K FollowingRajesh Radhakrishnan @RRaajjesshh
6 Followers 375 Following Full stack engineer, NLP മലയാളം , Principal Architect @experionglobal.Karina Romanova @rkarina2703
0 Followers 50 FollowingPavan Singh @Pavanksnextrai
27 Followers 177 FollowingDhruv Trehan @dhruvtrehan9
834 Followers 4K Following Learning to learn and do. He/Him. DMs are open. Prev @metaformsai @stoaHQ @TheCitizen_inLe Quy An @tasuke2k3
36 Followers 228 FollowingDOKON @dokondokon
479 Followers 2K Following AI/ML/Python教材、医療AI/マテリアルズ・インフォマティクスなど 映画/読書/音楽/アート/サイエンス 日本メディカルAI学会公認資格、(国内AIコンペ実績)SIGNATE Expert、Nishika Aランク、MBTI : INTP ドコンドコンは昔の長男の口癖Ed Bayes @edbayes
128 Followers 740 Following AI product & research. Fmr @Theteamatx, @Harvard @USUKFulbrightOrochimaru's Demeanou.. @theYorubayesian
373 Followers 551 Following I have been blessed with a wilder mind. Guided. Gifted.kumar @kumar__nn
0 Followers 1K FollowingShaunak Khare @ShaunakKhare
30 Followers 285 FollowingAshutosh Mehra @ashutoshmehra
1K Followers 5K Following Senior Principal Scientist at Adobe. Working on Acrobat AI Assistant, LLMs, and document ML.rongge @aronggege
231 Followers 5K FollowingChuanming @ChuanmingLiu
229 Followers 4K Following Ex-PhD student and alumni @sjtu1896 . Global citizen. Bootstrapping silicon-based life.Prakhar Verma @prakhar_verma7
35 Followers 301 Following Research Intern @MSFTResearch, Doctoral researcher at @AaltoUniversity, exploring statistical machine learning. Previously: @UniofOxford @TomTomS Jeeva @jeeva2812
59 Followers 279 Following I am here just to like @anishgiri tweets (and to flex, @adhibanchess follows me). IITM '22Nikhil Kulkarni @NikhilKulk1109
9 Followers 63 FollowingShikhar @ShikharSSU
219 Followers 868 Following Code/speech-text/multilingual LLMs, Currently PreDoc @Google | @IIScCSA | Directi | Microsoft Research | @bitspilaniindiaRaj Dabre @prajdabre1
3K Followers 758 Following NLP/Machine Translation/NLG/Deep Learning. Researcher-@NICT_Publicity. Adjunct Faculty-@iitmadras. Visiting Professor-@iitbombay. Ex-@KyotoU_News. #nlprocSumanth @sumanthd17
2K Followers 1K Following PhD’ing @iitmadras @AI4Bharat, Google PhD Fellow, Past life - @GoogleAI @Mila_Quebec @IIITSCYann LeCun @ylecun
712K Followers 719 Following Professor at NYU. Chief AI Scientist at Meta. Researcher in AI, Machine Learning, Robotics, etc. ACM Turing Award Laureate.Danish Pruthi @danish037
7K Followers 628 Following Faculty at Indian Institute of Science, Bangalore. PhD from @LTIatCMU.Andrej Karpathy @karpathy
979K Followers 905 Following 🧑🍳. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets 🧠🤖💥Sebastian Raschka @rasbt
267K Followers 906 Following Machine learning & AI researcher writing at https://t.co/A0tXWzG1p5. LLM research engineer @LightningAI. Previously stats professor at UW-Madison.MIT CSAIL @MIT_CSAIL
298K Followers 22K Following MIT's Computer Science & Artificial Intelligence Laboratory (CSAIL). Media Inquiries: [email protected]Vivek Gupta @keviv9
2K Followers 5K Following PostDoc @cogcomp UPenn | Ph.D. CS @UUtah | @iitkanpur. @Bloomberg & @MSFTResearch Fellow | ex-@MetaAI, @IBM, @Verisk, @samsungresearch, @Synopsys #nlp #mlAditya Joshi @aadi_joshi
1K Followers 1K Following (He/Him) 🦜 🧑🏽🔬 🏳️🌈 Lecturer (Asst Prof) in #NLProc at @UNSWCOMPUTING . Made in 🇮🇳, 🇦🇺 is home. PhD from @iitbombay and @monashuni .Monojit Choudhury @monojitchou
3K Followers 556 Following Professor at @mbzuai, #AI #Ethics #NLProc #LinguisticsOlympiad #artlover #foodlover #traveller #philosopher #puzzlist, ex-Microsoft ResearchPartha Talukdar @partha_p_t
4K Followers 215 Following Researcher @googleai, Faculty @iiscbangalore, Founder @kenomeioSoumith Chintala @soumithchintala
186K Followers 883 Following Cofounded and lead @PyTorch at Meta. Also dabble in robotics at NYU. AI is delicious when it is accessible and open-source.Ponnurangam Kumaragur.. @ponguru
6K Followers 385 Following #ProfGiri #Precog #PGChairGiri @iiit_hyderabad, Distinguished Member @TheOfficialACM, TEDx Speaker, Angel Investor, Alumni @CarnegieMellon @bitspilaniindiaEMNLP 2024 @emnlpmeeting
12K Followers 41 Following EMNLP 2024 - The 2024 Conference on Empirical Methods in Natural Language Processing, November 12 –16, 2024 Hashtag: #EMNLP2024Graham Neubig @gneubig
31K Followers 588 Following Associate professor at CMU, studying natural language processing and machine learning.Balaraman Ravindran @ravi_iitm
5K Followers 325 FollowingSara Hooker @sarahookr
39K Followers 7K Following I lead @CohereForAI. Formerly Research @Google Brain @GoogleDeepmind. ML Efficiency at scale, LLMs, @trustworthy_ml. Changing spaces where breakthroughs happen.Christopher Manning @chrmanning
127K Followers 116 Following Director, @StanfordAILab. Assoc. Director, @StanfordHAI. Founder, @stanfordnlp. Prof. CS & Linguistics, @Stanford. IP @aixventureshq. 🇦🇺 Do #NLProc & #AI. 👋Divy Thakkar @divy93t
5K Followers 2K Following Strategy, Programs & Product @GoogleAI , HCI Researcher. Ph.D @CityUniLondon Alumni @iift1963 @daiictofficial. Personal views.Sohom Ghosh @sohom1ghosh
64 Followers 461 Following Sr. Data Scientist @Fidelity | Researcher @JUFET #NLP #NLProc #FinancialTexts #LLMs Likes: Reading, Travelling, Fitness, Treks, Music (Harmonica) Own opinionsKavya Manohar (കാ.. @kavya_manohar
1K Followers 837 Following PhD in Speech and Language Processing, #Malayalam Language Technologist, Teacher, Student, Free Knowledge Enthusiast, Opentype Font Engineer, Feministrohan anil @_arohan_
12K Followers 2K Following Principal Engineer, @GoogleDeepMind Gemini. prev PaLM-2. Tinkering with optimization and distributed systems. opinions are my own.Microsoft Translator @mstranslator
26K Followers 40 Following Microsoft's automatic translation service powers translations in phone apps, Office and Bing, and for developers as an Azure Cognitive Services API.Arun Iyer @tengantsuu
98 Followers 183 Following I live in dreams and most likely to die in one. I am clairvoyant in the world I live. https://t.co/L3l3yBgXIq https://t.co/K1oMhq7KLRthamar | @thamar_solorio
2K Followers 675 Following NLP Prof @MBZUAI, & @UH, Director @RiTUAL_Lab. Friend, mother, partner, loves sunny days and live music. EiC @reviewAcl and ARR board. Views are my own.Indian Linguistics | .. @TianChengWen
4K Followers 34 Following Observations on Indian linguistics, languages, language use, primarily from South, West India. Run by @SandalBurn. Mail @ [email protected] for work inquiries.Vinodh Rajan @virtualvinodh
500 Followers 203 Following 🇮🇳 🏴 🇩🇪 🏳️🌈 Jack of many trades, master of fewUnbabel @Unbabel
4K Followers 1K Following Unbabel helps businesses deliver multilingual customer experience at scale.Antonis Anastasopoulo.. @anas_ant
3K Followers 2K Following Assist. Prof at George Mason CS #nlproc MT, ASR, and documentation of endangered languages.David Mataciunas @DeividasMat
207 Followers 952 Following Co-founder @ AQ22 🦾 Europe Region Lead @ Cohere for AI 💎 Chairman of the Board @ AI Association of Lithuania 🚀✨Deepak Vijaykeerthy @DVijaykeerthy
290 Followers 649 Following ML Research & Engineering @IBMResearch. Ex @MSFTResearch. Opinions are my own! Tweets about books & food.Harsh Singhal @HiHarshSinghal
363 Followers 733 Following Writing https://t.co/x93kNu8rad | ML @ Adobe, Netflix, LinkedIn | Public Speaker | Creating at https://t.co/2cSmujzpN9MeitY-nasscom CoE-IoT.. @nasscomCoEIoT
5K Followers 1K Following Center of Excellence for accelerating innovation in IoT & AI.cibu cj @cibucj
266 Followers 56 Followingଓଡ଼ିଆ AI ML @odias_in_ai
1K Followers 2K Following Odias in ML/AI. Collaboration & discussion of ideas to increase ଓଡ଼ିଆ and ଓଡ଼ିଶା’s presence in AI/ML.Jay Gala @jaygala24
258 Followers 2K Following AI Resident @ai4bharat | Researching multilingual NLP | Previously @ucsd @TCS @stratzyhq | Interested in multimodal and multilingual learning | #NLProcRavi Theja @ravithejads
3K Followers 672 Following Developer Advocate Engineer at @llama_index (LlamaIndex)Koustava Goswami @koustavagoswami
338 Followers 537 Following Research Scientist @Adobe_Research | PhD @insight_centrePriyam Mehta @PriyamMehta7
43 Followers 1K FollowingKhyathi Chandu @khyathi_chandu
1K Followers 444 Following Research Scientist @AI2 | Previously at : @MetaAI @LTICMU @SCSatCMU @GoogleAI @Apple | RisingStars2020Sriparna Saha @Sriparnas19
16 Followers 41 Following Associate Professor at Department of Computer Science and Engineering, Indian Institute of Technology Patna, India;Hemant Mohapatra @MohapatraHemant
47K Followers 119 Following investing @lightspeedindia, past: @a16z prod/engg @Google @AMD; @supabase @pixxelspace @gorattle @sarvamai @solana @pintuID. Poetry, physics & 🎹C Chaitanya @nutanc
2K Followers 412 Following Co Founder/CTO at OzoneTel($20 million ARR). India's largest cloud communication/CX platform. A habitual offender of being ahead of times :)Abhijeet Awasthi @awasthi_a_
367 Followers 687 Following Research in LLMs for Code (@MSFTResearch), #NLPRoc and Speech. Formerly: @iitbombay, @GoogleAI, Samsung Research, @IITKgpJay Alammar @JayAlammar
35K Followers 1K Following Machine learning and language models R&D. Builder. Writer. Visualizing AI, ML, and LLMs one concept at a time. @Cohere. https://t.co/TquuQXlLOJLuke Zettlemoyer @LukeZettlemoyer
8K Followers 2K FollowingSebastian Riedel (@ri.. @riedelcastro
15K Followers 470 Following Researcher in NLP/ML @deepmind, @ucl_nlp, @[email protected] on MastodonColin Raffel @colinraffel
30K Followers 655 Following nonbayesian parameterics, sweet lessons, and random birds. Friend of @srush_nlpPercy Liang @percyliang
49K Followers 408 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | PianistMohit Iyyer @MohitIyyer
6K Followers 1K Following assoc. prof at @umasscs, member of @UMass_NLP. i work on natural language processing and deep learningDavid Ifeoluwa Adelan.. @davlanade
2K Followers 1K Following @DeepMind Academic Fellow @uclcs, incoming assistant Professor @mcgillu, Canada CIFAR AI Chair @CIFAR_News | interested in multilingual NLP | Disciple of JesusAlexis Conneau @alex_conneau
24K Followers 113 Following Audio AGI Research Lead @OpenAI - GPT-Next - Past: XLM, Unsupervised ASR, Unsupervised MT, Wav2vec 2.0/XLSR, MUSE, Unsupervised cross-lingual transferAjay Divakaran @ajaydiv
1K Followers 1K Following Sr. tech. director, vision and learning, center for vision technologies, SRI International Decency, Research, music, wit above all. opinions mine alone.Atsushi Fujita @akf
870 Followers 388 Following Computational Linguist, Natural Language Processing Researcher, and Paraphraser. My tweets describe nothing other than my personal opinion. No nukes. No war.Shehzaad Dhuliawala @shehzaadzd
343 Followers 909 Following PhD student at @ETH_en | Previously Research Engineer @MSFTResearch Montréal | Master's at @UMassCS. He/HimSebastian Nehrdich @SebastianNehrd2
1K Followers 1K Following CTO of the MITRA project @BAIR, UC Berkeley. Research in ancient Asian low resource languages, with focus on machine translation and semantic search.Alham Fikri Aji @AlhamFikri
3K Followers 334 Following NLP/AI scientist. Faculty at @MBZUAI Previously @EdinburghNLP PhD, @Amazon Alexa, @Google research, @Apple SiriRowan Cheung @rowancheung
497K Followers 377 Following Founder @therundownai. Sharing the latest developments in the world of artificial intelligence.Mausam (IITD) @mishumausam
3K Followers 55 Following Founding Head, Yardi School of Artificial Intelligence at IIT Delhi. AI (NLP, ML, MDP) Researcher. Indian Classical Music aficionado.Gary Marcus @GaryMarcus
145K Followers 7K Following “A beacon of clarity”. Spoke at US Senate AI Oversight committee. Founder/CEO Geometric Intelligence (acq. by Uber). Rebooting AI & Taming Silicon Valley.Calling all IndicTrans2 users... 🥁🥁 We are happy to announce that now both the model and TOKENIZER are AutoClass callable. This makes the entire setup end-to-end HF compatible, and streamlines PEFT/fine-tuning procedures with the HF trainer. [1/n]
Yesterday we had our first paper reading session in the @ai4bharat discord covering LoRA, DoRA and REFT (@aryaman2020). This is to be the first of many. Recordings: drive.google.com/drive/u/0/mobi… Link to discord: discord.com/invite/KSkKswzh Join us!
It’s the year 2030. Meta has released LLaMa 9 ProMax. GPT-8 can answer your MMLU question before you finish typing. The baseline for multilingual LLMs is still BLOOM and Aya.
CreoleVal has been accepted to TACL! Hearty congratulations to all authors, especially @heather_nlp who grinded hard to get this work complete! Updated manuscript to go up soon. This marks my 4th paper on Creoles!
Extremely excited to present CreoleVal which is a collection of multilingual multitask benchmarks for Creoles. This is the first of its kind effort to collect data and train models spanning 28 Creoles. arxiv.org/abs/2310.19567 1/N
AI4Bharat discord will go live soon! Time to involve the community at scale :)
Most of y'all don't live in Japan and it shows! 🫢
@anoopk Thanks for sharing the survey. I added Airavata, Navarasa & other models on indic.chat to collect real world data. Will also expand it to leaderboard in future.
Presenting a talk about large language models to an audience of judges and judicial officers in Canberra tomorrow. I hope they judge me well.
📢 🦁Thrilled to share the v1 of @allen_ai 🦁 WildBench -- benchmarking LLMs with real user tasks on curated hard tasks, instance-specific checks, length penalty on Elo, fair comparisons, updating datasets, human evaluation collection, and community-driven contributions.
Introducing AI2 𝕎𝕚𝕝𝕕𝔹𝕖𝕟𝕔𝕙 ! We aim to benchmark LLMs with challenging tasks from real users in the wild. 🤗 Link: hf.co/spaces/allenai… 🤩 What great features does it offer? 🌟x9 ⬇️ 🌟1. 𝐂𝐡𝐚𝐥𝐥𝐞𝐧𝐠𝐢𝐧𝐠 & 𝐑𝐞𝐚𝐥: We carefully curate a collection of 1024 hard…
We have a short paper accepted at NAACL Findings! We study the role of subwords in multilingual MT, comparing the ability of different subword methods to induce cross-lingual synergy and reduce interference. Paper: arxiv.org/pdf/2403.20157… (work with @janmbuys)
@aakrit Innovation is happening in India. Brilliant research scientists are building core technology, not wrappers. Unfortunately we do not have investors in India that can back such tech. This is a key differentiator. Investors in India back products & services hence no innovation…
Thrilled to receive the prestigious Annual Research Excellence Award at @IITHyderabad for the @cse_iith department! 🏆 #Honored #phdlife #ResearchExcellence #greatful
EMNLP 2024 will take place in Miami, Florida from Nov 12th to Nov 16th, 2024, at the Hyatt Regency Miami Hotel. More information: 2024.emnlp.org #EMNLP2024
Santhosh is a prolific open source pioneer through his work on Wikipedia/Wikimedia. A good opportunity to ask everything you wanted to.
Hosting an AMA (Ask Me Anything) at r/developersIndia now. reddit.com/r/developersIn…
Congratulations to Pranjal Dutta, winner, and Jogendra Nath Kundu, honorable mention, of the @Indiaacm Doctoral Dissertation Award. TCS is proud to be the founding sponsor for this #award - bit.ly/3TiEmYq @TCS #TCSResearch #InventingForImpact #PhDChat #AcademicTwitter
𝐓𝐡𝐞 𝐄𝐫𝐚 𝐨𝐟 1-𝐛𝐢𝐭 𝐋𝐋𝐌𝐬 𝐏𝐫𝐨𝐛𝐥𝐞𝐦𝐬 𝐰𝐢𝐭𝐡 𝐟𝐮𝐥𝐥-𝐩𝐫𝐞𝐜𝐢𝐬𝐢𝐨𝐧 𝐋𝐋𝐌𝐬 Large Language Models (LLMs) have achieved remarkable results in natural language processing, but this comes at ever-increasing model size. The size of LLMs poses issues for…
🎉 🎉 🎉 Presenting our blog on IndicVoices! IndicVoices is an ongoing journey spanning 16,237 speakers, 145 Indian districts and 22 Indic languages! Blog: ai4bharat.iitm.ac.in/blog/indicvoic… Paper: arxiv.org/abs/2403.01926 Dataset: ai4bharat.iitm.ac.in/indicvoices/ Kindly help spread the word!
The paper considers the Impact of word order and it's effect on various NLU tasks. Interesting work 👏. I believe follow up works should focus on information extraction tasks and should explore more on concepts like dependency length and valency.
LLMs perform tasks well even given scrambled sentences When the model can reconstruct the unscrambled sentence it ignores order, but only then🧵 arxiv.org/abs/2402.18838 Chen O'Donnell @sivareddyg @Mila_Quebec @mcgillu