Pratik Joshi @Roprajo
Research Engineer @GoogleDeepMind | Teaching machines to code | Prev @LTIatCMU @GoogleAI, @MSFTResearch @BITSPilaniGoa pratikmjoshi.github.io Mountain View, CA Joined March 2018-
Tweets100
-
Followers2K
-
Following478
-
Likes2K
I had such fun talking with Gretchen Huizinga about my research and all the wonderful work that I get to do with my amazing colleagues at @IndiaMSR
I had such fun talking with Gretchen Huizinga about my research and all the wonderful work that I get to do with my amazing colleagues at @IndiaMSR
CodeGemma is out, it's fast and awesome! Check out the technical report for more details: goo.gle/codegemma.
CodeGemma is out, it's fast and awesome! Check out the technical report for more details: goo.gle/codegemma.
Games like Fallout:New Vegas, Skyrim, Dark Souls, Witcher 3,GOW are immersive experiences that've told stories in a way no other medium can. Intrigued to see what's next, but more than the tech, it's about the creativity and vision behind it. A game with both will be a must-play.
Games like Fallout:New Vegas, Skyrim, Dark Souls, Witcher 3,GOW are immersive experiences that've told stories in a way no other medium can. Intrigued to see what's next, but more than the tech, it's about the creativity and vision behind it. A game with both will be a must-play.
This is so cool. Love the technical report analysis behind going for a careful synthetic pipeline (diffs, instruction schema, condensed error states) over using more noisy verbose real inputs.
This is so cool. Love the technical report analysis behind going for a careful synthetic pipeline (diffs, instruction schema, condensed error states) over using more noisy verbose real inputs.
Ever noticed how Pixar adapts movies for international markets? The beloved newscaster in Zootopia is a jaguar in Brazil, a panda in China, a koala in Australia … While machine translation (MT) has only dealt with language in speech/text thus far, we extend the scope of MT to…
We're starting to roll out API support for Gemini 1.5 Pro for developers. We're excited to see what you build with the 1M token context window! We'll be onboarding people to the API slowly at first, and then we'll ramp it up. In the meantime, developers can try out Gemini 1.5…
We're starting to roll out API support for Gemini 1.5 Pro for developers. We're excited to see what you build with the 1M token context window! We'll be onboarding people to the API slowly at first, and then we'll ramp it up. In the meantime, developers can try out Gemini 1.5…
This is looking very very good...
This is such an exciting and important outcome of this effort, empowering new coders to use complex codebases and even contribute. Wish I had this when I first started coding, and used to hesitate to create a PR because I thought I misunderstood/missed something.
This is such an exciting and important outcome of this effort, empowering new coders to use complex codebases and even contribute. Wish I had this when I first started coding, and used to hesitate to create a PR because I thought I misunderstood/missed something.
Today, I am very proud share what we have been working on for the last 14 months. ✨ Introducing Aya -- a new state-of-art for massively multilingual models. 🔥🎉
Today, I am very proud share what we have been working on for the last 14 months. ✨ Introducing Aya -- a new state-of-art for massively multilingual models. 🔥🎉
# Portrayals of AI People sometimes read a bit too specifically into my bio "Building a kind of JARVIS". I name JARVIS in general terms only, as one of my favorite popular portrayals of an AI - a helpful, conversational, empowering e/ia automation. An aid against evil and…
History of American southern accents. I’m obsessed with how she switched through them so effortlessly
This is one of my favorite sounds I made for the game! The heartbeat sound is actually my daughter's heartbeat while she was still in the womb - I recorded it via the 3.5mm output of a baby doppler. The longer tonal elements are from a children's choir warming up in a gymnasium!
This is one of my favorite sounds I made for the game! The heartbeat sound is actually my daughter's heartbeat while she was still in the womb - I recorded it via the 3.5mm output of a baby doppler. The longer tonal elements are from a children's choir warming up in a gymnasium!
A nice way to end the year with some data: 66 Good News Stories You Didn't Hear About in 2023 futurecrunch.com/goodnews2023/
Gemini is here, and it's great at coding! I'm very excited to share some of my work on Gemini's coding abilities during last year. Check out: * our blog post: blog.google/technology/ai/… * our video on coding abilities: youtu.be/LvGmVmHv69s?si… #google #ai #codegeneration
I'm thrilled to have been a part of this huge team effort ♊! So happy to have contributed to code capabilities, long live code generation!
I'm thrilled to have been a part of this huge team effort ♊! So happy to have contributed to code capabilities, long live code generation!
My 1.5yrs at MSR with Monojit and Kalika as mentors was an amazing research experience. Monojit is a thoughtful, caring advisor who challenges you to think outside the box. I've learnt a lot from his guidance. Highly recommend this opportunity!
My 1.5yrs at MSR with Monojit and Kalika as mentors was an amazing research experience. Monojit is a thoughtful, caring advisor who challenges you to think outside the box. I've learnt a lot from his guidance. Highly recommend this opportunity!
How would you choose the best data instances to label, that maximize the performance of a model on target data? What if your target data is multilingual and you have no annotators in those languages? Our new work, DeMuX, addresses this problem. arxiv.org/abs/2311.06379 (1/n)
Presenting now at Nord-083, @ICCVConference !
Presenting now at Nord-083, @ICCVConference !
Really enjoyed being part of this panel discussion on foundational models and safety with @random_walker, @ruchowdh, and @mbogen! Starts at 32:25. Thanks @CenDemTech for hosting this.
Really enjoyed being part of this panel discussion on foundational models and safety with @random_walker, @ruchowdh, and @mbogen! Starts at 32:25. Thanks @CenDemTech for hosting this.
I'm heading to Paris for #ICCV2023 to present our work: openaccess.thecvf.com/content/ICCV20… This project was my first foray into multimodal ML and CV research, so this is daunting but exciting. Looking forward to seeing old faces and new! Feel free to DM to chat or meet.
Shruti Rijhwani @shrutirij
4K Followers 499 Following * Research Scientist @GoogleDeepMind * #NLProc research * PhD from @LTIatCMU * Amateur woodworker, scuba diver, foosball playerShaily @shaily99
5K Followers 2K Following PhD @LTIatCMU Prev: @GoogleAI @MSFTResearch. Working on ethics and evaluation in #NLProc. Usually ranting, often about research & DEI. 📚 @readsndrantsMonojit Choudhury @monojitchou
3K Followers 556 Following Professor at @mbzuai, #AI #Ethics #NLProc #LinguisticsOlympiad #artlover #foodlover #traveller #philosopher #puzzlist, ex-Microsoft ResearchKritika Prakash @kritipraks
9K Followers 1K Following Researcher and artist. PhD in Computer Science @UChicago. Loves coffee, cats, cafes, cinnamon rolls, and chai. ENFJ-T.Divy Thakkar @divy93t
5K Followers 2K Following Strategy, Programs & Product @GoogleAI , HCI Researcher. Ph.D @CityUniLondon Alumni @iift1963 @daiictofficial. Personal views.Kayo Yin @kayo_yin
8K Followers 556 Following PhD student @berkeley_ai @berkeleynlp working on interpretability and signed languages. Former @msftresearch @deepmind @carnegiemellon @polytechnique. 🇫🇷🇯🇵Arkil @arkil_patel
757 Followers 828 Following PhD Student at Mila (@Mila_Quebec) and McGill (@mcgillu) | Research in ML/NLP | Prev @allen_ai @MSFTResearch | alum @bitspilaniindiaSebastian Raschka @rasbt
267K Followers 906 Following Machine learning & AI researcher writing at https://t.co/A0tXWzG1p5. LLM research engineer @LightningAI. Previously stats professor at UW-Madison.Sumanth @sumanthd17
2K Followers 1K Following PhD’ing @iitmadras @AI4Bharat, Google PhD Fellow, Past life - @GoogleAI @Mila_Quebec @IIITSCKhyati Jain @jnkhyati
1K Followers 651 Following Supercharging teachers with GenAI @Google. Heart lies at @bitspilanigoa Prev adventures: @TeamSundial @MSFTResearch @riken_en @last9io @GoogleAi @HRI_PrayagrajPartha Talukdar @partha_p_t
4K Followers 215 Following Researcher @googleai, Faculty @iiscbangalore, Founder @kenomeioVidhi Jain @viddivj
3K Followers 3K Following Graduate student at @CMU_Robotics. student researcher @Google @GoogleDeepMind Robotics. @MetaAI Resident 2021. Previously at @IndiaMSR, @bitspilaniindia She/herSimran Khanuja @simi_97k
2K Followers 897 Following NLP | PhD Student @LTIatCMU | Predoctoral Researcher @Google | Microsoft Research | BITS Pilani, GoaVivek Gupta @keviv9
2K Followers 5K Following PostDoc @cogcomp UPenn | Ph.D. CS @UUtah | @iitkanpur. @Bloomberg & @MSFTResearch Fellow | ex-@MetaAI, @IBM, @Verisk, @samsungresearch, @Synopsys #nlp #mlkalikabali @kalikabali
2K Followers 518 Following love reading, fiction and poetry. love food, making and eating. work in languages, and in technology. work harder as a mom. ponder over education for all.Harshita Diddee @ihsrahedid
641 Followers 698 Following LTI PhD @SCSatCMU | Prev: RF at @MSFTResearch | Interested in Data Quality EstimationSatwik Bhattamishra @satwik1729
407 Followers 643 Following CS PhD student at Oxford | Ex - Research fellow at Microsoft Research India, Undergrad at BITS PilaniSiddharth Dalmia @siddalmia05
1K Followers 445 Following Research Scientist @GoogleDeepmind | #SpeechProc and #NLProc | PhD from @LTIatCMU @SCSatCMU | Ex-intern @GoogleAI, @AWSCloud, @FacebookAISriram Rajamani @SriramRajamani
3K Followers 473 Following Geek, technologist, research junkie. Dad, husband, son, brother & uncle. Managing Director, MSR India. Working with wonderful colleagues and friends.Aakanksha Chowdhery @achowdhery
7K Followers 3K Following LLMs @ Google DeepMind :: PaLM, Gemini // Previously @MSFTResearch, @Stanford, @Princeton // views my own and subject to changeKanishka Utagikar @KanishkaUt18947
34 Followers 118 Following 19 || UG '27 || Machine Learning || Competitive Programming || Python || Looking to ExploreGurgaon Meditation @MeditationIN
55K Followers 60K Following When you go to Truth by discarding your karma, habits and body which are your human mind and are reborn from there, you can become complete and live eternally.KB @katiebowles_
642 Followers 5K Following Advancing AI for Healthcare at Scale at @AbridgeHQ | $150M Series C 🚀 | We're Hiring!Peter Morales @PeterMoralesX
218 Followers 2K Following Founder of funded Stealth AI Startup. Interested in AI development at the edge? DM.Alo @Hal90910
0 Followers 2K Followinguhsayer @uhsayer
80 Followers 1K FollowingPratyush Shukla @PratyushSh_
4 Followers 116 FollowingNicolas Keller @Nicolas_Keller
839 Followers 5K Following Interested in science-based startups. Having the time of my life @meshcapade; angel investor; ex Vsquared Ventures, ex @FRANKAROBOTICS; @iGEM alumnusFaria Huq @FariaHuqOaishi
572 Followers 1K Following PhD Student @SCSatCMU working with @jeffbigham working on Agents 🤖 and Interaction📱. Prev- SGI Fellow'21 @MIT_CSAIL, Tero labs.cmiller @cmiller41913842
83 Followers 3K Following Radiologist at Duke University Medical Center @DukeRadiology; student in #AI for Product Innovation @DukeEngineering; @StanfordEngBudhalabs @budhalabs
8 Followers 20 FollowingDeepak Sunny @s07492729
4 Followers 106 FollowingMeet Jain @MeetJain495531
17 Followers 62 Following Harkirat Cohort 2.0 Week 8 | IITM BS Data Science | ST AI & ML Engineering TCET | TCET Open Source Executive Director | Frontend, Backend, AI, ML, Blockchainko @samuraii7777
147 Followers 2K FollowingHareessh P @PHareessh
34 Followers 315 FollowingSarthak Arora @sarthakvarora
63 Followers 222 Following Research @UCBerkeley | AI for Good Foundation | Ethics | Queer In AI (he/him)Ram Samarth @chaostocolor8
5 Followers 159 Following 🌐 CSE Student @ IIIT KOTTAYAM | Graph Representation Learning 🤖 Federated Learning Enthusiast |Hitesh Kandala @HiteshK03
78 Followers 360 Following Unlocking core memories by traveling ✨ Research Fellow @MSFTResearch | EE @IITBombay '22Amit Vikram Raj @avr_027
7 Followers 387 Following Studying from Home | ML Engineering | NLP | Writing good codeEdmar Miyake @emiyake
38 Followers 462 FollowingFranjo Ivancic @fivancic
341 Followers 783 Following Senior Staff Software Engineer & Manager at Google. https://t.co/GNlq6Pi68dDipesh Singnurkar @dipsss31
11 Followers 76 FollowingAPIMatic.io @APIMatic
2K Followers 1K Following Developer Experience Beyond API Docs 🚀 Generate high quality SDKs from your #OpenAPI definition with #DXAutomationMohith @Mohith7548
80 Followers 338 Following Data Scientist | ML Engineer | Computer Science GraduateAmitkumar Rajpurohit @AmitkumarRajpur
46 Followers 867 Following Computer Science, Algorithms, Python, Distributed Systems, Machine Learningrajkanna @thilina78247
36 Followers 786 FollowingPrathamesh Devadiga @PrathameshD_8
8 Followers 78 Following Machine Learning, Deep Learning, NLP enthusiast | MLOps | Research @ IIT-Indore, @ MedInn-TechLabs | Co-Founder and President of Catalysis (NGO) | OxML '24Sudhanshu @Sudhanshu_rgh
111 Followers 938 FollowingRahul Ramesh @theCoderDotIn
585 Followers 982 Following co-Founder/CTO at https://t.co/nMLYLCabOi Motorcycles | Computers | Good FoodYann LeCun @ylecun
711K Followers 718 Following Professor at NYU. Chief AI Scientist at Meta. Researcher in AI, Machine Learning, Robotics, etc. ACM Turing Award Laureate.Shruti Rijhwani @shrutirij
4K Followers 499 Following * Research Scientist @GoogleDeepMind * #NLProc research * PhD from @LTIatCMU * Amateur woodworker, scuba diver, foosball playerShaily @shaily99
5K Followers 2K Following PhD @LTIatCMU Prev: @GoogleAI @MSFTResearch. Working on ethics and evaluation in #NLProc. Usually ranting, often about research & DEI. 📚 @readsndrantsAndrej Karpathy @karpathy
978K Followers 904 Following 🧑🍳. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets 🧠🤖💥Monojit Choudhury @monojitchou
3K Followers 556 Following Professor at @mbzuai, #AI #Ethics #NLProc #LinguisticsOlympiad #artlover #foodlover #traveller #philosopher #puzzlist, ex-Microsoft ResearchGraham Neubig @gneubig
31K Followers 586 Following Associate professor at CMU, studying natural language processing and machine learning.Kritika Prakash @kritipraks
9K Followers 1K Following Researcher and artist. PhD in Computer Science @UChicago. Loves coffee, cats, cafes, cinnamon rolls, and chai. ENFJ-T.Divy Thakkar @divy93t
5K Followers 2K Following Strategy, Programs & Product @GoogleAI , HCI Researcher. Ph.D @CityUniLondon Alumni @iift1963 @daiictofficial. Personal views.(((ل()(ل() 'yoav))).. @yoavgo
46K Followers 2K FollowingKayo Yin @kayo_yin
8K Followers 556 Following PhD student @berkeley_ai @berkeleynlp working on interpretability and signed languages. Former @msftresearch @deepmind @carnegiemellon @polytechnique. 🇫🇷🇯🇵François Chollet @fchollet
469K Followers 770 Following Deep learning @google. Creator of Keras. Author of 'Deep Learning with Python'. Opinions are my own.Arkil @arkil_patel
757 Followers 828 Following PhD Student at Mila (@Mila_Quebec) and McGill (@mcgillu) | Research in ML/NLP | Prev @allen_ai @MSFTResearch | alum @bitspilaniindiaSebastian Raschka @rasbt
267K Followers 906 Following Machine learning & AI researcher writing at https://t.co/A0tXWzG1p5. LLM research engineer @LightningAI. Previously stats professor at UW-Madison.Sumanth @sumanthd17
2K Followers 1K Following PhD’ing @iitmadras @AI4Bharat, Google PhD Fellow, Past life - @GoogleAI @Mila_Quebec @IIITSCKhyati Jain @jnkhyati
1K Followers 651 Following Supercharging teachers with GenAI @Google. Heart lies at @bitspilanigoa Prev adventures: @TeamSundial @MSFTResearch @riken_en @last9io @GoogleAi @HRI_PrayagrajPartha Talukdar @partha_p_t
4K Followers 215 Following Researcher @googleai, Faculty @iiscbangalore, Founder @kenomeioVidhi Jain @viddivj
3K Followers 3K Following Graduate student at @CMU_Robotics. student researcher @Google @GoogleDeepMind Robotics. @MetaAI Resident 2021. Previously at @IndiaMSR, @bitspilaniindia She/herSimran Khanuja @simi_97k
2K Followers 897 Following NLP | PhD Student @LTIatCMU | Predoctoral Researcher @Google | Microsoft Research | BITS Pilani, GoaFranjo Ivancic @fivancic
341 Followers 783 Following Senior Staff Software Engineer & Manager at Google. https://t.co/GNlq6Pi68dNithish Kannen @NithishKannen
450 Followers 2K Following Languages @GoogleAI | Ex- @AmazonScience London, @IBMResearch | @CNERG @IITKgp | #NLPProcAbhinav Gupta @backpropper
803 Followers 5K Following phd student @Mila_Quebec | ms @CILVRatNYU @NYU_Courant | previously @GoogleDeepMind @AIatMeta @GoogleAI @labsdotgoogle @MSFTResearch @AdobeResearchJoshua Howland @JoshuaHowland10
14 Followers 96 FollowingArnav Gupta @championswimmer
48K Followers 2K Following Mobile Apps | Engineering | Product | Memes ✉️ https://t.co/Ke95JbIJrn 🎙 https://t.co/BjdvqMlchx 💼 planet-scale entertainment platform 🚘 road trips @SayaniBhPrashant Krishnan @GillyPrash
118 Followers 513 FollowingSedrick Keh @sedrickkeh2
158 Followers 185 Following research engineer @ToyotaResearch prev: machine learning @mldcmu interested in natural language generation and evaluationJack Rae @drjwrae
9K Followers 353 Following Principal Scientist @ Google DeepMind Work on Gemini 💎♊ Compression is all you need LLMs (e.g. Gopher, Chinchilla, Gemini) 💼 Past: OpenAI, QuoraDaniel Han @danielhanchen
7K Followers 934 Following Building @UnslothAI. Finetune LLMs 30x faster https://t.co/aRyAAgKOR7. Prev ML at NVIDIA. Hyperlearn used by NASA. I like maths, making code go fastKundan Krishna @kundan_official
538 Followers 547 Following PhD student at CMU. past gigs at Adobe, Amazon, Google.Shreyas Anupindi @ravian_42
2 Followers 47 FollowingSara Hooker @sarahookr
39K Followers 7K Following I lead @CohereForAI. Formerly Research @Google Brain @GoogleDeepmind. ML Efficiency at scale, LLMs, @trustworthy_ml. Changing spaces where breakthroughs happen.Cohere For AI @CohereForAI
15K Followers 174 Following We are a research lab and open science initiative that seeks to solve complex machine learning problems. Join us in exploring the unknown, together.Aman Sanger @amanrsanger
15K Followers 656 Following building @cursor_ai at @anysphere https://t.co/EdcQJ2dv0J | https://t.co/vJ5zNuT6WONimesh Ghelani @nims11
107 Followers 231 Following Research Engineer @GoogleDeepmind Working on AI for CodeCenter for Democracy .. @CenDemTech
39K Followers 2K Following The Center for Democracy & Technology. Shaping technology policy and architecture, with a focus on equity and justice. @CDTEU for our EU-based team.Syeda Nahida Akter @SNAT02792153
152 Followers 477 Following PhD student at @LTIatCMU @SCSatCMU. Working on Multimodal Question Answering #NLProcSumangala Patki @sumangala_17
37 Followers 183 Following Robotics graduate student at UCSD | @BITSPilaniGoa '20 | Yoga Practitioner 🧘♀️main @main_horse
8K Followers 474 Following AGI Believer. Haven't applied @OpenAI. Likes are not always endorsement.Rachit Bansal @rach_it_
892 Followers 1K Following Pre-doctoral Researcher @GoogleAI • Prev. @dtu_delhi '22 @technionlive @AdobeResearch • Anything `science', ~cosmos, and Oxford commasEric Chu @its_ericchu
2K Followers 793 Following Research scientist @ Google DeepMind. AI reasoning + alignment/safety to help humans. Gemini, Bard, PaLM 2. Prev PhD @ MIT.Sanket Vaibhav Mehta,.. @sanketvmehta
688 Followers 1K Following Research Scientist @GoogleAI | Ph.D. @LTIatCMU @SCSatCMU @CarnegieMellon | Past @AdobeResearch, @IITRoorkeeFederico Villa @federicovillaw
1K Followers 1K Following Design Lead, Gemini @GoogleDesign. Graduate Professor @CACollegeofArts. Sharing reflections on design, life and getting through. ⚡🇨🇴Sai Avinash @avinash_70
51 Followers 179 FollowingMahesh Sathiamoorthy @madiator
9K Followers 932 Following LLMs and Data. Discuss about data for LLMs: https://t.co/x4iAft5cHV Ex-GoogleDeepMindAlexandre Kirchmeyer @a_kirchmeyer
63 Followers 830 Following MSML @CMU | prev @PrincetonVL, @PolytechniqueAakanksha Chowdhery @achowdhery
7K Followers 3K Following LLMs @ Google DeepMind :: PaLM, Gemini // Previously @MSFTResearch, @Stanford, @Princeton // views my own and subject to changeAnirudh Khatry @AnirudhKhatry
423 Followers 739 Following Incoming CS PhD @UTAustin | Research Fellow at @ProseMsft, @Microsoft | AI4Code | Guitarist | VJTI ‘21Vedant Misra @vedantmisra
2K Followers 292 Following AI researcher @DeepMind (Gemini, Minerva, PALM) | Alum @OpenAI (Codex, Grokking) | @HubSpot | Founder/CEO Kemvi (acq HUBS) | Physics @ColumbiaIshita Mediratta @ishitamed
497 Followers 1K Following Researcher 🚀 @AIatMeta | Ex @GanAIOfficial @mldcmu @bitspilaniindia | Apple WWDC 2016 & Facebook F8 Scholar | AI 🤖, Cricket 🏏 & Bollywood 💃Kaushik Shivakumar @19kaushiks
155 Followers 193 Following @GoogleDeepMind prev. BS and MS from @berkeley_eecsSidharth Raja @sidharth_raja
614 Followers 1K Following Capturing sound waves @Google. DEL → BLR → SFO.Varun Godbole @VarunGodbole
259 Followers 617 Following Software engineer at @GoogleDeepMind. Working on Gemini.Gabi Surita @gssurita
254 Followers 259 Following Building the Al counterculture. Sometimes converts coffee into code. (she/her) 🏳️🌈 🇧🇷Dhruv Batra @DhruvBatraDB
14K Followers 323 Following Senior Director (FAIR @MetaAI). Professor (@GeorgiaTech). Co-founded CaliperAI. Researcher in AI. @CarnegieMellon alum.Connor Shorten @CShorten30
16K Followers 15K Following Research Scientist @weaviate_io! Mostly working on Generative Feedback Loops with DSPy and Filtered ANN. Host of the Weaviate podcast! DSPy playlist below!Anku Rani @anku__rani
405 Followers 446 Following NLP @UofSC | prev. @verisk @cactusglobal @apptio @pixiu_in @NITIAayog | post grad @plakshaUniv| under grad @unishivajiRitam Dutt @Ritam_Dutt
230 Followers 318 Following Ph.D. Student @ Language Technologies Institute, CMUNathan Benaich @nathanbenaich
51K Followers 32K Following solo member of investment staff @airstreet, brewing ambition @airstreetcafe, next token predictor @airstreetpressrohan anil @_arohan_
12K Followers 2K Following Principal Engineer, @GoogleDeepMind Gemini. prev PaLM-2. Tinkering with optimization and distributed systems. opinions are my own.Junkai Zhang @JunkaiZZ
438 Followers 1K Following CS Ph.D. student @UCLA | Machine Learning, Generative Model, AI for Science | Previous Mathematics B.S. @TsinghuaMiranda Bogen @mbogen
3K Followers 1K Following Director of the AI Governance Lab @CenDemTech / responsible AI + policy📢 Exciting new work on hierarchical generalization in transformers! Do all training objectives lead to hierarchical generalization?🌲 Nope! Language modeling objective is special. Curious as to why? 🧐 Generalizing hierarchically might be "simpler" for an LM.
📢 New Paper! Ever wondered why transformers are able to capture hierarchical structure of human language without incorporating an explicit 🌲 structure in their architecture? In this work we delve deep into understanding hierarchical generalization in transformers. (1/n)
New work on evaluating LLMs for generation in Indic Languages: IndicGenBench 👉5 diverse tasks, 29 Indic languages, >100k examples. 👉Curated using human translations ensuring high quality. 👉Multi-way parallel dataset. arxiv.org/abs/2404.16816 github.com/google-researc… (1/n)
Very pleased IndicGenBench is now out, lots of headroom for PhD students to start cracking 🙂 Particularly happy about the 29 languages coverage, including first-time generative evals (or any evals?) for many languages , eg, Garhwali, Konkani, Rajasthani, etc. Enjoy! @GoogleAI
New work on evaluating LLMs for generation in Indic Languages: IndicGenBench 👉5 diverse tasks, 29 Indic languages, >100k examples. 👉Curated using human translations ensuring high quality. 👉Multi-way parallel dataset. arxiv.org/abs/2404.16816 github.com/google-researc… (1/n)
📢 Releasing TRI's open-source Mamba-7B trained on 1.2T tokens of RefinedWeb! Mamba-7B is the largest fully recurrent Mamba model trained and is a state-of-the-art recurrent LLM. 🚀🚀🚀 huggingface.co/TRI-ML/mamba-7…
One of the most annoying thing about twitter now is that when I put someone's name in the search or try and tag them, I am bombarded with a list of blue tick accounts instead of people I know and follow.
My team and I are moving from Google Research to Google DeepMind - with a bunch of other teams. Super pumped for the future! #DeepMindAI blog.google/inside-google/…
Build what you need and use what you build. This is a core philosophy of my research. It shifts the focus away from publishing “papers” to what really matters — impact. This thread unpacks why I think this is a successful approach to science. 1/10 Or see: perceiving-systems.blog/en/post/build-…
Awesome post! "Improvement only bottlenecked by high-quality annotated data"
At this point I feel like we understand pretty well what's going on with LLMs: - Outputs are roughly equivalent to kernel smoothing over positional embeddings (arxiv.org/pdf/1908.11775…) - The learned computation model is *probably* bounded by RASP-L (arxiv.org/pdf/2310.16028…) -…
@JeffDean liked my cool tshirt at the @agihouse_org hackathon today
We won the best paper award at @llm4code at @ICSEconf! Thank you to the amazing collaborators at @ProseMsft! A very special thank you to organizers of the first LLM4Code workshop!
Thanks to all the attendees for showing up to the LLM4Code workshop! Special congrats to all the award winning authors! 🔥 We hope to see you all again in #llm4code 2025! 🍁
Good feature for others to read and understand each metric/score easily. Similar to this, 2 click reproductions (2CR) in Pyserini gives you command to exactly reproduce the score, which I also find is helpful: castorini.github.io/pyserini/2cr/m…
Most leaderboards just give you scores, leaving one wondering: what does 76.8% mean? In HELM, we are committed to full transparency, meaning clicking on a score will reveal the full set of instances, and you can even inspect the exact prompt (which we know makes a big…
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments The first-of-its-kind scalable, real computer environment for multimodal agents, supporting task setup, execution-based evaluation, and interactive learning across various operating…
Check out our latest work at @ProseMsft which will be presented today at @llm4code at @ICSEconf!
Ever wish you could uncover data insights with just a snap of your fingers? 🤔 Say hello to automated insights – making data exploration a breeze! (1/7) #DataScience #AI #InsightAutomation
Curious about socially-intelligent AI? Check out our paper on underlying technical challenges, open questions, and opportunities to advance social intelligence in AI agents: Work w/ @lpmorency, @pliang279 📰Paper: arxiv.org/abs/2404.11023 💻Repo: github.com/l-mathur/socia… 🧵1/9
Given smarter open models really happy @ai_minion is a single tuned forward pass instead of a bunch of crazy routers, tools, prompts, etc etc
Bias and variance tradeoff machinelearningflashcards.com
The super exciting TED talk on the SixthSense technology by @pranavmistry 14 years back inspired me a lot in many ways over the years 🔥. Finally got a chance to meet him and discuss research 😍. The TED talk video which I have watched a thousand times: youtube.com/watch?v=YrtANP……
Postdoc opening in Languages group at Google Deepmind based out of Bangalore Topics: LLMs, multilinguality, multimodality, RAI, etc. Strong candidates may apply by sending cv to [email protected] with [LLM-Postdoc] in subject by Apr 26 DM/email for any questions
New work with @andrew_ilyas and @aleks_madry on tracing predictions back to individual components (conv filters, attn heads) in the model! Paper: arxiv.org/abs/2404.11534 Thread: 👇
How do model components (conv filters, attn heads) collectively transform examples into predictions? Is it possible to somehow dissect how *every* model component contributes to a prediction? w/ @harshays_ @andrewilyas, we introduce a framework for tackling this question!…
We are beyond excited to be hosting a meetup on May 1st in San Francisco: DSPy End-to-End! 🌉🔥 Super grateful to our collaborators @arizeai and @cohere for co-hosting this with us, and beyond excited to be featuring a talk from @lateinteraction! See you in San Francisco, it…
One prompt does not fit all language models ☝️ Luckily for you, DSPy automates the task of prompt engineering! Here is a thread with a few things to know about the collection of compilers in DSPy. It is also outlined in a new blog post from @CShorten30 and I, “Your Language…