Ankesh Anand @ankesh_anand
research scientist @googledeepmind, working on gemini // prev @mila_quebec, @googleai, @msftresearch, @iitkgp ankeshanand.com London, England Joined December 2011-
Tweets910
-
Followers3K
-
Following600
-
Likes5K
Let’s just say that 1.5 Pro’s abilities have been severely underestimated.
Bunch of improvements to Gemini 1.5 Pro today: * No waitlist on the API. * Better at reasoning, multimodal, instruction-following and a bunch of other evals. * Video+Audio support: Video inputs can now parse audio as well on AIStudio. developers.googleblog.com/2024/04/gemini…
【Gemini 1.5 Proの性能が群を抜いてすごい件。GPT-4、Claude 3を凌駕】 目でも見えないくらい細かい生成AI企業のカオスマップをGemini 1.5 Pronで解析したところ、5分くらいずっとAIが動いて企業名を書き出した。 GPT-4:解析不能 Claude 3:本当に一部のみ 比較すると差は歴然。…
Gemini 1.5 is free in AI Studio right now, with 1M context length. 😉 Waitlist should move pretty fast, sign up here: aistudio.google.com/app/waitlist/9…
Gemini 1.5 is free in AI Studio right now, with 1M context length. 😉 Waitlist should move pretty fast, sign up here: aistudio.google.com/app/waitlist/9…
Congrats to the anthropic team on shipping! GPQA is kinda the only eval I trust on this table, and opus seems clearly better there than any other publicly available model.
Congrats to the anthropic team on shipping! GPQA is kinda the only eval I trust on this table, and opus seems clearly better there than any other publicly available model.
For this simple retrieval task: - short input: Gemini 1.5 wins - medium input: methods other than Gemini 1.5 perform poorly - long input: methods other than Gemini 1.5 fail totally
IMO, Gemini 1.5 Pro is the largest leap in long form video understanding since I started my PhD in long form video + language understanding back in 2016, and more will be coming 🔥
IMO, Gemini 1.5 Pro is the largest leap in long form video understanding since I started my PhD in long form video + language understanding back in 2016, and more will be coming 🔥
I gave Gemini 1.5 Pro Mr. Beast’s latest video. It’s 22min and 347,849 tokens. I asked it to “Reply with [NAME OF GUY ON BLUE FOOTBALL] has [TOTAL CASH PRIZE POOL] of [FOOD ITEM BEING ADVERTISED].” Watch the model get it 100% correct. Insane leap for AI.
Shit, Google wasn't kidding. Gemini 1.5 Pro just went straight from a full movie to a summary in seconds. No transcription, no intermediate steps. Just visual tokens -> summary. Next up, validating the haystack tests.
To preempt any confusion: Multimodal queries don't go through Pro / Ultra yet, but that's coming soon too!
To preempt any confusion: Multimodal queries don't go through Pro / Ultra yet, but that's coming soon too!
How do the SOTA large Vision-Language Models (or Multimodal LLMs) performance on this task? A striking performance gap from average humans (e.g., 99% vs. 39-72% on the real-world subset)! Congrats to the Gemini team for the impressive lead over GPT-4V @JeffDean! Curious about…
🔥Breaking News from Arena Google's Bard has just made a stunning leap, surpassing GPT-4 to the SECOND SPOT on the leaderboard! Big congrats to @Google for the remarkable achievement! The race is heating up like never before! Super excited to see what's next for Bard + Gemini…
@eladgil @patrickc In AI at least, the real 30 under 30 imo you have never heard of. They are 5 layers down the org chart from the CEO. They are usually not on Twitter, they have an unmaintained LinkedIn, they don’t go on podcasts, and they maybe published at one point but don’t do so anymore. They…
Gather-Attend-Scatter (GATS), a novel module that combines pretrained foundation models operating at different rates into larger multimodal networks. Paper: arxiv.org/abs/2401.08525
We have some new findings on synthetic data I'm excited about: * Synthetic data improves mathematical reasoning, at scale! * Fine-tuning with synthetic data turns out to be better than fine-tuning on GT human written answers.
We have some new findings on synthetic data I'm excited about: * Synthetic data improves mathematical reasoning, at scale! * Fine-tuning with synthetic data turns out to be better than fine-tuning on GT human written answers.
the gemini team moves very fast and the energy is contagious! this is going to break a lot of narratives and assumptions in the coming year :) s/o to @LencKarel @antoine77340 and @drjwrae for pairing on this!
the gemini team moves very fast and the energy is contagious! this is going to break a lot of narratives and assumptions in the coming year :) s/o to @LencKarel @antoine77340 and @drjwrae for pairing on this!
Jim Fan @DrJimFan
228K Followers 3K Following @NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.Irina Rish @irinarish
9K Followers 992 Following prof UdeM/Mila; Canada Excellence Research Chair; AAI Lab head https://t.co/UzlrC7ZrGF; INCITE project PI https://t.co/0rV7szd7rH; CSO https://t.co/XDhj6MEtUjDanijar Hafner @danijarh
14K Followers 867 Following Building AI that makes autonomous decisions using world models, artificial curiosity, and temporal abstraction @DeepMindEric Jang @ericjang11
69K Followers 3K Following physical AGI at 1X. Author of "AI is Good for You" https://t.co/eFg4WXhg0pyobibyte @y0b1byte
15K Followers 2K Following Kurin ViTaly, senior research scientist @IsomorphicLabs, ML PhD from @UniofOxford on RL, Multitask learning & GraphsBehnam Neyshabur @bneyshabur
18K Followers 689 Following Senior Staff Research Scientist @GoogleDeepMind, Interested in reasoning w. LLMs, traveling & backpackingDelip Rao e/σ @deliprao
46K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈Eugene Vinitsky @EugeneVinitsky
13K Followers 2K Following Anti-cynic. Artificial narrow intelligence. Autonomous vehicles, multi-agent learning, and transportation. RS at Apple, Asst. Prof at @nyutandon. He/him.Tim Rocktäschel @_rockt
29K Followers 1K Following Open-Endedness Team Lead @GoogleDeepMind, Professor of AI @AI_UCL, PI @UCL_DARK, @ELLISforEurope Scholar. ex @MetaAI (FAIR), @CompSciOxford. Opinions my own.Felix Hill @FelixHill84
9K Followers 777 Following Research Scientist, Deepmind I try to think hard about everything I tweet, esp on 90s football and 80s music None of my opinions are really someone else'sEthan Caballero is bu.. @ethanCaballero
8K Followers 2K Following ML PhD student @Mila_Quebec ; previously @GoogleDeepMindNathan Benaich @nathanbenaich
51K Followers 31K Following solo member of investment staff @airstreet, brewing ambition @airstreetcafe, next token predictor @airstreetpressSara Hooker @sarahookr
39K Followers 7K Following I lead @CohereForAI. Formerly Research @Google Brain @GoogleDeepmind. ML Efficiency at scale, LLMs, @trustworthy_ml. Changing spaces where breakthroughs happen.Hattie Zhou @oh_that_hat
5K Followers 764 Following Finding \hat{y} Give me anonymous feedback: https://t.co/7aBNrpbad8Miles Brundage @Miles_Brundage
43K Followers 10K Following Policy research at @openai. I mostly tweet about AI, animals, and sci-fi. He/him. Views my own.Andreas Kirsch 🇮�.. @BlackHC
9K Followers 4K Following Past: 🧑🎓 DPhil @AIMS_oxford @ExeterCollegeOx @UniofOxford (4.5yr) 🧙♂️ RE @DeepMind (1yr) 📺 SWE @Google (3yrs) 🎓 @TU_Muenchen 👤 Fellow @nwspkrohan anil @_arohan_
12K Followers 2K Following Principal Engineer, @GoogleDeepMind Gemini. prev PaLM-2. Tinkering with optimization and distributed systems. opinions are my own.Petar Veličković @PetarV_93
30K Followers 555 Following Staff Research Scientist @GoogleDeepMind | Affiliated Lecturer @Cambridge_Uni | Associate @clarehall_cam | GDL Scholar @ELLISforEurope. Monoids. 🇷🇸🇲🇪🇧🇦Rishabh Agarwal @agarwl_
6K Followers 539 Following Senior Research Scientist, @GoogleDeepMind, ex-🧠. Agents that make decisions. NeurIPS Best Paper (RLiable). Mila, IIT Bombay.Nathan Lambert @natolambert
25K Followers 684 Following Figuring out AI @allen_ai, "rl boi" DM me papers. Writes @interconnectsai, talks @retortai Has phd and some credentialsHassan Hayat 🔥 @TheSeaMouse
4K Followers 4K Following Building the AI assistant for all @ https://t.co/D4gDyw97gucharmingElis @EllisCharming
11 Followers 571 Following Technology & Entrepreneurship - Tweets/ReTweets/Opinions are my own.V Sriram @VSriram23
141 Followers 3K Following_Luminous_ @Luminous847311
0 Followers 400 Following Nice to meet you. My hobbies are reading, food and sports. I like cats😘 I like to meet new friends while traveling🎉🎉🎉Harsh Desai @dreamerharsh
1 Followers 3K FollowingHeather (Heat ☀️).. @blonde_buddha
67 Followers 499 Following Living in the moment, guided by mindfulness 🌿💜 | Minimalist by nature | Sharing the joy of yoga and mindfulness | Your guide to happiness 🧘♀️✨The Fool 🃏 @yourthefool
187 Followers 117 Following Esoteric ramblings from a divinity poisoned weirdo. Anarchist transhumanist. Anticapitalist tech enthusiast. θ∆ they/she 30 something. Queer AF.Samad Koita @samadkoita
25 Followers 442 FollowingShreyas Vaidya @shreyasvaidya23
159 Followers 1K Following Nothing beats the joy of solving interesting problems Third year UG majoring in CS @iitjodhpurThomas Paes @thpaes
90 Followers 1K FollowingMichał Jaroń @jaron_michal
2K Followers 3K Following Machine learning engineer in love with soccer analytics. ⚽️📈 Ex data analyst at Legia Warszawa. Alumni of University of Warsaw - Master of Computer Science.Rory Greig @rorygreig1
648 Followers 4K Following Research Engineer at Google DeepMind, interested in AI Alignment and Complexity Science.Eva Louise Marie Gabr.. @e681554349
6 Followers 3K FollowingAdham Elarabawy @adhamelarabawy
401 Followers 726 Following Machine Learning Research at https://t.co/KFczZn3Dmb | Previously @scale_ai @google, @berkeley_aiNaman Jain @StringChaos
903 Followers 887 Following CS PhD @UCBerkeley | Projects - R2E, LiveCodeBench, Chatbot-Arena Coding, RAFT, Data Quality | Past: @AWS @MSFTResearch @iitbombayшан симен @tyson_carl
242 Followers 5K Following Interested in Computational Psychiatry and Private EquityIshan @radshaan
13K Followers 2K Following Undergrad electrical engineering and computer science. Past: sw & fw for compact access control. Current: hw & fw for FSAE cars. Getting good at getting goodMichael Hajster @michaelhajster
127 Followers 338 Following Enterprise AI Consulting & Startups. Deeply committed to learning and leading in AI technology. Passionate about connecting and sharing insights.Ate-a-Pi @8teAPi
36K Followers 2K Following self aware neuron; historian from 2130; epistemic polluter; 95 yr old man;Sayan Chakraborty @shockrobortyy
177 Followers 877 Following ML @Qualcomm (prev: @BrownUniversity, @paytminsider, @bigbinary, @clarisights)Chunqi Li @lichunqi
58 Followers 519 FollowingMahyar Ebrahimi @EbrahimiMahyar
112 Followers 1K Following "Opinion is ultimately determined by the feelings, and not by the intellect." -- Herbert Spencerspaghettski @spaghettski
117 Followers 289 FollowingNithin singh @Nithin_sin
31 Followers 997 FollowingHumans 2.0 @IndianFriends
10K Followers 5K Following Higher intelligence from super natural resources & artificial intelligence through science, technology, healthcare, finance, management & divinity to enlighten!Martin Fan @perfectoid_ai
367 Followers 8K FollowingJim @Jim90165537
65 Followers 642 FollowingNick Gibb @gibbnicholas
217 Followers 1K Following Here for the endless LLM developments Work @hellobluedotCricco @cricco_bomber
405 Followers 513 Followingsummitbytes @summitbytes
65 Followers 191 Following Your go-to source for curated news on transformative science and technologyNehul Patel @Nehul1
97 Followers 463 Following🥀 @PsyNetMessage
469 Followers 3K Following⍼ₑᵣᵢ𝒸 @LericDax
7K Followers 4K Following applied anthropologist of tech, space, & media, VR/ML/AI researcher & developer, coder, cyberneticist, semiotician, founder @AzothCorp, lead @MnemosyneLabsbioshok(INFJ) @bioshok3
19K Followers 2K Following AGI/AI Alignment/Existential risk/X-risk/Super Intelligence/Singularity/技術トレンド https://t.co/2vsogSTe3X(MBTIの哲学) ルッキズムに祈るセカイ系 INFJ/5w4/ILI ※AI研究者ではありませんGoogle DeepMind @GoogleDeepMind
941K Followers 275 Following We’re a team of scientists, engineers, ethicists and more, committed to solving intelligence, to advance science and benefit humanity.Jim Fan @DrJimFan
228K Followers 3K Following @NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.Irina Rish @irinarish
9K Followers 992 Following prof UdeM/Mila; Canada Excellence Research Chair; AAI Lab head https://t.co/UzlrC7ZrGF; INCITE project PI https://t.co/0rV7szd7rH; CSO https://t.co/XDhj6MEtUjAI at Meta @AIatMeta
526K Followers 255 Following Together with the AI community, we are pushing the boundaries of what’s possible through open science to create a more connected world.Rosanne Liu @savvyRL
32K Followers 965 Following Cofounded & running @ml_collective. Host of Deep Learning Classics & Trends. Research at Google DeepMind. DEI/DIA Chair of ICLR & NeurIPS. Writing https://t.co/IbycyGfnDRDanijar Hafner @danijarh
14K Followers 867 Following Building AI that makes autonomous decisions using world models, artificial curiosity, and temporal abstraction @DeepMindSoumith Chintala @soumithchintala
185K Followers 871 Following Cofounded and lead @PyTorch at Meta. Also dabble in robotics at NYU. AI is delicious when it is accessible and open-source.Lucas Beyer (bl16) @giffmana
56K Followers 442 Following Researcher (Google DeepMind/Brain in Zürich, ex-RWTH Aachen), Gamer, Hacker, Belgian. Mostly gave up trying mastodon as [email protected]Eric Jang @ericjang11
69K Followers 3K Following physical AGI at 1X. Author of "AI is Good for You" https://t.co/eFg4WXhg0pEdward Grefenstette @egrefen
36K Followers 773 Following FR/US/GB AI/ML Person, Director of Research at @GoogleDeepMind, Honorary Professor at @UCL_DARK, @ELLISforEurope Fellow. All posts are personal.yobibyte @y0b1byte
15K Followers 2K Following Kurin ViTaly, senior research scientist @IsomorphicLabs, ML PhD from @UniofOxford on RL, Multitask learning & GraphsShane Gu @shaneguML
28K Followers 1K Following Research Scientist & Manager @GoogleDeepMind Tokyo/MTV. ex: @GoogleAI Brain, @OpenAI. (JP: @shanegJP)Sergey Levine @svlevine
79K Followers 122 Following Associate Professor at UC Berkeley Co-founder, Physical IntelligenceBehnam Neyshabur @bneyshabur
18K Followers 689 Following Senior Staff Research Scientist @GoogleDeepMind, Interested in reasoning w. LLMs, traveling & backpacking(((ل()(ل() 'yoav))).. @yoavgo
46K Followers 2K FollowingDelip Rao e/σ @deliprao
46K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈Marc G. Bellemare @marcgbellemare
13K Followers 350 Following CSO & co-founder, Reliant AI. Ex RL research lead at Google Brain, DeepMind. Known for Atari 2600 RL benchmark, Distributional RL (MIT Press 2023).Corry Wang @corry_wang
24K Followers 251 Following Strategy @ Google | Formerly tech equity research @ Bernstein Research. All opinions expressed are my own, and do not represent Google'sWen-Ding Li @xu3kev
2K Followers 5K Following Program Synthesis & ML. Previously Student Researcher at @google. Previously intern at @theteamatx. Mastodon: [email protected]Melvin Johnson @melvinjohnsonp
941 Followers 280 Following Researcher @ Google Research. Multilingual NLP and MT. Previously, Stanford CS.nisten @nisten
10K Followers 5K Following fullstack-dev democratizing intelligence @skunkworks_ai | 🦝.ai | prev https://t.co/68jAlAVBKR |nay 1.5 pro @nschucher
273 Followers 1K FollowingRamona Comanescu @ramona_crg
125 Followers 388 Following Research Engineer at @GoogleDeepMind working on Gemini, Fairness, Safety ♊️ Previously @Meta ML grad @Cambridge_Uni & CompSci @EdinburghUni @InfAtEdAbhishek Sinha @a7b2_3
278 Followers 665 Following Perception @Waymo | MSCS @Stanford Deep learning enthusiast..ardent arsenal fanudio @udiomusic
26K Followers 0 FollowingLogan Kilpatrick @OfficialLoganK
92K Followers 2K Following Lead product for @Google AI Studio and working on the Gemini API, helping developers build with AI, my views!Dwarkesh Patel @dwarkesh_sp
51K Followers 697 Following Being pretrained Host of Dwarkesh Podcast https://t.co/3SXlu7fy6N https://t.co/rEhnfYywXY https://t.co/hQfIWdM1UnGoogle AI Studio @googleaistudio
809 Followers 21 Following Google AI Studio is a free, web-based developer tool that enables you to quickly develop prompts and then get an API key to use in your app developmentAndy Coenen @_coenen
1K Followers 1K Following AI controllability @Google (PAIR) | Artist | Builder | Empowering creativityCognition @cognition_labs
123K Followers 19 Following Makers of Devin, the first AI software engineer. We are an applied AI lab focused on reasoning, and code is just the beginning. Join us: https://t.co/tpfZwEwGiqNikolay Savinov 🇺�.. @SavinovNikolay
1K Followers 0 Following Research Scientist at @GoogleDeepMind Work on LLM pre-training in Gemini ♊ 10M context length in Gemini 1.5 Pro 📈Joe Stanton @joe_stant
483 Followers 1K Following Engineer @ Google DeepMind, working on Gemini Inference & Deployment. Previously Tech Director at @RedBadgerTeamAlexander Chen @alexanderchen
8K Followers 1K Following Creative Director at Google Creative Lab, working on AI. Opinions are my own. https://t.co/bSOAeDObmzSimon @tokumin
1K Followers 2K Following Gemini, Labster. Previous: PaLM2, YouTube, Discover, Search, NBU, Pixel, Android, Carto, JRPass, UNEP, Conservation International/World Bank, SANBIJing Yu Koh @kohjingyu
3K Followers 486 Following Machine Learning PhD student @CarnegieMellon. Previously: fulltime vision-and-language research @GoogleAI, undergrad @sutdsg. 🇸🇬Machel Reid @machelreid
2K Followers 1K Following Research Scientist @GoogleDeepMind Working on LLMs on the Gemini Team; did gemini 1.5 proNithya Attaluri @attaluri_nithya
230 Followers 601 Following research engineer @googledeepmind // @miteecs @mitcsail bs & meng ‘23Mario Lucic @MarioLucic_
3K Followers 147 Following Staff Research Scientist @ https://t.co/pXedOGSgT3. Gemini Video and Audio-video understanding.Yi Tay @YiTayML
28K Followers 97 Following Chief scientist & Co-founder @RekaAILabs past: Research Scientist @Google Brain 🧠 currently learning to be a dad 🍼👶Dustin Tran @dustinvtran
40K Followers 649 Following Research Scientist at Google DeepMind. I lead evaluation at Gemini / Bard. AI, Bayesian statistics, deep learning.Antoine Yang @AntoineYang2
702 Followers 410 Following Research Scientist @GoogleDeepMind, Gemini multi-modal 💎. Prev: PhD @Inria & @ENS_ULM, MEng @Polytechnique.Isabel🌻 @isabelunraveled
24K Followers 1K Following understanding the self🪞🦋 new essay: https://t.co/k7PDN8kkCuCollin Burns @CollinBurns4
11K Followers 274 Following Superalignment @OpenAI. Formerly @berkeley_ai @Columbia. Former Rubik's Cube world record holder.Gabriel Dulac-Arnold @gabepsilon
767 Followers 577 Following RL Research Google @DeepMind, previously Google Brain, previously @DeepMind. Poking at neural networks to make them do something. 🔥🌍🔥 🏳️🌈 ✊Yikang Shen @Yikang_Shen
995 Followers 232 Following Research staff member at MIT-IBM Watson Lab. PhD from Mila.James Bradbury @jekbradbury
10K Followers 8K Following Compute at @AnthropicAI! Previously JAX, TPUs, and LLMs at Google, MetaMind/@SFResearch, @Stanford Linguistics, @Caixin.Clive Chan @itsclivetime
6K Followers 2K Following intelligence per picojoule @openai // prev dojo @tesla_ai raptor @spacex // proud sponsor of the 😌 emojiCem Anil @cem__anil
2K Followers 1K Following Machine learning / AI Safety at @AnthropicAI and University of Toronto / Vector Institute. Prev. student researcher @google (Blueshift Team) and @nvidia.Nitish @StrongDuality
3K Followers 1K Following language modeling research @OpenAI | views are my ownFuzhao Xue @XueFz
4K Followers 541 Following Ph.D. candidate@NUSingapore, Intern of GEAR @NVIDIA | Google PhD Fellow | LLM, Foundation Model Scaling | Ex-@GoogleBrain | Zero-shot Cooking Learner🧑🍳Gabriel Barth-Maron @gbarthmaron
657 Followers 402 Following Staff Research Engineer @GoogleDeepMind | 🎓 MSc @BrownUniversityKarel Lenc @LencKarel
92 Followers 50 FollowingXiang Yue @xiangyue96
2K Followers 421 Following Postdoc @LTIatCMU. PhD from Ohio State @osunlp. Training & evaluating foundation models. Pushing the boundaries of AI🤖. Previously @MSFTResearch.Sholto Douglas @_sholtodouglas
15K Followers 849 Following Scaling Gemini @Deepmind - working towards intelligence too cheap to meterWenhu Chen @WenhuChen
11K Followers 516 Following AI researcher @UWaterloo @GoogleAI @VectorInst. Interested in natural language processing, diffusion models. I direct TIGER-Lab at UWaterloo.Fangyu Liu @hardy_qr
1K Followers 1K Following Research Scientist @GoogleDeepMind building Gemini♊. PhD @CambridgeLTL . BMath @UWaterloo . From 成都🐼.Swaroop Mishra @Swarooprm7
5K Followers 894 Following Research Scientist @GoogleDeepMind (Gemini). Pioneering LLM Research 🔥. Instruction tuning, Factuality, Reasoning and next gen Product. Opinions my own.Antoine Miech @antoine77340
1K Followers 454 Following Ornithologist @GoogleDeepMind 🦩, Gemini MultimodalSoravit “Beer” Ch.. @schangpi
316 Followers 571 Following A computer scientist from Bangkok, Thailand. @GoogleAI. Ex-member of @ShaLabUSC at @CSatUSC & @BrownCSDept. He/Him.Michela Paganini @WonderMicky
7K Followers 2K Following Staff Research Scientist @DeepMind | LLMs, Evals & Model Understanding | Previously: @facebookAI | @Yale Physics PhD | @CERN | @BerkeleyLab | @UCBerkeleyI am super excited to share our Llama3 preview models (8B and 70B). I am proud to have been a part of this amazing effort over the past 8 months. We still have some super cool stuff coming up in the coming months... until then, enjoy playing with these preview models…
1/n Gemini 1.5 Pro is surprisingly rich and insightful! Here's a small test I just did to answer the question "Explain why Talking Head AI that mimic humans speaking and expressions are primarily researched in Asia. What is the underlying motivations for this?" Here are…
We studied In-Context learning with hundreds to thousands of examples. My favorite example: I sent *one million* tokens to Gemini 1.5 Pro for linear classification with 64 dimensional integer-valued vectors and many-shot learning performs similarly to k-Nearest Neighbours.
@arankomatsuzaki If they allow you to keep the many-shot kv cache warm for you, that could replace fine-tuning for a good number of use cases. Probably easier to deploy given the weights are the same
If you haven’t tried AI Studio, you can try Gemini 1.5 pro for free! aistudio.google.com
After receiving community feedback, we added @GoogleDeepMind Gemini 1.5 Pro's results. 👇 Gemini 1.5 Pro's vision ability was significantly improved compared to 1.0 Pro and matched GPT-4's performance on our VisualWebBench! 🏆 Its action prediction (e.g., predicting what would…
🚀Introducing VisualWebBench: A Comprehensive Benchmark for Multimodal Web Page Understanding and Grounding. visualwebbench.github.io 🤔What's this all about? Why this benchmark? > Back in Nov 2023, when we released MMMU (mmmu-benchmark.github.io), a comprehensive multimodal…
Long context chat feels quite different and very impressive. I have been testing a long context chat with Gemini 1.5 pro preview that's well over 100k tokens, and the way it's able to draw on earlier conversational context feels like a huge step up in assistant capability.
Interesting to see a big new multimodal model come out, especially as Gemini 1.5 has been the only publicly available model that could work with video I have only played with it a bit, and the model seems pretty capable, but it hallucinates much more than Gemini on video so far
Meet Reka Core, our best and most capable multimodal language model yet. 🔮 It’s been a busy few months training this model and we are glad to finally ship it! 💪 Core has a lot of capabilities, and one of them is understanding video --- let’s see what Core thinks of the 3 body…
i'm at a loss. no one warned me multimodal input has made it this far. i fed one of my favorite songs, "the stars vs creatures" by colleen into gemini 1.5 pro, without telling it anything else. it can hear.
@OfficialLoganK Man, people in the comment section here are obviously confusing Gemini 1.5 Pro and Gemini Advanced. Two different monsters.
Can Gemini 1.5 actually read all the Harry Potter books at once? I tried it. All the books have ~1M words (1.6M tokens). Gemini fits about 5.7 books out of 7. I used it to generate a graph of the characters and it CRUSHED it.
New audio mode in Gemini is ridiculous. I uploaded a full raw 2+ hours of audio in there, and asked a direct question and in less than 30 seconds it processed 250K tokens of audio, and extracted valuable insights from this audio. Also in the the pic of @googleaistudio : 1)…
Excited to share Udio, a music generation app. I co-founded Udio with the brilliant @yaroslav_ganin @conormdurkan @DavidDingAI @avincentsanchez Thanks to them and to all our supporters. Let's make some music!
Introducing Udio, an app for music creation and sharing that allows you to generate amazing music in your favorite styles with intuitive and powerful text-prompting. 1/11
Guys… Gemini 1.5 is insanely good. It excels in transcribing, recognizing people being transcribed with sufficient examples, and multilingual capabilities. It surpasses both GPT-4 and Claude 3 in coding—an all-around amazing tool. A mind-blowing improvement over Gemini 1.0…
@ankesh_anand I've been telling everybody that Gemini is being massively slept on. Currently my favorite model.
@ankesh_anand @profjoeyg I think a similar arena for LMMs exists: github.com/OpenGVLab/Mult… (however I don't think they have as many models or as large a user-base) vlarena.opengvlab.com also does not seem to be as regularly updated as chatbot-arena.
Introducing Udio, an app for music creation and sharing that allows you to generate amazing music in your favorite styles with intuitive and powerful text-prompting. 1/11
@ankesh_anand Gemini 1.5 Pro is now even better 🙌