Dylan Slack @dylanslack20
Researcher at ... Ph.D. @UCIbrenICS. Prev @awscloud and @googleAI. I tweet about misc findings + plug my papers dylanslacks.website San Fransisco Joined March 2019-
Tweets376
-
Followers566
-
Following565
-
Likes5K
Do LLMs hold knowledge that might be dangerous in the hands of a malicious user? Can hazardous knowledge be unlearned? Introducing WMDP: an open-source eval benchmark of 4,157 multiple-choice questions that serve as a proxy measurement of LLM’s risky knowledge in biosecurity,…
📣 Announcing the release of the WMDP LLM benchmark, designed by Scale’s Safety, Evaluations, and Analysis Lab (SEAL) in partnership with @ai_risks (CAIS)! 🧵 scale.com/blog/measuring…
@xanderatallah .@scale_AI will build exactly this :) coming soon!
New paper! Q-probe is a lightweight approach to RL on top of LLMs. We learn a linear value function on the LLM embeddings and use a variant of rejection sampling to define a policy. Results linked in the thread from first author @ke_li_2021 on coding problems and RLHF. 🧵
New paper! Q-probe is a lightweight approach to RL on top of LLMs. We learn a linear value function on the LLM embeddings and use a variant of rejection sampling to define a policy. Results linked in the thread from first author @ke_li_2021 on coding problems and RLHF. 🧵
My colleague Willow Primack used DALL-E to illustrate Allen Ginsberg’s Howl, and it was just too good not share (with permission). Here’s a teaser. Howl, Illustrated by AI I saw the best minds of my generation destroyed by madness, starving hysterical naked
Excited to share our work, "Skill Set Optimization", a continual learning method for LLM actors that: - Automatically extracts modular subgoals to use as skills - Reinforces skills using environment reward - Facilitates skill retrieval based on state allenai.github.io/sso 🧵
[#eacl2024 paper] TL;DR We introduce 𝗴𝗿𝗮𝗱𝗶𝗲𝗻𝘁-𝗯𝗮𝘀𝗲𝗱 𝗿𝗲𝗱 𝘁𝗲𝗮𝗺𝗶𝗻𝗴 (𝗚𝗕𝗥𝗧), an effective method for triggering language models to produce unsafe responses, even when the LM is finetuned to be safe through 𝑎𝑙𝑖𝑔𝑛𝑚𝑒𝑛𝑡.
Hiring for two Product Leads for our skyrocketing Generative AI Business scale.com/careers/435774… scale.com/careers/431576…
Hiring for two Product Leads for our skyrocketing Generative AI Business scale.com/careers/435774… scale.com/careers/431576…
intern applicants: i'm actually very likely more interested in your crazy little unfinished side project or nerdy interest than your gpa – you can proudly show them!
The @scale_AI team put a spotlight⚡️on our work "Pushing Mixture of Experts to the Limit" arxiv.org/abs/2309.05444 and showcase the impact on LLaMA-2. Really nice blog post + working implementation. 🔥 scale.com/blog/fine-tuni…
First talk of the day by @sameer_ has begun!! Join us at the XAI-in-Action workshop in Room 271 !
If you happened upon a tin of Cafe Du Monde coffee at #NeurIPS2023, allow extra time at the airport for TSA to double/triple check it
Scale's MLE @dylanslack20, presented his poster “Post Hoc Explanations of Language Models Can Improve Language Models” at NeurIPS 2023 yesterday! He presented a method that uses post-hoc model explanations to automatically construct Chain-of-Thought (CoT) examples, leading to…
Tuesday 5:15-7:15 pm #1424: Post Hoc Explanations of Language Models Can Improve Language Model With @SatyaScribbles , @Jiaqi_Ma_ , @dylanslack20 , @sameer_ , @hima_lakkaraju
Scale is at #NeurIPS2023 in New Orleans! What’s better than a shoggoth sticker? A sax-playing shoggoth sticker, of course! 🎷 🐙 Drop by booth #1222 in Hall D for demos of our latest advancements powering gen AI and autonomy including the Automotive Foundation Model. Make sure…
Will be at #NeurIPS2023 next week! If you’re an LLM researcher / research engineer interested in robust evaluations, safety, red teaming or scalable oversight, let’s chat! Mainly hiring for SEAL but also happy to chat about collaboration opportunities.
Walmart tech vet, AI2 research scientists are behind new Seattle startup Spiffy geekwire.com/2023/walmart-t… via @geekwire
this model is really impressive, congrats to the team on the release! 🎉
𝙷𝚒𝚖𝚊 𝙻.. @hima_lakkaraju
16K Followers 836 Following Professor @Harvard; PI @ai4life_harvard; Co-founder @trustworthy_ml; #AI #ML #Safety; Stanford PhD; MIT @techreview #35InnovatorsUnder35Sameer Singh @sameer_
7K Followers 2K Following Cofounder @SpiffyAI and Assoc Prof at @UCIrvine, working on reliable LLMs, explanations for AI+ML, adversaries for NLP, and debugging/evaluation.Dan Roy @roydanroy
45K Followers 2K Following ML / AI researcher, emphasis on theory. Research Director and Canada CIFAR AI Chair, @VectorInst Professor, @UofT (Statistics/CS)Ofir Press 🖋 @OfirPress
10K Followers 3K Following I build tough benchmarks for LMs and then I get the LMs to solve them. Postdoc @Princeton. PhD from @nlpnoah @UW. Ex-visiting researcher @MetaAI & @MosaicML.Ana Marasović @anmarasovic
4K Followers 604 Following Asst prof @UUtah · Ex @allen_ai @uwnlp postdoc @HD_NLP PhD · she/her 🇭🇷Stephan Mandt @StephanMandt
2K Followers 556 Following ML Professor @UCIrvine, previously @blei_lab, @Princeton. #GenerativeAI, #Compression, #AI4Science. Program Chair @aistats_conf 2024; General Chair AISTATS 2025Asma Ghandeharioun @ghandeharioun
2K Followers 489 Following Research Scientist @GoogleAI working on ML interpretability & human-centered AI, PhD from @MITBill Yuchen Lin 🤖 @billyuchenlin
6K Followers 2K Following Research @allen_ai. I evaluate (multi-modal) LLMs, build agents, and study the science of LLMs. Previously: @GoogleAI & @MetaAI FAIR @nlp_uscSunnie S. Y. Kim @sunniesuhyoung
2K Followers 1K Following PhD student @VisualAILab @PrincetonHCI. AI transparency and explainability. First name pronounced as sunny☀ she/her https://t.co/c3atPcWlR1Shaily @shaily99
5K Followers 2K Following PhD @LTIatCMU Prev: @GoogleAI @MSFTResearch. Working on ethics and evaluation in #NLProc. Usually ranting, often about research & DEI. 📚 @readsndrantsJasmijn Bastings @jasmijnbastings
4K Followers 2K Following Sr Research Scientist @GoogleDeepMind. Interested in gender, feminism, fairness, bias & ethics in #NLProc/#AI. Views my own. She/they.RosalindHansom @Evc6I610GYV2M
0 Followers 180 FollowingAdam @Adam81935513
1 Followers 12 FollowingArif Ahmad @arif_ahmad_py
278 Followers 7K Following All things AI, Computer Science and Circuits! Prev. @GoogleAIBrian Huang @brianryhuang
1K Followers 1K FollowingDave xrp lion @Davexrp_lion1
457 Followers 3K FollowingAW @AW489030117156
8 Followers 23 Followingweb3 marketer @MarketerWeb3
1 Followers 4K FollowingSummer Yue @summeryue0
1K Followers 218 Following Director of Safety and Standards at Scale AI. Prev: RLHF lead on Bard, researcher at Google DeepMind / Brain (LaMDA, RL/TF-Agents, superhuman chip design)James Kang @JamesKa29162538
18 Followers 125 Followingsrste @contactsrste
39 Followers 224 FollowingWhat makes sense @car_heroes
45 Followers 183 FollowingJunlan Lu @junlan_lu
132 Followers 377 FollowingJiao Sun @sunjiao123sun_
2K Followers 365 Following Research Scientist at Google Gemini \n\n NLP PhD @ USC, Amazon ML Fellow \n\n ex-{Google Brain, Alexa AI} nlper, IIIS Tsinghua-RenZac Kenton @ZacKenton1
1K Followers 1K Following Research Scientist in AI safety at DeepMind. Views are my own and don't represent DeepMind.Ahmad Beirami @abeirami
4K Followers 2K Following Building safe, helpful, and scalable generative AI @Google | ex-{@AIatMeta, @EA, @MIT, @Harvard, @DukeU} | @GeorgiaTech PhD | زن زندگی آزادی | opinions my ownchaissa @chaissa12
347 Followers 5K FollowingYo Shavit @yonashav
4K Followers 830 Following policy for v smart things @openai. Past: CS PhD @HarvardSEAS/@SchmidtFutures/@MIT_CSAIL. Tweets my own; on my head be it.ƒ @fjlin
90 Followers 5K FollowingIan Fisher @ianafisher
93 Followers 48 Following Software engineer. Ex-Google. Creator of https://t.co/5iqNSYMf6O. I write Outsider Art on Substack: https://t.co/IdhhOXdpMoliuyong @forrestbing
265 Followers 5K Following I am a researcher in AIGC, Multi-modality and VitrualHuman tech direction☁️ @rototavukcu
72 Followers 536 Following personal microblog of a researcher & man of simple pleasures.Yangsibo Huang @YangsiboHuang
1K Followers 726 Following PhD candidate @Princeton. Prev: @GoogleAI @AIatMeta.MARkkkk @mazhenguan1
82 Followers 570 FollowingLinetteMarroquin @LinetteMar8501
50 Followers 2K Followingnick nassuphis @NNassuphis
120 Followers 5K FollowingRahul @rahulkharwadkar
190 Followers 354 Following Technologist, Startup ex-entrepreneur, P&L leader, Semiconductor industryEmily Li @EmilyLiJiayao
382 Followers 624 Following Researcher @carnegiemellon | Founder @acadiaai | Prev research @modern_ai, ML @ evolution_devices, Founder @ arquestssern | Data-centric & Multimodal AIAbdullah Mamun @AB9Mamun
26 Followers 122 Following Ph.D. Student at Arizona State University - Computer ScienceRongfei Lu @lrf138
163 Followers 567 Following Legal AI, data privacy, aerospace and policy | @uber, @StanfordAILab, @CodeXStanford, @HooverInst, @dartmouthElisa Nguyen @_elinguyen
267 Followers 472 Following PhD Student in STAI group (https://t.co/MMYVUHhbeR) at @uni_tue and @MPI_IS (IMPRS-IS), part of @KImachtSchule and @VivaconAguaEric Wallace @Eric_Wallace_
6K Followers 1K Following Researcher at OpenAI working to make language models more trustworthy, secure, and private.Alexandr Notchenko @Gang1man
783 Followers 5K Following Engineer ⋂ Scientist ⋂ Maker Prev: Co-founder and CTO at https://t.co/YweWtzzgM8 PhD grad from @Skoltech Organiser and Co-Founder of @ods_aiRyan Connor 🟪 @_RyanRConnor
720 Followers 3K Following Research @blockworksres. The most profitable arb is time horizon. e/acc since before it had a name. at ryanconnor on farcasterBurny — Effective O.. @burny_tech
14K Followers 6K Following Transhuman engineer in singularity! Lover of AI & omnidisciplionary metamathemagics! Hypercuriousia! Omniperspectivity! Shapeshifting metafluid! Freedom 4 all!Faria Huq @FariaHuqOaishi
573 Followers 1K Following PhD Student @SCSatCMU advised by @jeffbigham reimagining Agents 🤖 and Interaction📱. Prev- SGI Fellow'21 @MIT_CSAIL, Tero labs.Hugh Zhang @hughbzhang
1K Followers 522 Following open source ai @scale_AI. co-created @gradientpub.Jun Yuan @yuanjunandrew
4 Followers 43 Following Ph.D. candidate of Data Science at New Jersey Institute of Technology (NJIT)Xiaotian (Max) Han @XiaotianHan1
951 Followers 1K Following CS Ph.D. @TAMU | Ex-Research Intern @Amazon @Meta @Snap | #machine_learningDarko @Darko1521056
272 Followers 4K Following𝙷𝚒𝚖𝚊 𝙻.. @hima_lakkaraju
16K Followers 836 Following Professor @Harvard; PI @ai4life_harvard; Co-founder @trustworthy_ml; #AI #ML #Safety; Stanford PhD; MIT @techreview #35InnovatorsUnder35Yann LeCun @ylecun
712K Followers 719 Following Professor at NYU. Chief AI Scientist at Meta. Researcher in AI, Machine Learning, Robotics, etc. ACM Turing Award Laureate.(((ل()(ل() 'yoav))).. @yoavgo
46K Followers 2K FollowingAndrej Karpathy @karpathy
979K Followers 905 Following 🧑🍳. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets 🧠🤖💥AK @_akhaliq
310K Followers 3K Following AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo follow on Hugging Face: https://t.co/q2Qoey80GxFrançois Chollet @fchollet
470K Followers 769 Following Deep learning @google. Creator of Keras. Author of 'Deep Learning with Python'. Opinions are my own.Sameer Singh @sameer_
7K Followers 2K Following Cofounder @SpiffyAI and Assoc Prof at @UCIrvine, working on reliable LLMs, explanations for AI+ML, adversaries for NLP, and debugging/evaluation.Emma Pierson @2plus2make5
9K Followers 797 Following CS faculty @cornell_tech; past @MSFTResearch, @StanfordAILab, @Rhodes_Trust scholar. Health+inequality+ML. "On the whole, though, I take the side of amazement."Sasha Rush @srush_nlp
52K Followers 464 Following Professor, Programmer in NYC. Cornell Tech, Hugging Face 🤗 https://t.co/cZl0wTfqGzKristian Lum @KLdivergence
22K Followers 1K Following Research Scientist at Google DeepMind | @FAccTConference OG | Past Twitter META, @hrdag & UPenn, UChicago faculty |Sebastian Raschka @rasbt
267K Followers 906 Following Machine learning & AI researcher writing at https://t.co/A0tXWzG1p5. LLM research engineer @LightningAI. Previously stats professor at UW-Madison.Tal Linzen @tallinzen
16K Followers 893 Following Professor @nyuling and @NYUDataScience, research scientist @GoogleAISara Hooker @sarahookr
39K Followers 7K Following I lead @CohereForAI. Formerly Research @Google Brain @GoogleDeepmind. ML Efficiency at scale, LLMs, @trustworthy_ml. Changing spaces where breakthroughs happen.Kyunghyun Cho @kchonyc
61K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).Dan Roy @roydanroy
45K Followers 2K Following ML / AI researcher, emphasis on theory. Research Director and Canada CIFAR AI Chair, @VectorInst Professor, @UofT (Statistics/CS)Percy Liang @percyliang
49K Followers 408 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | PianistChristoph Molnar @ChristophMolnar
30K Followers 1K Following Author of Interpretable Machine Learning https://t.co/gJKlTA2deP | Newsletter: https://t.co/6fQuMr8yI8Christopher Manning @chrmanning
127K Followers 116 Following Director, @StanfordAILab. Assoc. Director, @StanfordHAI. Founder, @stanfordnlp. Prof. CS & Linguistics, @Stanford. IP @aixventureshq. 🇦🇺 Do #NLProc & #AI. 👋Eric Jang @ericjang11
69K Followers 3K Following physical AGI at 1X. Author of "AI is Good for You" https://t.co/eFg4WXhg0pBesmira Nushi 💙�.. @besanushi
2K Followers 739 Following Researcher @MSFTResearch artificial intelligence, human-machine collaboration, technology & society.Jingna Zhang @zemotion
78K Followers 485 Following Gundam pilot wannabe. Photographer, AD. Building new social platform for art 👉 @cara_hq | https://t.co/iM0FwRT0Qz ✨ in Tokyo!🗼Zac Kenton @ZacKenton1
1K Followers 1K Following Research Scientist in AI safety at DeepMind. Views are my own and don't represent DeepMind.Aish Fenton @aishfenton
2K Followers 2K Following Director of Machine Learning at Netflix, FP fan, Kiwi, and cat dad. He/Him.Ahmad Beirami @abeirami
4K Followers 2K Following Building safe, helpful, and scalable generative AI @Google | ex-{@AIatMeta, @EA, @MIT, @Harvard, @DukeU} | @GeorgiaTech PhD | زن زندگی آزادی | opinions my ownKate Park @goldenkatepark
10K Followers 150 Following Director of Product @Scale_AI, Previously Staff AI Product Manager @Tesla Autopilot, @Palantir, @Uber, @Google, Stanford CSPeyman Milanfar @docmilanfar
67K Followers 264 Following Distinguished Scientist at Google Research. Computational Imaging, Machine Learning, and Vision. Tweets = personal opinions. May change or disappear over time.Groq Inc @GroqInc
46K Followers 470 Following Creator of the LPU™ Inference Engine, providing the fastest speed for AI applications, designed & engineered in N. America https://t.co/DsEqVAC5DpTeortaxes▶️ @teortaxesTex
7K Followers 1K Following Ours is the age of unaligned utilitarians. Other problems are relatively unimportant, but sometimes I tweet about them anyway. (кто/кого)Xian Li @xl_nlp
2K Followers 242 Following Research Scientist @MetaAI. NLP, ML. Opinions are my own.Yo Shavit @yonashav
4K Followers 830 Following policy for v smart things @openai. Past: CS PhD @HarvardSEAS/@SchmidtFutures/@MIT_CSAIL. Tweets my own; on my head be it.Ian Fisher @ianafisher
93 Followers 48 Following Software engineer. Ex-Google. Creator of https://t.co/5iqNSYMf6O. I write Outsider Art on Substack: https://t.co/IdhhOXdpMoRussell Kaplan @russelljkaplan
11K Followers 652 Following Director of engineering @Scale_AI. Former startup founder, ML scientist @Tesla Autopilot, researcher @StanfordSVL.Alexis Conneau @alex_conneau
24K Followers 113 Following Audio AGI Research Lead @OpenAI - GPT-Next - Past: XLM, Unsupervised ASR, Unsupervised MT, Wav2vec 2.0/XLSR, MUSE, Unsupervised cross-lingual transferYangsibo Huang @YangsiboHuang
1K Followers 726 Following PhD candidate @Princeton. Prev: @GoogleAI @AIatMeta.Rongfei Lu @lrf138
163 Followers 567 Following Legal AI, data privacy, aerospace and policy | @uber, @StanfordAILab, @CodeXStanford, @HooverInst, @dartmouthElliot Creager @elliot_creager
561 Followers 527 Following machine learning research ~ assistant professor @WaterlooENG ~ faculty affiliate @VectorInst @TorontoSRI i draft my tweets in /tmp/ (he/him)Devendra Chaplot @dchaplot
8K Followers 365 Following Building next-gen AI at @MistralAI. Past: Research Scientist at Facebook AI Research. Ph.D. @SCSatCMU, BTech @iitbombay CS.Guillaume Lample @GuillaumeLample
37K Followers 648 Following Cofounder & Chief Scientist https://t.co/hLfvKLkFHd (@MistralAI). Working on LLMs. Ex @MetaAI | PhD @Sorbonne_Univ_ | MSc @CarnegieMellon | X11 @PolytechniqueMistral AI @MistralAI
91K Followers 0 Following Fast, open-source and secure language models. Join us https://t.co/INALdNGvCPFaria Huq @FariaHuqOaishi
573 Followers 1K Following PhD Student @SCSatCMU advised by @jeffbigham reimagining Agents 🤖 and Interaction📱. Prev- SGI Fellow'21 @MIT_CSAIL, Tero labs.Asma Ghandeharioun @ghandeharioun
2K Followers 489 Following Research Scientist @GoogleAI working on ML interpretability & human-centered AI, PhD from @MITHugh Zhang @hughbzhang
1K Followers 522 Following open source ai @scale_AI. co-created @gradientpub.Kara Swisher @karaswisher
1.5M Followers 2K Following “Vitriolic” and now “shrill”media lady, though dogs can hear me loud and clearTomas Pfister @tomaspfister
201 Followers 69 Following Head of AI Research @GoogleCloud, Researcher #ML #AI #computervisionManaal Faruqui @manaalfar
3K Followers 646 Following Senior Staff Research Scientist @Google Bard. Love eating, movies, travel and politics. Spread love, not war.Timothy Luong (Chongz.. @chongzluong
234 Followers 139 Following Memer of Technical Staff @cartesia_ai | Class Clown @ Scale Angels Fund | Ex @Airbnb 🏠 @AmazonLab126 🤖 @scale_AI 👽 @tryexponent 🫡 & other startups...Summer Yue @summeryue0
1K Followers 218 Following Director of Safety and Standards at Scale AI. Prev: RLHF lead on Bard, researcher at Google DeepMind / Brain (LaMDA, RL/TF-Agents, superhuman chip design)Alex Nichol @unixpickle
8K Followers 389 Following Code, AI, and 3D printing. Opinions are my own, not my computer's...for now. Husband of @thesamnichol. Co-creator of DALL-E 2. Researcher @openai.Tim Dettmers @Tim_Dettmers
29K Followers 821 Following PhD Student at @UW. I blog about deep learning and PhD life at https://t.co/Y78KDJJFE7.Dean Carignan @DeanCarignan
923 Followers 1K Following Chief of Staff for @Microsoft's Chief Scientific Officer; exploring responsible practices in AI, Data Science, ML Ops. Ex: @MSFTReseach @Mckinsey, @WorldbankCrémieux @cremieuxrecueil
88K Followers 904 Following I write about genetics, 'metrics, and demographics. Read my long-form writing at https://t.co/8hgA4nNS2A.Thomas Scialom @ThomasScialom
6K Followers 232 Following AGI Researcher @MetaAI -- Lead Llama 2 and Postraining Llama 3. Also CodeLlama, Galactica, Toolformer, Bloom, Nougat, GAIA, ..Stella Biderman @BlancheMinerva
15K Followers 748 Following Open source LLMs and interpretability research at @BoozAllen and @AiEleuther. My employers disown my tweets. She/herEunkyung Jo @ekjo15
722 Followers 821 Following PhD student @UCI_Informatics, @Purdue_UX and @SNUnow alum. HCI. Personal Informatics. Self-Tracking. Health & Wellbeing. Mental Health.Abhinav Sharma @abhinavsharma
1K Followers 460 Following Director, Product Design at @Scale_AI. Previously Cofounder CEO @InsightBrowser (YC W19), ML Eng at Facebook, Design and Product Lead at QuoraChris Lattner @clattner_llvm
79K Followers 182 Following Building beautiful things like Mojo🔥 and MAX @Modular, lifting the world of production AI/ML software into a new phase of innovation. We’re hiring! 🚀🧠jack morris @jxmnop
11K Followers 765 Following getting my phd in nlp @cornell_tech 🚠 // academic optimist // tweeting from the snack aisle at trader joesJelani Nelson @minilek
22K Followers 184 Following Professor @Berkeley_EECS. Research Scientist (part-time) @GoogleAI. Founder @addiscoder. 🇻🇮🇺🇸🇪🇹William Wang @WilliamWangNLP
14K Followers 718 Following UCSB NLP Lab + ML Center. https://t.co/6TOnqbk6YT https://t.co/KJYhnav3Et Mellichamp Chair Prof. at UCSB CS. PhD @ CMU SCS. Areas: #NLProc, Machine Learning, AI.Tri Dao @tri_dao
19K Followers 365 Following Incoming Asst. Prof @PrincetonCS, Chief Scientist @togethercompute. Machine learning & systems.Professor life is off to a great start! Honored to receive a grant from Apple ML Research and to be named a Google Research Scholar. Looking forward to more work developing ML methods for healthcare and equity Pictured: an apple, Google, and me
Scatter plot with top-left good and yolo axes is the new radar plots where ours surrounds everything.
Good morning: @SnowflakeDB’s new 480B parameter #LLM is made of 128 experts! It’s bigger than #Grok and is now the largest *fully open source (Apache 2.0* LLM! 🧵👇 how does it compare to Llama 3, Mixtral, and GPT4?
More exciting news today -- Gemini 1.5 Pro result is out! Gemini 1.5 Pro API-0409-preview now achieves #2 on the leaderboard, surpassing #3 GPT4-0125-preview to almost top-1! Gemini shows even stronger performance on longer prompts, in which it ranks joint #1 with the latest…
Congrats @GoogleDeepMind on shipping Gemini 1.5 Pro to public review! Upon capacity & latency testing, we have now brought Gemini 1.5 Pro up to the Arena🤖 Big improvement from Pro 1.0 to 1.5 across the board, and exceptionally strong long context understanding. Come test and…
it’s a good model sir
More exciting news today -- Gemini 1.5 Pro result is out! Gemini 1.5 Pro API-0409-preview now achieves #2 on the leaderboard, surpassing #3 GPT4-0125-preview to almost top-1! Gemini shows even stronger performance on longer prompts, in which it ranks joint #1 with the latest…
had a nightmare where i published a paper and the next day MKBHD literally just cooks me in a fully fledged paper review video
Should we trust LLM evaluations on publicly available benchmarks?🤔 Our latest work studies the overfitting of few-shot learning with GPT-4. with @HarshaNori Vanessa Rodrigues @besanushi and Rich Caruana Paper: arxiv.org/abs/2404.06209 More details👇 [1/N]
As we increasingly rely on #LLMs for product recommendations and searches, can companies game these models to enhance the visibility of their products? Our latest work provides answers to this question & demonstrates that LLMs can be manipulated to boost product visibility!…
From the LLaMa 3 blogpost - they use a combination of rejection sampling, DPO and PPO for post-training. Really interested to know what tasks/parts of the process each algorithms benefits the most.
We have a new preprint out - your language model is not a reward, it’s a Q function! 1. The likelihood of the preferred answer must go down - it’s a policy divergence 2. MCTS guided decoding on language is equivalent to likelihood search on DPO 3. DPO learns credit assignment
At least on the startup side, it's definitely not a bygone era. This shit is unreasonably on point even today, we just have way more Russ Hannemans
Silicon Valley premiered 10 years ago on HBO. While many of the dynamics and situations depicted still exist today, it already feels like a period piece, a window into a bygone era
venture capital
Is there a job where they pay you $250,000 a year to hang out in the park and get dinner with your friends?
Me: My paper has been rejected so many times, it has stopped bothering me anymore A friend (doing PostDoc): Congrats, you've mastered the ultimate PhD skill and are ready to graduate! Definitely need more friends like him lol 😂
The crappiness of the Humane AI Pin reported here is a great example of the underappreciated capability-reliability distinction in gen AI. If AI could *reliably* do all the things it's *capable* of, it would truly be a sweeping economic transformation. theverge.com/24126502/human…