Yuxiang (Jimmy) Wu @YuxiangJWu
Co-founder @WecoAI | UCL PhD | Natural Language Processing | Machine Learning | formerly intern @allen_ai @MetaAI yuxiang.me Joined October 2014-
Tweets188
-
Followers1K
-
Following1K
-
Likes2K
Here you go, no more waitlisting and just run it locally! Local version of AIDE currently serves as a powerful tool for data scientists and machine learning engineers to explore draft designs.
Here you go, no more waitlisting and just run it locally! Local version of AIDE currently serves as a powerful tool for data scientists and machine learning engineers to explore draft designs.
Great works from @zhengyaojiang, @YuxiangJWu and the @WecoAI team! Maybe I can delegate all my request calling pandas for data processing to Weco's AIDE 😀
Great works from @zhengyaojiang, @YuxiangJWu and the @WecoAI team! Maybe I can delegate all my request calling pandas for data processing to Weco's AIDE 😀
Watching AIDE autonomously tackle ML problems by designing, implementing, and iterating on code, conducting experiments, and evaluating results has been nothing short of fascinating. Can't wait to see how it will revolutionize DS and ML workflows, enabling a broader range of…
Watching AIDE autonomously tackle ML problems by designing, implementing, and iterating on code, conducting experiments, and evaluating results has been nothing short of fascinating. Can't wait to see how it will revolutionize DS and ML workflows, enabling a broader range of… https://t.co/ww4OetWzGf
We're excited to announce AIDE has become the first human-level AI agent for data science! AIDE outperforms half of human data scientists on a wide range of Kaggle competitions, surpassing conventional AutoML, LangChain agents, and ChatGPT with human assistance. 🏆
The concept of LLM-driven software engineers is intriguing, but in most cases, they still fall short of human engineers without human oversight. Our AI agent has changed the game by outperforming half of the data scientists in Kaggle competitions with zero human intervention!
The concept of LLM-driven software engineers is intriguing, but in most cases, they still fall short of human engineers without human oversight. Our AI agent has changed the game by outperforming half of the data scientists in Kaggle competitions with zero human intervention!
We are working on an open leaderboard for measuring hallucinations in LLMs! 🚀🚀🚀 Read more about it here: huggingface.co/blog/leaderboa… Leaderboard address (and discussion forum, if you'd like to provide feedback or help 🙂): huggingface.co/spaces/halluci…
We are working on an open leaderboard for measuring hallucinations in LLMs! 🚀🚀🚀 Read more about it here: huggingface.co/blog/leaderboa… Leaderboard address (and discussion forum, if you'd like to provide feedback or help 🙂): huggingface.co/spaces/halluci… https://t.co/9HPmqMseKa
Today we’re announcing Weco AI and our first product, AIDE: your AI agent for Machine Learning. Simply describe your task in natural language, and AIDE will search the design space to deliver source code and a report for you. Join the waitlist now at weco.ai (1/3)
Extremely excited to announce new work (w/ @MinqiJiang) on learning RL policies and world models purely from action-free videos. 🌶️🌶️ LAPO learns a latent representation for actions from observation alone and then derives a policy from it. Paper: arxiv.org/abs/2312.10812
🚀Excited to share this work by @zodiacJRH and team on enhancing LLM robustness using Natural Language Explanations (NLEs)! When prompted with few-shot human-written explanations, ChatGPT can generate better NLEs for improving robustness of LLM. arxiv.org/abs/2311.07556
🚀Excited to share this work by @zodiacJRH and team on enhancing LLM robustness using Natural Language Explanations (NLEs)! When prompted with few-shot human-written explanations, ChatGPT can generate better NLEs for improving robustness of LLM. arxiv.org/abs/2311.07556
🔍 Our latest work: Deciphering GPT-4V on Knowledge-Intensive Visual Q&A, Link: arxiv.org/abs/2311.07536. 🌟 Explore how this powerhouse nails common sense, world knowledge, decision-making rationale, revealing its visual understanding, reasoning, and explanations. 🧠💡#GPT4V #AI
Beyond excited to publish our newest Weaviate Podcast with @Nils_Reimers! 🎙️🎉 Nils is one of my favorite people in the world to discuss Search with! Discussion topics include Cohere's Rerankers, Metadata, Long Doc Embeddings, RAG, and more!📚 youtube.com/watch?v=KITxQz…
Great news! “Efficient Transformers with Dynamic Token Pooling” has been accepted to #ACL23! We increase the efficiency *and* performance of Transformer LM by jointly segmenting and modelling language. @PontiEdoardo @AdrianLancucki @JChorowski 📜arxiv.org/abs/2211.09761
Great news! “Efficient Transformers with Dynamic Token Pooling” has been accepted to #ACL23! We increase the efficiency *and* performance of Transformer LM by jointly segmenting and modelling language. @PontiEdoardo @AdrianLancucki @JChorowski 📜arxiv.org/abs/2211.09761
It was a great pleasure to be in Cardiff to talk about ChatArena @_chatarena and future directions in multi-LLM collaboration.
It was a great pleasure to be in Cardiff to talk about ChatArena @_chatarena and future directions in multi-LLM collaboration.
🧠Determining the smartest LLM is a fascinating challenge! At ChatArena, we're working on creating game environments and a benchmark of their social capabilities. Excited to discuss this in the Weaviate Podcast with @CShorten30. Let's build this together: chatarena.org
🧠Determining the smartest LLM is a fascinating challenge! At ChatArena, we're working on creating game environments and a benchmark of their social capabilities. Excited to discuss this in the Weaviate Podcast with @CShorten30. Let's build this together: chatarena.org
Thrilled to have joined @CShorten30 on the Weaviate Podcast to discuss ChatArena! A fascinating platform enabling multiple #ChatGPT to engage in diverse game environments. Can't wait to see what the community can build by orchestrating multiple #LLMs in @_chatarena 🤖🗨️
Thrilled to have joined @CShorten30 on the Weaviate Podcast to discuss ChatArena! A fascinating platform enabling multiple #ChatGPT to engage in diverse game environments. Can't wait to see what the community can build by orchestrating multiple #LLMs in @_chatarena 🤖🗨️
Wait, did ChatGPT just solve prisoners' dilemma? 🤔
I recently decide to change my twitter handle from @mindjimmy to @YuxiangJWu, because I realise that the old one is a bit confusing. Don't worry, I will keep posting interesting works and findings in AI, NLP and Large Language Models. #AI #LLM #GPT4 @_chatarena
Akari Asai @AkariAsai
11K Followers 650 Following Ph.D. student @uwcse & @uwnlp. NLP. IBM Ph.D. fellow (2022-2023). Meta student researcher (2023-) . ☕️ 🐕 🏃♀️🧗♀️🍳AK @_akhaliq
310K Followers 3K Following AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo follow on Hugging Face: https://t.co/q2Qoey80GxPasquale Minervini �.. @PMinervini
7K Followers 4K Following Researcher in ML/NLP at the University of Edinburgh (faculty @InfAtEd @EdinburghNLP), @ELLISforEurope, @UCL_NLP, PI for @Clarify2020, https://t.co/WydvfU8ugz he/theyJim Fan @DrJimFan
229K Followers 3K Following @NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.Yu Su @ysu_nlp
6K Followers 857 Following Dist. Assist. Prof.@OhioState, Director @osunlp, 20% Researcher@Microsoft. I like to think about intelligence, artificial or biologicalBill Yuchen Lin 🤖 @billyuchenlin
6K Followers 2K Following Research @allen_ai. I evaluate (multi-modal) LLMs, build agents, and study the science of LLMs. Previously: @GoogleAI & @MetaAI FAIR @nlp_uscYao Fu @Francis_YAO_
14K Followers 2K Following PhD @EdinburghNLP on LLMs and Machine Reasoning. Ex. @Columbia @PKU1898 @MITIBMLab @allen_ai AGI has yet to come, so keep runningWenhu Chen @WenhuChen
11K Followers 520 Following AI researcher @UWaterloo @GoogleAI @VectorInst. Interested in natural language processing, diffusion models. I direct TIGER-Lab at UWaterloo.Ori Ram @ori__ram
765 Followers 386 Following Research Scientist @GoogleAI, working on #NLProc. Previously: PhD from @TelAvivUni, Research Scientist @AI21LabsSebastian Riedel (@ri.. @riedelcastro
15K Followers 470 Following Researcher in NLP/ML @deepmind, @ucl_nlp, @[email protected] on MastodonTim Dettmers @Tim_Dettmers
29K Followers 821 Following PhD Student at @UW. I blog about deep learning and PhD life at https://t.co/Y78KDJJFE7.Ofir Press @OfirPress
10K Followers 3K Following I build tough benchmarks for LMs and then I get the LMs to solve them. Postdoc @Princeton. PhD from @nlpnoah @UW. Ex-visiting researcher @MetaAI & @MosaicML.Xin Eric Wang @xwang_lk
7K Followers 1K Following Multimodal and Embodied AI Researcher / Professor @UCSC. Director of https://t.co/Y4swOBag21. AI for Humanity in the long run. he/himWenhao Yu @wyu_nd
2K Followers 622 Following Senior Research Scientist at @TencentGlobal AI Lab in Seattle | Bloomberg PhD Fellow | Ex. @MSFTResearch @allen_ai @NotreDame @BloombergPatrick Lewis @PSH_Lewis
4K Followers 655 Following London-based AI/NLP Research Scientist. I co-lead the RAG & tool use team at Cohere w/ @s_hofstaetter. Previous Fundamental AI Research at Meta AI, FAIR, UCL AILuca Soldaini 🎀 @soldni
6K Followers 1K Following I like tokens! Lead for OLMo data team at @allen_ai (Dolma 🍇), OSS is fun, @QueerInAI organizer 🤖☕️🍕they/them (views mine, not my employer’s)Swaroop Mishra @Swarooprm7
5K Followers 894 Following Research Scientist @GoogleDeepMind (Gemini). Pioneering LLM Research 🔥. Instruction tuning, Factuality, Reasoning and next gen Product. Opinions my own.Victor Zhong @hllo_wrld
4K Followers 450 Following ML+NLP assistant prof @UWCheritonCS. Formerly @MSFTResearch @MetaAI, @SFResearch via @MetamindIO, @uwnlp, @StanfordNLP, @eceuoft.ssteevens @Steevens43
159 Followers 5K FollowingMaximilian Wolf @MaxWolf_01
31 Followers 97 Followingsq z @s1833815
5 Followers 18 FollowingAnnalisa Fernandez @BecauseCulture
10K Followers 4K Following Tech culture and language in UX, AI, LLMs, data, social media, and privacy. Speaker on cultural differences and global inclusion. Ex Latam M&A 🇺🇸🇧🇷🇪🇸Un Biagini @u_biag
50 Followers 5K FollowingAzucena Spincic @AzuceSpin
68 Followers 5K FollowingWalter Goldsby @GoldsWalt
62 Followers 5K FollowingAngelo Sabota @AngeloS96471
61 Followers 5K Followingasyraff @asyraffhi
8 Followers 1K FollowingLily-grace Ursua @UrsuaLily28371
75 Followers 5K FollowingDulce Engert @dulc_enge
64 Followers 5K FollowingArif Ahmad @arif_ahmad_py
273 Followers 7K Following All things AI, Computer Science and Circuits! Prev. @GoogleAIAlayna Dagg @AlaynaDagg50221
100 Followers 5K FollowingLongyue Wang @wangly0229
889 Followers 454 Following Dr. | Research Fellow @ Tencent AI Lab | IEEE Senior Member | Previously @DCU PhD & RA, @TencentGlobal InternRubie Prinn @RPrinn17710
75 Followers 5K FollowingAI Papers Podcast @aipaperspodcast
919 Followers 2K Following A digestible daily update on the latest AI Research Papers. Brought to you by @pocketpodappparia @pariawshahi
120 Followers 6K FollowingHenry Grafé @GrafeHenry97431
7 Followers 32 FollowingMichael Johnson @onemoremichael
463 Followers 5K Following Co-Founder of Ref | Leaving the resume behind. Al-native platform that surfaces relevant & authentic context on candidates, validated by Al-assisted referrals.Samantha Dobosz @SamanthaDo462
33 Followers 5K FollowingLiu Xiaochen @lxc0422
118 Followers 1K Following #ArtificialIntelligence #MachineLearning #ComputerVision #3Dreconstruction #3Dmeasurement #ImageProcessing #Robotics #Programming #PackagingGus Kuehne... America.. @GusKuehne
14K Followers 13K Following I can't be wrong Trump, DeSantis, V.Ram, Kennedy & Gaetz agree with me? I initiated largest foreign trade case in US history on behalf of US Manuf. ... & won.Kayla Stambaugh @KaylaS1778
81 Followers 5K FollowingGuillaume Bouchard @gbouchar
427 Followers 279 Following AI Entrepreneur, Research Scientist and Investor. CEO and Co-Founder of Checkstep.LKW Ankauf Heidelberg @lkwheidelberg
16 Followers 90 Following Möchtest du dein altes Lastkraftwagen verkaufen? Wir bieten den besten Ankaufservice in Heidelberg an. Erhalte sofortiges Angebot!Arnetta Maccauley @arnet_maccau
17 Followers 3K FollowingGe Zhang @GeZhang86038849
743 Followers 448 Following Founder: M-A-P(https://t.co/CGWz8Jr9K9) Incoming Ph.D. student: Computer Science @UWaterloo MSc: ECE & DS @UMich BSc: Computer Science @ BUPT白犬 @baiquan04025970
0 Followers 52 FollowingJacek Łubiński @jumbojacek
286 Followers 120 Following VC @market1capital, backing tech companies of tomorrow building platforms, networks and infrastructureHuan Sun (OSU) @hhsun1
3K Followers 480 Following Associate Professor (with Tenure) in CSE, endowed CoE Innovation Scholar, The Ohio State University (NLP and Data Mining)AW Smith @iammaestro04
31 Followers 234 FollowingTaki @Takihasanrafi
53 Followers 757 Following Foundation Models / LLMs / VLMs / Responsible AI | PhD in CS @Hanyang!Ciline @ciline
15 Followers 845 Followinglin yu @linyu61852547
0 Followers 19 Followingmarvo @marvos76302504
6 Followers 130 FollowingAkari Asai @AkariAsai
11K Followers 650 Following Ph.D. student @uwcse & @uwnlp. NLP. IBM Ph.D. fellow (2022-2023). Meta student researcher (2023-) . ☕️ 🐕 🏃♀️🧗♀️🍳AK @_akhaliq
310K Followers 3K Following AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo follow on Hugging Face: https://t.co/q2Qoey80Gx(((ل()(ل() 'yoav))).. @yoavgo
46K Followers 2K FollowingYann LeCun @ylecun
711K Followers 718 Following Professor at NYU. Chief AI Scientist at Meta. Researcher in AI, Machine Learning, Robotics, etc. ACM Turing Award Laureate.Pasquale Minervini �.. @PMinervini
7K Followers 4K Following Researcher in ML/NLP at the University of Edinburgh (faculty @InfAtEd @EdinburghNLP), @ELLISforEurope, @UCL_NLP, PI for @Clarify2020, https://t.co/WydvfU8ugz he/theyJim Fan @DrJimFan
229K Followers 3K Following @NVIDIA Sr. Research Manager & Lead of Embodied AI (GEAR Lab). Creating foundation models for Humanoid Robots & Gaming. @Stanford Ph.D. @OpenAI's first intern.Yu Su @ysu_nlp
6K Followers 857 Following Dist. Assist. Prof.@OhioState, Director @osunlp, 20% Researcher@Microsoft. I like to think about intelligence, artificial or biologicalPercy Liang @percyliang
49K Followers 408 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | PianistAndrej Karpathy @karpathy
979K Followers 905 Following 🧑🍳. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets 🧠🤖💥AI at Meta @AIatMeta
532K Followers 255 Following Together with the AI community, we are pushing the boundaries of what’s possible through open science to create a more connected world.Bill Yuchen Lin 🤖 @billyuchenlin
6K Followers 2K Following Research @allen_ai. I evaluate (multi-modal) LLMs, build agents, and study the science of LLMs. Previously: @GoogleAI & @MetaAI FAIR @nlp_uscWilliam Wang @WilliamWangNLP
14K Followers 718 Following UCSB NLP Lab + ML Center. https://t.co/6TOnqbk6YT https://t.co/KJYhnav3Et Mellichamp Chair Prof. at UCSB CS. PhD @ CMU SCS. Areas: #NLProc, Machine Learning, AI.Yi Tay @YiTayML
29K Followers 97 Following chief scientist / cofounder @RekaAILabs 🫠 past: research scientist @google brain 🤯 currently learning to be a dad 🍼Kyunghyun Cho @kchonyc
61K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).Yao Fu @Francis_YAO_
14K Followers 2K Following PhD @EdinburghNLP on LLMs and Machine Reasoning. Ex. @Columbia @PKU1898 @MITIBMLab @allen_ai AGI has yet to come, so keep runningRichard Socher @RichardSocher
101K Followers 971 Following CEO @youSearchEngine Investing at @aixventuresHQ Before: Stanford Adj Prof in AI/NLP, Chief Scientist at Salesforce, MetaMindWenhu Chen @WenhuChen
11K Followers 520 Following AI researcher @UWaterloo @GoogleAI @VectorInst. Interested in natural language processing, diffusion models. I direct TIGER-Lab at UWaterloo.Kai Zou @anMe_kz
4K Followers 40 Following Founder and CEO at https://t.co/YPVwP0HF5C, https://t.co/QRm3Mj3azx, https://t.co/rG5uII6TfJNetMind.AI @NetmindAi
29K Followers 92 Following NetMind Power is a decentralized platform aimed at democratizing AI computing power. Telegram: https://t.co/cYOXxXdzRT ; Discord: https://t.co/YStJyP1T1iCreatify AI @CreatifyLab
776 Followers 104 Following Redefining ad creation with the power of AI. Short video ads generated from a product link. Your shortcut to engaging video ads. 🚀Ge Zhang @GeZhang86038849
743 Followers 448 Following Founder: M-A-P(https://t.co/CGWz8Jr9K9) Incoming Ph.D. student: Computer Science @UWaterloo MSc: ECE & DS @UMich BSc: Computer Science @ BUPTMoucheng Xu @moucheng_xu
131 Followers 376 Following Research Scientist @odin_vision, Medical Image Computing, Deep LearningEric Jang @ericjang11
69K Followers 3K Following physical AGI at 1X. Author of "AI is Good for You" https://t.co/eFg4WXhg0pJiawei Zhao @jiawzhao
317 Followers 174 Following PhD Candidate @Caltech, Fmr Research Intern @nvidia1LittleCoder💻 @1littlecoder
12K Followers 1K Following AI, ML, Open Source at - https://t.co/EKsvaArRIkGarry Tan @garrytan
433K Followers 4K Following President & CEO @ycombinator —Founder @Initialized—PM/designer/engineer who helps founders—YouTuber—San Francisco Democrat accelerating the boom loop—e/accAndrew Gao @itsandrewgao
24K Followers 2K Following techno optimist! currently: @nomic_ai @stanford; prev @LangChainAI; Z Fellow 🇺🇸Dmitry Alimov @dmitryalimov
6K Followers 7K Following Tech VC and entrepreneur. Curious. Investing and building in AI. Built companies in media and tech. Founder @frontiervc. Learned things @harvard, @stanfordJack Altman @jaltma
76K Followers 415 Following Investing at Alt Capital. Founder and chairman of Lattice.Peter Zakin @pzakin
3K Followers 4K Following investor @ upfront ventures. dev tools, ai, data. Sold Hyper Travel to @tradeshift. ex @venmo, YC S2010.Yingchen Xu @YingchenX
405 Followers 254 Following CS PhD at @ucl_dark and @MetaAI I do research in reinforcement learning. 🤖️🎨⛰️Chenyang Lyu 吕晨�.. @Chenyang_Lyu
719 Followers 678 Following Postdoc at @MBZUAI, PhD from @ml_labs_irl and @dcucomputing @dcu interested in Natural Language Processing, mainly Large Language Models (LLMs).Cecilia Ziniti @CeciliaZin
8K Followers 5K Following Founder & CEO @gcai_co | General Counsel & CLO | Ex @Amazon, @MoFoLLP, @replit, @BloomTech. Writes re AI, tech, business, law, in-house counsel, and leadership.Stella Biderman @BlancheMinerva
15K Followers 748 Following Open source LLMs and interpretability research at @BoozAllen and @AiEleuther. My employers disown my tweets. She/herAlex Volkov (Thursd/A.. @altryne
25K Followers 1K Following ✨ AI Evangelist with @weights_biases 🪄🐝 🎙️ Host of @thursdai_pod Founder and CEO @ https://t.co/qbC0EP7h1k AI Consultant GPU POOR Def. not an owl *hoot*John Rush @johnrushx
20K Followers 3K Following 20 bootstrapped Tools For Busy Founders. Sharing lessons on Startups & Growth. ⑴https://t.co/PJscUxOC4Z ⑵https://t.co/wxaRNYF9F5 ⑶https://t.co/hS4xMWThHi … ⒇⇢https://t.co/Fpjq9yZPMZTanishq Mathew Abraha.. @iScienceLuvr
54K Followers 1K Following PhD at 19 | Founder and CEO at @MedARC_AI | Research Director at @StabilityAI | @kaggle Notebooks GM | Biomed. engineer @ 14 | TEDx talk➡https://t.co/xPxwKTq6QbDominik Schmidt @schmidtdominik_
247 Followers 276 Following Research Engineer @WecoAI, previously @ucl_dark, @Microsoft, @tu_wienAdina Yakup @AdeenaY8
3K Followers 465 Following @huggingface 🤗 | Contributing to Chinese ML community.OpenAI Developers @OpenAIDevs
71K Followers 0 Following Official @OpenAI account for anyone building on our APIs. Join us in building the future of AI. We ❤️ developers!Microsoft @Microsoft
13.9M Followers 2K Following We're on a mission to empower every person and every organization on the planet to achieve more. Support: @MicrosoftHelpsa16z @a16z
763K Followers 47 Following we invest in software eating the world https://t.co/A9eTFq6Xbx https://t.co/MXGUBJoMi4 Sign up for our newsletters: https://t.co/vkcLgyb2qXAI Breakfast @AiBreakfast
167K Followers 210 Following The latest rumors and developments in the world of artificial intelligence. DM to include your AI project in the newsletter.Midjourney @midjourney
338K Followers 0 Following New research lab. Exploring new mediums of thought. Expanding the imaginative powers of the human species. Join our beta: https://t.co/yAUpCWJRziDeepLearning.AI @DeepLearningAI
221K Followers 30 Following We are an education technology company with the mission to grow and connect the global AI community.Robert Stojnic @rbstojnic
3K Followers 488 Following Open source AI. ⌛Past: Llama 2 and Llama 3 technical leadership at Meta AI, Papers with Code co-creator.Tengyu Ma @tengyuma
26K Followers 512 Following Assistant professor at Stanford; Co-founder of Voyage AI (https://t.co/wpIITHLgF0) ; Working on ML, DL, RL, LLMs, and their theory.sophiaalthammer @sophiaalthammer
913 Followers 594 Following Member of Technical Staff in Retrieval-Augmented Generation Team @cohere, previously PhD in neural Information Retrieval @tu_wienSebastian Hofstätter @s_hofstaetter
1K Followers 254 Following RAG & tool use modelling co-lead @Cohere; PhD in efficient neural information retrieval from @tu_wienArthur Mensch @arthurmensch
40K Followers 873 Following Co-founder and CEO @MistralAI. Apply https://t.co/yHGRZAtjcxLisa Alazraki @LisaAlazraki
651 Followers 790 Following #ML & #NLProc PhD student @ImperialCollege. Prev. research intern @GoogleAI. Reasoning, planning & LLMs as agents. Reposted papers are my reading list 📚Tri Dao @tri_dao
19K Followers 365 Following Incoming Asst. Prof @PrincetonCS, Chief Scientist @togethercompute. Machine learning & systems.Manling Li @ManlingLi_
3K Followers 419 Following Postdoc @Stanford, Incoming Assistant Professor @Northwestern, PhD @UIUC. Working on Knowledge Foundation Models, especially for Multimodal data (Language + X).Manchester NLP @Manchester_NLP
46 Followers 7 Following The Natural Language Processing Group at the University of Manchester | @OfficialUoMLianhui Qin @Lianhuiq
4K Followers 397 Following Incoming Assistant Professor at UCSD CSE. Currently postdoc at AI2 Mosaic. NLP, ML, AI. I’m recruiting PhD students.Yuandong Tian @tydsh
16K Followers 806 Following Research Scientist and Senior Manager in Meta AI (FAIR). AI-guided Optimization and Representation Learning. Novelist in spare time. PhD in @CMU_Robotics.Simon Shaolei Du @SimonShaoleiDu
6K Followers 2K Following Assistant Professor @uwcse. Postdoc @the_IAS. PhD in machine learning @mldcmu.Open Philanthropy @open_phil
15K Followers 17 Following Open Philanthropy's mission is to help others as much as we can with the resources available to us.Helen Toner @hlntnr
21K Followers 1K Following Interests: China+ML, natsec+tech, brains+words+absurdity | Current: @CSETGeorgetown (opinions my own) | Former: @open_philMeta presents Layer Skip Enabling Early Exit Inference and Self-Speculative Decoding We present LayerSkip, an end-to-end solution to speed-up inference of large language models (LLMs). First, during training we apply layer dropout, with low dropout rates for
It's been exactly one week since we released Meta Llama 3, in that time the models have been downloaded over 1.2M times, we've seen 600+ derivative models on @huggingface and much more. More on the exciting impact we're already seeing with Llama 3 ➡️ go.fb.me/xsqzz8
Also check out our new paper: arxiv.org/abs/2404.05221 - A unified perspective of reasoning algorithms (which lies behind 🦙LLM reasoners) - AutoRace 🏁: automated _evaluation_ of LLM reasoning chains - Analysis of LLMs and reasoning algorithms (⛓️CoT, 🌲ToT, 🎶RAP, …)
Releasing 🔥LLM Reasoners v1.0🔥 🥇Popular library for advanced LLM reasoning - Reasoning-via-Planning (RAP)🎶 - Chain-of-Thoughts (CoT)⛓️ - Tree-of-Thoughts (ToT)🌴 - Grace decoding💄 - Beam search🔎 🥇Enhances #Llama3, GPT4, LLMs on @huggingface llm-reasoners.net
@WecoAI This tool sounds like a game-changer for simplifying machine learning code generation! Excited to see how AIDE can automate and streamline development processes.
Introducing AdvPrompter! 🚀Our new optimization technique that crafts prompt-dependent, human-readable adversarial suffixes in real-time, enhancing security for LLMs🛡️! 1️⃣Generate suffix in ~2 seconds, ~800x faster than methods like GCG and AutoDAN. 2️⃣Higher success rate, in…
New short course with @MistralAI ! Mistral's open-source Mixtral 8x7B model uses a "mixture of experts" (MoE) architecture. Unlike a standard transformer, an MoE model has multiple expert feed-forward networks (8 in this case), with a gating network selecting two experts at…
Nice to see performance measured on actual benchmarks instead of just reporting perplexity. Looks like a great resource for anyone building Language Models! 🙌
We have just released 🍷 FineWeb: 15 trillion tokens of high quality web data. We filtered and deduplicated all CommonCrawl between 2013 and 2024. Models trained on FineWeb outperform RefinedWeb, C4, DolmaV1.6, The Pile and SlimPajama!
Llama3 reminds everyone of the misconception about scaling laws again: it's not that a larger model is always better, but that a larger model is cheaper to train if you want to reach the same performance. Yes, this might be somewhat counter-intuitive, but this is one of the key…
Llama 3 was trained using intra-document causal masking, as suggested by @yuzhaouoe's paper "Analysing The Impact of Sequence Composition on Language Model Pre-Training"! 🚀🚀🚀 arxiv.org/abs/2402.13991
So glad to share that I am one of the recipients of an @OpenAI Superaligment Fast Grant on the topic of #CoTfaithfulness 🥳🥳
The superalignment fast grants are now decided! We got a *ton* of really strong applications, so unfortunately we had to say no to many we're very excited about. There is still so much good research waiting to be funded. Congrats to all recipients!
Hey everyone! I am SUPER excited to publish our newest Weaviate podcast with Kyle Davis, the creator of RAGKit! 🎙️🔥 At a high-level, the podcast covers our understanding of RAG systems through 4 key areas: (1) Ingest / ETL, (2) Search, (3) Generate / Agents, and (4) Evaluation.…
So excited about integrating Weights & Biases with DSPy -- and showing how this helps you optimize RAG apps built with Cohere and Weaviate! 🔌😂👍 I am fascinated with how much more accessible optimization is becoming for building software applications. So cool to be…
Instead of having to manually write examples of your task (or examples of rationales when adding things like Chain-of-Thought prompting), DSPy orchestrates generating synthetic examples and controlling their quality. We are super happy to share an update on adding…
Excited to announce the Compass Beta, a very powerful multi-aspect data search system powered by a new embedding model, Compass. We're looking for help stress-testing the model's capabilities and finding where it breaks. Sign up here: txt.cohere.com/compass-beta/
Check out this Open-Source AI Data Scientist!! 🧑💻
Here you go, no more waitlisting and just run it locally! Local version of AIDE currently serves as a powerful tool for data scientists and machine learning engineers to explore draft designs.
Here you go, no more waitlisting and just run it locally! Local version of AIDE currently serves as a powerful tool for data scientists and machine learning engineers to explore draft designs.
We are open sourcing AIDE, a machine learning code generation agent. Simply describe the problem in natural language; the agent will begin experimenting on your local machine and then deliver the solution code. github.com/WecoAI/aideml