Wei Shi @weishi
Central Region, Singapore Joined November 2008-
Tweets59
-
Followers83
-
Following929
-
Likes3
@bllchmbrs @ApacheSpark @matei_zaharia @alighodsi True. Its just begining. I try to learn DSPy by example: RAG fueled by DSPy medium.com/@JacekWo/rag-f…
Excited to share new RAG Demo Application Template! code: github.com/diicellman/dsp… Built with @FastAPI & @Gradio, powered by @stanfordnlp DSPy, and made fully local with @ollama. It's an example for devs exploring DSPy, RAG, or creating AI-driven apps locally.
Meta presents AdvPrompter Fast Adaptive Adversarial Prompting for LLMs While recently Large Language Models (LLMs) have achieved remarkable successes, they are vulnerable to certain jailbreaking attacks that lead to generation of inappropriate or harmful content.
The best CUDA intro course by @nvidia with 460 bite sized videos. It was the course released with Udacity 9 yrs ago. It is kinda old, but you can grasp core ideas around it. youtube.com/playlist?list=…
A reminder that most evaluation benchmarks are garbage
Two free medium-compute Mixture-Of-Experts research ideas: Prerequisite: Mixtral 8x7B is 32 layers, at each layer there are 8 experts, each token is assigned to 2 experts at a given layer. 1) Dynamic Expert Assignment in MoE Models Every token is assigned to 2*32=64 experts in…
DSPy is an amazing tool because it gives engineers the power to move from model provider to model provider in. a PRINCIPLED WAY. Engineers just don't know it yet. I want to change that.
✨ Excited ✨ Wanna get a whirlwind tour of memorization in diffusion models, how to find it, how to mitigate it, drop by the talk! Will discuss all my 3 papers (including style memorization) on this topic. Papers: 1. arxiv.org/abs/2212.03860 (CVPR'23) 2.…
✨ Excited ✨ Wanna get a whirlwind tour of memorization in diffusion models, how to find it, how to mitigate it, drop by the talk! Will discuss all my 3 papers (including style memorization) on this topic. Papers: 1. arxiv.org/abs/2212.03860 (CVPR'23) 2.…
Why are Signatures a thing in DSPy? Standard prompts conflate interface (“what should the LM do?”) with implementation (“how do we tell it to do that?”). DSPy signatures isolate the former so we can infer & learn the latter from data — in the context of a bigger program.
Why are Signatures a thing in DSPy? Standard prompts conflate interface (“what should the LM do?”) with implementation (“how do we tell it to do that?”). DSPy signatures isolate the former so we can infer & learn the latter from data — in the context of a bigger program.
Make Your LLM Fully Utilize the Context Microsoft researchers and collaborators present an approach to overcome the lost-in-the-middle challenge common in LLMs. It applies an explicit "information-intensive" training procedure on Mistral-7B to enable the LLM to fully utilize…
We're having a big event on agents at CMU on May 2-3 (one week from now), all are welcome! cmu-agent-workshop.github.io It will feature: * Invited talks from @alsuhr @ysu_nlp @xinyun_chen_ @MaartenSap and @chris_j_paxton * Posters of cutting edge research * Seminars and hackathons
We're having a big event on agents at CMU on May 2-3 (one week from now), all are welcome! cmu-agent-workshop.github.io It will feature: * Invited talks from @alsuhr @ysu_nlp @xinyun_chen_ @MaartenSap and @chris_j_paxton * Posters of cutting edge research * Seminars and hackathons
🤖🏆LangGraph: Can Language Models Solve Olympiad Programming? 🤖🏆 Last week, Princeton researchers released the USACO benchmark dataset and showed that a zero-shot GPT-4 agent only passes 8.7% of the questions. We've implemented this paper in LangGraph and created a tutorial…
Emerging AI Agent Architectures Researchers from IBM and Microsoft present this concise summary of emerging AI agent architectures. It focuses the discussion on capabilities like reasoning, planning, and tool calling which are all needed to build complex AI-powered agentic…
5/ A Survey on Retrieval-Augmented Text Generation for LLMs - presents a comprehensive overview of the RAG domain, its evolution, and challenges. x.com/omarsar0/statu…
5/ A Survey on Retrieval-Augmented Text Generation for LLMs - presents a comprehensive overview of the RAG domain, its evolution, and challenges. x.com/omarsar0/statu…
We've been looking quite a bit lately at measuring the language-understanding ability of LLMs. This is an arxiv preprint on this line of work (w/ @rtsarfaty @VikaBsmv). arxiv.org/abs/2404.06283
How Faithful are RAG Models? This new paper aims to quantify the tug-of-war between RAG and LLMs' internal prior. It focuses on GPT-4 and other LLMs on question answering for the analysis. It finds that providing correct retrieved information fixes most of the model…
Snowflake dropped a 408B Dense + Hybrid MoE 🔥 > 17B active parameters > 128 experts > trained on 3.5T tokens > uses top-2 gating > fully apache 2.0 licensed (along with data recipe too) > excels at tasks like SQL generation, coding, instruction following > 4K context window,…
As you are graduating from ideas to engineering, one of the key concepts to be aware of is Parallel Computing and Concurrency Control. 🔒🔓 I am SUPER excited to share our 94th Weaviate podcast with Magdalen Dobson Manohar! Magdalen is one of the most impressive scientists I…
1/n Learning to Search: How LLMs Can Master Problem-Solving The ability to plan, strategize, and search for solutions lies at the heart of intelligent behavior. While recent advancements in large language models (LLMs) have demonstrated impressive capabilities in various tasks,…
Growing up, Paul Graham’s principles of using less words gave me a lot of solace. In a British colony like India at a Catholic school, we were taught the opposite: Use fancy words. Prefer verbosity. Signal your class. This distracted so many people from: Just. Knowing. Stuff.
Candace @candace18oropes
110 Followers 3K FollowingLydiaPiers @pKw78kmhCg48fi
0 Followers 121 FollowingPhusare @Phusare198265
0 Followers 172 FollowingSopsayt @sopsayt42901
2 Followers 188 FollowingSusan @susan62roberts
185 Followers 3K FollowingDeshoyt @deshoyt17343
0 Followers 122 Followingคำวิภาน.. @ao8pdk2aR0j6eY
60 Followers 1K Following เข้าร่วมกับฉันและเชื่อมต่อกันผ่านโซเชียลมีเดีย อย่าลืมติดตามฉัน ฉันจะอัปเดตข้อมูลการติดต่อของฉันในหน้าแรกKathleen Williams @KathleenWi76546
106 Followers 3K FollowingBarbara @barbara66ramire
205 Followers 3K FollowingAshley Alexander @AlexanderA64047
62 Followers 3K FollowingCarlene @Carlene00577385
90 Followers 3K FollowingMica Teo @micateo94
19 Followers 111 Followingp9xa7eueraw1w @1snc3pu01ws8ns
2 Followers 395 Following The team offers short-term investments in cryptocurrencies. With a rigorous plan, you can earn between $500 and $5,000. Click to join TG: https://t.co/VG4QztIxLaÚt Hoàng @tHong54917540
78 Followers 856 FollowingAnna Evans @AnnaEvans77211
97 Followers 3K FollowingTara Garcia @TGarcia90504
68 Followers 3K FollowingDiya Franco @DiyaFranco17695
10 Followers 709 FollowingJoanne Cunliffe @JoanneCunliff10
2K Followers 3K FollowingKaren @Aveniformh9
10 Followers 281 Following Question For My Crush do you think of me as much as I think of you?Jordan Miller @JordanMill36668
86 Followers 2K FollowingAfroz Mohiuddin @afrozenator
1K Followers 5K Following Research Engineer at Google Brain. Interested in Science, Psychology, Investing, Design and generally almost everything. Good Thoughts, Good Words, Good Deeds.SheilaNick @515LIGiF31OHTy
15 Followers 396 Following I swear in the name of God, don't miss an opportunity to earn 500-5000usdc every day. https://t.co/MzZhjyNK7MKaren Fisher @KarenFiinny
1K Followers 3K FollowingVelvet Artist @VelvetArti89977
13 Followers 2K FollowingFastpitch_ @Fastpit29876951
21 Followers 2K FollowingZ_oe @Zoe609107672154
17 Followers 2K FollowingTar__tlette @TTlette79043
14 Followers 1K FollowingLeigh Foster @LeighFoste12933
93 Followers 3K FollowingRighondish @righondish5328
2 Followers 164 Following I live alone now and enjoy business, traveling, shopping, food and music. I have a calm personality and I hope we can be friends.小花 @qlfUDKqaCZ2B9Ka
102 Followers 862 FollowingKhánh @Khanh20052609
34 Followers 703 FollowingChomba Bupe @ChombaBupe
7K Followers 2K Following Tech entrepreneur | machine intelligence https://t.co/zzD5ZNb0OW https://t.co/h0mJxdVxQqShane W @spw_enterprise
0 Followers 398 FollowingNina Wu @NinaWu85737633
12 Followers 268 Followingchen guang @mornshiner
6 Followers 52 FollowingLindy @Lindy9426
2 Followers 29 FollowingFenohariniaina Saria @8naes
14 Followers 15 FollowingGeorgeWalke @Gwalke883
12 Followers 9 FollowingPat Verga @pat_verga
745 Followers 252 Following NLP researcher @cohere. Formerly Google DeepMind and @umass_nlpRomboDawg @dudeman6790
323 Followers 11 Following Self: https://t.co/1Qw3zmIX4T Org: https://t.co/E7dqCGwE8USteven Feng @stevenyfeng
1K Followers 275 Following Stanford CS PhD student @stanfordnlp @StanfordAILab. Master's from Carnegie Mellon @LTIatCMU. NLP, Computer Vision, Machine Learning, and AI research.Bill Chambers @bllchmbrs
1K Followers 806 Following 👷 https://t.co/ODHNO6YBx7 ✍️ https://t.co/cX04twkyJ5 1x indie exit. 1x O'Reilly author. 🦄 🚀 - Anyscale, Databricks, $PCOR Talks about Startups, Data, AIMica Teo @micateo94
19 Followers 111 FollowingRoy Shilkrot @RoyShilkrot
587 Followers 1K Following Teaching Professor AI @MIT ML/CV @TuftsUniversity Chief Scientist @tulipinterfaces, open source ❤️ https://t.co/qbuKVQJLgP, join me https://t.co/qbLTVNw6YwAston Zhang @astonzhangAZ
5K Followers 92 Following Research Scientist at the #llama team of Meta Generative AI, designing and training large language models. Opinions are my own.Davis Blalock @davisblalock
12K Followers 165 Following Research scientist + first hire @MosaicML. @MIT PhD. I write + retweet technical machine learning content. If you write a thread about your paper, tag me for RTHarry Tormey 🇮🇪.. @htormey
6K Followers 746 Following 🔨Eliminating toil with AI @stepchangelabs. Full stack, Python, AI, Native iOS & #ReactNative developer. EX: Eng @coinbase @apple @facebook. ☘️ 🏠 🇺🇸swyx @swyx
91K Followers 3K Following Anti-ego ideas for anti-ergodic life. Founder, @smolmodels ▹ Listen: @latentspacepod ▹ Read: @coding_career ▹ Join: @aiDotengineerInfinity AI (YC W24) @toinfinityai
3K Followers 35 Following A script-to-movie foundation model. Our first product allows people to generate talking-head videos. Try it here: https://t.co/vxiFOR9wpNJiang Chen @jiangc1010
75 Followers 92 Following Head of AI Platform & Ecosystem @ Zilliz; Prev: PM & TL @ Google Search IndexingDamien Henry @dh7net
7K Followers 891 Following Cofounder @ClipdropApp, acquired by @heyjasperai AI x Images @googlearts Created GoogleCardboard @GoogleViva Technology @VivaTech
81K Followers 3K Following #VivaTech is Europe’s biggest startup & tech event |📍Paris: May 22-25, 2024 |💡Daily fix of tech & startup news, insights & innovations!Fred Ehrsam @FEhrsam
218K Followers 2K Following Exploring the frontier. Co-founder of @Paradigm and @Coinbase.Shunyu Yao @ShunyuYao12
7K Followers 858 Following Language agents (ReAct, Reflexion, Tree of Thoughts) for digital automation (WebShop, SWE-bench, SWE-agent)Packy McCormick @packyM
189K Followers 3K Following Not Boring || Not Boring Capital || Age of Miracles || Advisor @a16z crypto Techno-OptimistMckay Wrigley @mckaywrigley
147K Followers 439 Following I make AI stuff. Teaching AI skills @TakeoffAI, building codegen tools @CodewandAI, open source AI chat @ChatbotUI. Investing in AI startups.Kanishk Gandhi @gandhikanishk
922 Followers 692 Following Phd @Stanford CS; w/ Noah Goodman, Dorsa Sadigh | Prev: @LakeBrenden @NYUDataScience, @IITKanpur, @Path_AIHyperWrite @HyperWriteAI
9K Followers 5 Following Your AI Personal Assistant — writing and productivity tools + ai agents // building self-driving mode for your computer | by @OthersideAIZeyuan Allen-Zhu @ZeyuanAllenZhu
8K Followers 273 Following physics of language models @ Meta / FAIR IOI - USACO - MCM - ACM/ICPC - Codejam Tsinghua - MIT - Princeton/IAS - MSR - FAIRPhilipp Schmid @_philschmid
16K Followers 653 Following Tech Lead and LLMs at @huggingface 👨🏻💻 🤗 AWS ML Hero 🦸🏻 | Cloud & ML enthusiast | 📍Nuremberg | 🇩🇪 https://t.co/l1ppq3q3hkLysandre @LysandreJik
7K Followers 582 Following Head of Open-Source at Hugging Face. Maintainer of 🤗/Transformers. I tweet about Open Source. He/himNiels Rogge @NielsRogge
10K Followers 690 Following ML Engineer @ML6team, part-time at @huggingface. @KU_Leuven grad. General interest in machine learning, deep learning. Making AI more accessible for everyone!Soyeong Jeong @SoyeongJeong97
129 Followers 161 Following NLP Researcher | Ph.D. student at KAIST Information Retrieval, Open Domain Question Answering, Retrieval-Augmented GenerationMachine Learning Stre.. @MLStreetTalk
19K Followers 383 Following AI YouTube & Audio Podcast (MLST). Run by Dr. Tim Scarfe @ecsquendor and featuring co-host @DoctorDuggar https://t.co/bVe6XB85YDSydMathInst @SydMathInst
818 Followers 422 Following Sydney Mathematical Research Institute (SMRI) Transforming maths in Australia w/ research and public outreach. [email protected] YouTube https://t.co/tsujOpWYV6Higher Order Company @higherordercomp
1K Followers 2 Following We are HOC, a tech startup with the goal of building the inevitable massively parallel future of computers.Taelin @VictorTaelin
17K Followers 903 Following Founder of @HigherOrderComp Building the massively parallel future of computing Reaching AGI to cure all diseases and suffering is all that mattersKsenia Se @Kseniase_
3K Followers 3K Following I build @TheTuringPost, equipping you with in-depth knowledge and forward-thinking analysis to make smarter decisions about AI & ML • mom to 4 boys and 1 girlOmar Khattab @lateinteraction
11K Followers 2K Following CS PhD candidate @StanfordNLP. 2022 Apple Scholar in AI/ML. Author of ColBERT (https://t.co/2ZtgXoa1np), DSPy (https://t.co/BH7WmMKDXR), & various retrieval & LM systems.MilaNLP @MilaNLProc
4K Followers 447 Following The Milan Natural Language Processing Group #NLProc #ML #AIKa Ling Wu @wukaling
646 Followers 346 Following Co-founder & CEO @upsolveai (@ycombinator W24) | Customer-facing analytics & reporting as a service, powered by AI | Ex-@PalantirTech | [email protected]Herumb Shandilya @krypticmouse
2K Followers 427 Following NLP Engineer @SixHQai | Working on ColBERT, DSPy @Stanford | Learning @forai_ml | Chill @CreworkHQ | Ex-ML @sosimplified @DRDO_IndiaKangwook Lee @Kangwook_Lee
2K Followers 676 Following Assistant Professor, ECE, UW-Madison / Leading deep learning research @ KRAFTONGuillem Simeon @guillemsimeon
307 Followers 330 Following (He/him) Physicist. PhD student, @UPFBarcelona. Geometric deep learning and AI for science. 🏳️🌈Midjourney @midjourney
338K Followers 0 Following New research lab. Exploring new mediums of thought. Expanding the imaginative powers of the human species. Join our beta: https://t.co/yAUpCWJRziShizhe Diao @shizhediao
1K Followers 928 Following On job market actively seeking industry positions ML NLP PhD | Intern @BytedanceTalk @sinovationvc Finetune your own LLMs with LMFlow: https://t.co/UTykmQAYPTDiana @sdianahu
6K Followers 247 Following Group Partner @ycombinator. Prev co-founder CTO @escherreality and Head of AR platform @nianticlabsSophie @lebrechts
901 Followers 839 Following COO @allen_ai formerly AI/ML @Apple, SVP Strategy & Ops https://t.co/4Z5RuqSEkZ, PhD from @BrownUniversity, post-doc @CarnegieMellonAmplify Partners @AmplifyPartners
6K Followers 210 Following Early-stage investors for technical founders building the future of infra, AI/ML, data and cybersecurity.Tom Blomfield @t_blom
48K Followers 837 Following Group Partner at @ycombinator Cofounded @gocardless and @monzoAI Breakfast @AiBreakfast
168K Followers 210 Following The latest rumors and developments in the world of artificial intelligence. DM to include your AI project in the newsletter.7 projects that every AI engineer must explore:
today i found out that this one australian guy has been toiling away making incredibly detailed Neural Circuit Diagrams with the vibe of a 1950s issue of Popular Mechanics, but content fit for the 2020s behold. the Transformer
Three Schemes to Revolutionize Nuclear Power: The future of nuclear energy could lie with supersmall reactors ... bit.ly/1DTMFQ0