supercoderhawk @supercoderhawk
NLP engineer at patsnap. NLP, deep learning researcher. github.com/supercoderhawk Shanghai, People's Republic of Joined November 2016-
Tweets452
-
Followers72
-
Following2K
-
Likes321
🕊️The Paloma paper is truly impressive - a must-read for anyone caring about the language model evaluation. It addresses two crucial questions that had previously left me puzzled: ❓Can the validation loss on one corpus (e.g., C4) represent all domains? The answer is no🚫.…
RAG And Context Understanding A great diagram that showcases the challenges with RAG benchmarking and LLM context understanding RAG systems are complex because of the following 4 issues. Stuffing the context of the LLM rarely helps and typically confuses the LLM We need a…
Microsoft presents UFO A UI-Focused Agent for Windows OS Interaction paper page: huggingface.co/papers/2402.07… introduce UFO, an innovative UI-Focused agent to fulfill user requests tailored to applications on Windows OS, harnessing the capabilities of GPT-Vision. UFO employs a…
New paper: How can you tell when a model is hallucinating? Let it cheat! An expert doesn't need to cheat, so if your model learns to cheat, there must be something it doesn't know. Our general new approach for measuring uncertainty: arxiv.org/abs/2402.08733
An incredible skill that I have witnessed, especially at OpenAI, is the ability to make “yolo runs” work. The traditional advice in academic research is, “change one thing at a time.” This approach forces you to understand the effect of each component in your model, and…
so i guess this is a thing now universities running ads to resell students' data for training llms 💰💰💰
It’s year 2024, and n-gram LMs are making a comeback!! We develop infini-gram, an engine that efficiently processes n-gram queries with unbounded n and trillion-token corpora. It takes merely 20 milliseconds to count the frequency of an arbitrarily long n-gram in RedPajama (1.4T…
Large Language Model (LLM) agents promise to free us from mundane tasks, but how should they best interact with our world? Introducing CodeAct, an agent {framework, instruction-tuning dataset, model}, employs executable Python code to unify the actions of LLM agents. 🧵1/
Continual Learning for LLMs One of the biggest challenges of working with LLMs is keeping them updated. Continual learning aims to enhance the overall linguistic and reasoning capabilities of LLMs. This survey paper provides an overview of developments in continual learning.…
A Novel RAG Approach That Understands The Whole Document Context RAG has rapidly evolved to be the standard way to apply LLMs in production. However, most methods are still limited because most existing methods retrieve only short contiguous chunks from a retrieval corpus,…
Lots of compelling AI research ideas this week ranging from self-correcting RAG to sparsified LVLMs. A few papers I’ve been reading this week: - OLMo - SliceGPT - MoE-LLaVa - Corrective RAG - Rephrasing the Web - Redefining Retrieval in RAG - LLMs for Mathematical Reasoning…
We just opened sourced SQLCoder-70B! It outperforms all publicly accessible LLMs for Postgres text-to-SQL generation by a very wide margin. SQLCoder is finetuned on @AIatMeta's CodeLlama-70B model that was released yesterday on less than 20,000 hand-curated prompt completion…
(1/5)🚀 Our OpenMoE Paper is out! 📄 Including: 🔍ALL Checkpoints 📊 In-depth MoE routing analysis 🤯Learning from mistakes & solutions Three important findings: (1) Context-Independent Specialization; (2) Early Routing Learning; (3) Drop-towards-the-End. Paper Link:…
I'm currently looking into different metrics and frameworks around Retrieval-Augmented Generation (RAG) evaluation. This is a first brain dump. But the landscape is already quite broad. What RAG evaluation metrics and frameworks have you already tested? And which ones did you…
MuGI: Enhancing Information Retrieval through Multi-Text Generation Intergration with Large Language Models Proposes a framework that leverages LLM text generation to expand queries and substantially improves IR performance. 📝arxiv.org/abs/2401.06311 👨🏽💻github.com/lezhang7/Retri…
Improving Information Retrieval in LLMs One effective way to use open-source LLMs is for search tasks, which could power many other applications. This work explores the use of instruction tuning to improve a language model's proficiency in information retrieval (IR) tasks.…
Here’s a neat paper by Barnett et al. (@DeakinA2I2) that outlines 7 failure points in building a RAG pipeline over your data. 🚫 Missing content (did not index it) 🚫 Missing in top-k retrieved set 🚫 Missing in reranked set 🚫 Not extracted (in context but LLM couldn’t use) 🚫…
There was a lot of cool RAG research in the past year or two, and luckily for you, all of these efforts are tracked under one place! “Retrieval-Augmented Generation for Large Language Models: A Survey” by Gao et al. does an admirable job categorizing all RAG research into three…
There was a lot of cool RAG research in the past year or two, and luckily for you, all of these efforts are tracked under one place! “Retrieval-Augmented Generation for Large Language Models: A Survey” by Gao et al. does an admirable job categorizing all RAG research into three… https://t.co/uf0U8YdgWV
Although there are abundant work studying long-context LLMs, most of them talks about architecture / positional encoding, almost none of existing papers talk about data. In this work, we take a close look at data influence on context scaling yaofu.notion.site/Understanding-…
New RAG technique alert 🚨 We’ve come up with an advanced RAG technique in @llama_index that lets you ask structured questions over many documents ✨: 1. Model each document as a metadata dictionary - store more attributes beyond a simple text summary. (e.g. a row in SQL…
New RAG technique alert 🚨 We’ve come up with an advanced RAG technique in @llama_index that lets you ask structured questions over many documents ✨: 1. Model each document as a metadata dictionary - store more attributes beyond a simple text summary. (e.g. a row in SQL… https://t.co/F72ZdY1nVa

YvonneHalifax @SrtzB8qrxR2bI8
1 Followers 60 Following
MeroyPartridge @Z0o98tTJ42sh6b
0 Followers 57 Following
Rerna @Rerna046234
1 Followers 308 Following
TristaHearst @YDc2k7j7E4NfC
120 Followers 4K Following Professional eye-roller | Amateur wine critic 🍷👀
DefenseStocksX🇺�... @Geeixo177807
36 Followers 1K Following 15-30% Monthly | 2 High-Conviction Stocks.Short-Term Gains: 15-20% in Days/Weeks.DM "JOIN" for WhatsApp Alerts. Live Trade Signals • Market Analysis
Neanal @Neanal661
45 Followers 2K Following
Yunxin Li @LyxTg
1K Followers 508 Following Ph.D. Candidate. Currently focusing on multimodal reasoning and planning with large models. Past research interns: ByteDance Seed, Tencent PCG/AILab.
Xinyu Yang @Xinyu2ML
998 Followers 984 Following Ph.D. @CarnegieMellon. Working on data and hardware-driven principled algorithm & system co-design for scalable and generalizable foundation models. They/Them
Nan HUO @NanHUO9637
564 Followers 842 Following CS PhD Student @HKUniversity. Previously M.S. @JohnsHopkins.
Ziyang Luo @ChiYeung_Law
1K Followers 3K Following Research Scientist @salesforce | Agents Researcher | Ex @MSFTResearch @AlibabaGroup @NUSingapore @HKBU_NLP
Nsite @NsiteOvRJs
12 Followers 634 Following
Huan Wang @huan__wang
2K Followers 2K Following Director @ Salesforce Research. Research Interest: Large Language Model, Action Agent, Reinforcement Learning, Time Series Analytics, Learning Theory.
Tracey @davistracey51
395 Followers 3K Following
Xuheng Li @xuhengli_
957 Followers 2K Following CS PhD candidate @UCLA, supervised by @QuanquanGu | RL, deep learning theory, diffusion model | Previously BSc @PKU1898 | Stargazer
Rohan Jha @Robro612
225 Followers 377 Following CS PhD Student @jhuclsp Previously: Intern @JinaAI_, MS CS @UTAustin, BS AI @carnegiemellon Interested in Information Retrieval and NLP
Abdul Aziz @aziz_cuCSE
88 Followers 3K Following CS Lecturer @ IIUC | BSc Engg. in CSE @University of Chittagong. Research Assistant @CSECU-DSG. Research Interest includes multimodal AI and Multilingual NLP
Yangqiu Song @yqsong
2K Followers 1K Following Associate Professor at HKUST, working on knowledge graphs, NLP, knowledge discovery on texts and graphs
Cunxiang Wang @CunxiangWang
761 Followers 1K Following [email protected], Postdoc@tsinghua, working with Prof. Jie Tang. PhD advised by Prof. Yue Zhang. Prev: Interned @AWScloud. LLM Evaluation, Posttraining
Tianshu Zhang @Tianshu_OSU
396 Followers 614 Following Ph.D student @osunlp @OhioStateCSE. Ex-intern @IBMResearch, @Adobe. Lead author of TableLlama. #NLProc
Franco Maria Nardini @fmnardini
774 Followers 2K Following Senior Researcher @ ISTI-CNR, Pisa, Italy. Information Retrieval, Machine/Deep Learning, Efficiency, Big Data.
Yuhan Liu @YuhanLiu_nlp
462 Followers 886 Following CS PhD student @NYU_Courant advised by @eunsolc, previous intern @tsvetshop
Ningyu Zhang@ZJU @zxlzr
3K Followers 2K Following Associate Professor @ZJU_China. Research interests include NLP, LLM, KG, Agent, Knowledge Editing.
Qinyuan Cheng @cheng_qinyuan
506 Followers 698 Following Alignment researcher and Physicist, PhD student at FNLP Lab @FudanUniv; MOSS team; Building Context Intelligence; True Dota2 fans
Kimberly @logan44kimberly
283 Followers 3K Following
Hamid Naderi Yeganeh @naderi_yeganeh
59K Followers 31K Following Research Student @UCL Maths. Mathematical artist. Email: naderiyeganeh at gmail dot com
Songlin Yang @SonglinYang4
12K Followers 3K Following research @MIT_CSAIL @thinkymachines. work on scalable and principled algorithms in #LLM and #MLSys. in open-sourcing I trust 🐳. she/her/hers
Ivelisse @hoare_ivelisse3
286 Followers 3K Following
Make money easily @4euN6UF6l8k2M
8 Followers 556 Following MEXC focuses on financial management, stocks, cryptocurrencies, digital assets and investments. Currently, new users can get free dollars when they sign up.
Mathew Manoj @MathewManoj19
7 Followers 150 Following
jinyang (patrick) li @jinyang34647007
1K Followers 1K Following CS PhD student @HKUniversity. Previously M.S. in @Columbia. Intern at @MSFTResearch, prev. at @AlibabaGroup. LLM, SQL Intelligence, Code Gen for New User Exp
Sharon @sharonwalther35
273 Followers 3K Following
Carol Walker @CarolWahlom
2K Followers 2K Following
Kristine @kristine_bollin
312 Followers 3K Following
Bob Elliot @BobEUnIimitedd
250 Followers 3K Following CIO at @UnlimitedFnds | PM of $HFND | Fmr IC @Bridgewater | Described as one of the few "sane" voices on #fintwit | Comments are not investment advice
Haoran LI @TeaPotLiid
54 Followers 124 Following
Alignment Lab AI @alignment_lab
13K Followers 4K Following Devoted to addressing alignment. We develop state of the art open sourced AI. https://t.co/oANsMnut7V https://t.co/6aJDLUvuU5
Siru Ouyang @Siru_Ouyang
906 Followers 879 Following CS PhD candidate @IllinoisCDS. Alumni @sjtu1896.
Jinhao Jiang @Boru80914053
435 Followers 783 Following I'm a phd candidate in RUC AIBOX team, a NLPer and just studying!
Adrienne @p_adrienne23
283 Followers 3K Following
Shannon @shannon_beman_
392 Followers 3K Following
Kimi.ai @Kimi_Moonshot
50K Followers 98 Following Built by Moonshot AI to empower everyone to be superhuman.
Tongyi Lab @Ali_TongyiLab
4K Followers 18 Following We advance the development of AGI and foster open source collaboration towards a smarter future.
Dayoon Ko @dayoon12161
78 Followers 104 Following M.S/Ph.D integrated student in CSE @SeoulNatlUni | Research Intern @LG_AI_Research
鍾馨溪 @zhongxinxi1
39K Followers 111 Following A new Chinese writer and historical researcher 一名中國新銳作家及歷史研究者
CatFly @imyouhu
4K Followers 626 Following ✦Views expressed are my own ✦Operations Manager at https://t.co/U7OvuKre7C ✦Passionately Curious ✦Passionate about AI, Tech, Finance, and Society
Yiheng Xu @yihengxu_
1K Followers 708 Following ai agent research @hkuniversity | scaling agent @Alibaba_Qwen | ex @msftresearch @sfresearch | from automation to autonomy
Fan Zhou @FaZhou_998
1K Followers 833 Following Qwen Coding @Alibaba_Qwen. Prev: Core member @XLangNLP, Intern @MSFTResearch.
Alex Yang @himseIf_65
4K Followers 216 Following building @better_auth. @nodejs @wakujs @jotaijs member, make JavaScript better. Opinions are my own. Prev @llama_index, @AFFiNEOfficial. CN @himself_65
Zihan Wang - on RAGEN @wzihanw
23K Followers 609 Following PhD Student @NorthwesternU. Intern @yutori_ai. I study PhysiCS of LLM. Ex @deepseek_ai @uiuc_nlp @RUC. RAGEN | Chain-of-Experts | ESFT.
yan5xu @yan5xu
6K Followers 333 Following 🤖 AI 野生研究员 | ex @ManusAI_HQ & @hey_im_monica 推特内容仅代表个人观点,和公司无关
海拉鲁编程客 @hylarucoder
18K Followers 999 Following 🖥️ Indie Maker 🛠️ AI 能力边缘疯狂试探者 📌 油管「海拉鲁编程客」 🌸 沦为程序员的段子手/猫咪
AppSail.dev @AppSaildotDEV
6K Followers 893 Following Co-founder of https://t.co/6hXCeF5q98 https://t.co/o08c9N03DK https://t.co/bJvmBtFVLU #独立开发者 #AI出海 #境外公司 #境外手机号 #全球收款 #港卡 #美卡 #数字游民
Claude @claudeai
110K Followers 1 Following Claude is an AI assistant built by @anthropicai to be safe, accurate, and secure. Talk to Claude on https://t.co/ZhTwG8dz3D or download the app.
yv @yvbbrjdr
6K Followers 389 Following exists as 451; opinions are my own; Creator of @LANDropApp, @AthenaAGI, LMRouter; MTS @MicrosoftAI LLM training infra; Ex-@NVIDIA RISC-V security
Mirella Lapata @mlapata
462 Followers 5 Following
Cognition @cognition
147K Followers 32 Following Makers of Devin, the first AI software engineer. We are an applied AI lab building end-to-end software agents. Join us: https://t.co/JZDd4Vik4P
天猪 TZ @atian25
6K Followers 277 Following - Currently focused on @Trae_ai IDE. - Used to be a core developer of open source projects EggJS/CNPM. - ByteDance Engineer / ex-Alipay.
Kol Tregaskes @koltregaskes
14K Followers 6K Following AI, Tech & Science News Curator🔬 | AI Art Creator 🎨 | AI Video Producer 🎬🤖 | AI Music Composer 🎵 | Alt-ego of @axylusion
Xianjun Yang✈️ICM... @xianjun_agi
988 Followers 1K Following RS @AIatMeta. GenAI safety, data-centric AI. Previously Phd @ucsbnlp, BEng @tsinghua_uni. Opinions are my own.
Xiaonan Li @yyyjtrzj
170 Followers 242 Following Research Scientist at Reka-AI; PhD @Fudan_University | Intern @Microsoft NLP, LLM, RAG
FBI Houston @FBIHouston
111K Followers 116 Following Official FBI Houston X. Submit tips at https://t.co/kh5Pr0t47e. Public info may be used for authorized purposes: https://t.co/NiiH1SfKO1.
The White House @WhiteHouse
2.5M Followers 6 Following The Golden Age of America Begins Right Now. 📱 Text USA to 45470 to receive alerts.
Sysinternals @Sysinternals
19K Followers 154 Following Created by Mark Russinovich and Bryce Cogswell and later acquired by Microsoft, Sysinternals utilities help you troubleshoot and manage your Windows systems.
International Cyber D... @IntCyberDigest
5K Followers 3K Following Your weekly go-to cybersecurity newsletter, curated and commented on by our senior analysts. Got tips? Signal: IntCyberDigest.17
Pushmeet Kohli @pushmeet
17K Followers 90 Following Computer Scientist, Leading Science and Strategic Initiatives @ Google DeepMind.
马东锡 NLP @dongxi_nlp
14K Followers 764 Following Prev. PhD @Stockholm_Uni | Alumni @KTHuniversity @uppsalauni Sharing insights on AI, autonomous agents, and large language & reasoning models
jianlin.su @Jianlin_S
3K Followers 14 Following Grad is all you need @Kimi_Moonshot Blog: https://t.co/YVxsWylklA , Cool Papers: https://t.co/scS1n1oyaO
Nick Sweeting @thesquashSH
3K Followers 5K Following hacking on browsers @browser_use ⑊ learning about brains ⑊ internet archiving @ArchiveBoxApp ⑊ 🚴♂️🏍🎵🗻 ⑊ 沪老外 ⑊ @MonadicalHQ ⑊ @RecurseCenter '14
後醍醐天豚(建... @kashihara28612
2K Followers 721 Following 东京外资职场不分享|东雪莲单推人|交友文学、前女友文学|发发做的饭和威士忌|消費税増税支持、所得税・社会保険料を無くせ
Hongjin Su @hongjin_su
640 Followers 569 Following Ph.D. student of @HKUniversity, following @taoyds, NLP group 2022. #NLProc
Yung-Sung Chuang @YungSungChuang
1K Followers 679 Following PhD student @MIT_CSAIL | Intern @MetaAI @Microsoft @MITIBMLab | BS @NTU_SPML in #Taiwan
Kristina Gligorić @krisgligoric
4K Followers 1K Following Incoming Assistant Professor of Computer Science @JohnsHopkins / Postdoc @StanfordNLP / Computer Science PhD @EPFL_en . Computational Social Science, NLP, AI
karminski-牙医 @karminski3
24K Followers 1K Following A coder, road bike rider, server fortune teller, electronic waste collector, co-founder of KCORES, ex-director at IllaSoft, KingsoftOffice, Juejin.
Ásgeir Thor Johnson @asgeirtj
480 Followers 381 Following
Jack McKechnie @JackMcK1999
185 Followers 163 Following PhD student at the University of Glasgow. @TerrierTeam @IR_Glasgow
SemiAnalysis @SemiAnalysis_
34K Followers 16 Following
Niloofar (✈️ ACL) @niloofar_mire
7K Followers 2K Following Niloofar Mireshghallah — incoming asst. prof @LTIatCMU @CMU_EPP, RS in @AIatMeta, postdoc @uwcse, Ph.D. @ucsd_cse, former @MSFTResearch -Privacy, ML, NLP