Xin Zhang | 张鑫 @xinzhangai
NLP | LLM, PhD student. Qwen3-Embed/GTE/GME. Past: intern @Ali_TongyiLab . Living w/ Long-Covid. izhx.github.io Hong Kong Joined August 2018-
Tweets182
-
Followers228
-
Following459
-
Likes1K
LoRA makes fine-tuning more accessible, but it's unclear how it compares to full fine-tuning. We find that the performance often matches closely---more often than you might expect. In our latest Connectionism post, we share our experimental results and recommendations for LoRA.…
@jxmnop Search will always be necessary. If you wanted to teach a human to be as smart as possible, you'd teach them how to find information and critical thinking, instead of trying to teach them everything about everything.
Two headache things I had with the existing Search API: 1) Needing an extra crawl API to get full context. 2) not very accurate filtering over date, which leads to search time contamination for time-sensitive data. Perplexity Search API looks like a good replacement.
Two headache things I had with the existing Search API: 1) Needing an extra crawl API to get full context. 2) not very accurate filtering over date, which leads to search time contamination for time-sensitive data. Perplexity Search API looks like a good replacement.
Should you fine-tune your embedding model? (Spoiler: probably not 𝘺𝘦𝘵) 𝘉𝘦𝘧𝘰𝘳𝘦 jumping into fine-tuning, ask yourself: is your retrieval pipeline actually failing because of domain-specific knowledge gaps, or could it be something simpler? Here's what to check first:…
Really excited to share that FreshStack has been accepted at #neurips25 D&B Track (poster)! 🥁🥁 Huge congratulations to all my @DbrxMosaicAI co-authors! Time to see you in San Diego! 🍻
-2016 (classic era): focus on data efficiency 2017-2025 (pretraining era): focus on compute efficiency 2026-: focus on data efficiency (again) The standard Transformer paradigm is optimized for compute efficiency. As we look at data efficiency, we'll see very different design…
-2016 (classic era): focus on data efficiency 2017-2025 (pretraining era): focus on compute efficiency 2026-: focus on data efficiency (again) The standard Transformer paradigm is optimized for compute efficiency. As we look at data efficiency, we'll see very different design…
Impressive models! 🚀 Congrats!!! We now have new SOTA encoder backbone. Our mGTE-MLM-base outperforms XLM-R-base in 2024 summer (or jan by training), its a "semi-modern" encoder 😃. Glad to see it stands as one of only two baselines, alongside XLM-R.
Impressive models! 🚀 Congrats!!! We now have new SOTA encoder backbone. Our mGTE-MLM-base outperforms XLM-R-base in 2024 summer (or jan by training), its a "semi-modern" encoder 😃. Glad to see it stands as one of only two baselines, alongside XLM-R.
New open-weight small embedding model surpass mGTE !! With Matryoshka Embedding 🪆
We benchmarked 10 optimizers and found that the recent new optimizers still have limited speed up (~10%) over Adam at a "larger" scale (1.2B, 8x data than Chinchilla optimal). I guess that means more research to be done in this area!
We benchmarked 10 optimizers and found that the recent new optimizers still have limited speed up (~10%) over Adam at a "larger" scale (1.2B, 8x data than Chinchilla optimal). I guess that means more research to be done in this area!
What BrowseComp+ confirms: (1) Better retrievers remain crucial in agentic search settings and just "grepping" is mostly a suboptimal strategy. Semantic embeddings enable getting to the answer faster and more often w.r.t BM25. (2) Having said that, agentic search is clearly the…
🚀 Introducing BrowseComp-Plus: A More Fair and Transparent Evaluation Benchmark of Deep-Research Agent. It is a new Deep-Research evaluation benchmark built on top of BrowseComp. It features - 📚 a fixed, carefully curated corpus of web documents - ✅ human-verified positive…
If you're a researcher or engineer releasing open science papers & open models and datasets, I bow to you 🙇🙇🙇 From what I'm hearing, doing so, especially in US big tech, often means fighting your manager and colleagues, going through countless legal meetings, threatening to…
Finally, a 45 page literature review of text embedding model, datasets, evaluation and training methods: arxiv.org/abs/2507.20783
Poster Booth 168, welcome and chat!!! #ACL2025NLP
Poster Booth 168, welcome and chat!!! #ACL2025NLP https://t.co/RuzqdBQkBy
We noticed in community discussions that when using Qwen3-embedding's GGUF models, some developers are not appending the special token <|endoftext|> at the end of the context. This can significantly hurt model accuracy. Check our Model Card (huggingface.co/Qwen/Qwen3-Emb…) for more…
‼️Sentence Transformers v5.0 is out! The biggest update yet introduces Sparse Embedding models, encode methods improvements, Router module for asymmetric models & much more. Sparse + Dense = 🔥 hybrid search performance! Details in 🧵

Siru Ouyang @Siru_Ouyang
1K Followers 956 Following CS PhD candidate @UofIllinois. Alumni @sjtu1896. Interned at @GoogleAI @TencentGlobal @MSFTResearch. LLMs, Agents, Reasoning
Amr Kayid @amr_kayid
107 Followers 4K Following 🪼🪄prev: @runwayml @Cohere 🐳 Research FORai / @CohereForAI 🧙♂️ @ManifoldRG @OpenMinedOrg 🕵 @TU_Muenchen 🤖🇩🇪🧠
Gregory @GRRRRRegor
7 Followers 354 Following
Khaime @BuildWithKhaime
127 Followers 228 Following AI Explorer | Engineer-in-Training 🛠️ | Web Designer | Documenting my journey in tech, AI, and innovation ✍🏽💡
云创兽Ai @Guumhaug438370
0 Followers 115 Following 🔍 dream chaser diving deep into stock investing! eager for pro tips. DM me for market news tips! 🎯 #Investing #Trading
Nandan Thakur @beirmug
2K Followers 3K Following CS PhD student @uwaterloo • previously intern @DbrxMosaicAI @GoogleAI, RA @UKPLab • IR+NLP research (https://t.co/kxQprYr7Xn, https://t.co/YVvVjSyXOS, TREC-RAG and FreshStack)
yu.zhang @yuzhang04174605
8 Followers 253 Following
Shengyao Zhuang @ShengyaoZhuang
307 Followers 294 Following Applied scientist at @amazon Working on information retrieval, NLP.
Mohamed Ismail @_medo_mi
64 Followers 348 Following Web Designer / Developer, SEO Specialist, Internet Professional, software engineer
Укрась прощ... @Mac_Jei
18 Followers 1K Following «Nisi solis nobis scripsimus» / Пишем только для нас самих. 🗣️ 🇷🇺 🇬🇪 🇦🇿 🇹🇷 🇺🇸 🔧 Big Data & AI stuff (DS/DA/DE/DevOps)
Fatih⏩⤴️ @taskinfatih
676 Followers 6K Following Lover of all novel and hard concepts: especially machine learning and systems theory
Yiqi Wang @YiqiWang119050
0 Followers 161 Following
ccrd11 @yihongliu272424
2 Followers 230 Following
Cannon Chen @cannon0102
2 Followers 369 Following
竹然枫 @heybro23333
1 Followers 474 Following
Juan Pablo Balarini @jpbalarini
451 Followers 1K Following Passionate about building stuff - @eagerworks co-founder. If you want to build a product, talk to me!
zzzzzzoo00oo @zzzzzzoo00oo
260 Followers 5K Following
Ticklish @ticklishgorilla
191 Followers 3K Following Ticklish about AI 🖖 | Cybersecurity explorer | Retro tech lover 📼 chasing shiny gadgets | Talking AI, tools and tech shaping tomorrow, one byte at a time.
AIStrikeSec ֎ @AIStrikeSec
124 Followers 2K Following AIStrikeSec : Empowering offensive security with cutting-edge AI. Smarter penetration testing & threat simulation.
Eric Tchirnhausen @tchirnhaus20039
30 Followers 5K Following Like to try new things you never know; trying to prove all software can be automated 😅 😅 😅 | ML/AI, | C++/Java/Go | GitHub : Dyl777
Robert Scoble @Scobleizer
543K Followers 24K Following The best from ML/AI community | Ex-Microsoft, Rackspace, Fast Company | Wrote eight books about the future | Silicon Valley robots, holodecks, BCIs, & startups.
Nico Nico @crist1an001
211 Followers 2K Following
ibm @i18nbigmouse
0 Followers 2K Following
Quetzal @Quetzalcoatl517
27 Followers 7K Following
Gill @Quqi21
307 Followers 5K Following
Sergio Soage @Sergio_Soage
876 Followers 6K Following artificial intelligence, math. Random stuff @ https://t.co/tqV9OIPsWE
zffl @zffl
14 Followers 846 Following
Bamya @0xBamya
26 Followers 643 Following Building transparent systems, from genesis 🧬 block to cloud | decrypting finance in nodes ⛓️
Jaymari Chua @JaymariChua
81 Followers 777 Following generative AI safety research; ex-Amazon big data engineer
Rishi Swami @rishi_swami
177 Followers 2K Following Data, AI/ML | Learning iteration # 1.1e4| Helping orgs extract value from data
Vedant @code_mesh15
20 Followers 490 Following Final year undergrad at @IITKgp. Prev: Undergraduate Research Fellow @PKU1898 , CMI , @TIFRScience
Jaidev Shah @JaidevShah4
519 Followers 3K Following @amazonscience | @microsoft AI | @columbia | agents, search and personalization
WX J @ppangxuan
2 Followers 125 Following
Raphaël Sourty @raphaelsrty
746 Followers 782 Following Language Models, Knowledge Bases, Knowledge Distillation PhD | AI @LightonIO
Siru Ouyang @Siru_Ouyang
1K Followers 956 Following CS PhD candidate @UofIllinois. Alumni @sjtu1896. Interned at @GoogleAI @TencentGlobal @MSFTResearch. LLMs, Agents, Reasoning
XiaomiMiMo @XiaomiMiMo
793 Followers 3 Following
hazyresearch @HazyResearch
9K Followers 1K Following A research group in @StanfordAILab working on the foundations of machine learning & systems. https://t.co/JHK58TDorG Ostensibly supervised by Chris Ré
Ant Ling @AntLingAGI
2K Followers 115 Following A series of open-source large models from Ant Group, Ling for LLM, Ring for Reasoning LLM, Ming for MLLM. See us at inclusionAI.
Zhihu Frontier @ZhihuFrontier
894 Followers 79 Following 🚀Bringing China's AI & tech trends, voices and perspectives to the global stage. ⚡️Powered by 知乎/https://t.co/OkIemRZdcj, China's leading knowledge community.
DatologyAI @datologyai
2K Followers 12 Following DatologyAI builds tools to automatically select and optimize the best data on which to train AI models, leading to better, smaller models which train faster.
HKUNLP @hkunlp2020
118 Followers 85 Following We are a group of researchers working on natural language processing in the Department of Computer Science at the University of Hong Kong.
Shengyao Zhuang @ShengyaoZhuang
307 Followers 294 Following Applied scientist at @amazon Working on information retrieval, NLP.
Manuel Faysse @ManuelFaysse
2K Followers 408 Following NLP Research, interning at FAIR @AIatMeta + PhD Candidate @CentraleSupelec Prev: @imperialcollege, @epfl
Antoine Chaffin @antoine_chaffin
2K Followers 590 Following 28, French CS Engineer 💻, PhD in ML 🎓🤖 — Guiding generative models for better synthetic data and building multimodal representations @LightOnIO — 🇫🇷🇬🇧
jietang @jietang
3K Followers 108 Following Professor @ Tsinghua University Artificial General Intelligence, Large Language Model
Imperial NLP @imperial_nlp
1K Followers 962 Following We are the Natural Language Processing community at @imperialcollege
JHU CLSP @jhuclsp
7K Followers 6K Following Center for Language and Speech Processing at @JohnsHopkins #NLProc #MachineLearning #AI https://t.co/6IXR5OSQtw @[email protected]
Hongyuan Mei @RoverHM
1K Followers 162 Following Core Contributor to Grok 4 & Grok 4 Heavy. Member of Technical Staff @xAI. Training knowledgeable AI reasoners. ex-@GoogleDeepMind, @TTIC_Connect, @jhuclsp.
Albert Gu @_albertgu
18K Followers 88 Following assistant prof @mldcmu. chief scientist @cartesia_ai. leading the ssm revolution.
Saining Xie @sainingxie
23K Followers 1K Following researcher in #deeplearning #computervision | assistant prof at @nyu_courant | rs @googledeepmind | past: rs @meta (FAIR) @ucsandiego | ynwa
Pascale Fung @pascalefung
3K Followers 46 Following Senior Director of A.I. Research, Meta-FAIR. Chair Professor of ECE, HKUST. Fellow of AAAI, ACL, IEEE, ISCA.
Google Research @GoogleResearch
23K Followers 16 Following Impossible? Let’s see. From algorithms to neuroscience to AI, Google Research strives to progress science, advance society & improve billions of people’s lives.
Peng Qi @qi2peng2
4K Followers 391 Following Research Lead @OrbyAI. Previously: @AWS AI, $JD AI, PhD @stanfordnlp, UG @Tsinghua_Uni. He/him. Opinions my own.
Tongyi Lab @Ali_TongyiLab
9K Followers 20 Following We advance the development of AGI and foster open source collaboration towards a smarter future.
jianlin.su @Jianlin_S
3K Followers 14 Following Grad&Clip is all you need @Kimi_Moonshot Blog: https://t.co/YVxsWykMw2 , Cool Papers: https://t.co/scS1n1o0lg
Jingtao Zhan @Jingtao_Zhan
262 Followers 203 Following PhD student at @Tsinghua_Uni @thuir_lab. #WebSearch | #NeuralRanking | #NLP.
Xinyu Crystina Zhang @crystina_z
667 Followers 694 Following PhD @UWaterloo ugrad @HKUST | prev. Google DM @cohere #CLOVA, MPI | Multilingual | IR | author of https://t.co/cPiuWIg8pW Mr. TyDi
Kaichao You @KaichaoYou
4K Followers 134 Following phd student in tsinghua university, working on @vllm_project
sean lee @xmlee97
155 Followers 404 Following 🧑🏫@ polyu | developing multimodal IR @mixedbreadai | ex-@alipay | Hiking | Semantics & Information Retrieval
Unitree @UnitreeRobotics
92K Followers 307 Following High performance civilian robot manufacturer. Please everyone be sure to use the robot in a Friendly and Safe manner. https://t.co/hI6LafokVm
Yu Su (hiring postdoc... @ysu_nlp
11K Followers 963 Following cooking something new | prof. @osunlp | sloan fellow | intelligence and agents | author of Mind2Web, SeeAct, MMMU, HippoRAG, BioCLIP, UGround.
xAI @xai
1.8M Followers 38 Following
Xiao Liu (Shaw) @ShawLiu12
574 Followers 167 Following PhD @Tsinghua @THUKEG Developing P-Tuning, ChatGLM, AgentBench, and AutoGLM. 📖 Sharing paper digest on LLMs.
Salesforce AI Researc... @SFResearch
18K Followers 382 Following We advance state-of-the-art #AI techniques paving the path for innovative products at @Salesforce. Focus areas: #AIAgents, #EnterpriseAI, #EGI, and #TrustedAI.
DeepSeek @deepseek_ai
972K Followers 0 Following Unravel the mystery of AGI with curiosity. Answer the essential question with long-termism.
Manling Li @ManlingLi_
8K Followers 760 Following Assistant Professor@NU, Amazon Scholar, Postdoc@Stanford, PhD@UIUC #NLP #CV Language+Vision/EmbodiedAI, Reasoning, Planning, Compositionality, Trustworthiness
wing.nus @wing_nus
580 Followers 301 Following Web IR / NLP Group at the National University of Singapore
Jiayi Pan @jiayi_pirate
13K Followers 2K Following 🧑🍳 Reasoning Agents @xAI | PhD on Leave @Berkeley_AI | Views Are My Own
Ningyu Zhang@ZJU @zxlzr
3K Followers 2K Following Associate Professor @ZJU_China. Research interests include NLP, LLM, KG, Agent, Knowledge Editing.
Yizhe Zhang @YizheZhangNLP
1K Followers 535 Following Research Scientist at Apple MLR | ex-researcher @ Microsoft Research, Meta AI | PhD @ Duke University
Tom Hosking @tomhosking
932 Followers 632 Following Model merging lead for Command A @cohere. Prev: PhD student in NLP @EdinburghNLP @Edin_CDT_NLP, @BloomsburyAI @UCL @DRWTrading
Luke Zettlemoyer @LukeZettlemoyer
10K Followers 2K Following
Violet Peng @VioletNPeng
7K Followers 518 Following Associated Professor@UCLA-CS. Research NLP, AI creativity, controllable generation, model evaluation, computational journalism, event. (she/her/hers)
Yunyao Li @yunyao_li
6K Followers 717 Following Bring GenAI and Knowledge Graph to enterprise systems. | Director of ML @Adobe Experience Platform | Previously @Apple @IBMResearch. Tweets are all mine.
Wang shuai (Dylan) @dylan_wangs
221 Followers 204 Following Postdoc and Finishing PhD student at @Ielabgroup in @UQSchoolEECS | @UWAEMS alumni RAG, IR, NLP, Model Efficiency BlueSky: dylanshuaiwang