Senior Researcher @TencentGlobal, working on LLMs.
Ph.D. at @UniMelb; Ex @BytedanceTalk, @MSFTResearchtimhuang1.github.io Melbourne, AustraliaJoined June 2016
🌺GPT-4o’s image generation is stunning — but how well does it handle complex scenarios? 🤔
We introduce 🚀CIGEVAL🚀, a novel method to evaluate models' capabilities in Conditional Image Generation 🖼️➕🖼️🟰🖼️. Find out how top models perform when conditions get truly…
These findings resonate with my impressions. AFAIC, structured prompting outperforms CoT & ICL by steering LLMs through workflows.
Great to see this ‘rebuttal’ backed by such rigorous analysis — reminds me of the insights in LLMs Cannot Self-Correct. We need more like this!
These findings resonate with my impressions. AFAIC, structured prompting outperforms CoT & ICL by steering LLMs through workflows.
Great to see this ‘rebuttal’ backed by such rigorous analysis — reminds me of the insights in LLMs Cannot Self-Correct. We need more like this!
To Code, or Not To Code?
Exploring Impact of Code in Pre-training
discuss: huggingface.co/papers/2408.10…
Including code in the pre-training data mixture, even for models not specifically designed for code, has become a common practice in LLMs pre-training. While there has been…
🚀 A game-changer benchmark: LLM-Uncertainty-Bench 🌟
📚 We introduce "Benchmarking LLMs via Uncertainty Quantification", which challenges the status quo in LLM evaluation.
💡 Uncertainty matters too: we propose a novel uncertainty-aware metric, which tests 8 LLMs across 5…
FuseChat
Knowledge Fusion of Chat Models
While training large language models (LLMs) from scratch can indeed lead to models with distinct capabilities and strengths, this approach incurs substantial costs and may lead to potential redundancy in competencies. An alternative…
15K Followers 7K FollowingI build tough benchmarks for LMs and then I get the LMs to solve them. SWE-bench & SWE-agent. Postdoc @Princeton. PhD @nlpnoah @UW.
320 Followers 520 FollowingResearcher at the Alibaba DAMO Academy, Singapore R&D Center | Former Visiting Postdoc Researcher at UIUC @uiuc_nlp | NLP PhD from CUHK @CUHKofficial
2K Followers 2K FollowingPh.D. Student @PrincetonCS. Prev @Stanford @UW @pika_labs @MSFTResearch @UofIllinois. I used to work on computer vision, but it's not all I do.
2K Followers 479 FollowingPh.D. student @HKUSTGuangZhou, Researcher @MetaGPT_, Cofounder of OpenManus, previously at RUC, Lenovo Research AI Lab, Zhipu AI.
227 Followers 570 FollowingSecond year PhD @UW | Post-Training, LLM reasoning and synthetic dataset.
https://t.co/cYAkbnCsCp
Open to chat and collaborate!
14K Followers 3K Followingresearch @MIT_CSAIL @thinkymachines. work on scalable and principled algorithms in #LLM and #MLSys. in open-sourcing I trust 🐳. she/her/hers
171 Followers 1K FollowingJournal of Contemporary Eastern Asia (ISSN 2383-9449) is a refereed biannual journal that takes a lead on a new scholarship in Asia. Tweet by @zhang_dechun
408 Followers 1K FollowingAs long as I offer an abundance of solutions in artificial intelligence, so long I’m alive; a lack of solutions will foreshadow my extinction.
15K Followers 7K FollowingI build tough benchmarks for LMs and then I get the LMs to solve them. SWE-bench & SWE-agent. Postdoc @Princeton. PhD @nlpnoah @UW.
95K Followers 207 FollowingLMArena: Open Platform for Community-driven AI Benchmarking. Graduated from UC Berkeley / @lmsysorg. We’re hiring: https://t.co/1OkfLq2n0I
320 Followers 520 FollowingResearcher at the Alibaba DAMO Academy, Singapore R&D Center | Former Visiting Postdoc Researcher at UIUC @uiuc_nlp | NLP PhD from CUHK @CUHKofficial
614 Followers 205 Following🎓phd in Tsinghua University. Focus on RL, Embodied AI, and MLLM. 📖Author of limit-of-RLVR,phyworld,DeeR-VLA. 💼Seek a visit currently.
1K Followers 1K FollowingRS @AIatMeta. AI search/reasoning/agent/safety. Previously Phd @ucsbnlp, BEng @tsinghua_uni. Opinions are my own
Fast learner with strong intellectual curiosity
2K Followers 2K FollowingPh.D. Student @PrincetonCS. Prev @Stanford @UW @pika_labs @MSFTResearch @UofIllinois. I used to work on computer vision, but it's not all I do.
583 Followers 552 FollowingCS Ph.D. at National University of Singapore (🇸🇬NUS-PhDing)
CS & STAT B.S. at University of Illinois Urbana-Champaign (🇺🇸UIUC-BS)
2K Followers 479 FollowingPh.D. student @HKUSTGuangZhou, Researcher @MetaGPT_, Cofounder of OpenManus, previously at RUC, Lenovo Research AI Lab, Zhipu AI.
792 Followers 960 FollowingMultidisciplinary artist. Pushing creative boundaries with AI & photography.Exploring a wide range of topics. Capturing stunning visuals & creating magic
227 Followers 570 FollowingSecond year PhD @UW | Post-Training, LLM reasoning and synthetic dataset.
https://t.co/cYAkbnCsCp
Open to chat and collaborate!