a third year master student at Wuhan University, focusing on the natural language processing area. I'm actively looking for a PhD position for 2025Fall.cooper12121.github.io ChinaJoined December 2022
Since everyone wants to learn RL for language models now post DeepSeek, reminder that I've been working on this book quietly in the background for months.
Policy gradient chapter is coming together. Plugging away at the book every day now.
rlhfbook dot com
The Top ML Papers of the Week (May 27 - June 2):
- SimPO
- GNN-RAG
- Attention as an RNN
- Abacus Embeddings
- Symbolic Chain-of-Thought
- Contextual Position Encoding
...
I've been thinking about the many, MANY, DPO spinoff methods we've been seeing recently for rlhf.
IPO, D2PO, CPO, ORPO, SPO, sDPO, KTO, DNO...
Most claim they're "the best" but doesn't properly compare to related work. What do we do in alignment research?
Thread 📚
🎉 Exciting News! 🎉 Just open-sourced my latest project: Llama3-based 8x8b-MoE model! 🚀 Extends llama3-8B-Instruct model with MoE architecture. Check it out & give it a star! github.com/cooper12121/ll…
To close out 2023, here are 10 of the most interesting AI research advancements we shared on our feed this year — and where you can find more details on the work.
1️⃣ Segment Anything (SAM)
A step toward the first foundation model for image segmentation.
Details:…
Key vulnerabilities of GPT-4:
1. Fine-tuning API can remove or diminish safety guardrails, causing the model to produce harmful outputs or assist with dangerous requests
2. Fine-tuning can make the model generate targeted misinformation against public figures
3. Fine-tuning…
Foundation models are transforming society: in the past month alone, we've seen a flurry of releases!
GPT-4, Claude, PaLM API, Alpaca, Dolly, Jurassic-2, PaLM-E, GPT4All, Cerebras-GPT, OpenFlamingo, ...
We built Ecosystem Graphs to track their footprint:
crfm.stanford.edu/ecosystem-grap…
4 essential books anyone should read:
• Machine Learning with PyTorch and Scikit-Learn
• Transformers for NLP
• Deep Learning with Python
• Designing Machine Learning Systems
23 Followers 138 FollowingResearch lab from UC Davis(@ucdavis) specializing in #NLP, #Multimodal, and #AI4Science (particularly on #LLMs and #VLMs). Directed by Prof. @lifu_huang
3K Followers 6K FollowingBiomedical Engineering researcher turned Systems Designer,Machine learning, ai +Robotics,cryptography etc. I fall in love with Machine learning every week :)
397 Followers 663 FollowingNLP PhD at @NTUsg | Previous @Cornell_CS | Working on LLMs for Scientific Discovery & Reasoning | Former Intern at @MSFTResearch | MOOSE & MOOSE-Chem series
613 Followers 439 FollowingPhD student at the University of Amsterdam / ILLC, interested in computational linguistics and (mechanistic) interpretability. Current Anthropic Fellow.
876K Followers 52 Followingwe invest in software eating the world
https://t.co/A9eTFq6plZ
https://t.co/MXGUBJoesw
Watch "The Ben & Marc Show": https://t.co/eRuDhx7kpe
21K Followers 97 FollowingThe #1 AI Engineering podcast & newsletter. Technical insights and news today you will use at work tomorrow! Hosted by @swyx and @fanahova
6K Followers 15 FollowingAI researcher @deepseek_ai. Interested in reasoning ability of LLMs. The long-term research goal is to develop artificial general intelligence.
1K Followers 749 FollowingAI / NLP Researcher
Incoming faculty at @UBC_CS and @CAIDA_UBC
Postdoctoral fellow at @StanfordHAI @stanfordnlp
Former PhD student at @uwcse @uwnlp
he/him
4K Followers 313 FollowingUniversity of Copenhagen Natural Language Understanding research group, led by @IAugenstein #NLProc #ML #dlearn
Funded by @ERC_Research @DFF_raad @VILLUMFONDEN
56K Followers 854 FollowingFiguring out AI @allen_ai, open models, RLHF, fine-tuning, etc
Contact via email.
Writes @interconnectsai
Wrote The RLHF Book
Mountain runner
9K Followers 101 FollowingMember of Technical Staff at Anthropic AlphaGo, AlphaZero, MuZero, AlphaCode, AlphaTensor, AlphaProof Gemini RL Prev Principal Research Engineer at DeepMind
14K Followers 2K FollowingThis Week in #MachineLearning & #AI (podcast) brings you the most interesting and important stories from the world of #ML and artificial intelligence.
11K Followers 1K FollowingI like tokens! I lead the OLMo data team at @allen_ai w/ @kylelostat. Open source is fun 🤖☕️🍕🏳️🌈 Opinions are sampled from my own stochastic parrot
711 Followers 23 FollowingResearch team @allen_ai working on AI, HCI, ML, NLP, accessibility, and comp. social science in support of @SemanticScholar's mission of accelerating science.
15K Followers 6K FollowingI build tough benchmarks for LMs and then I get the LMs to solve them. SWE-bench & SWE-agent. Postdoc @Princeton. PhD @nlpnoah @UW.
397 Followers 663 FollowingNLP PhD at @NTUsg | Previous @Cornell_CS | Working on LLMs for Scientific Discovery & Reasoning | Former Intern at @MSFTResearch | MOOSE & MOOSE-Chem series
20K Followers 1K Following@OpenAI Language agents (ReAct, Reflexion, Tree of Thoughts, SWE-agent, CoALA) for digital automation (WebShop, SWE-bench, tau-bench)
4K Followers 2K FollowingResearch Scientist @NVIDIA focusing on efficient post-training of LLMs. Finetuning your own LLMs with LMFlow: https://t.co/UTykmQBwFr Views are my own.