Machine Learning PhD student @mldcmu
Duke '19 BS in Math and CS
Student researcher @google
Past: applied scientist intern at Amazon @awscloud Pittsburgh, PAJoined February 2015
Many LLM fine-tuning methods. Unclear what you should use & why?
In our new paper, we did an extensive study of on-policy RL, supervised & offline contrastive methods (DPO, IPO) to answer this... 🧵⬇️
On-policy > offline, mode-seeking > mode-covering
understanding-rlhf.github.io
This has all the trappings of becoming a permanent reference. I have greatly admired the work of one of the authors for a long time. Nice to see this in book form! arxiv.org/abs/2403.14606
Just got back from vacation, and super excited to finally release Griffin - a new hybrid LLM mixing RNN layers with Local Attention - scaled up to 14B params!
arxiv.org/abs/2402.19427
My co-authors have already posted about our amazing results, so here's a 🧵on how we got there!
I'm excited to be back in the classroom for CMU 11-711 Advanced NLP this semester! phontron.com/class/anlp2024/
We revamped the curriculum to take into account recent advances in LLMs, and we have a new assignment "build-your-own-LLaMa".
We'll be posting slides/videos going forward.
1K Followers 5K FollowingThe DAO investor. Early @Aleph__zero inv. Decentralization. Born on Vikings island called Jomsborg. Applied math. My posts are not financial advise.
39 Followers 5K FollowingHi I'm Rais. I'm mainly focussing on Math and Science lifelong. There is a lot to discover in these fields and my mind is always blown by all the cool things.
1K Followers 1K FollowingI support Recommendations CoreML team at @meta, and build large scale recommender systems(model & infra) for FB Reels, IG Reels and In Feed Recommendations.
87 Followers 458 FollowingLearning enthusist ,full of boyish charm and verbal puffery, inspired by buck mulligan, stephan dedulas and all the other fearful jesuits.
2K Followers 776 FollowingPhD at IDSIA with @SchmidhuberAI. Working on self-improving AI that generalizes (MetaGenRL, VSML, GPICL). @DeepMind @GoogleAI intern, @UCL, @HPI_DE alumnus.
45 Followers 208 FollowingPhD student at 🤖 @whi_rl and @flair_ox 🤖 First Class MEng from Oxford 🎓 I love machine learning, artificial intelligence and fantasy books 🐉🧙
2K Followers 2K FollowingAssistant professor in computer science, University of Toronto | @UofTRobotics @VectorInst | Working on robotics, vision, and machine learning.
2K Followers 2K FollowingPhD in machine learning & optimization, now postdoc at EPFL. Julia language enthusiast. Amateur songwriter (aka PianoHamster). OCD survivor.
2K Followers 575 FollowingResearcher at @GoogleDeepMind. PhD student at @VectorInst / @UofT. Building tools to study neural nets and find out what they know. He/him.
6K Followers 227 FollowingLlama3 pre-training lead. Partially to blame for things like the Cicero Diplomacy bot, BART, RoBERTa, kNN-LM, top-k sampling & Deal Or No Deal.
2K Followers 615 FollowingAssociate Prof. at SJTU, leading GAIR Lab (https://t.co/Nfd8KmZx3B) Co-founder of Inspired Cognition, Postdoc at @LTIatCMU, Previously FNLP, @MILAMontreal,
10K Followers 3K FollowingI build tough benchmarks for LMs and then I get the LMs to solve them. Postdoc @Princeton. PhD from @nlpnoah @UW. Ex-visiting researcher @MetaAI & @MosaicML.
5K Followers 22 Following@Harvard Professor of MCB & Physics and Director of Swartz Program in Theoretical Neuroscience;
@HebrewU Professor of Physics and Neuroscience (Emeritus)
39K Followers 96 FollowingThe AI development platform - From idea to AI, Lightning fast ⚡️. Creators of AI Studio, PyTorch Lightning... Get help: https://t.co/a69wnEARV9