my machine learning research account where i tell you abt all my sick experiments | pfp: me w/ https://t.co/XWwMkEg1a1 | personal account: @maxisawesome538maxisawesome.github.io San Francisco, CAJoined December 2022
Interesting trend in AI: the best results are increasingly obtained by compound systems, not monolithic models.
AlphaCode, ChatGPT+, Gemini are examples.
In this post, we discuss why this is and emerging research on designing & optimizing such systems.
bair.berkeley.edu/blog/2024/02/1…
Thrilled to announce Aya 🌿, a massively multilingual instruction-tuned LLM, featuring 101 languages and the largest collection of multilingual instruction datasets. Over half of these languages are under-resourced. A monumental effort from @CohereForAI and Aya team 🚀
Thrilled to announce Aya 🌿, a massively multilingual instruction-tuned LLM, featuring 101 languages and the largest collection of multilingual instruction datasets. Over half of these languages are under-resourced. A monumental effort from @CohereForAI and Aya team 🚀
LLMs improved using available data from the noisy Internet.
@CohereForAI researchers achieved unexpected results by pruning data.
Their research suggests removing most pretraining data while maintaining performance!
In 2022, we Launched the Cohere For AI Scholars Program to help close the gap between research experience and opportunity. In our inaugural year, we welcomed 6 talented researchers - @luizapzbn, @lekeonilude, @maxdoesresearch, @aahmadian_, @tedzadouri and Meriem Boubdir.
Really proud of our work led by @maxdoesresearch w @ahmetustun89@luizapzbn@W4ngatang@mziizm 🎉
LM datasets are huge. Is all text needed? How can we measure data quality in this setting? Enter data pruning: removing subsets least valuable while preserving performance.
Really proud of our work led by @maxdoesresearch w @ahmetustun89@luizapzbn@W4ngatang@mziizm 🎉
LM datasets are huge. Is all text needed? How can we measure data quality in this setting? Enter data pruning: removing subsets least valuable while preserving performance.
2K Followers 8K FollowingHuman&Digital Rights Activist
Telecommunications Engineer+CISCO Eng
Medoola Innovation&Policy Institute
Mechatronics Engineering Nelson Mandela University
371 Followers 5K FollowingUnderstanding people and instructing machines.
Thoughts and tweets are not mine or my employer's or my neighbour's. Probably they are yours.
2K Followers 3K FollowingPhD @uwaterloo🌲 IR & NLP | I like good evals🔎 l Prev: intern @DbrxMosaicAI @GoogleAI & RA @UKPLab | https://t.co/kxQprYr7Xn, https://t.co/YVvVjSyXOS, TREC-RAG and FreshStack! ✨
166 Followers 213 Followingresearcher @DBRXMosaicAI - i develop synthetic data and RL methods to test and improve agents. Ex @MSFTResearch and @Livermore_Lab. Ph.D. @PurdueECE
44K Followers 1K FollowingCTO at @Databricks and CS prof at @UCBerkeley. Working on data+AI, including @ApacheSpark, @DeltaLakeOSS, @MLflow, https://t.co/94gROE5Xa0. https://t.co/nmRYAKG0LZ
615 Followers 686 FollowingPerception for "embodied AI" at StackAV. Visiting Researcher @CMU_robotics. Formerly @motionaldrive @argoai. Opinions are my own.
1K Followers 2K FollowingInterested in making LLMs go brrrrr
x+1: MS @LTIatCMU
x: LLM @Zomato
x-N: https://t.co/ht5ObQh7RV & Program Synthesis with LLMs @ProseMsft
3K Followers 3K Followingfounder of rag startup, ex Pinterest Search / Homefeed, https://t.co/0VwMvjB9Xh, Altiscale, Google Ads, Search, Google Code Jam organizer
24K Followers 706 FollowingMember of the technical staff @ Anthropic. Most (in)famous for inventing diffusion models. AI + physics + neuroscience + dynamics.
707 Followers 3K FollowingAI / ML / RL research @Mila_Quebec / @UMontreal, prev. research @Ualberta, @AmiiThinks, @rlai_lab. Open science community lead @Cohere_Labs .
2K Followers 597 FollowingOpen-endedness, Data-centric AI @LilaSciences
Previously: RS @synth_labs, PhD @ucsbNLP, Internships @AIatMeta @MSFTResearch
All puns are my own
46K Followers 1K FollowingWriter https://t.co/TquuQXlLOJ. O'Reilly Author https://t.co/Fl3uPAZHLg. LLM Builder @Cohere. Visualizing AI one concept at a time.
12K Followers 816 FollowingAuthor of Grokking Machine Learning, ML and QC popularizer, YouTuber: https://t.co/MGCYjf6M9K, Opinions are my own https://t.co/S1jvXnAa8U
No recent Favorites. New Favorites will appear here.