Priya Goyal @priy2201
Founding member @datologyai, ex-Google Deepmind, ex-Facebook AI Research (FAIR). prigoyal.github.io New York, USA Joined January 2012-
Tweets201
-
Followers1K
-
Following499
-
Likes773
🎉 Thrilled to announce that DatologyAI has been named to the CB Insights AI 100 list! 🏆 The DatologyAI team is committed to continuing to advance the field of AI and empowering organizations with high-quality data. Stay tuned for more exciting updates! 😀
Reminder about the ✨MARCH 25✨ submission deadline for the Responsible GenAI Workshop @CVPR!! We welcome 4-page papers at the intersection of responsible AI and generative AI. See full list of topics here: sites.google.com/view/cvpr-resp…
Introducing @datologyai — Making models better through better data, automatically! techcrunch.com/2024/02/22/dat…
A new startup by former FAIRies
Announcing our investment in @datologyai! Led by AI pioneers @arimorcos, @hurrycane & @leavittron, Datology is a data curation platform to reduce training costs & improve model performance. @sarahcat21 shares why the future of AI just got brighter: amplifypartners.com/blog-posts/dat…
Our mission @datologyai is to enable anyone to train powerful AI models by making data curation and optimization easy for everyone. Hear more about our mission here: datologyai.com/post/introduci…
I'm incredibly excited to announce our new company, @datologyai! Training models is hard and identifying the right data is the most important and difficult part -- our goal @datologyai to make optimizing training data at scale easy and automatic across modalities.…
I’m very excited to share our work on Gemini today! Gemini is a family of multimodal models that demonstrate really strong capabilities across the image, audio, video, and text domains. Our most-capable model, Gemini Ultra, advances the state of the art in 30 of 32 benchmarks,…
Introducing the Perception Test, a new multimodal benchmark using real-world videos to help evaluate the perception capabilities of a model: dpmd.ai/dm-perception-… 1/
I learned a lot from my first #FAccT22 session this morning! Thanks to the authors — @priy2201, @ninamarkl, @noagarciad, Yusuke Hirota, and their teams — for the thought provoking papers & discussion!
"Vision Models Are More Robust And Fair When Pretrained On Uncurated Images Without Supervision" from FAIR Paris & NYC Large-scale experiment with 10 billion param RegNet pre-trained by SSL with SwAV on 1 billion random public Instagram photos. arxiv.org/abs/2202.08360
Vision Models Are More Robust And Fair When Pretrained On Uncurated Images Without Supervision abs: arxiv.org/abs/2202.08360 10B parameters dense model, outperforms sota models (supervised and self-supervised) trained on ImageNet on 20 out 25 image classification tasks
Self-supervised learning is really pushing the boundaries of what's possible with deep learning these days. This new paper showcases some of those applications; from improving visual representations to better model robustness and generalization. arxiv.org/abs/2202.08360
Insightful post from @priy2201 showing that self-supervised learning can allow us to avoid the western-bias of labelled datasets ai.facebook.com/blog/seer-an-i…
Soumith Chintala @soumithchintala
185K Followers 877 Following Cofounded and lead @PyTorch at Meta. Also dabble in robotics at NYU. AI is delicious when it is accessible and open-source.AI at Meta @AIatMeta
530K Followers 255 Following Together with the AI community, we are pushing the boundaries of what’s possible through open science to create a more connected world.Lucas Beyer (bl16) @giffmana
56K Followers 445 Following Researcher (Google DeepMind/Brain in Zürich, ex-RWTH Aachen), Gamer, Hacker, Belgian. Mostly gave up trying mastodon as [email protected]elvis @omarsar0
188K Followers 482 Following Building with LLMs @dair_ai • Prev: Meta AI, Galactica LLM, PapersWithCode, Elastic, PhD • Creator of the Prompting Guide (~4M learners)Eric Jang @ericjang11
69K Followers 3K Following physical AGI at 1X. Author of "AI is Good for You" https://t.co/eFg4WXhg0pDelip Rao e/σ @deliprao
46K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈Thomas Wolf @Thom_Wolf
68K Followers 4K Following Co-founder and CSO @HuggingFace - open-source and open-scienceJeff Dean (@🏡) @JeffDean
296K Followers 6K Following Chief Scientist, Google DeepMind and Google Research. Co-designer/implementor of things like @TensorFlow, MapReduce, Bigtable, Spanner, Gemini .. (he/him)Joelle Pineau @jpineau1
10K Followers 352 Following AI researcher. VP AI Research (FAIR), @AIatMeta. Professor of Computer Science, @mcgillu. Core academic member, @Mila_QuebecPensé FFun @inftyCategory
120 Followers 5K FollowingKyla Kelly @_Kyla_Kelly_
28 Followers 173 Following AI Enthusiast - Opinions and statements are my own.Me__lanie @lanie91441
4 Followers 970 Followingmikail khona @KhonaMikail
1K Followers 1K Following Incoming intern @nvidia | ex-intern @NttResearch studying LLMs | Physics PhD candidate, comp. neuro and deep learning, @MIT @mitbrainandcog @MIT_PhysicsVelvet_Vista @velvet_vis58205
1 Followers 201 Following Nice to meet you. My hobbies are reading, food and sports. I like cats😘 I like to meet new friends while traveling🎉🎉🎉Jacob Portes @JacobianNeuro
681 Followers 1K Following Research Scientist @MosaicMLxDatabricks. I like it when neuroscience inspires AI 🧠+🖥️Noatha @Noatha29899
1 Followers 469 FollowingExpedition_Elle @ElleExpedi76790
0 Followers 507 FollowingArif Ahmad @ArifAhm92263086
209 Followers 6K Following All things AI, Computer Science and Circuits! Prev. @GoogleAIvibha @vibhamasti
468 Followers 1K Following she/her. Master’s @LTIatCMU. Prev: ML @ Apple India. Bachelor’s @PESUniversity. Not a professional account.Tipheigh @tipheigh66076
0 Followers 441 FollowingNathan Benaich @nathanbenaich
51K Followers 32K Following solo member of investment staff @airstreet, brewing ambition @airstreetcafe, next token predictor @airstreetpressEliza_beth8 @Beth8Eliza57759
2 Followers 1K FollowingDebesh Jha @debesh_jha
509 Followers 2K Following Senior Research Associate | Researching on AI in Medicine @NURadiology @NorthwesternU | PhD in computer science @simula_research @UiTNorgesarktis | @chosun_univEthrRpl_93 @ethrrpl81333
0 Followers 607 Following_Review @Review1221817
4 Followers 913 FollowingAva_Garcia @AvaGarcia162861
4 Followers 1K FollowingJosh Wills @josh_wills
18K Followers 2K Following Engineering at @datologyai; @duckdb enthusiast, ex-@slackhqDmytro Dzhulgakov @dzhulgakov
3K Followers 572 Following Co-founder and CTO @FireworksAI_HQ. PyTorch core maintainer. Previously FB Ads. Ex-Pro Competitive ProgrammerPavel Izmailov @Pavel_Izmailov
6K Followers 1K Following Working on LLM reasoning @OpenAI 🤖 Incoming Assistant Professor @nyuniversity 🏙️ #StopWar 🇺🇦Saurabh Shah @saurabh_shah2
504 Followers 980 Following ML Engineer @Apple /Siri NLU, prev @allen_ai @Penn …. 🎤dabbler in standup comedy and music 🎸… 🐈⬛enjoyer of cats 🐈 and mountains🏔️ …he/himAkash Gokul @AkashGokul_
8 Followers 1K Following3_JessW_JAS8 @3Jas83647
27 Followers 347 FollowingCoen Mouton @CoenMouton
81 Followers 504 Following 🎓 PhD Student in South Africa. Research focus is on decision boundaries in DNNs - generalization and adversarial robustness.Bartosz Cywinski @bartoszcyw
20 Followers 446 FollowingTosothe @tosothe64749
4 Followers 235 Following 我是汽車貿易商,專營各大廠牌汽車進口,每輛車都擁有美國當地第三方認證證書,嚴謹把關品質、出港船運、進港報關、專業車測領牌皆一手包辦,一條龍的服務可以替您簡省許多成本Jordan Gong @jordan__gong
39 Followers 2K FollowingHarshay Shah @harshays_
406 Followers 445 Following ML PhD student at MIT, advised by @aleks_madry Previously: @googleai @msftresearch @illinoiscsAlexander D'Amour (al.. @alexdamour
4K Followers 1K Following Research Scientist at Google Brain. Statistics, Data Science, ML, causality, fairness. Prev at Harvard (PhD), UC Berkeley (VAP). Opinions my own. he/him.Angéline Pouget @angelinepouget
25 Followers 82 Following Student Researcher @ Google DeepMind | Data Science MSc @ ETH Zürich | 2022 Excellence ScholarIbrahim Alabdulmohsin.. @ibomohsin
907 Followers 582 Following AI Research Scientist at @GoogleDeepmindMeetasl @meetasl91756
6 Followers 226 Following 我是汽車貿易商,專營各大廠牌汽車進口,每輛車都擁有美國當地第三方認證證書,嚴謹把關品質、出港船運、進港報關、專業車測領牌皆一手包辦,一條龍的服務可以替您簡省許多成本Aflah 🍉🕊️ @Aflah02101
180 Followers 981 Following Researching @mpi_sws_, @lcs2lab & @AiEleuther • Prev @GoldmanSachs • GSoC @TensorFlow • Senior @IIITDelhi • #CEASEFIRENOW 🕊️Mayank maurya @maurya7mayank
51 Followers 158 Following https://t.co/HN01B3mXSb.. LLB. constitution ( favourite subject) Relationship ..❣️... SingleGAO Hongnan @gaohongnan
37 Followers 297 FollowingPraveen — e/acc @pravnx
364 Followers 1K Following Software Engineer; Interests: HPC, AI, Product Management, EntrepreneurshipMakya @Makya12345678
5 Followers 960 FollowingAditya Kusupati @adityakusupati
3K Followers 2K Following 🔬PhD.. @uwcse: @RAIVNLab; Been places..... Done things....Krish Dasgupta @officialKrishD
868 Followers 4K Following Forever Learner | Building Reinforcement Learning Systems | Healthcare | Robots and Brains | Graph ML for HealthLuke McDermott @lukemcdermotttt
117 Followers 430 Following AI Researcher at Modern Intelligence, focusing on Efficient Deep Learning & Adaptive AI | Incoming PhD @UCSanDiegoAlon Albalak @AlbalakAlon
878 Followers 461 Following CS PhD candidate at @ucsbNLP. Research: Data-centric AI, Efficiency in ML, NLP.Sumukh Aithal @sumukhaithal6
44 Followers 453 Following Graduate Student at CMU. Interested in solving research problems in machine learning.Navreet Kaur @navreeetkaur
169 Followers 551 Following Research in NLP, Responsible AI, HCI, AI Ethics. She/herSenaBeren @findingmerit
287 Followers 3K FollowingYann LeCun @ylecun
709K Followers 718 Following Professor at NYU. Chief AI Scientist at Meta. Researcher in AI, Machine Learning, Robotics, etc. ACM Turing Award Laureate.Andrej Karpathy @karpathy
977K Followers 904 Following 🧑🍳. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets 🧠🤖💥AK @_akhaliq
309K Followers 3K Following AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo follow on Hugging Face: https://t.co/q2Qoey80GxSoumith Chintala @soumithchintala
185K Followers 877 Following Cofounded and lead @PyTorch at Meta. Also dabble in robotics at NYU. AI is delicious when it is accessible and open-source.AI at Meta @AIatMeta
530K Followers 255 Following Together with the AI community, we are pushing the boundaries of what’s possible through open science to create a more connected world.Google DeepMind @GoogleDeepMind
942K Followers 275 Following We’re a team of scientists, engineers, ethicists and more, committed to solving intelligence, to advance science and benefit humanity.Lucas Beyer (bl16) @giffmana
56K Followers 445 Following Researcher (Google DeepMind/Brain in Zürich, ex-RWTH Aachen), Gamer, Hacker, Belgian. Mostly gave up trying mastodon as [email protected]elvis @omarsar0
188K Followers 482 Following Building with LLMs @dair_ai • Prev: Meta AI, Galactica LLM, PapersWithCode, Elastic, PhD • Creator of the Prompting Guide (~4M learners)PyTorch @PyTorch
379K Followers 77 Following Tensors and neural networks in Python with strong hardware acceleration. PyTorch is an open source project at the Linux Foundation. #PyTorchFoundationPaul Graham @paulg
1.9M Followers 772 FollowingKyunghyun Cho @kchonyc
60K Followers 2K Following a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).rohan anil @_arohan_
12K Followers 2K Following Principal Engineer, @GoogleDeepMind Gemini. prev PaLM-2. Tinkering with optimization and distributed systems. opinions are my own.François Fleuret @francoisfleuret
30K Followers 473 Following Prof. @Unige_en, Adjunct Prof. @EPFL_en, Research Fellow @idiap_ch, co-founder @nc_shape. AI and machine learning since 1994. I like reality.Eric Jang @ericjang11
69K Followers 3K Following physical AGI at 1X. Author of "AI is Good for You" https://t.co/eFg4WXhg0pAndrew Ng @AndrewYNg
1.0M Followers 909 Following Co-Founder of Coursera; Stanford CS adjunct faculty. Former head of Baidu AI Group/Google Brain. #ai #machinelearning, #deeplearning #MOOCs(((ل()(ل() 'yoav))).. @yoavgo
46K Followers 2K FollowingKosta Derpanis @CSProfKGD
48K Followers 198 Following #CS Associate Prof @YorkUniversity, #ComputerVision Scientist Samsung #AI, @VectorInst Faculty Affiliate, TPAMI AE, #CVPR2024/#ECCV2024 Publicity Co-chairPedro Domingos @pmddomingos
78K Followers 167 Following Professor of computer science at UW and author of 'The Master Algorithm' and '2040'. Into machine learning, AI, and anything that makes me curious.Dwarkesh Patel @dwarkesh_sp
54K Followers 700 Following Being pretrained Host of Dwarkesh Podcast https://t.co/3SXlu7fy6N https://t.co/rEhnfYywXY https://t.co/hQfIWdM1UnJacob Portes @JacobianNeuro
681 Followers 1K Following Research Scientist @MosaicMLxDatabricks. I like it when neuroscience inspires AI 🧠+🖥️lmsys.org @lmsysorg
36K Followers 172 Following Large Model Systems Organization. We created Vicuna and Chatbot Arena! Compare 30+ LLMs (GPT-4/Claude/Llamas) side-by-side at https://t.co/IDFeIDIOtmRyan Lowe @ryan_t_lowe
5K Followers 357 Following what is the place from which we are creating? ❤️✨🤠❤️Andreas Kirsch 🇮�.. @BlackHC
9K Followers 5K Following Past: 🧑🎓 DPhil @AIMS_oxford @ExeterCollegeOx @UniofOxford (4.5yr) 🧙♂️ RE @DeepMind (1yr) 📺 SWE @Google (3yrs) 🎓 @TU_Muenchen 👤 Fellow @nwspkSasha Rush @srush_nlp
52K Followers 464 Following Professor, Programmer in NYC. Cornell Tech, Hugging Face 🤗 https://t.co/cZl0wTfqGzColin Raffel @colinraffel
30K Followers 655 Following nonbayesian parameterics, sweet lessons, and random birds. Friend of @srush_nlpPiotr Padlewski @PiotrPadlewski
1K Followers 319 Following Chief Meme Officer @ https://t.co/CtBrcKmliI, ex-Google Deepmind/Brain ZurichJosh Wills @josh_wills
18K Followers 2K Following Engineering at @datologyai; @duckdb enthusiast, ex-@slackhqDmytro Dzhulgakov @dzhulgakov
3K Followers 572 Following Co-founder and CTO @FireworksAI_HQ. PyTorch core maintainer. Previously FB Ads. Ex-Pro Competitive Programmerclem 🤗 @ClementDelangue
90K Followers 5K Following Co-founder & CEO @HuggingFace 🤗, the open and collaborative platform to build machine learningPercy Liang @percyliang
49K Followers 408 Following Associate Professor in computer science @Stanford @StanfordHAI @StanfordCRFM @StanfordAILab @stanfordnlp | cofounder @togethercompute | Pianistswyx @swyx
91K Followers 3K Following Anti-ego ideas for anti-ergodic life. Founder, @smolmodels ▹ Listen: @latentspacepod ▹ Read: @coding_career ▹ Join: @aiDotengineerPavel Izmailov @Pavel_Izmailov
6K Followers 1K Following Working on LLM reasoning @OpenAI 🤖 Incoming Assistant Professor @nyuniversity 🏙️ #StopWar 🇺🇦Adam Wolff @dmwlff
8K Followers 496 Following Engineering @ElectricCapital ⚡️ Formerly Facebook & Robinhood Avid cook, dedicated snow person Views are my own, not investment adviceSarah Catanzaro @sarahcat21
12K Followers 1K Following “All methods are sacred if they are internally necessary” (GP @amplifypartners, prev @canvasvc; Head of Data @Mattermark; @palantirtech; @c4ads)Ofir Press @OfirPress
10K Followers 3K Following I build tough benchmarks for LMs and then I get the LMs to solve them. Postdoc @Princeton. PhD from @nlpnoah @UW. Ex-visiting researcher @MetaAI & @MosaicML.Mike Lewis @ml_perception
6K Followers 227 Following Llama3 pre-training lead. Partially to blame for things like the Cicero Diplomacy bot, BART, RoBERTa, kNN-LM, top-k sampling & Deal Or No Deal.Ari Morcos @arimorcos
6K Followers 2K Following CEO and Co-founder @datologyai working to make it easy for anyone to make the most of their data. Former: RS @AIatMeta (FAIR), RS @DeepMind, PhD @PiN_Harvard.sarah guo // convicti.. @saranormous
91K Followers 3K Following startup investor and builder, founder @w_conviction. accelerating AI adoption, interested in progress. tech podcast: @nopriorspodJack Urbanek @JackUrbs
370 Followers 15 Following I'm a Founding Member of DatologyAI's technical staff, working to solve automated data curation for ML training at scale. Formerly worked at FAIR.Pratyush Maini @pratyushmaini
1K Followers 336 Following Trustworthy ML | PhD student @mldcmu | Founding Member @datologyai | Prev. Comp Sc @iitdelhiGroq Inc @GroqInc
44K Followers 466 Following Creator of the LPU™ Inference Engine, providing the fastest speed for AI applications, designed & engineered in N. America https://t.co/DsEqVAC5DpMagic.dev @magicailabs
10K Followers 3 Following Magic is working on frontier-scale code models to build a coworker, not just a copilot. Come join us: https://t.co/hGZKtUzsR3ElevenLabs @elevenlabsio
64K Followers 11 Following Research lab exploring new frontiers of Voice AI. Building tools for long-form speech synthesis, voice cloning and dubbing.Johnny Ho @randomjohnnyh
3K Followers 175 Following Cofounder, chief strategy officer @perplexity_ai. Former high frequency trader, competitive programmer.josh @j_mcgraph
522 Followers 801 Following founding member of @datologyai | PhD student @dsg_uwaterloo | poodle enthusiastBogdan Gaza @hurrycane
2K Followers 2K Following co-founder & CTO @DatologyAI working to make it easy for anyone to make the most of their data, hax0r, ex-@Twitter & Amazon EngineeringDatologyAI @datologyai
961 Followers 17 Following DatologyAI builds tools to automatically select and optimize the best data on which to train AI models, leading to better models which train faster.Ledell Wu @LedellWu
696 Followers 235 Following AI Research Scientist (Generative AI/LLM/Multimodal) Co-founder @CreatifyLab, Past: FAIR @MetaAI, @BAAIBeijing Recipient of ICML 2023 Test-of-Time AwardDan Shipper 📧 @danshipper
46K Followers 2K Following co-founder / ceo @every | | how to think, create, and relate with @ChatGPTappJacob Austin @jacobaustin132
3K Followers 796 Following @Google @DeepMind researcher. AI for math and science. Coding. Gemini. I also play piano. NYC. Opinions my ownJan Leike @janleike
44K Followers 322 Following ML Researcher, co-leading Superalignment @OpenAI. Optimizing for a post-AGI future where humanity flourishes.OpenAI Developers @OpenAIDevs
71K Followers 0 Following Official @OpenAI account for anyone building on our APIs. Join us in building the future of AI. We ❤️ developers!Thomas Scialom @ThomasScialom
6K Followers 227 Following AGI Researcher @MetaAI -- Lead Llama 2 and Postraining Llama 3. Also CodeLlama, Galactica, Toolformer, Bloom, Nougat, GAIA, ..Jascha Sohl-Dickstein @jaschasd
19K Followers 622 Following Member of the technical staff @ Anthropic. Most (in)famous for inventing diffusion models. AI + physics + neuroscience + dynamics.ok, there we have it from @elonmusk on tesla's earnings call today new h100 count = 35,000 and coming up to 85,000 by year end
Apple dropping absolute FACTS! Give them kudos: huggingface.co/papers/2404.14…
this is exciting! hasn't crossed the truly open line until all the data and stuff is actually available, but promising model and tons to learn from it.
Introducing Snowflake Arctic. An efficiently intelligent and truly open LLM built by Snowflake.
Doing an AI infra startup is like the moon mission: you calculate the burn carefully to get into orbit, you maneuver well to land a good exploration spot, and you make sure you have enough supplies to carry you back to earth to call it a success.
The GPT4 of datasets took down Hugging Face, sorry all 😅😅😅
It's a great week for open source AI! Data is among the highest impact work to push the field forward. Bravo to 🤗
Llama3 was trained on 15 trillion tokens of public data. But where can you find such datasets and recipes?? Here comes the first release of 🍷Fineweb. A high quality large scale filtered web dataset out-performing all current datasets of its scale. We trained 200+ ablation…
15T tokens DataLoader, you're welcome
We have just released 🍷 FineWeb: 15 trillion tokens of high quality web data. We filtered and deduplicated all CommonCrawl between 2013 and 2024. Models trained on FineWeb outperform RefinedWeb, C4, DolmaV1.6, The Pile and SlimPajama!
Its genuinely hard to believe a 70B model is up there with the 1.8T GPT4? I guess training data really is everything
🧵 A nice LLAMA-3 summary by @natolambert. In short, performance is better, but largely due to the sheer increase in the pre-training scale. Training data is a secret. ↩️
Instead of leaf blowers, I want a quiet little robot that picks leaves up one at a time and puts them in a bag, at night while I'm sleeping.
🎉 Thrilled to announce that DatologyAI has been named to the CB Insights AI 100 list! 🏆 The DatologyAI team is committed to continuing to advance the field of AI and empowering organizations with high-quality data. Stay tuned for more exciting updates! 😀
Penzai is one of the coolest ML libraries out there. Not only can you inspect every weight matrix and attention head in a Colab, you can trivially knock out heads, skip or repeat layers, or extract intermediates with a one line change. A beautiful tool for interpretability.
Excited to share Penzai, a JAX research toolkit from @GoogleDeepMind for building, editing, and visualizing neural networks! Penzai makes it easy to see model internals and lets you inject custom logic anywhere. Check it out on GitHub: github.com/google-deepmin…
The model card has some more interesting info too: github.com/meta-llama/lla… Note that Llama 3 8B is actually somewhere in the territory of Llama 2 70B, depending on where you look. This might seem confusing at first but note that the former was trained for 15T tokens, while the…
@dwarkesh_sp can never see myself move to the Bay; despite the AI/Tech density :) New York is an addiction
Today Meta released Llama 3! Congrats to the team. In their blog post they wrote that, "the curation of a large, high-quality training dataset is paramount", while providing almost no information about how it was made, how it was filtered, or its contents.
Congrats to @AIatMeta on Llama 3 release!! 🎉 ai.meta.com/blog/meta-llam… Notes: Releasing 8B and 70B (both base and finetuned) models, strong-performing in their model class (but we'll see when the rankings come in @ @lmsysorg :)) 400B is still training, but already encroaching…
AI cloud.
Pitch your service/product in 2 words. This one is a bit more difficult.