Quentin Lhoest @qlhoest
Open Source ML Engineer @huggingface | Maintainer of 🤗Datasets huggingface.co/lhoestq Joined September 2013-
Tweets897
-
Followers3K
-
Following227
-
Likes2K
Can you guess where these photos were taken? @geoguessr players have taken this skill to the extreme, but how good can an AI perform? Introducing OpenStreetView-5M 🌍, the first open-access and global-scale dataset of street view images. 🔗 Links and 🤖 demo below 👇 #CVPR2024
Let's go!! Common Voice 17 - now on the Hub! 🔥 With 31,000 hours of audio (& transcriptions) across 124 languages. *sound on 🎶* 847 hours of data were added in CV 17, along with 493 hours of validated data. Four new languages have been added to this edition: Haitian…
🌎 Better AI is better data, and for better data we need expertise! As part of the 'Data is Better Together' project in collaboration with Hugging Face, we bring the Domain Specific Datasets. You can read more in this post: huggingface.co/blog/burtensha… 🤗
I published my filtered and uncensored dataset for Dolphin-2.9 on @huggingface so if you wanna make your own spin on Dolphin, or just see how Dolphin is created, you can check it out. Thanks to all the upstream dataset creators for open source data! huggingface.co/datasets/cogni…
ZeroGPU is free distributed GPUs in HF Spaces 🔥 ⬇️ will give access to 100 new people in the next hours
Announcing that we are on our way to solve a long standing issue of document processing: correction of OCR mistakes. @pleaisfr publishes the largest dataset to date with automated OCR correction, 1 billion words in English, French, German and Italian huggingface.co/datasets/PleIA…
Llama3 was trained on 15 trillion tokens of public data. But where can you find such datasets and recipes?? Here comes the first release of 🍷Fineweb. A high quality large scale filtered web dataset out-performing all current datasets of its scale. We trained 200+ ablation…
Llama3 was trained on 15 trillion tokens of public data. But where can you find such datasets and recipes?? Here comes the first release of 🍷Fineweb. A high quality large scale filtered web dataset out-performing all current datasets of its scale. We trained 200+ ablation…
We have just released 🍷 FineWeb: 15 trillion tokens of high quality web data. We filtered and deduplicated all CommonCrawl between 2013 and 2024. Models trained on FineWeb outperform RefinedWeb, C4, DolmaV1.6, The Pile and SlimPajama!
AK @_akhaliq
310K Followers 3K Following AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo follow on Hugging Face: https://t.co/q2Qoey80Gxmerve @mervenoyann
56K Followers 4K Following open-sourceress at @huggingface 🧙🏻♀️ proud mediterrenean 🍋 I do TL;DR on ML papersclem 🤗 @ClementDelangue
91K Followers 5K Following Co-founder & CEO @HuggingFace 🤗, the open and collaborative platform for AI buildersOmar Sanseviero @osanseviero
32K Followers 2K Following Chief Llama Officer @huggingface 🦙 Founder @AI_Learners. Xoogler (SWE @Google Assistant, 20% PM TF Graphics). 100% Hacker Llama🇵🇪🇲🇽Julien Chaumond @julien_c
47K Followers 1K Following Co-founder and CTO at @huggingface 🤗. ML/AI for everyone, building products to propel communities fwd. @Stanford + @PolytechniqueHugging Face @huggingface
345K Followers 189 Following The AI community building the future. https://t.co/VkRPD0VKaZ #BlackLivesMatter #stopasianhateThomas Wolf @Thom_Wolf
68K Followers 4K Following Co-founder and CSO @HuggingFace - open-source and open-scienceNate Raw @_nateraw
7K Followers 1K Following machine learning hacker. previously @huggingface @lightningaiSasha Luccioni, PhD �.. @SashaMTL
19K Followers 4K Following AI & Climate @HuggingFace, Board Member of @WiMLworkshop and @ClimateChangeAI. @techreview 35 Innovators under 35, @TEDTalks speaker. She/her/Dr/ 🦋Lewis Tunstall @_lewtun
9K Followers 425 Following 🤗 LLM engineering & research @huggingface 📖 Co-author of "NLP with Transformers" book 💥 Ex-particle physicist 🤘 Occasional guitarist 🇦🇺 in 🇨🇭MMitchell @mmitchell_ai
80K Followers 1K Following Interdisciplinary researcher focused on shaping AI towards long-term positive goals. ML & Ethics. Same content in the Sky, Threads, & the Prehistoric ElephantNiels Rogge @NielsRogge
10K Followers 690 Following ML Engineer @ML6team, part-time at @huggingface. @KU_Leuven grad. General interest in machine learning, deep learning. Making AI more accessible for everyone!abhishek @abhi1thakur
81K Followers 663 Following 🤗 I build AutoTrain @huggingface 👨🏽💻 World's First 4x Grand Master @kaggle 🎥 YouTube 100k+: https://t.co/BHnem8fTu5 ⭐ GitHub StarDelip Rao e/σ @deliprao
46K Followers 5K Following Busy inventing the shipwreck. @Penn. Past: @johnshopkins, @UCSC, @Amazon, @Twitter ||Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈Sasha Rush @srush_nlp
52K Followers 464 Following Professor, Programmer in NYC. Cornell Tech, Hugging Face 🤗 https://t.co/cZl0wTfqGzZach Mueller @TheZachMueller
10K Followers 393 Following 🤗 Technical Lead for the Accelerate Project | Passionate about Open Source | Nerd who enjoys touching the grass | #ADHD | He/HimAbubakar Abid @abidlabs
12K Followers 1K Following Hind Rajab. 5 yrs old. She + 14,000 children killed by Israeli forces. PLEASE don't be silent. Take 5 min to call your reps and urge peace (link in bio)Leandro von Werra @lvwerra
6K Followers 310 Following Machine learning @huggingface: co-lead of @bigcodeproject and maintainer of TRL.Ross Wightman @wightmanr
18K Followers 1K Following Computer Vision @ 🤗. Ex head of Software, Firmware Engineering at a Canadian 🦄. Currently building ML, AI systems or investing in startups that do it better.Florent Daudens @fdaudens
11K Followers 6K Following Press Lead @HuggingFace / Passionate about AI & news / Previously @radiocanadainfo @ledevoir & coLoic Landrieu @captnloic
196 Followers 200 Following Machine learning and computer vision researcher. I focus on problems with large-scale geospatial structure.Zhaoyang Chu @zhaoyang_c68411
8 Followers 365 Following CS Master@HUST. Interested in SE+ML, specifically focusing on building trustworthy and reliable AI-based software systems. Seeking PhD starting in 2025 Fall.Tiezhen WANG @Xianbao_QIAN
917 Followers 355 Following Engineer at HuggingFace, ex-Googler on TFLite / micro. Ideas are my own.Milin Bhade @MilinBhade
59 Followers 1K Following Post Grad Student at IISc, Bangalore Masters in Computer Science & Automationyianan @yianan
40 Followers 1K Following安餒啊 @qiu48939
2 Followers 19 Followingontocord @ontocord
355 Followers 130 Following We dedicate ourselves to bringing lawful and effective data to AI training so that everyone can benefit from human knowledge. https://t.co/j0WMJCJzVBCheng Yang @yangcheng
291 Followers 2K Following co-founder https://t.co/O4DxnTK18e ex Scale AWS Uber LinkedInJulien Le Dem @J_
4K Followers 2K Following Architect, Founder, Angel, Advisor, OSS: @OpenLineage @MarquezProject, ASF: Parquet Arrow Iceberg 🐖. 🦋 https://t.co/4VQUXaZ5vu . he/himYenting Lin @yentinglin56
228 Followers 1K Following Research intern at @Nvidia Incoming research intern at @AIatMeta GenAI Previous scientist intern at @Amazon CS Ph.D. candidate at National Taiwan UniversityVolodymyr Kyrylov @darkproger
2K Followers 2K Following AI student at USI/ETH. Donate https://t.co/GDSkWG2takVitor Zucher | ויט.. @vmzucher
273 Followers 797 Following 2x Founder (1x Bootstrapped, 1x Seed $5M) - Acquired by IC 23' I do sales, marketing, code, data, product & growth. Zionist. Tech-Optimist. e/acc.Ferdinand Mom @FerdinandMom
129 Followers 558 Following Large scale training @HuggingFace. Average CPU & CUDA optimization enjoyer ~wa kimani @ngarawakimani
38 Followers 536 Following Software Engineer / Software Craftsman Github: https://t.co/0qUXMUFjgjPrashant Dixit @Prashant_Dixit0
172 Followers 775 Following AI/Computer Vision/LLM Researcher | Open-source ML | Building cool and exciting Stuff Connect- https://t.co/8wrqNPc2kPLynncc @Lynncc6
210 Followers 500 Following Actively seeking AI industry positions | Marketing | Contributing to ML community | LLM Researchlucacadalora @lucaxyzz
7K Followers 6K Following Admin https://t.co/oBJ2sf2RgV. ChatGPT Plus 100rb/bulan https://t.co/mkLkWLY9xd.Shukant Pal शुक.. @ShukantP
360 Followers 1K Following Machine Learning @getlindy. @getfacade. @PixiJS. @UTAustin. @OhioState. Previously @getTeamflow.shulei @qinshulei
38 Followers 675 FollowingChristophe Cerisara @ccerisara
125 Followers 358 Following CNRS researcher in computer science, speech recognition and natural language processingJonah Turner @drexalt
328 Followers 939 Following grinding ml, current master's student 🇫🇷 e/acc - gpu kernels - computer visionTony Carter @xtremesecurity
705 Followers 5K FollowingDeping Zhang @joebradly
94 Followers 3K FollowingEter Griffin @EterGriffinthor
239 Followers 3K Following Nōn nōbīs, Domine, nōn nōbīs, sed nōminī tuō dā glōriamVishwas Karhade @vkarhade
84 Followers 1K Following Director Success Architect at https://t.co/uPcdJ7TsLI, Salesforce CTA, Passionate about Technology and Engineering. All opinions are my personal opinions.MrDee@SOG 🫡 @sog_on_bird_app
1K Followers 780 Following Zeroth principle thinking. LLM enjoyor. The easiest person to fool is yourself. There is no absolute. The open world awaits you. #ProofOfWorkMax Zanoga @zanoga
210 Followers 935 Following 🚀 AI Enthusiast | 🔍 Exploring the frontiers of LLMs & ML | 📚 Lifelong learner & tech optimistAlwin K Lonappan @AlwinKLonappan
8 Followers 103 FollowingJuanjo do Olmo - .. @Claxterix
496 Followers 2K Following Please don't take my retweets too serious. - Twitter is a social experiment 😇. Medical AI Researcher @foundation29feb / BSc Pharma / AI from @iia_esAbdulrahman Tabaza @embed_dim
4 Followers 809 Following enjoyer of various vector spaces, encoders and modalitiesAK @_akhaliq
310K Followers 3K Following AI research paper tweets, ML @Gradio (acq. by @HuggingFace 🤗) dm for promo follow on Hugging Face: https://t.co/q2Qoey80Gxmerve @mervenoyann
56K Followers 4K Following open-sourceress at @huggingface 🧙🏻♀️ proud mediterrenean 🍋 I do TL;DR on ML papersclem 🤗 @ClementDelangue
91K Followers 5K Following Co-founder & CEO @HuggingFace 🤗, the open and collaborative platform for AI buildersOmar Sanseviero @osanseviero
32K Followers 2K Following Chief Llama Officer @huggingface 🦙 Founder @AI_Learners. Xoogler (SWE @Google Assistant, 20% PM TF Graphics). 100% Hacker Llama🇵🇪🇲🇽Julien Chaumond @julien_c
47K Followers 1K Following Co-founder and CTO at @huggingface 🤗. ML/AI for everyone, building products to propel communities fwd. @Stanford + @PolytechniqueHugging Face @huggingface
345K Followers 189 Following The AI community building the future. https://t.co/VkRPD0VKaZ #BlackLivesMatter #stopasianhateThomas Wolf @Thom_Wolf
68K Followers 4K Following Co-founder and CSO @HuggingFace - open-source and open-scienceNate Raw @_nateraw
7K Followers 1K Following machine learning hacker. previously @huggingface @lightningaiSasha Luccioni, PhD �.. @SashaMTL
19K Followers 4K Following AI & Climate @HuggingFace, Board Member of @WiMLworkshop and @ClimateChangeAI. @techreview 35 Innovators under 35, @TEDTalks speaker. She/her/Dr/ 🦋PyTorch @PyTorch
380K Followers 77 Following Tensors and neural networks in Python with strong hardware acceleration. PyTorch is an open source project at the Linux Foundation. #PyTorchFoundationLewis Tunstall @_lewtun
9K Followers 425 Following 🤗 LLM engineering & research @huggingface 📖 Co-author of "NLP with Transformers" book 💥 Ex-particle physicist 🤘 Occasional guitarist 🇦🇺 in 🇨🇭Soumith Chintala @soumithchintala
187K Followers 883 Following Cofounded and lead @PyTorch at Meta. Also dabble in robotics at NYU. AI is delicious when it is accessible and open-source.MMitchell @mmitchell_ai
80K Followers 1K Following Interdisciplinary researcher focused on shaping AI towards long-term positive goals. ML & Ethics. Same content in the Sky, Threads, & the Prehistoric ElephantNiels Rogge @NielsRogge
10K Followers 690 Following ML Engineer @ML6team, part-time at @huggingface. @KU_Leuven grad. General interest in machine learning, deep learning. Making AI more accessible for everyone!Jeremy Howard @jeremyphoward
222K Followers 5K Following 🇦🇺 Co-founder: @AnswerDotAI & @FastDotAI ; Hon Professor: @UQSchoolITEE ; Digital Fellow: @Stanfordabhishek @abhi1thakur
81K Followers 663 Following 🤗 I build AutoTrain @huggingface 👨🏽💻 World's First 4x Grand Master @kaggle 🎥 YouTube 100k+: https://t.co/BHnem8fTu5 ⭐ GitHub StarSasha Rush @srush_nlp
52K Followers 464 Following Professor, Programmer in NYC. Cornell Tech, Hugging Face 🤗 https://t.co/cZl0wTfqGzZach Mueller @TheZachMueller
10K Followers 393 Following 🤗 Technical Lead for the Accelerate Project | Passionate about Open Source | Nerd who enjoys touching the grass | #ADHD | He/HimKawactus @ho_andrew
114 Followers 245 Following Not a real scientist. Currently in SF, previously in Toronto and Vancouver.Julien Le Dem @J_
4K Followers 2K Following Architect, Founder, Angel, Advisor, OSS: @OpenLineage @MarquezProject, ASF: Parquet Arrow Iceberg 🐖. 🦋 https://t.co/4VQUXaZ5vu . he/himMichal Valko @misovalko
5K Followers 2K Following Llama @AIatMeta Paris & Inria & MVA - Ex: Gemini and BYOL @GoogleDeepMindKonrad Szafer @KonradSzafer
73 Followers 214 Following LLM Eval intern research @ Hugging Face | research assistant intern @ CMU AutonLabOlivierD @OlivierDehaene
113 Followers 9 Followingtomaarsen @tomaarsen
692 Followers 122 Following Sentence Transformers, SetFit & NLTK maintainer Machine Learning Engineer at 🤗 Hugging FaceTill Döhmen @tdoehmen
121 Followers 83 Following 🤖 AI/ML 🦆 @motherduck, part-time PhD stud. in Databases/ML https://t.co/g76bBKI9J8Rémi 〰️ @remilouf
6K Followers 1K Following LLMs & structured generation @dottxtai. @OutlinesOSS 〰️ . Alumni @ENS_ULM & @UniOfOxford. I wander.Julien Veron Vialard @veron_vialard
19 Followers 138 FollowingMikolaj Czerkawski @mikonvergence
378 Followers 559 Following Research Fellow at @ESA Φ-lab - Computer Vision, Remote Sensing, Signal ProcessingRemi Cadene @RemiCadene
8K Followers 587 Following Robotics at Hugging Face Ex-Tesla Autopilot Optimus Postdoc Brown, PhD SorbonnePierre Colombo @PierreColombo6
448 Followers 1K Following Associate Professor at Université Paris Saclay - CentraleSupelec - NLP - GenAIQuentin Gallouédec @QGallouedec
325 Followers 417 Following Research engineer @huggingface 🤗 PhD in RL Member of Stable-Baselines team: https://t.co/eX7JDWqc9FMehdi Ouazza @mehd_io
1K Followers 544 Following Data Engineer based in Berlin Writer on Substack, do videos on YoutubeMoritz Laurer @MoritzLaurer
2K Followers 1K Following 🤗 Machine Learning Engineer @HuggingFace. PhD researcher @VUAmsterdamIlyas Moutawwakil @IlysMoutawwakil
555 Followers 189 Following All benchmarks are wrong, some will cost you less than the others. MLE @HuggingFace 🤗 MEng @CentraleSupelec 🧑🎓Danielle Bitterman, M.. @dbittermanmd
836 Followers 623 Following Assistant Professor of Radiation Oncology | NLP/Informatics | She/Her | ☢️ | Harvard Medical School @BrighamWomens @DanaFarberLinoy Tsaban🎗️ @linoy_tsaban
2K Followers 893 Following Exploring the world of AI Art as a ML engineer @HuggingFace 🤗 | ✡️ & 🇮🇱 #BringThemHome 🎗️Daniel van Strien @vanstriendaniel
3K Followers 1K Following Machine Learning Librarian @huggingface 🤗 | Championing Open Science & ML | Sharing the latest ML datasets 🌟 | Tips for mastering the HF HubClément Chadebec @CChadebec
555 Followers 111 Following Research Scientist @StabilityAI | Ph.D in Machine Learning (Generative Models) @Inria. I also maintain python packages democratizing Deep Generative Models.Katie Link @katieelink
6K Followers 906 Following Machine learning for health. Previously @huggingface, @nyulangone, @Google @theteamatx. Views my own.Andrea Soria Jimenez @andrejanysa
106 Followers 668 FollowingPablo Montalvo @m_olbap
485 Followers 316 Following ML Engineer @HuggingFace. Previously ML R&D @ Rakuten. Computer vision and NLP mixer, ex-physicist. Dice thrower, dreamer, learner. He/him. Usually friendly :)Mistral AI @MistralAI
91K Followers 0 Following Fast, open-source and secure language models. Join us https://t.co/INALdNGvCPXenova @xenovacom
6K Followers 284 Following Bringing the power of machine learning to the web. Currently working on Transformers.js (@huggingface 🤗)Wanrong Zhu @ZhuWanrong
665 Followers 215 Following PhD Student @UCSB on #NLProc|former research intern @allen_ai, @GoogleAI|@PKU1898 alumniSheon Han @sheonhan
6K Followers 457 Following • pronounced "Sean" • words: @WIRED @NewYorker @TheAtlantic @QuantaMagazine @verge @NYTmag @Longreads • newsletter: https://t.co/UjLjAFEP1Zdylan @dylan_ebert_
6K Followers 173 Following Developer Advocate @HuggingFace, IndividualKex on TikTok/YT, PhDnicolas lhoest @nicolas_lhoest
206 Followers 183 Following interventional cardiologist#CTO#ComplexPCIBrigitte 🤗 @BrigitteTousi
2K Followers 2K Following Not an engineer @huggingface | Comms 🤗 | ex @Mila_Quebec | Aspiring 🍄 forager | She/Her/ElleBertrand Chevrier @kramp
832 Followers 3K Following Dead-end developer, front-end @huggingface Everything in the Browser, Free Software, Vim, Svelte. https://t.co/tY2n0xlc7D. 🎸 apprentice & metalheadVaibhav (VB) Srivasta.. @reach_vb
11K Followers 169 Following GPU poor @Huggingface | F1 fan | Here for @at_sofdog’s wisdom | *opinions my ownSanchit Gandhi @sanchitgandhi99
4K Followers 37 Following Open-source speech @huggingface 🤗. Previously Masters' at @Cambridge_Uni.DuckDB @duckdb
13K Followers 3 Following DuckDB is an in-process SQL OLAP database management system. "DuckDB" and the DuckDB logo are registered trademarks of the DuckDB Foundation.Hannes Mühleisen @hfmuehleisen
5K Followers 937 Following I like databases. Co-creator of @duckdb, Co-Founder and CEO @duckdblabs. Professor of Data Engineering @Radboud_Unihelen @mathemakitten
3K Followers 210 Following 💫🌷 my secret superpower is that i code while i cry better than anyone doesAlara Dirik @alaradirik
1K Followers 242 Following PhD candidate and @GoogleDeepMind scholar at @imperialcollege, previously at @huggingface and @unibogaziciWauplin @Wauplin
679 Followers 52 Following Software engineer at Hugging Face. Maintainer of 🤗/huggingface_hub.Abubakar Abid @abidlabs
12K Followers 1K Following Hind Rajab. 5 yrs old. She + 14,000 children killed by Israeli forces. PLEASE don't be silent. Take 5 min to call your reps and urge peace (link in bio)Hugo Laurençon @HugoLaurencon
568 Followers 184 Following ML research engineer @huggingface Les yeux rivés sur la lossRoss Wightman @wightmanr
18K Followers 1K Following Computer Vision @ 🤗. Ex head of Software, Firmware Engineering at a Canadian 🦄. Currently building ML, AI systems or investing in startups that do it better.Stanford AI Lab @StanfordAILab
137K Followers 318 Following The Stanford Artificial Intelligence Laboratory (SAIL), a leading #AI lab since 1963. ⛵️🤖 Emmy-winning video: https://t.co/lV9smZTC1mdataset-tldr-preference-dpo is a DPO/ORPO dataset for training models to produce concise tl;dr summaries of machine learning datasets based on their dataset cards. Created using @huggingface Inference APIs and @argilla_io's Distilabel. huggingface.co/datasets/davan…
PSA: All PRO users now have access to ZeroGPU. Here's a visual of the dynamic allocation of GPUs on HF Spaces 😵
👱♀️ Daar is ze dan, there she is! Fietje has arrived, a small and powerful #LLM for #Dutch. 🇧🇪🇳🇱 huggingface.co/spaces/BramVan… Fietje, based on @MSFTResearch @Phi2, is 2.5x smaller than models like GEITje 7B Ultra, but manages to match their performance in benchmarks. 🚀 Thread 👇
We released StarCoder2 Instruct, which is self-aligned, transparent, and fully permissive! It even beats versions of StarCoder2 trained on GPT-4 distilled data on several benchmarks. huggingface.co/blog/sc2-instr…
THANK YOU @BigCodeProject for sharing your StarCoder2 Instruct model, your data generation pipeline code, and your data! This is TRUE open source AI, the way it should be! I love you all! 😍
We released StarCoder2 Instruct, which is self-aligned, transparent, and fully permissive! It even beats versions of StarCoder2 trained on GPT-4 distilled data on several benchmarks. huggingface.co/blog/sc2-instr…
All this and so much more in our paper! 🎉 Accepted at #CVPR2024 📜 Paper: arxiv.org/abs/2404.18873 🌐 Web: imagine.enpc.fr/~ioannis.sigli… 💽 Data: huggingface.co/datasets/osv5m… 🤖 Demo (can you beat our AI?): huggingface.co/spaces/osv5m/p…
Can you guess where these photos were taken? @geoguessr players have taken this skill to the extreme, but how good can an AI perform? Introducing OpenStreetView-5M 🌍, the first open-access and global-scale dataset of street view images. 🔗 Links and 🤖 demo below 👇 #CVPR2024
Let's go!! Common Voice 17 - now on the Hub! 🔥 With 31,000 hours of audio (& transcriptions) across 124 languages. *sound on 🎶* 847 hours of data were added in CV 17, along with 493 hours of validated data. Four new languages have been added to this edition: Haitian…
Releasing StarCoder2 Instruct! 🚀 Achieves 72% HumanEval score using only self-generated content without any GPT-3.5/4 data. This work demonstrates that self-instruct works already well at the 15B scale without data from proprietary models! Read more: huggingface.co/blog/sc2-instr…
🌎 Better AI is better data, and for better data we need expertise! As part of the 'Data is Better Together' project in collaboration with Hugging Face, we bring the Domain Specific Datasets. You can read more in this post: huggingface.co/blog/burtensha… 🤗
Contributing datasets will be a top way to support machine learning in 2024. @Dorialexander and @ana_stasenko are providing some of the best examples of this kind of work via @pleiasfr, as demonstrated by how regularly their datasets trend on the Hub.
I have just published one of my latest datasets on huggingface. "Kleiner Astronaut" contains 30k german child adventure stories which were synthetically generated. Variations were encouraged by injecting inspiration sentences. huggingface.co/datasets/Jotsc…
I published my filtered and uncensored dataset for Dolphin-2.9 on @huggingface so if you wanna make your own spin on Dolphin, or just see how Dolphin is created, you can check it out. Thanks to all the upstream dataset creators for open source data! huggingface.co/datasets/cogni…
meanwhile I’m still on free @GoogleColab
First @nvidia DGX H200 in the world, hand-delivered to OpenAI and dedicated by Jensen "to advance AI, computing, and humanity":
ZeroGPU is free distributed GPUs in HF Spaces 🔥 ⬇️ will give access to 100 new people in the next hours
Announcing that we are on our way to solve a long standing issue of document processing: correction of OCR mistakes. @pleaisfr publishes the largest dataset to date with automated OCR correction, 1 billion words in English, French, German and Italian huggingface.co/datasets/PleIA…
@BAAIBeijing 🎭 Multi-turn Role Play Chat: huggingface.co/datasets/BAAI/… 👩🏫 Instruction Data: huggingface.co/datasets/BAAI/…
@BAAIBeijing 📚Chinese Corpora Internet: Large pre-train corpora on News, Legal, Novels, & Medical huggingface.co/datasets/BAAI/… huggingface.co/datasets/BAAI/…
So there this new thing called @tailwindcss i think its gonna be big. Heard people might like to use it to build @Gradio Custom Components.
@Gradio and @tailwindcss joined forces in version 4.28.0! So pumped we shipped support for tailwind in custom components. Lots of other good stuff in the release. See the release notes here: gradio.app/changelog#4-28…
So there this new thing called @tailwindcss i think its gonna be big. Heard people might like to use it to build @Gradio Custom Components.