🔵New paper!🔵 Our latest work on Pyramid Vector Quantization for LLMs achieves state-of-the-art post-training quantization with a Pareto-optimal trade-off between performance, bits per weight, and bits per activation. A thread. 👇 1/15
Video models like Sora and Gen 3 can generate realistic videos, but can they produce useful synthetic data for planning/RL?
Our work (AVID) explores how pretrained image-to-video models can be adapted to accurate action-conditioned world models.
1/n
Congrats to the Causica team @MSFTResearchCam . We got two papers accepted in #ICML 24, contributing to our efforts on integrating #Causality with modern #FoundationModels. 1/3
TL;DR: For the potential outcome framework, self-attention = causal inference via optimal balancing…
Updated GPU recommendations for the new Ampere RTX 30 series are live! Performance benchmarks, architecture details, Q&A of frequently asked questions, and detailed explanations of how GPUs and Tensor Cores work for those that want to learn more: timdettmers.com/2020/09/07/whi…
I want to get an accurate picture of GPU resources that PhD students have access to. PhD students, please respond and share with other students.
"What is the largest GPU system that you have access to?"
Please pick option +16 GPU only if your cluster has a +50 Gb/s interconnect
We've got new master thesis positions at @peltarion_ai within deep learning. Our previous students have often focused on nlp and audio, but we're working in many areas, hit us up!
Apply here: peltarion.teamtailor.com/jobs/873400-ma…
Check out our side project hn-timemachine.com.
It displays current HackerNews stories along with similar past ones.
@nilpath and me built this to qualitatively explore the potential and limitations of semantic similarity search.
Let me know what you think. #NLProc #NLP
How can you successfully train transformers on small datasets like PTB and WikiText-2? Are LSTMs better on small datasets? I ran 339 experiments worth 568 GPU hours and came up with some answers. I do not have time to write a blog post, so here a twitter thread instead. 1/n
Let's join the fight! I created a dead simple Docker image for running folding@home on NVIDIA GPUs with OpenCL, for those of us with GPU access: github.com/agrinh/folding…
Let's join the fight! I created a dead simple Docker image for running folding@home on NVIDIA GPUs with OpenCL, for those of us with GPU access: github.com/agrinh/folding…
AAAI'20 is seeking (self-)nominations of qualified individuals to serve as program committee members. With this call for help, we hope to expand and diversify the PC. See tinyurl.com/y32xd8cj for details and tinyurl.com/y24wppyx to nominate. (Please retweet!) @RealAAAI
A great guide for applying to PhD positions in deep learning by @Tim_Dettmers. Goes through everything from what makes a strong application and how to ask for recommendations to selecting positions. Highly recommended if you're applying.
timdettmers.com/2018/11/26/phd…
178 Followers 787 FollowingI help Product Managers build careers they love | Career Coach for Product Managers | 15+ years - $1B RevGen | Bold Decisive Action Invents the Future
633 Followers 442 FollowingPhD Student at @spcl_eth, focused on High-Performance Computing and Large Scale Deep Learning | Prev. intern at @Apple, @Microsoft, and @MSFTResearch
61 Followers 191 FollowingPhysicist & Research Software Engineer | Passionate about accelerating science with high-performance computing and sustainable code | Research at @UU_University
3K Followers 5K FollowingComputer Vision and Deep Learning. Computer vision and Machine learning engineer at #Rivian & Content writer at #datahacker.rs
3K Followers 1K FollowingBio & DMs: https://t.co/zvemKOHsiD | she/her | reevaluating my relationship with Twitter, now at https://t.co/cvoBYdKKeA | Posts/typos are personal views
8K Followers 2K FollowingAI research-eneur. Hiring eng: https://t.co/fv5QBjsv90. Was Research Scientist @ Google Brain / DeepMind, language model research. 🇨🇦🇺🇸
1.4M Followers 85K FollowingOfficial British Airways X account. For help, please get in touch with us on Facebook, Instagram or visit: https://t.co/GpmEQmqJkr
108K Followers 4 FollowingCohere builds secure, scalable, and private enterprise-grade AI solutions for real-world business problems. Join us: https://t.co/Yb2xItMObl
637K Followers 35 FollowingWe're an AI safety and research company that builds reliable, interpretable, and steerable AI systems. Talk to our AI assistant @claudeai on https://t.co/FhDI3KQh0n.
39K Followers 1 FollowingFastAPI framework, high performance, easy to learn, fast to code, ready for production. 🚀
Web APIs with Python type hints. 🐍
By @tiangolo 🤓
2K Followers 25 FollowingImprove your ML products with analytics, alerting, human feedback and more.
Newsletter for ML practitioners: https://t.co/nCCS7VMgcf
188 Followers 135 FollowingEn bot som tweetar den senaste live-datan och ljudbokstrenderna på Storytel i Sverige. Följ för att ta del av aktuella trender i Storytel-appen.
3K Followers 657 FollowingStorytel är Nordens största streamingtjänst för ljudböcker & e-böcker. Följ oss för nyheter och inspiration🎧Support:https://t.co/Fw0OAsU844
2K Followers 2K FollowingResearcher building Large Language Models from Sweden.
Also sharing artifacts from the weights.
AI Nordics discord: https://t.co/EEZxFT1QFo
79K Followers 424 FollowingDirector @PIK_climate. Also Professor @unipotsdam, @sthlmresilience. Internationally recognised Earth scientist on global sustainability, #PlanetaryBoundaries.
606K Followers 608 FollowingInternet Rocket Scientist, Gamer, Astronomer, Dad, Scotsman, Pilot. Makes videos about space and science https://t.co/mLfUsogKq5
264K Followers 670 FollowingBuilding with AI agents @dair_ai • Prev: Meta AI, Galactica LLM, Elastic, PaperswithCode, PhD • I share insights on how to build with AI Agents ↓
7K Followers 935 FollowingCofounder @ClipdropApp, acquired by @stabilityai
AI x Images @googlearts
Created GoogleCardboard @Google
SVP for @heyjasperai
No recent Favorites. New Favorites will appear here.