Tan Minh Dinh @tanmdinh
Ho Chi Minh, Vietnam Joined March 2019-
Tweets93
-
Followers143
-
Following752
-
Likes546
VideoJAM is our new framework for improved motion generation from @AIatMeta We show that video generators struggle with motion because the training objective favors appearance over dynamics. VideoJAM directly adresses this **without any extra data or scaling** 👇🧵
Finally, a method that successfully personalizes me without compromising the model's prior! On the right, is the result of Nested Attention, which clearly outperforms other methods by adhering to "pointillism" while still capturing my likeness—confirmed by my dear wife!
*Shortcut models* are a plug-and-play replacement for diffusion models that can generate in a single step (or more). This speeds up inference by up to 128x. Shortcut models are trained end-to-end, and do not require a separate distillation phase or learning schedules.
Do we still need codebook/quantization for scalable autoregressive visual generation? No! Thrilled to share our latest work on scaling w/ continuous tokens. We observe power-law scaling behavior on val loss, and obtain SOTA coco FID and GenEval score. arxiv.org/abs/2410.13863
🎥 Today we’re premiering Meta Movie Gen: the most advanced media foundation models to-date. Developed by AI research teams at Meta, Movie Gen delivers state-of-the-art results across a range of capabilities. We’re excited for the potential of this line of research to usher in…
Gemini 1.5 Model Family: Technical Report updates now published In the report we present the latest models of the Gemini family – Gemini 1.5 Pro and Gemini 1.5 Flash, two highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information…
Here’s an early preview of ElevenLabs Music. All of the songs in this thread were generated from a single text prompt with no edits. Title: It Started to Sing Style: “Pop pop-rock, country, top charts song.”
# explaining llm.c in layman terms Training Large Language Models (LLMs), like ChatGPT, involves a large amount of code and complexity. For example, a typical LLM training project might use the PyTorch deep learning library. PyTorch is quite complex because it implements a very…
⚡️SD3-Turbo: Fast High-Resolution Image Synthesis with Latent Adversarial Diffusion Distillation Following Stable Diffusion 3, my ex-colleagues have published a preprint on SD3 distillation using 4-step, while maintaining quality. The new method – Latent Adversarial Diffusion…
Snap presents MyVLM Personalizing VLMs for User-Specific Queries Recent large-scale vision-language models (VLMs) have demonstrated remarkable capabilities in understanding and generating textual descriptions for visual content. However, these models lack an understanding of
Stabillity AI presents Stable Diffusion 3 Scaling Rectified Flow Transformers for High-Resolution Image Synthesis Diffusion models create data from noise by inverting the forward paths of data towards noise and have emerged as a powerful generative modeling technique for…
Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models Sora is a text-to-video generative AI model, released by OpenAI in February 2024. The model is trained to generate videos of realistic or imaginative scenes from text instructions and…
I am excited and honored to co-organize #SyntaGen - a #CVPR2024 #workshop on Harnessing Generative Models for Synthetic Visual Dataset. We are calling for cutting-edge paper submissions. Detail information can find below or in our website syntagen.github.io
Announcing FlashAttention-2! We released FlashAttention a year ago, making attn 2-4 faster and is now widely used in most LLM libraries. Recently I’ve been working on the next version: 2x faster than v1, 5-9x vs standard attn, reaching 225 TFLOPs/s training speed on A100. 1/
Oops haven't tweeted too much recently; I'm mostly watching with interest the open source LLM ecosystem experiencing early signs of a cambrian explosion. Roughly speaking the story as of now: 1. Pretraining LLM base models remains very expensive. Think: supercomputer + months.…
Scaling up GANs for Text-to-Image Synthesis present our 1B-parameter GigaGAN, achieving lower FID than Stable Diffusion v1.5, DALL·E 2, and Parti-750M. It generates 512px outputs at 0.13s, orders of magnitude faster than diffusion and autoregressive mingukkang.github.io/GigaGAN/…
Stable Diffusion generates beautiful images, but can it be used for open-world recognition? Try Demo! huggingface.co/spaces/xvjiaru… Our #CVPR2023 paper shows that the pre-trained diffusion model indeed is a good image parser, allows for open-vocabulary segmentation and detection.
MEGANE: Morphable Eyeglass and Avatar Network abs: arxiv.org/abs/2302.04868 project page: junxuan-li.github.io/megane/
3DAvatarGAN: Bridging Domains for Personalized Editable Avatars abs: arxiv.org/abs/2301.02700 project page: rameenabdal.github.io/3DAvatarGAN/
This looks great! Simple idea, nice clean code implementation... Takes a key idea from diffusion (iterative improvement over multiple steps) but with a new (far less mathy) approach. Well done to the authors :)
This looks great! Simple idea, nice clean code implementation... Takes a key idea from diffusion (iterative improvement over multiple steps) but with a new (far less mathy) approach. Well done to the authors :)

Sarsex @Sarsex9045
28 Followers 947 Following
Yiheng Xu @yihengxu_
1K Followers 711 Following ai agent research @hkuniversity | scaling agent @Alibaba_Qwen | ex @msftresearch @sfresearch | from automation to autonomy
Frercu @Frercu27529
65 Followers 3K Following
Hin Nguyễn @HinNguy74966995
1 Followers 173 Following
Rexan Wongs. @rexan_wnog
120 Followers 1K Following 17 y/o high schooler. Working on https://t.co/WGnOzF5lej (300K users) & https://t.co/yyVtA08n3K. Finalist at @ethglobal. @apple ssc scholar ‘23
Vy Do @donhuvy
8 Followers 489 Following
Vũ Anh Tuấn @TuanAhVu
19 Followers 241 Following
Trung Nguyen @trungn1234
27 Followers 422 Following CS PhD @ UT Austin | Researching activation spaces in large language models
jordan @jontonp
16 Followers 972 Following
Sachin Iyer @ripebananamango
7 Followers 481 Following
Aakash Dhal ଆକା... @iamakdu
47 Followers 992 Following Machines are to be taught. I will teach them.
Lê Tuấn Nguyễn @LTunNguyn270988
0 Followers 29 Following
Sweaty Starup @_NickHuber
135 Followers 5K Following I buy real estate and start companies. Owner of Somewhere, Bolt Storage, RE Cost Seg, Titan Risk, BoldSEO, AdRhino, WebRun, RecruitJet, Spidexx.
Duck Anh Tran @duck_tran
40 Followers 720 Following Sleepy student, clumsy wanderer. Fulbright University Vietnam
Viet Nguyen @vng_sofw
13 Followers 212 Following ECE Ph.D. @JohnsHopkins | ex-Research Intern @Qualcomm
truongpdd @DanTruongPhanD1
8 Followers 236 Following
Quang Nguyễn @QuangNg19434493
7 Followers 169 Following
Hargan @khaigaming68
2 Followers 99 Following
PopularAi @PopularAiltd
237 Followers 588 Following PopularAi: Specialized AI platform offering content generation, speech-to-text, voiceover, image creation, and advanced analysis tools.
Generative AI @generativeaihub
21K Followers 19K Following Inspired by Algorithms, Powered by Imagination: Unleashing the Potential of Generative AI. #GenerativeAI #deeplearning #AI #MachineLearning
Hai Pham @HaiPham64109565
19 Followers 374 Following Incoming MVA Master @ ENS. Intern @ Qualcomm AI Research. Working on 3D computer vision and generative models.
Paylz @Paylzq
331 Followers 7K Following The best online market for digital downloads with cheap prices.
Lindamary @Lindamary427980
3 Followers 207 Following
khanh @khanhln13
0 Followers 22 Following
Lewis Walker ➲ @lewiswalkerai
8K Followers 3K Following Level up with Generative AI Enterprise. Deloitte AI, Goldman Sachs. 67k LinkedIn
Nguyễn Trọng Anh @NguynTrngAnh11
10 Followers 236 Following
Hieu Nguyen @JunHill9961
13 Followers 167 Following
Mouad JABRANE @MouaadJabrane
93 Followers 8K Following Geo-EDGE AI PhD Candidate @Iav_Hassan_I | Geospatial Data Scientist
William Sun @williamsun2020
56 Followers 1K Following
Bang-Dang Pham @pbdang2000
45 Followers 196 Following 🎓 A CS PhD Student @UWMadison | CV, Generative Models, Image Restoration
Arif Ahmad @arif_ahmad_py
536 Followers 7K Following We are in the world model era now. Prev. @GoogleDeepMind and @Nvidia
Trung Dao @trungdt880
46 Followers 187 Following
Dang Nguyen @dangnth97
336 Followers 1K Following PhD Candidate @CS_UCLA | IMO 2015 Silver | Prev: @GoogleResearch and @Cisco
Khoi Nguyen @ngducminhkhoi
223 Followers 102 Following Khoi Nguyen is a PhD in Computer Science graduating from Oregon State University. His research interests include computer vision and machine learning.
Minh-Quan Le @lmquancs
131 Followers 769 Following Ph.D. Student in CS @ Stony Brook University, working on likelihood-based generative models. Research Intern @ Microsoft.
Miguel Carranza @elwatto
11K Followers 823 Following https://t.co/UVId0JehXP™ Hombre Silla/co-founder/CTO. 💻 nerd – 🏄♂️ kook – 🎸 punk rock poser – 👨👩👧👧 twin dad. From Sevilla, 🇪🇸 in Encinitas, CA 🇺🇸
Dustin Tran @dustinvtran
54K Followers 695 Following I work on reasoning & posttraining at xAI. ex-google
Christian Cantrell @cantrell
14K Followers 614 Following Founder and CPO at @reve. Former VP of Product at Stability AI. Ex-Adobe. Creator of the Stable Diffusion Photoshop plugin. Writer repped by Gersh.
George Stock @georgesttock
31K Followers 785 Following Founder @makeugcai Create UGC with AI - https://t.co/DqyovofZdA
Google Gemini App @GeminiApp
226K Followers 38 Following The Gemini app turns research into reality, bringing frontier AI experiences like Veo 3, Deep Think, Nano Banana and more to hundreds of millions of people.
Nano Banana @NanoBanana
44K Followers 1 Following Nano Banana 🍌 the world's most powerful image editing and generation model! Try it for free in the @GeminiApp
Adam Pietrasiak @pie6k
34K Followers 430 Following I design through code. Building https://t.co/6ceZFejl4s (@screenstudio). Support → please reach out at [email protected] instead of DMs.
Screen Studio @screenstudio
20K Followers 3 Following Create beautiful screen recordings in minutes. #buildinpublic journey at @pie6k.
Jerry Li @JerryLiJiaming
705 Followers 277 Following
Hacker Residency Grou... @HackerResidency
479 Followers 9 Following HRG is an experimental new residency for world-class indie hackers... our first batch is taking place in vietnam this november, 2025 🔥
SaaS Gallery @saas_gallery
1K Followers 2 Following Unlock 50+ SaaS products’ insights from the SaaS Gallery that are now earning $100 to $1B per month, and get ideas, and build products to make money...
Jacky Chou (buying on... @indexsy
50K Followers 316 Following We buy and operate incredible online businesses in public.
Zhiting Hu @ZhitingHu
5K Followers 437 Following Assist. Prof. at UC San Diego; Artificial Intelligence, Machine Learning, Natural Language Processing
Fan Zhou @FaZhou_998
1K Followers 837 Following Qwen Coding @Alibaba_Qwen. Prev: Core member @XLangNLP, Intern @MSFTResearch.
Yiheng Xu @yihengxu_
1K Followers 711 Following ai agent research @hkuniversity | scaling agent @Alibaba_Qwen | ex @msftresearch @sfresearch | from automation to autonomy
Binyuan Hui @huybery
35K Followers 662 Following 🥝 Building Qwen @Alibaba_Qwen. Focus on CodeLLM (Pre-training and Post-training) / Reasoning / Agent. Ideas my own.
Denny Zhou @denny_zhou
22K Followers 540 Following Founded the Reasoning Team in Google Brain (now in the Gemini Core team of Google DeepMind). Build LLMs to reason. Opinions my own.
Claude @claudeai
139K Followers 1 Following Claude is an AI assistant built by @anthropicai to be safe, accurate, and secure. Talk to Claude on https://t.co/ZhTwG8dz3D or download the app.
👋 Jan @jandotai
11K Followers 979 Following Jan is the open-source ChatGPT replacement. We're building Open Superintelligence together. Community: https://t.co/NIyIbR60qQ
Umar Jamil @hkproj
15K Followers 1K Following AI @MistralAI - Join the best AI community on Discord: https://t.co/zYH1DlgdbW - Opinions my own
Kfir Aberman @AbermanKfir
2K Followers 287 Following Founding Member @DecartAI | Research Scientist | ex-@Snap | ex-@Google | Personalized Generative AI | DreamBooth
Daniel Garibi @DanielGaribi
172 Followers 63 Following
dadabots @dadabots
10K Followers 9K Following ∿Music hackers. Making music with code. 24/7 infinite livestreams. Prompt jockeys. StableAudio open models @harmonai_org @NoiseDAO @artblocks_io @braindrops_art
WaveSpeedAI @wavespeed_ai
3K Followers 31 Following Building the Fastest Inference System for Media Generative AI https://t.co/71kgEQVYEV
Calvin French-Owen @calvinfo
15K Followers 498 Following Making things, trail running. Prev: Codex @OpenAI, https://t.co/4qWGncHOAX, co-founder @Segment, @MIT
Vu Nguyen @VuNguyenDev
244 Followers 109 Following Building https://t.co/XOunaMEVZg: A time tracking & productivity app for Mac
Tongyi Lab @Ali_TongyiLab
9K Followers 20 Following We advance the development of AGI and foster open source collaboration towards a smarter future.
Decart @DecartAI
17K Followers 2 Following A new era of real-time generative experiences, enabled by cutting-edge AI efficiency
Yoav HaCohen @yoavhacohen
2K Followers 784 Following Lead of LTX-Video @ Lightricks. PhD Computer Science. Researcher. Entrepreneur.
Peter Tong @himtkw
3K Followers 330 Following https://t.co/zKumMbGKvd great products with @desmondhth growing https://t.co/UkHcFaIunn
Desmond @desmondhth
25K Followers 566 Following CEO https://t.co/dKKy0nRkCR (1M+ downloads). Bootstrapped with @himtkw. Generally good. Be a giver.
Trapit Bansal @TrapitBansal
32K Followers 250 Following AI Research @Meta | Co-Creator of OpenAI o1 | Previously @OpenAI, @MSFTResearch, @GoogleAI, @facebook, @iiscbangalore, and undergrad @IITKanpur
Jiahui Yu @jhyuxm
18K Followers 933 Following Perception @OpenAI; previously co-led Gemini Multimodal @GoogleDeepMind. opinions are my own.
enigmatic_e @8bit_e
10K Followers 589 Following Content Creator | Specializing in VFX, Animation, and AI-Driven Video Solutions. Business Contact: [email protected]
Yusu Qian @sueqian111
1K Followers 450 Following multimodal research at Apple, previously at NYU @nyuniversity and NJU @njuniversity
Matt Corey @Matt1Corey
995 Followers 1K Following Dad, Husband, Developer of @HomeMadeAuto, @BillsToBudget and other things. You can find me at @[email protected] most days.
Rob Hallam @robj3d3
27K Followers 963 Following Shipping like a machine 🤖 while I travel the world 🌍️ 🔥 https://t.co/myyLdxVzcg - Grow faster on X ($8K/m) 🚀 https://t.co/5fSvUeZJ8z ($38K) 🤝 https://t.co/l4mcaViGk7 🕵 https://t.co/Cwvsl9cPsS
Romain Torres @rom1trs
26K Followers 591 Following Building https://t.co/NCdxkHvPnn Make top performing ads with AI
Inbar Mosseri @inbar_mosseri
366 Followers 118 Following Veo Capabilities co-lead | Research lead at Google DeepMind