One weird trick for better diffusion models: concatenate some DINOv2 features to your latent channels!
Combining latents with PCA components extracted from DINOv2 features yields faster training and better samples. Also enables a new guidance strategy. Simple and effective!
One weird trick for better diffusion models: concatenate some DINOv2 features to your latent channels!
Combining latents with PCA components extracted from DINOv2 features yields faster training and better samples. Also enables a new guidance strategy. Simple and effective!
Why does Adam outperform SGD in LLMs training? Adaptive step sizes alone don't fully explain this, as Adam also surpasses adaptive SGD.
Is coordinate-wise adaptivity the secret? Not entirely—Adam actually struggles in the rotated parameter space! 🧵 (1/6)
arxiv.org/abs/2410.08198
🚨 Thrilled to share one of my main PhD projects!
We built an in silico evolution platform that couples a solenoid discriminator network with AlphaFold2 as an oracle, using a genetic algorithm for sequence update. 🧬✨ (1/9)
Excited to share our newest preprint! Glad to see how this evolved from a serendipitous finding from a rotation project. Led by the brilliant graduate student @Yuchen8314, see his thread for details
Excited to share our newest preprint! Glad to see how this evolved from a serendipitous finding from a rotation project. Led by the brilliant graduate student @Yuchen8314, see his thread for details
🔥 Benchmark Alert! MotifBench sets a new standard for evaluating protein design methods for motif scaffolding.
Why does this matter? Reproducibility & consistent evaluation have been lacking—until now.
Paper: arxiv.org/abs/2502.12479 | Repo: github.com/blt2114/MotifB…
A thread ⬇️
A DNA language model based on multispecies alignment predicts the effects of genome-wide variants @NatureBiotech
1. GPN-MSA, a DNA language model leveraging multispecies alignment (MSA), sets a new benchmark in predicting variant deleteriousness for both coding and noncoding…
Wrote about some of my favourite papers over the past year or so and some research directions that I am excited about in 2025
As a bonus, I think it's a good overview for someone to catch up on the current state of the art :)
Since everyone wants to learn RL for language models now post DeepSeek, reminder that I've been working on this book quietly in the background for months.
Policy gradient chapter is coming together. Plugging away at the book every day now.
rlhfbook dot com
753 Followers 439 FollowingDynamic, diverse department of bioinformaticians, systems biologists, geneticists, statisticians, cell & mol biologists, microscopy experts, AI engineers.
4K Followers 2K FollowingAssistant Prof. @ Stanford BASE, Genetics & Computer Science (courtesy). Lead the predictive genomics lab of ML & single cell/spatial genomics, focus on heart
422 Followers 548 FollowingProfessor in bioinformatics specializing in single-cell and spatial multi-omics, gene regulatory mechanisms, and Immuno-Oncology Informatics.
1K Followers 1K FollowingAssistant Professor@MD Anderson Bioinfo and Comp Bio (joint Systems Biology); NIH/NHGRI K99; Statistics; Epigenomics and 3D Genomics; Cancer Studies
19K Followers 7K FollowingBiotech exec advancing genomics and multiomics. Driving precision medicine through strategic marketing, real-world impact, and customer-focused execution.
551 Followers 665 FollowingPostdoc @ Srivastava Lab at the Gladstone Institutes. Buenrostro Lab Alumni. Interested in gene regulation, computational biology, aging, and human diseases.
1.4M Followers 1K FollowingBuilding @EurekaLabsAI. Previously Director of AI @ Tesla, founding team @ OpenAI, CS231n/PhD @ Stanford. I like to train large deep neural nets.
393 Followers 383 FollowingPhD student in machine learning for healthcare & biology at @MIT_CSAIL and @broadinstitute - on the industry job market!
https://t.co/kYUwRbflco
34K Followers 1K FollowingA Python developer at day A Java developer at night PyCon China organizer @pythonhunter__ co-founder @containerd CTL maintainer. Super fan of @yurucamp_anime
163 Followers 564 Following:)
#Bioinformatics Postdoc @vanvanka123 |
#rstats Epigenomics & ML |
@RWTH Aachen University |
🤓⌨️ |
eSports fan |
Decent home cook on a good day |
he/him
16K Followers 4K FollowingScientist, Assistant Professor @MITBiology, #FirstGen, ProteinBERTologist, 🇺🇦
No Human is illegal.
Moving to: https://t.co/sow6IRD3jj
756 Followers 804 FollowingResearch intern @nvidia; Ph.D. student at @Mila_Quebec. Interested in deep generative model, drug discovery and protein science.
8K Followers 560 FollowingScientist at DKFZ and EMBL in Heidelberg, loving stats, genomics and genetics. @[email protected]. For group news see @StatGenomics.