Qiang Gao @gaoqiang_nlp

a third year master student at Wuhan University, focusing on the natural language processing area. I'm actively looking for a PhD position for 2025Fall. cooper12121.github.io China Joined December 2022

Tweets

30
Followers

26
Following

175
Likes

869

Nathan Lambert @natolambert

8 months ago

Since everyone wants to learn RL for language models now post DeepSeek, reminder that I've been working on this book quietly in the background for months. Policy gradient chapter is coming together. Plugging away at the book every day now. rlhfbook dot com

23 168 1K 108K 2K

Download Image

Jason Kneen @jasonkneen

12 months ago

The new chatGPT 4o with Canvas system prompt:

73 339 3K 478K 5K

Download Image

internet hall of fame @InternetH0F

a year ago

This is the best use for of a drone I've ever seen 😭

3K 26K 194K 24.4M 31K

Download Video

harry law (hopfield network truther) @lawhsw

a year ago

don’t worry about it babe it’s just a brand

95 413 8K 471K 915

Download Image

MIT Media Lab @medialab

a year ago

Congratulations to the new faculty members in @MITEngineering, including Media Lab alum Anna Huang (@huangcza)! Huang will hold a joint appointment in @MITEECS and MIT Music and Theater Arts. news.mit.edu/2024/school-en…

0 4 44 9K 2

Qiang Gao @gaoqiang_nlp

a year ago

generated by #DreamMachine

0 0 0 364 0

Download Video

DAIR.AI @dair_ai

a year ago

The Top ML Papers of the Week (May 27 - June 2): - SimPO - GNN-RAG - Attention as an RNN - Abacus Embeddings - Symbolic Chain-of-Thought - Contextual Position Encoding ...

4 66 356 52K 259

Nathan Lambert @natolambert

a year ago

I've been thinking about the many, MANY, DPO spinoff methods we've been seeing recently for rlhf. IPO, D2PO, CPO, ORPO, SPO, sDPO, KTO, DNO... Most claim they're "the best" but doesn't properly compare to related work. What do we do in alignment research? Thread 📚

5 29 253 55K 241

Qiang Gao @gaoqiang_nlp

a year ago

🎉 Exciting News! 🎉 Just open-sourced my latest project: Llama3-based 8x8b-MoE model! 🚀 Extends llama3-8B-Instruct model with MoE architecture. Check it out & give it a star! github.com/cooper12121/ll…

0 0 1 111 0

Download Image

BURKOV @burkov

2 years ago

If you really want to do something useful in AI, instead of training another tiny llama, pick up this project hazyresearch.stanford.edu/blog/2024-01-1… and train a 1B-parameter multilingual BERT with 32k input size. The code is here github.com/HazyResearch/m2. The data is all over @huggingface. The…

8 78 485 77K 609

AI at Meta @AIatMeta

2 years ago

To close out 2023, here are 10 of the most interesting AI research advancements we shared on our feed this year — and where you can find more details on the work. 1️⃣ Segment Anything (SAM) A step toward the first foundation model for image segmentation. Details:…

32 337 1K 287K 724

Download Video

AK @_akhaliq

2 years ago

Alibaba releases DreaMoving demo on Hugging Face A Human Video Generation Framework based on Diffusion Models demo: huggingface.co/spaces/jiayong…

23 340 2K 183K 846

Download Video

Carlos E. Perez @IntuitMachine

2 years ago

Key vulnerabilities of GPT-4: 1. Fine-tuning API can remove or diminish safety guardrails, causing the model to produce harmful outputs or assist with dangerous requests 2. Fine-tuning can make the model generate targeted misinformation against public figures 3. Fine-tuning…

15 88 437 103K 522

Download Image

The Rundown AI @TheRundownAI

2 years ago

If you're not using AI, you're falling behind.

222 642 4K 3.3M 1K

ひさだん @hisadan

2 years ago

//#つぶやきProcessing int n=999999,p[]=new int[n],i,j; float t=1; void setup(){ size(800,800); for(i=2;i<n;i++)if(p[i]==0)for(j=i+i;j<n;j+=i)p[j]=i; } void draw(){ clear(); stroke(-1); for(i=2;i<n;i++)if(p[i]==0)circle(i*sin(i*t)/99+400,i*cos(i*t)/99+400,2); t+=1e-7; }

28 331 2K 160K 511

Download Video

rishi @RishiBommasani

2 years ago

Foundation models are transforming society: in the past month alone, we've seen a flurry of releases! GPT-4, Claude, PaLM API, Alpaca, Dolly, Jurassic-2, PaLM-E, GPT4All, Cerebras-GPT, OpenFlamingo, ... We built Ecosystem Graphs to track their footprint: crfm.stanford.edu/ecosystem-grap…

9 132 373 105K 164

Download Image

Vrdoljak J @Vrda82073569

3 years ago

@DrJimFan huggingface.co/spaces/JavaFXp… you should check this out

0 1 0 311 0

Abacus.AI @abacusai

3 years ago

4 essential books anyone should read: • Machine Learning with PyTorch and Scikit-Learn • Transformers for NLP • Deep Learning with Python • Designing Machine Learning Systems