Percy Liang @percyliang, Twitter Profile

Percy Liang @percyliang

a year ago

While instruction tuning is clearly necessary for producing usable interfaces like ChatGPT, the "magic" of language models comes from self-supervised learning on broad data, which enables emergent behavior like in-context learning and chain-of-thought.

10 52 393 164K 60

Adam.GPT @TheRealAdamG

a year ago

@percyliang Bingo. You still need a great base model to start with.

0 0 8 3K 0

CuddlySalmon | nptacek.eth @nptacek

a year ago

@percyliang 💯 this behavior comes built in with gpt-3, pretty amazing imo

CuddlySalmon | nptacek.eth @nptacek

a year ago

@percyliang 💯 this behavior comes built in with gpt-3, pretty amazing imo

4 5 43 0 16

Download Image

0 0 2 3K 0

Thalía Fung Goizueta @thaliafung

a year ago

@percyliang Tishby (2017), unravels deep learning. The recognition of essentials is a bottleneck and the most important part of learning is forgetting. "Forgetting," is the neutral emotion, which does not have with recurring meanings.

0 0 1 1K 0

Tom Shafron @ShafronTom

a year ago

@percyliang Exactly. I think training Codex was a breakthrough moment. It forced the model to build internal hidden models that help it process causality, logic and chain reasoning. This is the way.

0 1 7 1K 0

Tom Shafron @ShafronTom

a year ago

@percyliang Train models on math, logic, code, first principles philosophy and physics and it will better "interpret" new training data from fiction, opinion, dialog, news etc.

0 0 0 861 0

wmlu @wmlu

a year ago

@percyliang Agree. Though such emergent behaviors as in-context learning and CoT reasoning abilities are not fully understood yet, DNN-based language models are basically instilled with broad data (human knowledge/experience); the "magic" is their abilities to retrieve them efficiently.

0 0 3 378 1