David Pfau @pfau, Twitter Profile

David Pfau @pfau

10 months ago

I am absolutely begging AI researchers to learn the data processing inequality.

Sharon Goldman @sharongoldman

10 months ago

I am absolutely begging AI researchers to learn the data processing inequality.

11 22 154 222K 128

24 28 420 187K 163

David Pfau @pfau

10 months ago

Discovering new knowledge from synthetic data generated from existing knowledge is the information equivalent of a perpetual motion machine.

33 37 374 56K 31

Hassan Hayat 🔥 @TheSeaMouse

10 months ago

@pfau Worked wonders for humans, we learned from the knowledge our ancestors built on top the knowledge of theirs

2 0 4 1K 0

@pfau Agree for standard ML, but I think LLMs are weird. I think you could improve short/immediate answer ability by training on synthetic examples of final answers arrived at by chain of thought. Same goes for other kinds of consistency checks. Wouldn't be naive tuning on stale data.

1 0 1 980 0

iandanforth @iandanforth

10 months ago

@pfau I think you just described Maths as a perpetual motion machine.

2 0 19 1K 0

Raza Habib @RazRazcle

10 months ago

@pfau That isn't quite right. You lose information in a formal sense at each stage of processing but that doesn't mean much wrt to what you can learn. Most of the information in high-d data is irrelevant anyway.

0 0 1 314 0

RxFlow Robotics @RxflowR

10 months ago

@pfau Maybe if you’re talking about language models. For visual models, you’re 100% wrong, as the synthetic data generated by 3D models is genuinely new.