Thomas G. Dietterich @tdietterich, Twitter Profile

Thomas G. Dietterich @tdietterich

2 years ago

I propose that we adopt the term "Large Self-Supervised Models (LSSMs)" as a replacement for "Foundation Models" and "LLMs". "LLMs" don't capture non-linguistic data and "Foundation Models" is too grandiose. Thoughts? @percyliang

42 49 549 0 59

Percy Liang @percyliang

2 years ago

@tdietterich The beauty of language is that you can have multiple terms that highlight different aspects of the same object. You don't have to choose. I use "LLM" to talk about LLMs, "self-supervised" for their construction, and "foundation model" for their function. No term can be replaced.

1 2 35 0 1

Thomas G. Dietterich @tdietterich

2 years ago

@percyliang Yes, but as you know, "Foundation" is too close to "Foundational", and many of us find that troubling. That is why I'm proposing a more neutral term. For use, maybe we could just call them "Upstream models".

8 2 57 0 0

Lucas Beyer (bl16) @giffmana

2 years ago

@tdietterich @percyliang Or, hear me out, pre-trained models!

1 1 23 0 1

François Fleuret @francoisfleuret

2 years ago

@giffmana @tdietterich @percyliang The self-supervision matters.

2 0 2 0 0

Lucas Beyer (bl16) @giffmana

2 years ago

@francoisfleuret @tdietterich @percyliang I don't think so at all.

3 0 7 0 0

Lucas Beyer (bl16) @giffmana

2 years ago

@ggdupont @francoisfleuret @tdietterich @percyliang Funny you tell me that, because I have several papers doing exactly that...

1 0 3 0 0

François Fleuret @francoisfleuret

2 years ago

@giffmana @ggdupont @tdietterich @percyliang You do not think the best strategy to train models for image understanding will be eventually mostly self-supervised?

1 0 1 0 0

Lucas Beyer (bl16) @giffmana

2 years ago

@francoisfleuret @ggdupont @tdietterich @percyliang Only time will tell, but currently, this strategy performs comparatively poorly.

1 0 1 0 0

Thomas G. Dietterich @tdietterich

2 years ago

@giffmana @francoisfleuret @ggdupont @percyliang My impression was that self-supervised is competitive with supervised in computer vision. Is this wrong? In particular, doesn't self-supervised permit training on much more data?

1 0 1 0 0