Lucas Beyer (bl16) @giffmana, Twitter Profile

Lucas Beyer (bl16) @giffmana

2 years ago

1/3 All these methods look the same to you? That's the point of this paper! Simply adding losses works equally well as any fancy multi-task method, if one tunes the baseline properly. This matches my experience, and fits my philosophy: tune the simplest possible method -> win.

yobibyte @y0b1byte

2 years ago

7 54 286 0 63

Download Image

5 26 152 0 45

Lucas Beyer (bl16) @giffmana

2 years ago

2/3 I've tried fancy multi-task methods almost every year, but they never outperformed my well-tuned "just add the losses". I never thought much of it, but this paper actually explores both theoretically and empirically why that is!

2 0 13 0 0

talrid23 @talrid23

2 years ago

@giffmana I think it's true in other fields of deep learning as well, such as optimizers, augmentations and more. it's very hard to improve strong simple baselines, and often newer tricks fail to present improvement when we do a fair well-tuned comparison.

1 0 4 0 0

Sayak Paul @RisingSayak

2 years ago

@giffmana After the function matching paper and having trying it out myself, I am fully convinced about your philosophy.

1 0 1 0 0