François Fleuret @francoisfleuret, Twitter Profile

François Fleuret @francoisfleuret

a year ago

"linear layers" are affine and all the deep learning literature should be corrected.

20 3 37 53K 9

math_dandy @math_dandy

a year ago

@francoisfleuret Nice weekend pedantrolling.

1 0 5 1K 0

Steve Wacks @SteveWacks

a year ago

@francoisfleuret Just pretend b is part of W and X has a 1

1 0 24 2K 0

Sebastian Raschka @rasbt

a year ago

@francoisfleuret Yes but it doesn’t roll as nicely of the tongue. I also just call it a weighted sum, people usually know what is meant :P

2 0 10 5K 1

Kyle Vedder @KyleVedder

a year ago

@francoisfleuret Linear versus Affine is a distinction @JustinDomke semi-regularly brought up in his ML class and I really appreciated it

0 0 1 357 0

Daniel O'Connor @Singularitarian

a year ago

@francoisfleuret The same issue comes up when teaching or discussing calculus. I really want to say “the key idea of calculus is to take a nonlinear function (difficult) and approximate it locally by a linear function (easy)”. But technically, we’re approximating by an affine function.

1 0 1 690 0

Shawn Presser @theshawwn

a year ago

@francoisfleuret An activation makes it not-affine.

1 0 2 1K 0

Alfredo Canziani @alfcnz

a year ago

@francoisfleuret Just wait until you hear what the softargmax is commonly called… 🥴🥴🥴

0 0 31 5K 1

Saurabh Kumar @drummatick

a year ago

@francoisfleuret In most deep learning literature a linear layer comprises of an affine transformation followed by a non linear activation function. So a linear layer is more than just an affine transformation

0 0 0 134 0