John Carmack @ID_AA_Carmack, Twitter Profile

John Carmack @ID_AA_Carmack

2 years ago

Currently, precision decisions in ML are made at the layer or entire model level, but the underlying tensor cores operate on relatively small chunks, so it should be possible to optimize “mixed precision” inside a single weight matrix by permuting the weights to put ok-for-low \

9 8 268 0 22

John Carmack @ID_AA_Carmack

2 years ago

\precision weights together in the same tensor block. Maybe even getting parts down to 4 bits. More practically, permuting weights could probably allow the Ampere sparse matrix optimization to work better by distributing the zeros more evenly.

13 2 118 0 4

Ross Wightman @wightmanr

2 years ago

@ID_AA_Carmack x.com/tim_dettmers/s…

Tim Dettmers @Tim_Dettmers

2 years ago

@ID_AA_Carmack x.com/tim_dettmers/s…

17 250 1K 0 343

Download Image

0 0 3 0 1