Ross Wightman @wightmanr, Twitter Profile

Ross Wightman @wightmanr

2 years ago

I've been told timm has a lot of hidden features. Yes, the docs need improving, that's a WIP! Curious about one of those features I've been using a lot lately in CLIP ViT fine-tuning? Every model in timm, when used with optimizer factory supports layer-wise LR decay.

4 16 119 0 22

Ross Wightman @wightmanr

2 years ago

Also known as discriminative LR decay, this applies a decaying LR to the model params as you move away from the head. It's very useful for fine-tuning from large pretrain dataset (or semi/unsupervised train -> supervised) without blowing away properties from pretrain.

1 1 20 0 1

Jean-Baptiste Schiratti @jbschiratti

2 years ago

@wightmanr Out of curiosity, which script are you using to fine-tune models like CLIP ViT? There's a JAX script to train CLIP in huggingface/examples/research_projects but it does not seem to rely on timm...

1 0 2 0 0

Tony Angell @angellmethod

2 years ago

@wightmanr I doubt I’m skilled enough but it is hacktober if you have some baby doc issues that need fixed.

1 0 3 0 0

Thomas Capelle @capetorch

2 years ago

@wightmanr I would really love a collection of building blocks with @wightmanr quality. We were discussing this with @benjamin_warner today, high quality building blocks: Attention layers, ResBlocks, Upsample, that are torchscript-ables, and as fast as possible. You have everything already.

0 0 2 0 0