Skip to content

Commit c1425ca

Browse files
committed
Update hparams.mdx
1 parent a0fd395 commit c1425ca

File tree

1 file changed

+4
-2
lines changed

1 file changed

+4
-2
lines changed

hfdocs/source/hparams.mdx

Lines changed: 4 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,7 @@
1-
Over the years, many `timm` models have been trained with various hyper-parameters as the libraries and models evolved. I don't have a record of every instance, but have recorded instances of many that can serve as a very good starting point.a
1+
# HParams
2+
Over the years, many `timm` models have been trained with various hyper-parameters as the libraries and models evolved. I don't have a record of every instance, but have recorded instances of many that can serve as a very good starting point.
23

4+
## Tags
35
Most `timm` trained models have an identifier in their pretrained tag that relates them (roughly) to a family / version of hparams I've used over the years.
46

57
| Tag(s) | Description | Optimizer | LR Schedule | Other Notes |
@@ -18,6 +20,7 @@ Most `timm` trained models have an identifier in their pretrained tag that relat
1820
| `ram` | AugMix (with RandAugment) recipe | SGD (Nesterov) with JSD loss | Cosine with warmup ||
1921
| `bt` | Bag-of-Tricks recipe | SGD (Nesterov) | Cosine with warmup ||
2022

23+
## Config File Gists
2124
I've collected several of the hparam families in a series of gists. These can be downloaded and used with the `--config hparam.yaml` argument with the `timm` train script. Some adjustment is always required for the LR vs effective global batch size.
2225

2326
| Tag | Key Model Architectures | Gist Link |
@@ -28,4 +31,3 @@ I've collected several of the hparam families in a series of gists. These can be
2831
| `sw` | ViT, ConvNeXt, CoAtNet, MaxViT | [Link](https://gist.github.com/rwightman/943c0fe59293b44024bbd2d5d23e6303) |
2932
| `sbb` | ViT | [Link](https://gist.github.com/rwightman/fb37c339efd2334177ff99a8083ebbc4) |
3033
|| Tiny Test Models | [Link](https://gist.github.com/rwightman/9ba8efc39a546426e99055720d2f705f) |
31-

0 commit comments

Comments
 (0)