Add CSATv2 Models #2624

gusdlf93 · 2025-12-06T15:18:06Z

Hello,
As mentioned in the related issue (#2622),
the CSATv2 model worked only in the HuggingFace environment and failed to load in a standard timm setup.

I updated the code, so that CSATv2 loads correctly through the timm registry.

Changed

Ensured that timm.create_model("csatv2") works without errors

Regarding model definition / pretrained_cfg / bits

I reviewed the maintainer’s comment and updated these parts to align with the timm API as best as I understood.

Validation

Model loads successfully in timm environment
train.py, validate.py works without any issues

Result (Imagenet 1K)

Model	Acc@1	Acc@5	FLOPs#G	MACs#G	Params#M
csatv2	80.02%	94.9	2.77	1.38	11.1 M

If further adjustments are needed, I’m happy to revise the PR.
Thank you!

HuggingFaceDocBuilderDev · 2025-12-06T15:42:04Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

gusdlf93 · 2025-12-07T07:38:52Z

Fix JIT compilation failure by replacing axis with dim

The PyTorch JIT compiler does not support the axis argument alias for torch.cat (and other operations), causing CI tests to fail with Keyword argument axis unknown.

I have replaced all instances of axis with dim to ensure the model is scriptable and compatible with the test suite.

gusdlf93 · 2025-12-08T09:26:41Z

[Fix] Enable JIT compilation support for CSATv2

Several RuntimeError and jit.script compatibility issues in the CSATv2 model have been fixed.
The model now successfully passes the torch.jit.script(model) test and produces the same output as before.

Note to Reviewers

Apologies for the multiple fixes required in this area. This is my first time working with TorchScript/JIT compatibility, so I missed some of the strict static analysis requirements in the initial implementation. I have verified the fix with a local test script.

Detailed Changes

TransformerBlock:

Fixed a logic error in init where nested if statements prevented the else block from executing.

Explicitly initialized unused attributes (self.proj, self.pool1, etc.) as nn.Identity() when downsample=False. This fixes the "Module has no attribute" error during static analysis.

LayerNorm:

Changed the elif block to else in forward. This guarantees that the function always returns a Tensor, resolving the Expected Tensor but found Optional[Tensor] error.

Block:

Replaced dynamic instantiation of nn.UpsamplingBilinear2d inside forward with F.interpolate. JIT does not support creating module instances within script functions.

PreNorm:

Removed **kwargs from forward to comply with JIT's strict argument typing.

Attention:

Replaced map and lambda with standard torch operations and explicit loops/reshaping, as JIT does not support Python lambdas.

rwightman · 2025-12-08T17:52:47Z

FWIW you can run the tests locally on just the models with

pytest -vv tests/test_models.py -k csatv2

gusdlf93 added 2 commits December 6, 2025 23:44

Upload CSATv2

2cdeef8

Update csatv2.py

4f03468

Update csatv2.py

09d3d25

gusdlf93 added 4 commits December 8, 2025 18:16

Add files via upload

1b4d9b9

Add files via upload

2112c31

Delete csatv2.py

9903813

Add files via upload

10733d9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Add CSATv2 Models #2624

Add CSATv2 Models #2624

gusdlf93 commented Dec 6, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Dec 6, 2025

Uh oh!

gusdlf93 commented Dec 7, 2025

Uh oh!

gusdlf93 commented Dec 8, 2025

Uh oh!

rwightman commented Dec 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Add CSATv2 Models #2624

Are you sure you want to change the base?

Add CSATv2 Models #2624

Conversation

gusdlf93 commented Dec 6, 2025

Changed

Regarding model definition / pretrained_cfg / bits

Validation

Result (Imagenet 1K)

Uh oh!

HuggingFaceDocBuilderDev commented Dec 6, 2025

Uh oh!

gusdlf93 commented Dec 7, 2025

Fix JIT compilation failure by replacing axis with dim

Uh oh!

gusdlf93 commented Dec 8, 2025

[Fix] Enable JIT compilation support for CSATv2

Note to Reviewers

Detailed Changes

TransformerBlock:

LayerNorm:

Block:

PreNorm:

Attention:

Uh oh!

rwightman commented Dec 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants