try to support gemma4 by wenhuach21 · Pull Request #1656 · intel/auto-round

wenhuach21 · 2026-04-03T07:50:57Z

Description

Please briefly describe your main changes, the motivation.

Type of Change

Related Issues

Fixes or relates to #

Checklist Before Submitting

My code has been tested locally.
Documentation has been updated as needed.
New or updated tests are included where applicable.

for more information, see https://pre-commit.ci

Copilot

Pull request overview

This PR aims to add initial support for the gemma4 model family by adapting block input caching/quantization flow to handle variable block shapes and extra cached inputs.

Changes:

Allow wrapper blocks to forward positional args through to decoder layers.
Add a predefined fixed-attribute lookup for special model types (gemma4).
Extend caching/quantization to support variable-shaped block groupings and extra per-block cached inputs.

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 9 comments.

File	Description
auto_round/wrapper.py	Forwards positional args through `WrapperMultiblock.forward` to improve model compatibility.
auto_round/special_model_handler.py	Introduces predefined fixed attributes (e.g., `gemma4`) retrievable from `model.config.model_type`.
auto_round/compressors/base.py	Uses fixed attributes to alter block caching/quantization for variable block shapes and additional cached inputs.

auto_round/compressors/base.py

auto_round/special_model_handler.py

auto_round/compressors/base.py

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

for more information, see https://pre-commit.ci

wenhuach21 · 2026-04-03T08:19:24Z

TODO @n1ck-guo I'll leave it to you

1 consolidate with your pr. While this pr is more general, it costs large vram during calibration.

2 Add an argument to the API to allow users to configure this, since it’s not easy to determine whether a model has variable block inputs. One possible approach is to probe with sample data, but that would require loading all the blocks, which is costly.

…into support_gemma4

for more information, see https://pre-commit.ci

try to support gemma4

8ab6ebe

Copilot AI review requested due to automatic review settings April 3, 2026 07:50

[pre-commit.ci] auto fixes from pre-commit.com hooks

0cc631d

for more information, see https://pre-commit.ci

Copilot AI reviewed Apr 3, 2026

View reviewed changes

wenhuach21 and others added 6 commits April 3, 2026 16:08

Update auto_round/compressors/base.py

6a6afcf

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update auto_round/compressors/base.py

7f1af02

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update auto_round/compressors/base.py

416797c

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

refine

4638d8a

Update auto_round/special_model_handler.py

a8dd583

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

[pre-commit.ci] auto fixes from pre-commit.com hooks

8256d92

for more information, see https://pre-commit.ci

wenhuach21 and others added 4 commits April 3, 2026 16:24

fix

d3d1ed8

Merge branch 'support_gemma4' of https://github.com/intel/auto-round …

f2e5332

…into support_gemma4

support opt_rtn

222838e

[pre-commit.ci] auto fixes from pre-commit.com hooks

bc93840

for more information, see https://pre-commit.ci

Copilot started reviewing on behalf of wenhuach21 April 3, 2026 09:28 View session

wenhuach21 and others added 2 commits April 3, 2026 18:48

update

771b003

[pre-commit.ci] auto fixes from pre-commit.com hooks

f5849bf

for more information, see https://pre-commit.ci

wenhuach21 mentioned this pull request Apr 3, 2026

[Bug]: offload/immediate_saving issue #1659

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

try to support gemma4#1656

try to support gemma4#1656
wenhuach21 wants to merge 14 commits intomainfrom
support_gemma4

wenhuach21 commented Apr 3, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wenhuach21 commented Apr 3, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

wenhuach21 commented Apr 3, 2026

Description

Type of Change

Related Issues

Checklist Before Submitting

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wenhuach21 commented Apr 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

wenhuach21 commented Apr 3, 2026 •

edited

Loading