Add linear_attn entries to Qwen3.5 base_model_tp_plan by ZAID646 · Pull Request #47009 · huggingface/transformers

ZAID646 · 2026-07-01T18:24:33Z

Qwen3.5 is a hybrid model: ~75% of decoder layers use linear_attention (Gated DeltaNet) with their own projection matrices (in_proj_qkv, in_proj_z, in_proj_b, in_proj_a, out_proj). These were missing from base_model_tp_plan, causing:

OOM at TP>1 (weights not sharded)
RuntimeError in model.generate() (Conv1d channel mismatch after in_proj_qkv)

Fix: Add all linear_attn.* projections with "colwise_gather_output" pattern, which shards the weight matrix (fixing OOM) and all-gathers activations before the depthwise Conv1d (fixing the shape mismatch).

Updated both modular_qwen3_5.py (source) and configuration_qwen3_5.py (auto-generated).

github-actions · 2026-07-01T18:25:43Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: qwen3_5

github-actions · 2026-07-01T18:33:15Z

CI recap

Dashboard: View test results in Grafana
Latest run: 28538882882:2
Result: success | Jobs: 2 | Tests: 10 | Failures: 0 | Duration: 45s

Rocketknight1 · 2026-07-02T11:36:30Z

cc @Cyrilvallez for TP!

Add linear_attn entries to Qwen3.5 base_model_tp_plan

1938f9a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add linear_attn entries to Qwen3.5 base_model_tp_plan#47009

Add linear_attn entries to Qwen3.5 base_model_tp_plan#47009
ZAID646 wants to merge 1 commit into
huggingface:mainfrom
ZAID646:fix/qwen3_5-tp-plan

ZAID646 commented Jul 1, 2026 •

edited by github-actions Bot

Loading

Uh oh!

github-actions Bot commented Jul 1, 2026

Uh oh!

github-actions Bot commented Jul 1, 2026

Uh oh!

Rocketknight1 commented Jul 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

ZAID646 commented Jul 1, 2026 • edited by github-actions Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented Jul 1, 2026

Uh oh!

github-actions Bot commented Jul 1, 2026

CI recap

Uh oh!

Rocketknight1 commented Jul 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ZAID646 commented Jul 1, 2026 •

edited by github-actions Bot

Loading