Add chunked prefill with prefix KV cache reuse to Tunix Gemma 4. by copybara-service[bot] · Pull Request #1597 · google/tunix

copybara-service · 2026-06-16T18:58:59Z

Add chunked prefill with prefix KV cache reuse to Tunix Gemma 4.

Refactor attention into focused methods for KV projection,
cache update with prefix concatenation, and mask construction.
Support sliding window (ring buffer) and linear cache layouts.
Handle PAD masking, KV sharing, and flash attention fallback.
Comprehensive tests for chunked prefill, decode, KV sharing,
and PAD masking.

google-cla · 2026-06-16T18:59:16Z

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

View this failed invocation of the CLA check for more information.

For the most up to date status, view the checks section at the bottom of the pull request.

- Refactor attention into focused methods for KV projection, cache update with prefix concatenation, and mask construction. - Support sliding window (ring buffer) and linear cache layouts. - Handle PAD masking, KV sharing, and flash attention fallback. - Comprehensive tests for chunked prefill, decode, KV sharing, and PAD masking. PiperOrigin-RevId: 933189977

copybara-service Bot requested review from abheesht17, hgao327, jiangyangmu, lc5211, s-noghabi, sizhit2, tianshub and wang2yn84 as code owners June 16, 2026 18:59

github-actions Bot assigned abheesht17 Jun 16, 2026

copybara-service Bot had a problem deploying to testing June 16, 2026 18:59 Error

copybara-service Bot force-pushed the test_933189977 branch from 0ee18f7 to 5d7b355 Compare June 16, 2026 19:02

copybara-service Bot temporarily deployed to testing June 16, 2026 19:03 Inactive

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add chunked prefill with prefix KV cache reuse to Tunix Gemma 4.#1597

Add chunked prefill with prefix KV cache reuse to Tunix Gemma 4.#1597
copybara-service[bot] wants to merge 1 commit into
mainfrom
test_933189977

copybara-service Bot commented Jun 16, 2026

Uh oh!

google-cla Bot commented Jun 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

copybara-service Bot commented Jun 16, 2026

Uh oh!

google-cla Bot commented Jun 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant