You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Part of WS1 — Full Batch-Invariant Forward Chain (epic: #)
Why
Rollout generates token-by-token through the decode path; training re-runs the same sequence through prefill. If the two paths reduce in different orders, the same token gets different logprobs in rollout vs training — a classic and high-impact rollout-vs-training drift source. WS1 invariance work tends to focus on chunked-prefill; the decode stage must be covered explicitly.
Scope
Ensure the prefill and decode paths produce the same reductions for the same effective context.
Verify that attention over a cached context (decode: one query against N cached KV) reduces in the same fixed order as the equivalent prefill over the full sequence.
Cover the decode-stage path explicitly in tests, not only chunked-prefill.
Confirm cache writes/reads (layout, dtype of stored KV) do not introduce a precision difference between the path that wrote the cache and the path that consumes it.
Part of WS1 — Full Batch-Invariant Forward Chain (epic: #)
Why
Rollout generates token-by-token through the decode path; training re-runs the same sequence through prefill. If the two paths reduce in different orders, the same token gets different logprobs in rollout vs training — a classic and high-impact rollout-vs-training drift source. WS1 invariance work tends to focus on chunked-prefill; the decode stage must be covered explicitly.
Scope
Ensure the prefill and decode paths produce the same reductions for the same effective context.
Out of scope
Acceptance criteria
Notes
Planned PRs