Fix example max_steps calculations by fallintoplace · Pull Request #1629 · google/tunix

fallintoplace · 2026-06-28T20:02:39Z

What changed

This updates the example launch scripts that were computing max_steps as batch_size * num_batches * num_train_epochs * train_fraction.

Those scripts now compute max_steps from num_batches * num_train_epochs * train_fraction, which matches the CLI's step-based semantics. Since warmup_steps and decay_steps are derived from max_steps in these scripts, they now stay aligned as well.

Why

tunix.cli.grpo_main treats max_steps as optimizer/training steps and caps it against num_batches * num_train_epochs * train_fraction.

A handful of example scripts were multiplying by batch_size, which effectively turned the value into a sample count. That could overrun training by batch_sizex and skew warmup/decay schedules relative to the actual number of optimizer steps.

Impact

The affected examples now match the repo's documented and implemented max_steps behavior.

Validation

rg -n '\$batch_size \* \$num_batches \* \$num_train_epochs \* \$train_fraction' . -g '*.sh'
bash -n on the 8 updated scripts
git diff --check

Fix max_steps formulas in launch scripts

89b02d6

fallintoplace requested review from abheesht17, hgao327, jiangyangmu, lc5211, s-noghabi, sizhit2, tianshub and wang2yn84 as code owners June 28, 2026 20:02

github-actions Bot assigned tianshub Jun 28, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix example max_steps calculations#1629

Fix example max_steps calculations#1629
fallintoplace wants to merge 1 commit into
google:mainfrom
fallintoplace:fix/max-steps-formulas

fallintoplace commented Jun 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

fallintoplace commented Jun 28, 2026

What changed

Why

Impact

Validation

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants