Skip to content

Conversation

@rka97
Copy link
Contributor

@rka97 rka97 commented Jan 27, 2026

This is the PR for the LM workload with mixed-precision training on 4xA100.

@rka97 rka97 requested a review from a team as a code owner January 27, 2026 19:51
@github-actions
Copy link

github-actions bot commented Jan 27, 2026

MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅

@rka97 rka97 requested a review from priyakasimbeg January 27, 2026 19:52
@priyakasimbeg
Copy link
Contributor

This branch has the mixed precision code changes for all of the workloads. We have an existing PR open from the lm_workload_base branch that only has the LM workload changes. Note I did not do a final test with the mixed precision for just the LM workload after increasing the number of evals to get a better timing estimates. At this point we may just opt to do LM workload with TF32 for the new release since we don't have anymore bandwidth to test changes.

@github-actions github-actions bot locked and limited conversation to collaborators Jan 29, 2026
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants