Upgrade GLM-5 image to v0.5.10 by chunfangamd · Pull Request #1023 · SemiAnalysisAI/InferenceX

chunfangamd · 2026-04-10T12:46:23Z

Upgrade GLM5-FP8-MI355X-SGLang Image to v0.5.10rc0-rocm720-mi35x-20260413 after fixing the Slurm shared cache issue. With this upstream daily image, we use a new aiter 0.1.12.post1 which can improve the performance.

    if [[ "$FRAMEWORK" == "atom" ]] || [[ "$FRAMEWORK" == "sglang" ]]; then
        SLRUM_HOME_MOUNT=""
    else
        SLRUM_HOME_MOUNT=" --container-mount-home "
    fi

Co-authored with: @1am9trash @zhentaocc
Thanks to @cquil11 @billishyahao

github-actions · 2026-04-10T12:46:34Z

Thanks for the contribution! For vLLM & SGLang, please ensure that your recipes is similar to the official vLLM recipes and/or the SGLang cookbook

If it is not, please create a PR first before we can merge your PR into the master branch. Let's ensure that the documentation is first class such that the entire ML community can benefit from your hard work! Thank you

chunfangamd · 2026-04-10T12:49:37Z

@claude please active the e2e test with test-config --config-files .github/configs/amd-master.yaml --config-keys glm5-fp8-mi355x-sglang

Klaud-Cold · 2026-04-10T12:49:59Z

Claude finished @chunfangamd's task in 3s —— View job

E2E Test Triggered for GLM-5

Understand the request
Trigger e2e test workflow with test-config --config-files .github/configs/amd-master.yaml --config-keys glm5-fp8-mi355x-sglang --conc 4
Monitor workflow run — jobs queued, waiting for MI355X runners
Analyze benchmark results (pending runner availability)

Workflow triggered: Run #24243787005 — glm5-fp8-mi355x-sglang on branch chun/glm5

Jobs created:

Job	Status
`get-jobs`	✅ Success
`glm5_1k1k fp8 mi355x sglang \| tp=8 ep=1 conc-4`	⌛ Queued (waiting for `mi355x` runner)
`glm5_8k1k fp8 mi355x sglang \| tp=8 ep=1 conc-4`	⌛ Queued (waiting for `mi355x` runner)
`glm5_8k1k fp8 mi355x sglang \| tp=8 ep=1 conc-4 \| eval-only`	⌛ Queued (waiting for `mi355x` runner)
`multi-node`	⏩ Skipped (not applicable)

The workflow has been successfully dispatched. All 3 MI355X benchmark/eval jobs are queued and waiting for self-hosted MI355X runners to become available (~40min in queue so far). Once a runner picks up the jobs, the benchmarks will run automatically. Please monitor the run directly via the link above. You can tag @claude again once the run completes if you'd like me to analyze the results.

--- | View workflow run | Branch: chun/glm5

chunfangamd · 2026-04-10T12:57:25Z

@claude, I canceled the job https://github.com/SemiAnalysisAI/InferenceX/actions/runs/24243787005.

Klaud-Cold · 2026-04-10T12:57:51Z

Claude finished @chunfangamd's task in 15s —— View job

Acknowledged Job Cancellation

Noted cancellation of workflow run #24243787005

The e2e test run for glm5-fp8-mi355x-sglang has been canceled as requested.

If you'd like me to re-trigger the tests or help with anything else on this PR, just let me know!

--- | Branch: chun/glm5

billishyahao · 2026-04-14T05:01:58Z

Hi @chunfangamd This is new sweep based on sgl-dev nightly build:
https://inferencex.semianalysis.com/inference?unofficialRun=24379639570

cquil11 · 2026-04-14T19:50:22Z

@billishyahao @chunfangamd nightly is fine for now, since this is a new model and the pace of development is rapid. feel free to change this PR to the nightly image

change to nightly image

cquil11 · 2026-04-14T20:10:57Z

sweep: https://github.com/SemiAnalysisAI/InferenceX/actions/runs/24420536226

…0413 Updated the perf-changelog too

Upgrade GLM-5 image to v0.5.10

34c86c7

chunfangamd requested a review from a team April 10, 2026 12:46

chunfangamd requested a review from billishyahao as a code owner April 10, 2026 12:46

github-project-automation bot added this to InferenceMAX Board Apr 10, 2026

Update perf-changelog

f0bbb07

chunfangamd self-assigned this Apr 10, 2026

claude bot reviewed Apr 10, 2026

View reviewed changes

Comment thread .github/configs/amd-master.yaml

Comment thread .github/configs/amd-master.yaml Outdated

chunfangamd added 2 commits April 13, 2026 14:45

Merge branch 'main' into chun/glm5

c96ff59

Set kv-cache-dytpe fp8-e4m3 and disable-radix-cache

8d32c24

chunfangamd requested review from 1am9trash, seungrokj and yctseng0211 as code owners April 13, 2026 14:46

Oseltamivir added sweep-enabled and removed sweep-enabled labels Apr 13, 2026

cquil11 added 2 commits April 14, 2026 15:09

Update amd-master.yaml

58486f7

change to nightly image

Merge branch 'main' into chun/glm5

4f5f588

cquil11 added the sweep-enabled label Apr 14, 2026

chunfangamd added 2 commits April 15, 2026 07:14

Merge branch 'main' into chun/glm5

b7e00e8

Upgrade GLM5 FP8 MI355X SGLang image to v0.5.10rc0-rocm720-mi35x-2026…

7c4289f

…0413 Updated the perf-changelog too

chunfangamd force-pushed the chun/glm5 branch from 12fd039 to 7c4289f Compare April 15, 2026 07:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Upgrade GLM-5 image to v0.5.10#1023

Upgrade GLM-5 image to v0.5.10#1023
chunfangamd wants to merge 8 commits intomainfrom
chun/glm5

chunfangamd commented Apr 10, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Apr 10, 2026

Uh oh!

chunfangamd commented Apr 10, 2026

Uh oh!

Klaud-Cold commented Apr 10, 2026 •

edited

Loading

Uh oh!

chunfangamd commented Apr 10, 2026

Uh oh!

Uh oh!

Uh oh!

Klaud-Cold commented Apr 10, 2026 •

edited

Loading

Uh oh!

billishyahao commented Apr 14, 2026

Uh oh!

cquil11 commented Apr 14, 2026

Uh oh!

cquil11 commented Apr 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

chunfangamd commented Apr 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Apr 10, 2026

Uh oh!

chunfangamd commented Apr 10, 2026

Uh oh!

Klaud-Cold commented Apr 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

E2E Test Triggered for GLM-5

Uh oh!

chunfangamd commented Apr 10, 2026

Uh oh!

Uh oh!

Uh oh!

Klaud-Cold commented Apr 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Acknowledged Job Cancellation

Uh oh!

billishyahao commented Apr 14, 2026

Uh oh!

cquil11 commented Apr 14, 2026

Uh oh!

cquil11 commented Apr 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

chunfangamd commented Apr 10, 2026 •

edited

Loading

Klaud-Cold commented Apr 10, 2026 •

edited

Loading

Klaud-Cold commented Apr 10, 2026 •

edited

Loading