[ATOM/ROCm] atom deepseek r1 fp4 mtp3 on mi355x by seungrokj · Pull Request #1028 · SemiAnalysisAI/InferenceX

seungrokj · 2026-04-14T13:44:19Z

hi,

This is deepseek r1 fp4 on atom framework which supports mtp 3 tokens.

Recipe link: https://github.com/ROCm/ATOM/blob/main/recipes/DeepSeek-R1.md#mxfp4-with-mtp

Regards,
Seungrok

Signed-off-by: seungrokj <seungrok.jung@amd.com>

github-actions · 2026-04-14T13:44:29Z

Thanks for the contribution! For vLLM & SGLang, please ensure that your recipes is similar to the official vLLM recipes and/or the SGLang cookbook

If it is not, please create a PR first before we can merge your PR into the master branch. Let's ensure that the documentation is first class such that the entire ML community can benefit from your hard work! Thank you

github-actions · 2026-04-14T13:44:30Z

Thanks for the contribution! For vLLM & SGLang, please ensure that your recipes is similar to the official vLLM recipes and/or the SGLang cookbook

If it is not, please create a PR first before we can merge your PR into the master branch. Let's ensure that the documentation is first class such that the entire ML community can benefit from your hard work! Thank you

Signed-off-by: seungrokj <seungrok.jung@amd.com>

seungrokj · 2026-04-14T15:04:09Z

hi @functionstackx @cquil11
can you please review this PR?

e2e perf: https://github.com/SemiAnalysisAI/InferenceX/actions/runs/24402497975

cc. @ChuanLi1101 @andyluo7 @chunfangamd

cquil11

amd/DeepSeek-R1-0528-MXFP4-MTP-MoEFP4
wait what is this?

functionstackx · 2026-04-14T21:07:53Z

do yall support this model in sglang...

seungrokj · 2026-04-15T01:21:42Z

amd/DeepSeek-R1-0528-MXFP4-MTP-MoEFP4 wait what is this?

hi @cquil11

new model 1) below, quantized shared & routed experts of layers 61.
old model 2) didn't quantized the entire layers 61.

==

https://huggingface.co/amd/DeepSeek-R1-0528-MXFP4-MTP-MoEFP4
->
export exclude_layers="mlp.gate. *lm_head model.layers.61.eh_proj model.layers.61.shared_head.head model.layers.61.embed_tokens"
vs
https://huggingface.co/amd/DeepSeek-R1-0528-MXFP4
->
exclude_layers="self_attn mlp.gate. lm_head model.layers.61."

seungrokj · 2026-04-15T01:26:05Z

@functionstackx haven't checked mtp3 + 'native' sglang yet. (atom-oot-sglang, rocm/atom-dev:sglang-latest supports though; but same perf as atom). will check this with other folks and get back to you soon :D

functionstackx · 2026-04-15T02:56:24Z

@functionstackx haven't checked mtp3 + 'native' sglang yet. (atom-oot-sglang, rocm/atom-dev:sglang-latest supports though; but same perf as atom). will check this with other folks and get back to you soon :D

Yes, plz let me know when this model ckpt is supported in upstream sglang and we can accept this PR for this atom model kpt

cquil11 · 2026-04-15T13:25:14Z

to reiterate what @functionstackx said, we think it is a good general rule of thumb to only allow this model once it is able to run on upstream SGLang image

atom deepseek r1 fp4 mtp3

283c99f

Signed-off-by: seungrokj <seungrok.jung@amd.com>

seungrokj requested a review from a team April 14, 2026 13:44

seungrokj requested review from 1am9trash, billishyahao, chunfangamd and yctseng0211 as code owners April 14, 2026 13:44

github-project-automation bot added this to InferenceMAX Board Apr 14, 2026

claude bot reviewed Apr 14, 2026

View reviewed changes

Comment thread perf-changelog.yaml Outdated

seungrokj mentioned this pull request Apr 14, 2026

[recipe] ds r1 fp4 mtp3 model change ROCm/ATOM#563

Merged

atom deepseek r1 fp4 mtp3

83f7e74

Signed-off-by: seungrokj <seungrok.jung@amd.com>

cquil11 approved these changes Apr 14, 2026

View reviewed changes

cquil11 requested changes Apr 14, 2026

View reviewed changes

seungrokj added the AMD label Apr 15, 2026

seungrokj requested a review from cquil11 April 15, 2026 01:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ATOM/ROCm] atom deepseek r1 fp4 mtp3 on mi355x#1028

[ATOM/ROCm] atom deepseek r1 fp4 mtp3 on mi355x#1028
seungrokj wants to merge 2 commits intomainfrom
srok/atom_dsr1_fp4_mtp3

seungrokj commented Apr 14, 2026

Uh oh!

github-actions bot commented Apr 14, 2026

Uh oh!

github-actions bot commented Apr 14, 2026

Uh oh!

Uh oh!

seungrokj commented Apr 14, 2026

Uh oh!

cquil11 left a comment

Uh oh!

functionstackx commented Apr 14, 2026

Uh oh!

seungrokj commented Apr 15, 2026

Uh oh!

seungrokj commented Apr 15, 2026

Uh oh!

functionstackx commented Apr 15, 2026 •

edited

Loading

Uh oh!

cquil11 commented Apr 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

seungrokj commented Apr 14, 2026

Uh oh!

github-actions bot commented Apr 14, 2026

Uh oh!

github-actions bot commented Apr 14, 2026

Uh oh!

Uh oh!

seungrokj commented Apr 14, 2026

Uh oh!

cquil11 left a comment

Choose a reason for hiding this comment

Uh oh!

functionstackx commented Apr 14, 2026

Uh oh!

seungrokj commented Apr 15, 2026

Uh oh!

seungrokj commented Apr 15, 2026

Uh oh!

functionstackx commented Apr 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cquil11 commented Apr 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

functionstackx commented Apr 15, 2026 •

edited

Loading