Skip to content

[Feat] remove flatten in atom sglang mla like atom vllm mla#525

Open
ZLkanyo009 wants to merge 2 commits intomainfrom
lingzha/remove_flatten
Open

[Feat] remove flatten in atom sglang mla like atom vllm mla#525
ZLkanyo009 wants to merge 2 commits intomainfrom
lingzha/remove_flatten

Conversation

@ZLkanyo009
Copy link
Copy Markdown

Motivation

In the decode process of DeepSeek-R1, compared to atom-vLLM, atom-sglang has an extra flatten operator. This PR removes that flatten operator based on the handling in atom-vLLM, thereby accelerating the decode phase.

Performance

before:
image

after:
image

@ZLkanyo009 ZLkanyo009 requested a review from zhuyuhua-v April 13, 2026 03:07
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant