Add a SP e2e test. #1209

vanbasten23 · 2025-12-02T01:08:30Z

Description

This PR adds a SP e2e test and adds it to the CI.

Tests

pytest -s -vv tests/e2e/test_sequence_parallelism.py

Checklist

Before submitting this PR, please make sure:

I have performed a self-review of my code.
I have necessary comments in my code, particularly in hard-to-understand areas.
I have made or will make corresponding changes to any relevant documentation.

Signed-off-by: Xiongfei Wei <isaacwxf23@gmail.com>

…allelism.py Signed-off-by: Xiongfei Wei <isaacwxf23@gmail.com>

Signed-off-by: Xiongfei Wei <isaacwxf23@gmail.com>

github-actions · 2025-12-02T01:08:43Z

Description

Start with a short description of what the PR does and how this is a change from
the past.

The rest of the description includes relevant details and context, examples:

why is this change being made,
the problem being solved and any relevant context,
why this is a good solution,
some information about the specific implementation,
shortcomings of the solution and possible future improvements.

If the change fixes a bug or a Github issue, please include a link, e.g.,:
FIXES: b/123456
FIXES: #123456

Tests

Please describe how you tested this change, and include any instructions and/or
commands to reproduce.

Checklist

Before submitting this PR, please make sure:

I have performed a self-review of my code.
I have necessary comments in my code, particularly in hard-to-understand areas.
I have made or will make corresponding changes to any relevant documentation.

yaochengji · 2025-12-02T01:14:01Z

tpu_inference/layers/vllm/quantization/common.py

            token_num = x.shape[0]
            # NOTE(chengjiyao): make sure the sharded token_num is larger than TPU_SECOND_LAST_MINOR
            if token_num // self.mesh.shape["model"] >= TPU_SECOND_LAST_MINOR:
+                logger.info(


In our e2e test, should we check the log to see sp is actually enabled? Another way is to check the final optimized graph, but that's more difficult.

I checked manually that this line(76) is executed. But I think you meant if we can check it in the test. I'm not sure how we can do that in the test. Let me know if you have some ideas.

Though, I added a long prompt (line24 ""Three Rings...") to ensure token_num//8 >= 8 is triggered. Also the precompilation phase use very large num_tokens so this case is triggered.

Can we output the log in a file and checked the file's content later?

Could it be logger.debug instead of logger.info?

No luck for now. Somehow I couldn't capture the logger. I agree that this may be the easiest way to test. But if someone write a similar logging string, then this test may not work as intended. Other parallelism seem to have the same issue: it's hard to examine how each layer is sharded in the test.

I've been thinking if there is better way to test. Since SP's main benefits is to reduce memory, we can check with SP if the mem usage is indeed reduced. But there is no jax api that let me check the mem usage.

How about let's merge this pr so that it verifies with "enable_sequence_parallelism=True" and "tensor_parallelism=8" the test runs to completion, since that is the intended way to enable SP. Then when we do integration, we improve the test. Wdyt?

tpu_inference/layers/vllm/quantization/common.py

tests/e2e/test_sequence_parallelism.py

Signed-off-by: Xiongfei Wei <isaacwxf23@gmail.com>

.buildkite/pipeline_jax.yml

Signed-off-by: Xiongfei Wei <isaacwxf23@gmail.com>

vanbasten23 added 4 commits December 2, 2025 00:05

added a sp e2e test

d77baf9

Signed-off-by: Xiongfei Wei <isaacwxf23@gmail.com>

i'm able to run the test as pytest -s -vv tests/e2e/test_sequence_par…

82ab9eb

…allelism.py Signed-off-by: Xiongfei Wei <isaacwxf23@gmail.com>

Added more test cases.

8653c01

Signed-off-by: Xiongfei Wei <isaacwxf23@gmail.com>

Add sp e2e test to the CI.

f257cb7

Signed-off-by: Xiongfei Wei <isaacwxf23@gmail.com>

vanbasten23 marked this pull request as ready for review December 2, 2025 01:08

vanbasten23 requested review from QiliangCui, hfan, jcyang43, kyuyeunk and vipannalla as code owners December 2, 2025 01:08

vanbasten23 requested a review from yaochengji December 2, 2025 01:11

yaochengji reviewed Dec 2, 2025

View reviewed changes

kyuyeunk reviewed Dec 2, 2025

View reviewed changes

tpu_inference/layers/vllm/quantization/common.py Outdated Show resolved Hide resolved

tests/e2e/test_sequence_parallelism.py Show resolved Hide resolved

vanbasten23 requested a review from yaochengji December 2, 2025 17:01

improve the err msg

e9ba556

Signed-off-by: Xiongfei Wei <isaacwxf23@gmail.com>

py4 reviewed Dec 2, 2025

View reviewed changes

.buildkite/pipeline_jax.yml Outdated Show resolved Hide resolved

vanbasten23 added 4 commits December 2, 2025 22:32

fix up

7afe8fe

Signed-off-by: Xiongfei Wei <isaacwxf23@gmail.com>

try to make the test capture the log

59899b0

Signed-off-by: Xiongfei Wei <isaacwxf23@gmail.com>

still couldnt capture log. consider revert this and the last commit.

dab3ec6

Signed-off-by: Xiongfei Wei <isaacwxf23@gmail.com>

change logger.info to logger.debug

decf212

Signed-off-by: Xiongfei Wei <isaacwxf23@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add a SP e2e test. #1209

Add a SP e2e test. #1209

Uh oh!

vanbasten23 commented Dec 2, 2025

Uh oh!

github-actions bot commented Dec 2, 2025

Uh oh!

yaochengji Dec 2, 2025

Uh oh!

vanbasten23 Dec 2, 2025 •

edited

Loading

Uh oh!

yaochengji Dec 2, 2025

Uh oh!

yaochengji Dec 2, 2025

Uh oh!

vanbasten23 Dec 3, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Add a SP e2e test. #1209

Are you sure you want to change the base?

Add a SP e2e test. #1209

Uh oh!

Conversation

vanbasten23 commented Dec 2, 2025

Description

Tests

Checklist

Uh oh!

github-actions bot commented Dec 2, 2025

Description

Tests

Checklist

Uh oh!

yaochengji Dec 2, 2025

Choose a reason for hiding this comment

Uh oh!

vanbasten23 Dec 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

yaochengji Dec 2, 2025

Choose a reason for hiding this comment

Uh oh!

yaochengji Dec 2, 2025

Choose a reason for hiding this comment

Uh oh!

vanbasten23 Dec 3, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

vanbasten23 Dec 2, 2025 •

edited

Loading