用例测试#115

Open

kerer-ai wants to merge 44 commits intoAscend:v2.7.1from

kerer-ai:v2.7.1

Collaborator

kerer-ai commented Apr 7, 2026

用例测试

wangsike added 2 commits

April 7, 2026 20:03


          Add NPU full test workflow for PyTorch v2.7.1

c72f26b

This workflow enables running PyTorch upstream tests with NPU patches:
- Clones PyTorch v2.7.1 official repository for test source
- Applies test_upsteam patches from current repository
- Runs pytest with NPU device support
- Supports shard-based parallel execution (default 3 shards)
- Triggers on push/PR/schedule/workflow_dispatch


          Update NPU full test workflow configuration

df0e9ec

- Change runner from [self-hosted, npu-910b] to linux-aarch64-a3-2
- Set NUM_SHARDS to 40 (each shard ~2.5% of tests)
- Enable concurrent execution of all 40 shards (max-parallel: 40)

ascend-robot added the ascend-cla/yes label

ascend-robot commented Apr 7, 2026

CLA Signature Pass

kerer-ai, thanks for your pull request. All authors of the commits have signed the CLA. 👍


          Simplify container options to only --user root

09a96d6

ascend-robot commented Apr 7, 2026

CLA Signature Pass

kerer-ai, thanks for your pull request. All authors of the commits have signed the CLA. 👍


          Increase test shards from 40 to 100 for finer test distribution

3e49b19

Each shard now contains ~1% of tests instead of ~2.5%, reducing
the chance of a single shard containing multiple problematic test files.

ascend-robot commented Apr 8, 2026

CLA Signature Pass

kerer-ai, thanks for your pull request. All authors of the commits have signed the CLA. 👍


          修改bug

6848ba6

ascend-robot commented Apr 8, 2026

CLA Signature Pass

kerer-ai, thanks for your pull request. All authors of the commits have signed the CLA. 👍


          修复bug

5eab442

ascend-robot commented Apr 8, 2026

CLA Signature Pass

kerer-ai, thanks for your pull request. All authors of the commits have signed the CLA. 👍


          修复bug

e2634ae

ascend-robot commented Apr 8, 2026

CLA Signature Pass

kerer-ai, thanks for your pull request. All authors of the commits have signed the CLA. 👍


          在测试依赖阶段补装 zstandard

be5ddd9

ascend-robot commented Apr 8, 2026

CLA Signature Pass

kerer-ai, thanks for your pull request. All authors of the commits have signed the CLA. 👍


          Refine NPU full test workflow summaries

d793e24

ascend-robot commented Apr 8, 2026

CLA Signature Pass

kerer-ai, thanks for your pull request. All authors of the commits have signed the CLA. 👍


          修复bug

8cac3a7

ascend-robot commented Apr 8, 2026

CLA Signature Pass

kerer-ai, thanks for your pull request. All authors of the commits have signed the CLA. 👍


          修改最后的说明

644fd3e

ascend-robot commented Apr 8, 2026

CLA Signature Pass

kerer-ai, thanks for your pull request. All authors of the commits have signed the CLA. 👍


          增加几个用例

996926b

ascend-robot commented Apr 8, 2026

CLA Signature Pass

kerer-ai, thanks for your pull request. All authors of the commits have signed the CLA. 👍


          重构

0f583fb

ascend-robot commented Apr 8, 2026

CLA Signature Pass

kerer-ai, thanks for your pull request. All authors of the commits have signed the CLA. 👍


          Fix NPU full test shard stats reporting

d6b3c49

ascend-robot commented Apr 8, 2026

CLA Signature Pass

kerer-ai, thanks for your pull request. All authors of the commits have signed the CLA. 👍


          Fix NPU full test runtime dependencies

11f7d7c

ascend-robot commented Apr 8, 2026

CLA Signature Pass

kerer-ai, thanks for your pull request. All authors of the commits have signed the CLA. 👍


          Update CRASHED.yml with 126 crashed test files from workflow run 2428…

fc6c43c

…5667405

- Identified 21 shards that crashed during test execution
- Total 126 unique test files causing process crashes (SIGSEGV/SIGABRT)
- Categories: distributed, dynamo, functorch, nn, profiler, quantization, etc.

ascend-robot commented Apr 13, 2026

CLA Signature Pass

kerer-ai, thanks for your pull request. All authors of the commits have signed the CLA. 👍


          Revert disabled_testcases.json and add test_proxy_tensor.py to CRASHE…

3fa5551

…D.yml

- Revert commit 148e92b changes to disabled_testcases.json
- Add test/test_proxy_tensor.py to CRASHED.yml blacklist
- test_make_fx_exhaustive__native_batch_norm_legit_npu causes segfault

ascend-robot commented Apr 13, 2026

CLA Signature Pass

kerer-ai, thanks for your pull request. All authors of the commits have signed the CLA. 👍


          Add upload_torch_npu_wheel job to re-upload wheel artifact for download

9c72f7f

- Add new job 'upload_torch_npu_wheel' that downloads wheel from build job
  and re-uploads with clearer artifact name 'torch-npu-wheel-2.7.1'
- Increase retention days from 7 to 30 for easier access
- Update test and report job dependencies accordingly

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

ascend-robot commented Apr 13, 2026

CLA Signature Pass

kerer-ai, thanks for your pull request. All authors of the commits have signed the CLA. 👍


          Update test report: show all shards with total/pass counts

4cb09c2

- Change "Non-Passing Shards" to "分片任务详情"
- Show all shards in detail table instead of only failed ones
- Add "总用例数" and "通过用例数" columns for better visibility

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

ascend-robot commented Apr 13, 2026

CLA Signature Pass

kerer-ai, thanks for your pull request. All authors of the commits have signed the CLA. 👍


          Fix torch installation for ARM64 runners

1cfb60e

- Remove --index-url for PyTorch CPU wheels
- Use default PyPI which has aarch64 wheels for torch 2.7.1
- PyTorch CPU index only provides x86_64 wheels, not ARM64

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

ascend-robot commented Apr 13, 2026

CLA Signature Pass

kerer-ai, thanks for your pull request. All authors of the commits have signed the CLA. 👍


          Temporarily limit test shards to 1-10 for validation

a9a7c03

- Change default shard_end from 100 to 10
- This is a temporary change to validate the report format changes
- Will restore to 100 after validation passes

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

ascend-robot commented Apr 13, 2026

CLA Signature Pass

kerer-ai, thanks for your pull request. All authors of the commits have signed the CLA. 👍


          Use run_test.py for file-level parallel test execution

22475dc

- Change from direct pytest execution to run_test.py invocation
- Add --parallel parameter (default 2) to control NUM_PARALLEL_PROCS
- Execute from test directory (run_test.py expects this working dir)
- Strip 'test/' prefix from paths for run_test.py -i argument
- run_test.py automatically handles distributed tests as serial

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

ascend-robot commented Apr 13, 2026

CLA Signature Pass

kerer-ai, thanks for your pull request. All authors of the commits have signed the CLA. 👍


          Add per-test-file statistics in shard detail report

d5ba2ea

- Parse JUnit XML files to extract testsuite-level statistics
- Display test file name, passed/failed/error counts, and duration
- Format: "test_file.py: 5 passed, 2 failed, 1 error, 3.2s"
- Replace "Scope" column with "测试文件详情" column

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

ascend-robot commented Apr 13, 2026

CLA Signature Pass

kerer-ai, thanks for your pull request. All authors of the commits have signed the CLA. 👍


          Add pytest-rerunfailures and pytest-flakefinder dependencies

717c21f

run_test.py check_pip_packages() requires these packages:
- pytest-rerunfailures
- pytest-flakefinder
- pytest-xdist (already installed)

Without these, run_test.py exits with error code 1.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

ascend-robot commented Apr 13, 2026

CLA Signature Pass

kerer-ai, thanks for your pull request. All authors of the commits have signed the CLA. 👍


          Fix run_test.py test name format: strip .py suffix

166c135

run_test.py expects test names without the .py extension.
For example, it expects 'custom_backend/test_custom_backend'
not 'custom_backend/test_custom_backend.py'.

The strip_test_prefix function now removes both 'test/' prefix
and '.py' suffix from test paths before passing to run_test.py -i.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

ascend-robot commented Apr 13, 2026

CLA Signature Pass

kerer-ai, thanks for your pull request. All authors of the commits have signed the CLA. 👍


          Handle tests not recognized by run_test.py via pytest fallback

710c908

run_test.py has a predefined TESTS list and rejects tests from
directories like custom_backend/ and custom_operator/.

Solution:
- Validate tests against known unrecognized prefixes
- Run valid tests via run_test.py (file-level parallel)
- Run unrecognized tests via direct pytest fallback

This allows shard 1 tests (custom_backend, custom_operator) to run
while still benefiting from run_test.py parallelism for other tests.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

ascend-robot commented Apr 13, 2026

CLA Signature Pass

kerer-ai, thanks for your pull request. All authors of the commits have signed the CLA. 👍


          Fix pytest path for unrecognized tests: strip test/ prefix

Phase 2 pytest fallback was passing full paths like
'test/custom_backend/test_custom_backend.py' but pytest runs from
test_dir, so paths should be relative (e.g. 'custom_backend/test_custom_backend.py').

The strip_test_prefix function removes both 'test/' and '.py', so
we add '.py' back for pytest which expects file paths with extensions.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

ascend-robot commented Apr 13, 2026

CLA Signature Pass

kerer-ai, thanks for your pull request. All authors of the commits have signed the CLA. 👍


          Change test runner from linux-aarch64-a3-2 to linux-aarch64-a3-8

e4cbfe5

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

ascend-robot commented Apr 14, 2026

CLA Signature Pass

kerer-ai, thanks for your pull request. All authors of the commits have signed the CLA. 👍


          Fix report generation: capture Phase 1 run_test.py JUnit XMLs

c12990a

Changes:
1. Workflow: Upload both test-reports/ and pytorch-test-src/test/test-reports/
   to capture Phase 1 run_test.py output
2. Report generator: Improved testsuite aggregation that:
   - Parses ALL XML files (not just shard-specific)
   - Filters by planned test files using test identifier matching
   - Handles both Phase 1 (run_test.py) and Phase 2 (pytest) results

This fixes:
- INCOMPLETE status when run_test.py produced XMLs in different directory
- Empty "测试文件详情" column (now shows per-test-file statistics)
- Note column now properly shows test results summary

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

ascend-robot commented Apr 14, 2026

CLA Signature Pass

kerer-ai, thanks for your pull request. All authors of the commits have signed the CLA. 👍


          Fix report generation to properly aggregate Phase 1 XML stats

50069fa

The Phase 1 run_test.py output XMLs are stored in nested directories:
pytorch-test-src/test/test-reports/python-pytest/{test_identifier}/

Each directory contains multiple XML files (one per worker due to parallel
execution), and the testsuite name is "pytest" (generic), not the specific
test file identifier.

This fix:
- Uses the parent directory name as the test identifier for Phase 1 XMLs
- Aggregates stats from all XML files in the same directory
- Overrides INCOMPLETE status when Phase 1 XMLs exist with test results
- Parses testcase file attribute for Phase 2 XMLs to identify test files

This resolves the issue where shards showed INCOMPLETE status and empty
"测试文件详情" column even when Phase 1 XMLs with test results existed.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

ascend-robot commented Apr 14, 2026

CLA Signature Pass

kerer-ai, thanks for your pull request. All authors of the commits have signed the CLA. 👍

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels