Re-implement benchmarks in pytest by pipliggins · Pull Request #5630 · pybamm-team/PyBaMM

pipliggins · 2026-06-26T01:00:11Z

Description

Moves PyBaMM's speed and peak memory tests away from asv and into the pytest ecosystem, using pytest-benchmark and pytest-memray.

Adds a new workflow to check that the speed benchmarks in a PR do not regress compared to main.

To discuss: What to do with pybamm-bench. This stores the asv benchmarks over time and provides a gh-pages website to display them. The workflow to populate the repo hasn't run since April 2024 so the site is fairly out of date.
One option is to use the https://github.com/benchmark-action/github-action-benchmark, which should be able to provide a similar workflow and display plots in a similar manner to asv, should we want to keep some record.

Full suite is fast enough without casadi to not need a separate 'slow-bench' split.

codecov · 2026-06-26T01:26:16Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 98.09%. Comparing base (8013d49) to head (54d25d1).

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #5630      +/-   ##
==========================================
- Coverage   98.19%   98.09%   -0.10%     
==========================================
  Files         339      339              
  Lines       32069    32069              
==========================================
- Hits        31489    31459      -30     
- Misses        580      610      +30

☔ View full report in Codecov by Harness.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

martinjrobins

thanks @pipliggins, this looks good, a few questions below. https://github.com/benchmark-action/github-action-benchmark looks really good, can you please add these plots? (good find, I think I might add this to diffsol too!)

agriyakhetarpal

Thanks! I only did a quick pass just yet; it would be useful for us to set up these new benchmarks in such a way that we ensure they are as reliable as possible. In conda, we used CodSpeed (https://codspeed.io) until very recently – it is a good solution to eliminate statistical outliers from GitHub Actions runners and has a free tier. There is also https://bencher.dev, which is also free, which we've started using. I would recommend setting up either of those to go with this PR, for a start (I am not opposed to https://github.com/benchmark-action/github-action-benchmark at all, just that it might make sense to adopt these solutions in addition for their low variability).

We already have a 301 redirect set up for the GitHub Pages website at https://pybamm.org/benchmarks, so I can add that in the footer somewhere or change the link wherever our new location will be.

pipliggins · 2026-06-30T22:44:10Z

Thanks! I only did a quick pass just yet; it would be useful for us to set up these new benchmarks in such a way that we ensure they are as reliable as possible. In conda, we used CodSpeed (https://codspeed.io) until very recently – it is a good solution to eliminate statistical outliers from GitHub Actions runners and has a free tier. There is also https://bencher.dev, which is also free, which we've started using. I would recommend setting up either of those to go with this PR, for a start (I am not opposed to https://github.com/benchmark-action/github-action-benchmark at all, just that it might make sense to adopt these solutions in addition for their low variability).

We already have a 301 redirect set up for the GitHub Pages website at https://pybamm.org/benchmarks, so I can add that in the footer somewhere or change the link wherever our new location will be.

I saw Bencher come up a lot as I was googling around this, it looks good. We'd need an account associated with pybamm to 'claim' the project and view results - @brosaplanella is that something you could do? A potential issue with the free tier is it limits you to a 5m job timeout. On my mac it takes 4.30 - 5 mins to run the full suite, so we're likely to be close to/over that limit.

pipliggins added 7 commits June 25, 2026 18:02

Re-implement speed benchmarks in pytest-benchmark

ffb0465

Remove casadi from benchmarks

df45dab

Add .benchmarks to gitignore

bf35d67

Add memory benchmarks

bbbd696

Just mark speed benchmarks

3cc6653

Full suite is fast enough without casadi to not need a separate 'slow-bench' split.

Delete ASV benchmarks

a3c952e

Add workflow to run benchmarks against main

cb190b8

pipliggins force-pushed the benchmarks-to-pytest-2 branch from 4d1ddf4 to cb190b8 Compare June 26, 2026 01:02

add continue on error temporarily to benchmark pr

18e1379

pipliggins mentioned this pull request Jun 26, 2026

Re-implement speed benchmarks in pytest-benchmark #5584

Closed

pipliggins added 2 commits June 25, 2026 18:40

Increase the memory limit for test_parameterise_memory

d95d18b

Bump simulation_setup_memory limit

150461d

pipliggins changed the title ~~Re-implement speed benchmarks in pytest-benchmark~~ Re-implement benchmarks in pytest Jun 26, 2026

pipliggins added 2 commits June 29, 2026 12:26

Stop pytest auto-assignment under benchmarks

0477447

tests: stop pytest collecting benchmarks unless path explicitly passed

aa2e80d

pipliggins marked this pull request as ready for review June 29, 2026 23:17

pipliggins requested a review from a team as a code owner June 29, 2026 23:17

pipliggins requested review from BradyPlanden and martinjrobins June 29, 2026 23:17

martinjrobins requested changes Jun 30, 2026

View reviewed changes

Comment thread .github/workflows/benchmarks_pr.yml

Comment thread packages/pybamm/tests/benchmarks/README.md Outdated

Comment thread packages/pybamm/tests/benchmarks/test_unit_benchmarks.py Outdated

Comment thread packages/pybamm/tests/benchmarks/test_model_options.py Outdated

agriyakhetarpal reviewed Jun 30, 2026

View reviewed changes

pipliggins added 4 commits June 30, 2026 15:48

ci: delete old asv benchmark workflows

c789d6b

benchmarks: switch ScipySolver to IDAKLU in unit benchmarks

56bbdf4

benchmarks: rename speed_bench to time_bench

af8ea95

benchmarks: update readme

54d25d1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Re-implement benchmarks in pytest#5630

Re-implement benchmarks in pytest#5630
pipliggins wants to merge 16 commits into
pybamm-team:mainfrom
pipliggins:benchmarks-to-pytest-2

pipliggins commented Jun 26, 2026 •

edited

Loading

Uh oh!

codecov Bot commented Jun 26, 2026 •

edited

Loading

Uh oh!

martinjrobins left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

agriyakhetarpal left a comment •

edited

Loading

Uh oh!

pipliggins commented Jun 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Uh oh!

Conversation

pipliggins commented Jun 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Uh oh!

codecov Bot commented Jun 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

martinjrobins left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

agriyakhetarpal left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

pipliggins commented Jun 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

pipliggins commented Jun 26, 2026 •

edited

Loading

codecov Bot commented Jun 26, 2026 •

edited

Loading

agriyakhetarpal left a comment •

edited

Loading