ci: refactor tests to exclude heavy model benchmarking by default and allow by manual triggering by Mattdl · Pull Request #31 · techwolf-ai/workrb

Mattdl · 2026-01-09T10:45:59Z

Model evaluation is perfect for new contributions to check if they can reproduce results from a paper. However, we don't want this in every PR to trigger full evaluation of all models. Therefore it is disabled by default now and added to Model contributing documentation.

Reproducing results for models on a benchmark is now in a separate github workflow that can be manually triggered and is excluded by default to avoid bloating tests with the number of models being added.

mattdl-techwolf added 3 commits January 9, 2026 11:22

test: refactor of tests to exclude benchmarking validation of models.

102e1cf

Reproducing results for models on a benchmark is now in a separate github workflow that can be manually triggered and is excluded by default to avoid bloating tests with the number of models being added.

docs: CONTRIBUTING model guideline update

687b28b

chore: remove noqa from model tests

3970506

Mattdl merged commit 44dbe7e into main Jan 9, 2026
2 checks passed

Mattdl deleted the refactor-contributions branch January 9, 2026 11:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ci: refactor tests to exclude heavy model benchmarking by default and allow by manual triggering#31

ci: refactor tests to exclude heavy model benchmarking by default and allow by manual triggering#31
Mattdl merged 3 commits intomainfrom
refactor-contributions

Mattdl commented Jan 9, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Mattdl commented Jan 9, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants