Add accuracy metric to results stats summary #31

mccarthy-m-g · 2025-10-23T18:42:53Z

This PR adds an (optional) accuracy metric to the results stats summary of run_eval() (#30).

The main challenge for this one was defining the API, since unlike other error metrics, the user needs to define the absolute and relative error margins for whether a prediction is considered accurate or not. I opted to only calculate accuracy if the user has supplied the absolute and relative error margins (via a new .stats_summ_options argument and corresponding stats_summ_options() function). This lets us avoid defining universally suitable defaults, which is probably difficult to do appropriately.

The API looks like this:

run_eval(..., .stats_summ_options = stats_summ_options(acc_error_abs = 0.5, acc_error_rel = 0.25)

roninsightrx · 2025-10-23T19:35:05Z

I opted to only calculate accuracy if the user has supplied the absolute and relative error margins

Agree with this approach, otherwise there is too much chance of users just relying on what is default.

mccarthy-m-g · 2025-10-23T19:46:14Z

One Q for printing: Right now we're printing accuracy with NAs for each type when it isn't being calculated. Do you like that, or should we remove the accuracy column from printing when it isn't calculated?

To hide it, we just need to add a select statement to print.mipdeval_results_stats_summ(), like:

tibble(x = rep(NA, times = 5), y = 1:5) |> select(where(\(.x) all(!is.na(.x))))
#> # A tibble: 5 × 1
#>       y
#>   <int>
#> 1     1
#> 2     2
#> 3     3
#> 4     4
#> 5     5

roninsightrx · 2025-10-23T20:25:44Z

tests/testthat/test-run_eval.R

+      nrmse = c(0.411, 0.232, 0.199, 0.121, 0.486, 0.232),
+      mpe = c(-0.339, -0.004, -0.045, -0.002, -0.415, -0.004),
+      mape = c(0.445, 0.246, 0.166, 0.118, 0.558, 0.246),
+      accuracy = c(NA_real_, NA_real_, NA_real_, NA_real_, NA_real_, NA_real_)


would include at least one end-to-end test for run_eval() where we do calculate it

Added here: 070a7bb

roninsightrx

lgtm!

mccarthy-m-g added 4 commits October 23, 2025 09:35

add accuracy functions

4119633

add option to calculate accuracy in run_eval()

5cfb485

update news

ac1caa1

add tests for error metrics

4b64ad9

mccarthy-m-g requested a review from roninsightrx October 23, 2025 18:42

mccarthy-m-g added the enhancement New feature or request label Oct 23, 2025

mccarthy-m-g linked an issue Oct 23, 2025 that may be closed by this pull request

Add accuracy metric to results stats summary #30

Closed

roninsightrx reviewed Oct 23, 2025

View reviewed changes

calculate accuracy in basic run_eval() test

070a7bb

roninsightrx self-requested a review October 29, 2025 22:48

roninsightrx approved these changes Oct 29, 2025

View reviewed changes

mccarthy-m-g merged commit 834ef50 into main Nov 10, 2025
2 checks passed

mccarthy-m-g deleted the stat-accuracy branch November 10, 2025 17:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add accuracy metric to results stats summary #31

Add accuracy metric to results stats summary #31

Uh oh!

mccarthy-m-g commented Oct 23, 2025 •

edited

Loading

Uh oh!

roninsightrx commented Oct 23, 2025

Uh oh!

mccarthy-m-g commented Oct 23, 2025

Uh oh!

roninsightrx Oct 23, 2025

Uh oh!

mccarthy-m-g Oct 23, 2025

Uh oh!

roninsightrx left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add accuracy metric to results stats summary #31

Add accuracy metric to results stats summary #31

Uh oh!

Conversation

mccarthy-m-g commented Oct 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

roninsightrx commented Oct 23, 2025

Uh oh!

mccarthy-m-g commented Oct 23, 2025

Uh oh!

roninsightrx Oct 23, 2025

Choose a reason for hiding this comment

Uh oh!

mccarthy-m-g Oct 23, 2025

Choose a reason for hiding this comment

Uh oh!

roninsightrx left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

mccarthy-m-g commented Oct 23, 2025 •

edited

Loading