Add sampling uncertainty on precision-recall and ROC curves #26192

stephanecollot · Apr 16, 2023

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Add sampling uncertainty on precision-recall and ROC curves.
See more details in the Issue above.

stephanecollot · Apr 16, 2023

Hi,

Here is a first version on the PR, adding this feature on precision-recall only, once we agree on the integration for this one, I will add the ROC in an analogous way, in this PR.
Thank you in advance for having an initial look on sklearn/metrics/_plot/precision_recall_curve.py

I will add unit tests and more function docstrings in sklearn/metrics/_plot/uncertainty.py soon.

@glemaitre @betatim @lorentzenchr

RUrlus · Apr 18, 2023

sklearn/metrics/_plot/uncertainty.py

+"""
+TODO: Documentation
+
+AISTAT 2023 `Sampling uncertainties on the Precision-Recall curve`


Correct title of paper is: Pointwise sampling uncertainties on the Precision-Recall curve
with authors: R.E.Q. Urlus, M.A. Baak, S. Collot, I. Fridman Rojas

RUrlus · Apr 18, 2023

@stephanecollot This implementation does not match the implementation in MMU and is not as described in the paper.

This code creates a grid of a fixed shape for each P,R point and evaluates the chi2 score.
This means that different thresholds have different resolutions, e.g. 100 bins to cover 0.5 - 0.55 whereas other thresholds might only have a region that covers 0.9 - 0.91 with the same number of bins. You draw a region for each threshold but the overlap is no longer comparable as they are computed with different resolutions.

The reference implementation creates a P, R grid with a set number of points per axis.
For each threshold we evaluate the region in the P,R grid to evaluate and for each predetermined point in this region we compute the chi2 score. We only store the minimum chi2 score for each grid point and draw a single contour for the grid.
Not only is this much faster, this is also consistent as the whole curve has the same resolution.

stephanecollot · Apr 18, 2023

@stephanecollot This implementation does not match the implementation in MMU and is not as described in the paper.

This code creates a grid of a fixed shape for each P,R point and evaluates the chi2 score. This means that different thresholds have different resolutions, e.g. 100 bins to cover 0.5 - 0.55 whereas other thresholds might only have a region that covers 0.9 - 0.91 with the same number of bins. You draw a region for each threshold but the overlap is no longer comparable as they are computed with different resolutions.

The reference implementation creates a P, R grid with a set number of points per axis. For each threshold we evaluate the region in the P,R grid to evaluate and for each predetermined point in this region we compute the chi2 score. We only store the minimum chi2 score for each grid point and draw a single contour for the grid. Not only is this much faster, this is also consistent as the whole curve has the same resolution.

That is correct, I tried 3 different methods for the grid, fix number grid point per point (the current one), the paper one and an adaptative grid (i.e. "x point per cm²"). They had different pros and cons, and I picked the best in terms for plotting smoothness and execution time. But it is true that I'm not taking the minimum chi2, which can make the plot look different.
I'm going to have a closer look at this.

lorentzenchr · May 11, 2023

I‘m not sure, but maybe https://docs.scipy.org/doc/scipy/reference/generated/scipy.stats.random_table.html#scipy.stats.random_table could be of help.

add sampling uncertainty PR

79a71c3

github-actions bot added the module:metrics label Apr 16, 2023

RUrlus reviewed Apr 18, 2023

View reviewed changes

refine grid definition, this time aligned with the paper

aea06b4

lorentzenchr added the Stalled label Sep 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Add sampling uncertainty on precision-recall and ROC curves #26192

Add sampling uncertainty on precision-recall and ROC curves #26192

Uh oh!

stephanecollot commented Apr 16, 2023

Uh oh!

stephanecollot commented Apr 16, 2023

Uh oh!

RUrlus Apr 18, 2023

Uh oh!

RUrlus commented Apr 18, 2023 •

edited

Loading

Uh oh!

stephanecollot commented Apr 18, 2023

Uh oh!

lorentzenchr commented May 11, 2023

Uh oh!

Uh oh!

Search code, repositories, users, issues, pull requests...

Uh oh!

Add sampling uncertainty on precision-recall and ROC curves #26192

Are you sure you want to change the base?

Add sampling uncertainty on precision-recall and ROC curves #26192

Uh oh!

Conversation

stephanecollot commented Apr 16, 2023

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Uh oh!

stephanecollot commented Apr 16, 2023

Uh oh!

RUrlus Apr 18, 2023

Choose a reason for hiding this comment

Uh oh!

RUrlus commented Apr 18, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stephanecollot commented Apr 18, 2023

Uh oh!

lorentzenchr commented May 11, 2023

Uh oh!

Uh oh!

RUrlus commented Apr 18, 2023 •

edited

Loading