[MRG] FIX top_k_accuracy_score ignoring labels for "multiclass" case #19721

joclement · Mar 19, 2021

Reference Issues/PRs

Same changes as #19300, which I accidentally closed, but can not reopen.

What does this implement/fix? Explain your changes.

See #19300 for these changes and review by @thomasjpfan. If wanted, I can copy that content here.

Any other comments?

I'm sorry for this extra work/confusion.

kyleabeauchamp · Mar 31, 2021

Would love to see this one merge, anything needed to make sure it crosses the finish line?

thomasjpfan

Thank you for working on this @flyingdutchman23

doc/whats_new/v1.0.rst

sklearn/metrics/tests/test_ranking.py

thomasjpfan

LGTM!

kyleabeauchamp · Apr 12, 2021

Do you need help resolving the merge conflict? I'm interested in making sure this lands before the next release :).

joclement · Apr 13, 2021

Do you need help resolving the merge conflict? I'm interested in making sure this lands before the next release :).

Thank you for pointing that out, Done.

I further improved a commit description and included a fixup commit in another commit to have a cleaner history.

kyleabeauchamp · Apr 22, 2021

I guess this bugfix is not slated to land in the pending release?

jnothman

This LGTM. @glemaitre should it go in 0.24.2 as a fix to a new feature??

glemaitre · Apr 25, 2021

We can add it in the upcoming release. You only need to move the entry in 0.24.rst instead of 1.0

Currently the last 2 parameters of the added test fail, because the labels are not considered to decide whether the target is "binary" or "multiclass". The labels parameters is only used in later steps.

If a problem is actually "multiclass", and not all classes are contained in the parameter `y_true` , the function fails, because the determined type is "binary". That decision makes sense, if the parameter labels is not passed. The problem is that the function also fails, if the parameter `labels` is passed, although it would be possible to determine the type of and the number of classes in conjunction with this parameter. This commit fixes that, by checking whether the `labels` parameter has been set and contains more than 2 classes, if the type has been determined to be "binary" in the previous step.

This is for the case where `labels` is an `ndarray`. Co-authored-by: Thomas J. Fan <thomasjpfan@gmail.com>

Co-authored-by: Thomas J. Fan <thomasjpfan@gmail.com>

joclement · Apr 26, 2021

@glemaitre thanks for reviewing. The changelog has been moved to v0.24.rst.

glemaitre · Apr 26, 2021

Thanks

…ccuracy_score (scikit-learn#19721)

…ccuracy_score (#19721)

github-actions bot added the module:metrics label Mar 19, 2021

thomasjpfan reviewed Apr 2, 2021

View reviewed changes

doc/whats_new/v1.0.rst Outdated Show resolved Hide resolved

sklearn/metrics/tests/test_ranking.py Show resolved Hide resolved

sklearn/metrics/tests/test_ranking.py Outdated Show resolved Hide resolved

thomasjpfan changed the title ~~[MRG] Fix top_k_accuracy_score ignoring labels for "multiclass" case~~ FIX Fix top_k_accuracy_score ignoring labels for "multiclass" case Apr 2, 2021

thomasjpfan approved these changes Apr 3, 2021

View reviewed changes

joclement changed the title ~~FIX Fix top_k_accuracy_score ignoring labels for "multiclass" case~~ [MRG] FIX top_k_accuracy_score ignoring labels for "multiclass" case Apr 4, 2021

thomasjpfan changed the title ~~[MRG] FIX top_k_accuracy_score ignoring labels for "multiclass" case~~ FIX top_k_accuracy_score ignoring labels for "multiclass" case Apr 12, 2021

joclement changed the title ~~FIX top_k_accuracy_score ignoring labels for "multiclass" case~~ [MRG] FIX top_k_accuracy_score ignoring labels for "multiclass" case Apr 20, 2021

jnothman approved these changes Apr 24, 2021

View reviewed changes

glemaitre added this to the 0.24.2 milestone Apr 25, 2021

glemaitre added the To backport PR merged in master that need a backport to a release branch defined based on the milestone. label Apr 25, 2021

joclement and others added 10 commits April 26, 2021 08:45

Typo

adc32fe

Add test case for multiclass top_k_accuracy_score

6d5b13d

Currently the last 2 parameters of the added test fail, because the labels are not considered to decide whether the target is "binary" or "multiclass". The labels parameters is only used in later steps.

Impmrove check on labels

47a4098

This is for the case where `labels` is an `ndarray`. Co-authored-by: Thomas J. Fan <thomasjpfan@gmail.com>

Avoid duplicate tests

ab1dea2

Test strings in labels

9b0322b

Test labels as ndarray

b46b3f0

Add comment explaining test

ca32af9

Add changelog entry

feb62fe

Co-authored-by: Thomas J. Fan <thomasjpfan@gmail.com>

Improve description of test

cc23ce3

Co-authored-by: Thomas J. Fan <thomasjpfan@gmail.com>

Update v0.24.rst

9c38c23

glemaitre merged commit 6927fa2 into scikit-learn:main Apr 26, 2021

glemaitre pushed a commit to glemaitre/scikit-learn that referenced this pull request Apr 26, 2021

FIX mislabelling multiclass target when labels is provided in top_k_a…

8e27ac5

…ccuracy_score (scikit-learn#19721)

joclement deleted the fix-top-k-accuracy branch April 26, 2021 12:25

glemaitre pushed a commit that referenced this pull request Apr 28, 2021

FIX mislabelling multiclass target when labels is provided in top_k_a…

ab21254

…ccuracy_score (#19721)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[MRG] FIX top_k_accuracy_score ignoring labels for "multiclass" case #19721

[MRG] FIX top_k_accuracy_score ignoring labels for "multiclass" case #19721

Uh oh!

joclement commented Mar 19, 2021

Uh oh!

kyleabeauchamp commented Mar 31, 2021

Uh oh!

thomasjpfan left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

thomasjpfan left a comment

Uh oh!

kyleabeauchamp commented Apr 12, 2021

Uh oh!

joclement commented Apr 13, 2021 •

edited

Loading

Uh oh!

kyleabeauchamp commented Apr 22, 2021 •

edited

Loading

Uh oh!

jnothman left a comment

Uh oh!

glemaitre commented Apr 25, 2021

Uh oh!

joclement commented Apr 26, 2021

Uh oh!

glemaitre commented Apr 26, 2021

Uh oh!

Uh oh!

Search code, repositories, users, issues, pull requests...

Uh oh!

[MRG] FIX top_k_accuracy_score ignoring labels for "multiclass" case #19721

[MRG] FIX top_k_accuracy_score ignoring labels for "multiclass" case #19721

Uh oh!

Conversation

joclement commented Mar 19, 2021

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

kyleabeauchamp commented Mar 31, 2021

Uh oh!

thomasjpfan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

thomasjpfan left a comment

Choose a reason for hiding this comment

Uh oh!

kyleabeauchamp commented Apr 12, 2021

Uh oh!

joclement commented Apr 13, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kyleabeauchamp commented Apr 22, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jnothman left a comment

Choose a reason for hiding this comment

Uh oh!

glemaitre commented Apr 25, 2021

Uh oh!

joclement commented Apr 26, 2021

Uh oh!

glemaitre commented Apr 26, 2021

Uh oh!

Uh oh!

joclement commented Apr 13, 2021 •

edited

Loading

kyleabeauchamp commented Apr 22, 2021 •

edited

Loading