MNT Refactor fit method of NearestCentroid. #28072

fkdosilovic · Jan 6, 2024

Reference Issues/PRs

None.

What does this implement/fix? Explain your changes.

This PR refactors the fit method of the NearestCentroid classifier.
Changes include:

decoupling of the computation of class centroids and choosing the function for centroid computation
improvements to References

Any other comments?

None.

github-actions · Jan 6, 2024

✔️ Linting Passed

All linting checks passed. Your pull request is in excellent shape! ☀️

_{Generated for commit: 1a10bc5. Link to the linter CI: here}

glemaitre · Jan 11, 2024

sklearn/neighbors/_nearest_centroid.py

@@ -162,73 +169,86 @@ def fit(self, X, y):
            X, y = self._validate_data(X, y, accept_sparse=["csc"])
        else:
            X, y = self._validate_data(X, y, accept_sparse=["csr", "csc"])
+


Please remove the break line to minimize the diff. They don't break more clarity to the codebase.

glemaitre · Jan 11, 2024

sklearn/neighbors/_nearest_centroid.py

+       multiple cancer types by shrunken centroids of gene expression. Proceedings
+       of the National Academy of Sciences of the United States of America,
+       99(10), 6567-6572. The National Academy of Sciences.
+       <https://www.pnas.org/doi/full/10.1073/pnas.082099299>`_


Nowadays we use :doi: sphinx marker. You can check other occurrences.

glemaitre · Jan 11, 2024

sklearn/neighbors/_nearest_centroid.py

+
+        # Choose the transformation for boolean class mask vector.
+        if is_X_sparse:
+            mask_trf = lambda mask: np.where(mask)[0]  # noqa: E731


We don't use lambda and do not add exception to ruff.

glemaitre · Jan 11, 2024

sklearn/neighbors/_nearest_centroid.py

+        n_samples, n_features = X.shape
+
+        # Compute the centroids.
+        self.centroids_ = np.empty((n_classes, n_features), dtype=np.float64)


Actually, I am wondering if this refactoring is necessary since we are modifying this code in this PR: #26689

I have to check if we can just combine both approaches.

Refactor fit method of NearestCentroid.

1a10bc5

github-actions bot added the module:neighbors label Jan 6, 2024

glemaitre reviewed Jan 11, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

MNT Refactor fit method of NearestCentroid. #28072

MNT Refactor fit method of NearestCentroid. #28072

Uh oh!

fkdosilovic commented Jan 6, 2024 •

edited

Loading

Uh oh!

github-actions bot commented Jan 6, 2024

Uh oh!

glemaitre Jan 11, 2024

Uh oh!

glemaitre Jan 11, 2024

Uh oh!

glemaitre Jan 11, 2024

Uh oh!

glemaitre Jan 11, 2024

Uh oh!

Uh oh!

Search code, repositories, users, issues, pull requests...

Uh oh!

MNT Refactor fit method of NearestCentroid. #28072

Are you sure you want to change the base?

MNT Refactor fit method of NearestCentroid. #28072

Uh oh!

Conversation

fkdosilovic commented Jan 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Any other comments?

Uh oh!

github-actions bot commented Jan 6, 2024

✔️ Linting Passed

Uh oh!

glemaitre Jan 11, 2024

Choose a reason for hiding this comment

Uh oh!

glemaitre Jan 11, 2024

Choose a reason for hiding this comment

Uh oh!

glemaitre Jan 11, 2024

Choose a reason for hiding this comment

Uh oh!

glemaitre Jan 11, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

fkdosilovic commented Jan 6, 2024 •

edited

Loading