[MRG] PERF Significant performance improvements in Partial Least Squares (PLS) Regression #23876

fractionalhare · Jul 9, 2022

Description of Changes

Motivating Summary

This PR dramatically improves the speed of PLS regression by implementing the Dayal-MacGregor modified kernel algorithm (Dayal-MacGregor 1997). This has significant performance improvements over the current default, NIPALS, without sacrificing accuracy or numerical stability.

The Dayal-MacGregor algorithm accurately computes the weight and rotation matrices for the maximal covariance projection(s) of X without explicitly calculating the X score matrix and without deflating the Y target(s) at all. Thus it is (much) faster than NIPALS in the base case of univariate Y, and becomes asymptotically faster as the number of Y targets increases. Testing shows speed improvements of over 40% in the base case, and rising to over 95% when Y is multivariate.

In other words, this PR makes PLS regression 2x - 20x faster for equivalent results. The paper contains detailed numerical stability characteristics and accuracy results which are confirmed by my empirical tests below.

Specific Changes

the parameter constraints on the algorithm parameter for the _PLS base class were modified so that a user can select the new algorithm via algorithm="dayalmacgregor" or algorithm="kernel" - see comments below, I'm not attached to having both arguments refer to the new algorithm, but I've implemented it this way for now
the PLSRegression child class was modified to allow selection of the algorithm parameter during instantiation (previously a user could not select the algorithm, and the regression child class implicitly took the default of the base class). this argument defaults to algorithm="nipals" for backwards compatibility
the fit method was augmented to include the implementation of Dayal-MacGregor, within a conditional branch on the value of the algorithm attribute
Added a new test to cover the new algorithm

Performance & Accuracy Results

Small X (20 x 10) and Univariate Y (20 x 1) with 5 Components

NIPALS: 344 µs ± 4.77 µs per loop (mean ± std. dev. of 7 runs, 1,000 loops each)

Dayal-MacGregor: 198 µs ± 3.87 µs per loop (mean ± std. dev. of 7 runs, 1,000 loops each)

The output predictions agree to 6 decimal places.

43% faster

Small X (20 x 10) and Multivariate Y (20 x 2) with 5 Components

NIPALS: 700 µs ± 13.7 µs per loop (mean ± std. dev. of 7 runs, 1,000 loops each)

Dayal-MacGregor: 268 µs ± 4.92 µs per loop (mean ± std. dev. of 7 runs, 1,000 loops each)

The output predictions are equal to 4 decimal places.

62% faster

Moderate X (200 x 100) and Univariate Y (200 x 1) with 50 Components

NIPALS: 32.6 ms ± 2.16 ms per loop (mean ± std. dev. of 7 runs, 10 loops each)

Dayal-MacGregor: 15 ms ± 470 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)

The output predictions agree to 6 decimal places.

54% faster

Moderate X (200 x 100) and Multivariate Y (200 x 2) with 50 Components

NIPALS: 226 ms ± 21.5 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)

Dayal-MacGregor: 15.8 ms ± 280 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)

The output predictions are equal to 6 decimal places.

94% faster

Moderate X (200 x 100) and Multivariate Y (200 x 3) with 50 Components

NIPALS: 346 ms ± 29.2 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)

Dayal-MacGregor: 17 ms ± 1.48 ms per loop (mean ± std. dev. of 7 runs, 100 loops each)

The output predictions are equal to 5 decimal places.

95% faster

Large X (2000 x 1000) and Univariate Y (2000 x 1) with 100 Components

NIPALS: 859 ms ± 27.4 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)

Dayal-MacGregor: 271 ms ± 20.5 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)

The output predictions are equal to 6 decimal places.

59% faster

Large X (2000 x 1000) and Multivariate Y (2000 x 2) with 100 Components

NIPALS: 5.71 s ± 390 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)

Dayal-MacGregor: 233 ms ± 10.8 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)

The output predictions are equal to 6 decimal places.

96% faster

Large X (2000 x 1000) and Multivariate Y (2000 x 3) with 100 Components

NIPALS: 6.43 s ± 398 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)

Dayal-MacGregor: 236 ms ± 13.5 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)

The output predictions agree to 6 decimal places.

97% faster

Testing

Dev Environment & Architecture

My testing environment is as follows:

Hardware: MacBook Pro (16-inc, 2021) with Apple M1 Max Chip and 64GB RAM
OS: macOS Monterey v12.4
Python: 3.10.5
Installer/Package Manager: Anaconda3 (for Apple Silicon/arm64)
Building Compiler: llvm-openmp

I don't think this will materially impact the results, but wanted to note it.

Testing Script

I put together a performance and accuracy testing script.

After running conda create -n sklearn-dev -c conda-forge python numpy scipy cython joblib threadpoolctl pytest compilers llvm-openmp, conda activate sklearn-dev and pip install --verbose --no-build-isolation --editable . in the directory, the following testing script should validate the foregoing performance and accuracy assessment. It randomly generates an m x n matrix X and m x n_targets matrix Y, and then fits PLS regression models estimated with each of the NIPALS and Dayal-MacGregor algorithms, then assesses performance and accuracy.

import numpy as np
from numpy.testing import assert_array_almost_equal
from sklearn.cross_decomposition import PLSRegression

#%% Config
m = 1000
n = 200
n_targets = 2

X = np.random.rand(m, n)
y = np.random.rand(m, n_targets)
n_components = 50
scale = True

####
# Performance Testing
####

#%% NIPALS
%%timeit
nipals = PLSRegression(n_components=n_components, scale=scale, algorithm='nipals')
nipals.fit(X=X, Y=y)
nipals.predict(X)

#%% DayalMacGregor Kernel
%%timeit
dayal = PLSRegression(n_components=n_components, scale=scale, algorithm='dayalmacgregor')
dayal.fit(X=X, Y=y)
dayal.predict(X)

#%%
####
# Accuracy/Stability Testing
####

control = PLSRegression(n_components=n_components, scale=scale, algorithm='nipals')
test = PLSRegression(n_components=n_components, scale=scale, algorithm='dayalmacgregor')
control.fit(X, y)
test.fit(X, y)
assert_array_almost_equal(test.predict(X), control.predict(X), decimal=4)

Comments

I kept NIPALS as the default algorithm. If this PR is accepted, we might want to discuss whether or not NIPALS should remain the default given that Dayal-MacGregor is strictly superior for PLSRegression. Maybe keep NIPALS as the default, mention the new algorithm prominently in documentation and issue a warning that the new algorithm will become the default in a release or two?
Given that Dayal-MacGregor doesn't deflate the Y target(s), there are no y_scores_, y_weights_ and y_rotations_ class attributes. This means that Dayal-MacGregor is only implemented for PLS Regression (not PLS Canonical), and a ValueError check is implemented to that effect. Moreover, the PLSRegression class documentation was edited to reflect the fact that these three attributes are only given for NIPALS/SVD.
Should this algorithm be available via both algorithm="dayalmacgregor" and algorithm="kernel", or is it better to choose one? I currently have both implemented. I can see arguments either way but I'm not strongly opinionated. The Dayal-MacGregor algorithm is a kernel algorithm, but it's not the only one that exists in the literature (e.g. SIMPLS is also a kernel algorithm, albeit numerically unstable).

…ld class

…ariate y

…ns for y weight, score and rotation matrices in the class docstring

… deflation mode

merge main

fractionalhare · Jul 19, 2022

@ogrisel yep, done

ogrisel

Thanks for the updates. However #23876 (comment) has still not been addressed. Other than that, LGTM.

doc/whats_new/v1.2.rst

ogrisel · Jul 20, 2022

Another remark, the sign of the extracted coef can sometimes flip when switch algorithm, e.g. look at the plots of:

Would it be possible to use sklearn.utils.extmath.svd_flip or _svd_flip_1d to avoid this artifact? If not easy to achieve, no big deal.

merge main

…esponding tests

fractionalhare · Jul 20, 2022

@ogrisel Implemented the change covering your last outstanding review comment about attributes.

Would it be possible to use sklearn.utils.extmath.svd_flip or _svd_flip_1d to avoid this artifact? If not easy to achieve, no big deal.

It doesn't seem easy to achieve - we don't actually use both the U and Vt from the decomposition to form the x_weights and y_weights like we do with NIPALS. So we can't iteratively flip the 1d ndarrays computed in eachn_component round with _svd_flip_1d. The computation isn't similar to NIPALS in this sense.

So what ends up happening is Dayal-MacGregor coefficient matrix has the right signs to match the coefficient matrix you obtain from NIPALS, but the X rotation matrix and the Y loading matrix may have opposite signs in column vectors compared to the corresponding matrices from NIPALS.

This quirk is harmless for prediction because the coefficient matrix is x_rotations_ @ y_loadings_ so when the signs on individual column vectors are opposite due to SVD, the coefficient matrices still match.

fractionalhare · Jul 26, 2022

@ogrisel Who else do we need for a review?

fractionalhare · Aug 11, 2022

@ogrisel what is this waiting on?

ogrisel · Aug 12, 2022

@ogrisel what is this waiting on?

Unfortunately the documentation building CI has timed out. Can you please try to push another commit (e.g. yet another merge with the current main) to see if this problem has been resolved? I am not sure about the cause.

It would be great to be able to checkout the rendered HTML for the examples with your latest changes.

ogrisel

This PR overall looks good to me but I would like to get the CI working prior to considering a merge.

ogrisel · Aug 12, 2022

sklearn/cross_decomposition/_pls.py

+
+        for k in range(n_components):
+            if n_targets == 1:
+                w = S


Could you please update the tests to also cover the case where n_targets == 1?

ogrisel · Aug 12, 2022

sklearn/cross_decomposition/_pls.py

+                    )
+                except StopIteration as e:
+                    if str(e) != "Y residual is constant":
+                        raise


I think we can drop this if / raise branch: StopIteration can never be raised when Y residual are not constant. This will also make the coverage report happier.

ogrisel · Aug 12, 2022

This quirk is harmless for prediction because the coefficient matrix is x_rotations_ @ y_loadings_ so when the signs on individual column vectors are opposite due to SVD, the coefficient matrices still match.

I agree but the fact that the sign of the coef changes when switching solvers is practically annoying, e.g. when writing a doc with some explanation of a plot of the latent space: the top right part of the figure could silently become the bottom left of the figure if the code is switched from nipals to the new solver.

Since we don't plan to change the default PLS solver right away this not necessarily a big problem as this subtle behavior change will not happen just by updating the version of scikit-learn. Still, if there had been a way to enforce this stability that would a nice convenience to our future users. See the Principle of Least Astonishment.

lorentzenchr · Aug 13, 2022

I'm not an expert of PLS, but it seems to me that the proposed Dayal-MacGregor modified kernel algorithm is superior to the current algo. Therefore, in the long run, I would not introduce a new solver parameter.
Due to the sign change, the open question is how to do a proper deprecation cycle. Could we introduce a new parameter solver which we deprecate right from the beginning, saying that the default will change and that this parameter will be removed in 2 versions?

ogrisel · Aug 16, 2022

The new solver does compute all the attributes that the nipals algorithm could compute. So I would still keep the solver attribute to be able to select the nipals algorithm shall we decide to move to Dayal-MacGregor as the default in a future version of scikit-learn.

lorentzenchr · Aug 17, 2022

Conclusion: Let's introduce the solver parameter

Possible (for me quite likely) future path outside of this PR: change default to the new solver, finally remove solver and get rid of the old solver.

lorentzenchr · Dec 6, 2022

@fractionalhare Why did you close?

fractionalhare · Jan 21, 2023

@ogrisel what is required to get this merged now?

ayaanhossain · Nov 7, 2023

@fractionalhare @ogrisel @lorentzenchr @glemaitre hey guys any update on this?

lorentzenchr · Nov 7, 2023

any update on this?

This PR needs 2 reviews, our scarcest ressource. It usually helps a lot if CI is all green.

fractionalhare · Nov 7, 2023

Last status was an initial review from @ogrisel - enough time has passed that the review should probably be at least refreshed, and then one other person needs to do a review.

I can update the branch and see what coverage checks are still failing; we had the CI all green before so it should just be a matter of update and then tweaking tests.

glemaitre · Nov 7, 2023

From what I see before we need to have the comments of @ogrisel addressed. I can provide an additional review once the CI is working.

ayaanhossain · Jan 16, 2024

any update on this?

$fractionalhare$

fractionalhare added 12 commits July 9, 2022 14:33

add initial implementation of simpls and dayalmacgregor to test

33e904b

we also need to make the algorithm externally accessible from the chi…

1ea8e12

…ld class

we also need to relax the parameter constraints to allow algorithm

c0b9a77

add dayalmacgregor and simpls to parameter constraints

885a7a2

change the convention for checking univariate characteristic

964133a

change where we expose x scores and y scores

97c4af7

change orientation of the coefficient matrix

2b0ac82

add the _ at the end of the intercept attribute name

b2658f4

remove simpls, just use dayalmacgregor, and augment to support multiv…

8897da9

…ariate y

only support regression with dayalmacgregor kernel;

a6db0f7

need uppercase Y for scikitlearn conventions

d3092b1

working for multivate y

e8d3132

github-actions bot added the module:cross_decomposition label Jul 9, 2022

fractionalhare added 5 commits July 9, 2022 18:43

make compatible with black

3c3596d

add documentation for the new algorithm param in the PLSRegression class

b1d6419

add more clarity about the intercept

c2f90ed

use single quotes in documentation

91005a5

fix colon for sphinx documentation

cf48a58

fractionalhare added 11 commits July 9, 2022 21:12

found the indent

730b0d3

make the non-regression exception more explicit

a61bf1a

add a comment about obtaining eigenvector for largest eigenvalue

922254a

change variable names to be in-line with nipals/svd implementation

bde5725

also expose the x weights

2a47790

make notes about which attributes dont exist

051b3ff

black formatting, again

01e8f48

use standard conventions for the algorithm str options in the docstring

a0a2e22

standardize way we refer to multiple str options to express limitatio…

9a67e8d

…ns for y weight, score and rotation matrices in the class docstring

add dayal macgregor to test coverage

8b4df86

add test indicating that dayal macgregor is only valid for regression…

ff723c1

… deflation mode

Merge remote-tracking branch 'upstream/main' into pls_algos

7ecfeeb

merge main

ogrisel reviewed Jul 20, 2022

View reviewed changes

doc/whats_new/v1.2.rst Outdated Show resolved Hide resolved

fractionalhare added 4 commits July 20, 2022 11:59

Merge remote-tracking branch 'upstream/main' into pls_algos

d6c9dfa

merge main

implement the attribute correction for y_scores, y_rotations and corr…

1ce2488

…esponding tests

implement ogrisels recommended doc change for changelog

478ec37

formatting

29ffdda

fractionalhare added 2 commits July 27, 2022 17:40

$@fractionalhare$

Merge branch 'main' into pls_algos

4998bb1

$@fractionalhare$

Merge branch 'main' into pls_algos

1c9a206

$@fractionalhare$ fractionalhare requested a review from ogrisel July 30, 2022 16:11

ogrisel reviewed Aug 12, 2022

View reviewed changes

$@fractionalhare$ fractionalhare closed this by deleting the head repository Nov 27, 2022

lorentzenchr reopened this Dec 10, 2022

lorentzenchr added the Stalled label Apr 10, 2024

Search code, repositories, users, issues, pull requests...

Uh oh!

[MRG] PERF Significant performance improvements in Partial Least Squares (PLS) Regression #23876

Are you sure you want to change the base?

[MRG] PERF Significant performance improvements in Partial Least Squares (PLS) Regression #23876

Uh oh!

Conversation

fractionalhare commented Jul 9, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description of Changes

Motivating Summary

Specific Changes

Performance & Accuracy Results

Small X (20 x 10) and Univariate Y (20 x 1) with 5 Components

Small X (20 x 10) and Multivariate Y (20 x 2) with 5 Components

Moderate X (200 x 100) and Univariate Y (200 x 1) with 50 Components

Moderate X (200 x 100) and Multivariate Y (200 x 2) with 50 Components

Moderate X (200 x 100) and Multivariate Y (200 x 3) with 50 Components

Large X (2000 x 1000) and Univariate Y (2000 x 1) with 100 Components

Large X (2000 x 1000) and Multivariate Y (2000 x 2) with 100 Components

Large X (2000 x 1000) and Multivariate Y (2000 x 3) with 100 Components

Testing

Dev Environment & Architecture

Testing Script

Comments

Uh oh!

fractionalhare commented Jul 19, 2022

Uh oh!

ogrisel left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ogrisel commented Jul 20, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fractionalhare commented Jul 20, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fractionalhare commented Jul 26, 2022

Uh oh!

fractionalhare commented Aug 11, 2022

Uh oh!

ogrisel commented Aug 12, 2022

Uh oh!

ogrisel left a comment

Choose a reason for hiding this comment

Uh oh!

ogrisel Aug 12, 2022

Choose a reason for hiding this comment

Uh oh!

ogrisel Aug 12, 2022

Choose a reason for hiding this comment

Uh oh!

ogrisel commented Aug 12, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lorentzenchr commented Aug 13, 2022

Uh oh!

ogrisel commented Aug 16, 2022

Uh oh!

lorentzenchr commented Aug 17, 2022

Uh oh!

lorentzenchr commented Dec 6, 2022

Uh oh!

fractionalhare commented Jan 21, 2023

Uh oh!

ayaanhossain commented Nov 7, 2023

Uh oh!

lorentzenchr commented Nov 7, 2023

Uh oh!

fractionalhare commented Nov 7, 2023

Uh oh!

glemaitre commented Nov 7, 2023

Uh oh!

ayaanhossain commented Jan 16, 2024

Uh oh!

Uh oh!

$@fractionalhare$ fractionalhare commented Jul 9, 2022 •

edited

Loading

ogrisel commented Jul 20, 2022 •

edited

Loading

fractionalhare commented Jul 20, 2022 •

edited

Loading

ogrisel commented Aug 12, 2022 •

edited

Loading