Parallelize init_bound_dense in Elkan algorithm #19052

YusukeNagasaka · Dec 21, 2020

What does this implement/fix?

Parallelize init_bound_dense function. This is an initialization part in Elkan algorithm.

Any other comments?

This fix does not affect the quality of the clustering itself. This parallelization only reduces execution time of KMeans.fit on multi-core CPUs, even more beneficial on many-core CPUs.
For instance, I tested on a machine with 2 sockets of Intel Xeon CPUs (totally 40 cores). The table below is the time spent during KMeans.fit and init_bound_dense (the time in seconds). The data is generated uniformly at random with n_samples=1M, n_features=100. The parameters of KMeans are n_clusters=1000, init='random', algorithm='elkan', max_iter=1000.

	master	PR
total time	751	692
init_bound_dense	66	2

jeremiedbb · Dec 21, 2020

Thanks @YusukeNagasaka. looks good. It would make sense to also do the change in init_bound_sparse if you're willing to.

ogrisel

LGTM. Thanks for the improvement.

kobaski · Dec 21, 2020

This makes huge performance improvement on A64FX CPU, even not much improvement on Intel CPU. @YusukeNagasaka will change init_bound_sparse too.

YusukeNagasaka · Dec 22, 2020

The same parallelization is applied to init_bounds_sparse. Of course, this change does not affect the quality of clustering itself :)

The schedule in prange I adopted is static. The dynamic scheduling might show better performance on the skewed sparse dataset, but the dynamic scheduling brings some overhead of distributing work to each thread. I think the static scheduling works well enough on sparse datasets, too.

jeremiedbb

lgtm

ogrisel · Dec 22, 2020

Merged! Thank you very much @YusukeNagasaka!

parallelize init_bound_dense

39b5031

github-actions bot added the module:cluster label Dec 21, 2020

ogrisel approved these changes Dec 21, 2020

View reviewed changes

parallelize init_bounds_sparse

572bb42

ogrisel mentioned this pull request Dec 22, 2020

CI failing because of fetch_openml(name='miceprotein', version=4) #18882

Closed

Trigger CI

19f8c61

jeremiedbb approved these changes Dec 22, 2020

View reviewed changes

jeremiedbb added 2 commits December 22, 2020 17:09

what's new

3cda844

Merge branch 'master' into parallelize_init_bound

ecc70f2

ogrisel merged commit 3cdfb56 into scikit-learn:master Dec 22, 2020

glemaitre mentioned this pull request Apr 22, 2021

Release 0.24.2 #19954

Merged

12 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Parallelize init_bound_dense in Elkan algorithm #19052

Parallelize init_bound_dense in Elkan algorithm #19052

Uh oh!

YusukeNagasaka commented Dec 21, 2020

Uh oh!

jeremiedbb commented Dec 21, 2020

Uh oh!

ogrisel left a comment

Uh oh!

kobaski commented Dec 21, 2020 •

edited

Loading

Uh oh!

YusukeNagasaka commented Dec 22, 2020

Uh oh!

jeremiedbb left a comment

Uh oh!

ogrisel commented Dec 22, 2020

Uh oh!

Uh oh!

Search code, repositories, users, issues, pull requests...

Uh oh!

Parallelize init_bound_dense in Elkan algorithm #19052

Parallelize init_bound_dense in Elkan algorithm #19052

Uh oh!

Conversation

YusukeNagasaka commented Dec 21, 2020

What does this implement/fix?

Any other comments?

Uh oh!

jeremiedbb commented Dec 21, 2020

Uh oh!

ogrisel left a comment

Choose a reason for hiding this comment

Uh oh!

kobaski commented Dec 21, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

YusukeNagasaka commented Dec 22, 2020

Uh oh!

jeremiedbb left a comment

Choose a reason for hiding this comment

Uh oh!

ogrisel commented Dec 22, 2020

Uh oh!

Uh oh!

kobaski commented Dec 21, 2020 •

edited

Loading