-
-
Notifications
You must be signed in to change notification settings - Fork 10.9k
MAINT: Optimize numpy.count_nonzero for int types using SIMD operations #18183
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from 1 commit
Commits
Show all changes
14 commits
Select commit
Hold shift + click to select a range
d2e7768
Added support for SIMD operations for int types in numpy.count_nonzer…
touqir14 c716a12
Added tests for i1,i2,i4,i8 types for numpy.count_nonzero function
touqir14 15cf37d
Merged count_nonzero_int16/int32/int64 into count_nonzero_int and add…
touqir14 2b41cbf
Removed commented out code from PyArray_CountNonzero
touqir14 ed3d080
Merge remote-tracking branch 'upstream/master'
touqir14 87c5d51
Replaced manual sums with horizontal simd sums for count_nonzero_16/64
touqir14 65892ef
fixed CI errors and optimized further simd_16 and simd_32
touqir14 022cc66
some fixes for the build problems
touqir14 6895bab
another attempt to fix build issues
touqir14 89d6e55
removed the target variable and changed the loop as suggested by Saye…
touqir14 534132e
Modified PyArray_CountNonzero to discriminate between types based on …
touqir14 1eb91a3
Ensured overflow does not happen for 16 and 32 bit ints
touqir14 d208702
cleanup
seiko2plus 85e2ce9
fix up
seiko2plus File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Merged count_nonzero_int16/int32/int64 into count_nonzero_int and add…
…ed benchmarks
- Loading branch information
commit 15cf37d5394e69fc1847b1efa8d5253de4890cbe
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.