-
Notifications
You must be signed in to change notification settings - Fork 13.6k
[mlir][AMDGPU] Implement gpu.subgroup_reduce with DPP intrinsics on AMD GPUs #133204
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from 1 commit
Commits
Show all changes
28 commits
Select commit
Hold shift + click to select a range
029b2cc
Creates AMDToGPUPass to house a subgroup reduce lowering pattern to DPP
Muzammiluddin-Syed-ECE 427c817
Fix for numerical issues in MatVec tests
Muzammiluddin-Syed-ECE 655251b
Rewrites pattern to be closer to device lib impl.
Muzammiluddin-Syed-ECE 081d6f7
Removes AMDToGPUPass, moving pattern into existing pass
Muzammiluddin-Syed-ECE 0d560c2
Adding permlanex16 and other dpp related ops to mlir dialect
Muzammiluddin-Syed-ECE 015e9b9
Fixing permlanex16 intrinsic failure
Muzammiluddin-Syed-ECE 945f0e8
simplify verbose typing
Muzammiluddin-Syed-ECE 1b356ed
testing numerics
Muzammiluddin-Syed-ECE 7fd30c0
fixing
Muzammiluddin-Syed-ECE 0c28b4d
fixing
Muzammiluddin-Syed-ECE bfda712
fixing
Muzammiluddin-Syed-ECE 54c08ef
trying again
Muzammiluddin-Syed-ECE 6535bda
Fixing implementation
Muzammiluddin-Syed-ECE 85e3b62
Adding DPP test
Muzammiluddin-Syed-ECE 3392f08
Addressing PR comments
Muzammiluddin-Syed-ECE b59922a
removing unnecessary header
Muzammiluddin-Syed-ECE 6431293
Addressing PR comments
Muzammiluddin-Syed-ECE ae25fa0
moving permlanex16 changes to another commit
Muzammiluddin-Syed-ECE 9745098
fixing test
Muzammiluddin-Syed-ECE a6c35b3
fixing code formatting
Muzammiluddin-Syed-ECE 8a9cefb
Updating implementation to support gfx 10+
Muzammiluddin-Syed-ECE c395203
Small formatting change
Muzammiluddin-Syed-ECE ab15c44
Removing ReadlaneOps from test
Muzammiluddin-Syed-ECE 55f442e
Improve dpp implementation
Muzammiluddin-Syed-ECE 6442288
fixing formatting
Muzammiluddin-Syed-ECE 848c6ba
Fixing implementation of DPP subgroup reduce
Muzammiluddin-Syed-ECE 6da1653
Addressing PR comments
Muzammiluddin-Syed-ECE e19a615
Fixing Typo in RUN command
Muzammiluddin-Syed-ECE File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
fixing code formatting
Signed-off-by: Muzammiluddin Syed <muzasyed@amd.com>
- Loading branch information
commit a6c35b3a88cc22eb5f01447cdd69f5b1c017fd4a
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.