Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Appearance settings

Fix reduction contig impl offset handling#1458

Merged
oleksandr-pavlyk merged 6 commits intomasterIntelPython/dpctl:masterfrom
fix-reduction-contig_impl-offset-handlingIntelPython/dpctl:fix-reduction-contig_impl-offset-handlingCopy head branch name to clipboard
Oct 27, 2023
Merged

Fix reduction contig impl offset handling#1458
oleksandr-pavlyk merged 6 commits intomasterIntelPython/dpctl:masterfrom
fix-reduction-contig_impl-offset-handlingIntelPython/dpctl:fix-reduction-contig_impl-offset-handlingCopy head branch name to clipboard

Conversation

@oleksandr-pavlyk
Copy link
Contributor

This PR modifies computation of typed pointer to take offsets into account in reduction implementations for contiguous inputs.

It also adds tests which would fail before this change

  • Have you provided a meaningful PR description?
  • Have you added a test, reproducer or referred to an issue with a reproducer?
  • Have you tested your changes locally for CPU and GPU devices?
  • Have you made sure that new changes do not introduce compiler warnings?
  • Have you checked performance impact of proposed changes?
  • If this PR is a work in progress, are you opening the PR as a draft?

@github-actions
Copy link

@coveralls
Copy link
Collaborator

coveralls commented Oct 26, 2023

Coverage Status

coverage: 85.748%. remained the same when pulling bfba152 on fix-reduction-contig_impl-offset-handling into 03fd737 on master.

@github-actions
Copy link

Array API standard conformance tests for dpctl=0.15.1dev0=py310ha25a700_68 ran successfully.
Passed: 934
Failed: 66
Skipped: 119

@github-actions
Copy link

Array API standard conformance tests for dpctl=0.15.1dev0=py310ha25a700_69 ran successfully.
Passed: 935
Failed: 65
Skipped: 119

The optimization should not use max-work-group-size, to allow RT some
of the SLM memory.
@github-actions
Copy link

Array API standard conformance tests for dpctl=0.15.1dev0=py310ha25a700_70 ran successfully.
Passed: 935
Failed: 65
Skipped: 119

@oleksandr-pavlyk oleksandr-pavlyk merged commit d82f3a9 into master Oct 27, 2023
@oleksandr-pavlyk oleksandr-pavlyk deleted the fix-reduction-contig_impl-offset-handling branch October 27, 2023 00:56
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants

Morty Proxy This is a proxified and sanitized view of the page, visit original site.