Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Appearance settings

Optimization request: improve af::convolve for small kernel sizes #1874

Copy link
Copy link
Open
@pthon

Description

@pthon
Issue body actions

As discussed in arrayfire-python #145, af::convolve is currently falling back to fft+ifft for huger array sizes even if the kernel is very small (e.g. 3x3x3). This results in a performance loss for small sized convolution kernels.

As a benchmark, I timed a convolution with a 3x3x3 kernel against a single fft and plotted the quotient over the number of elements per dimension nn:
cpp_conv_vs_fft
For the code producing this plot please refer to the thread linked above.

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions

    Morty Proxy This is a proxified and sanitized view of the page, visit original site.