Vulkan based on #9650 #11835

inforithmics · Aug 9, 2025

A pull request of #9650 with newest patches of main

Version 12.5

Build with build_windows.ps1:

Some interesting Links:
Vulkan vs ROCm on Linux:
https://www.phoronix.com/review/llama-cpp-windows-linux/5
https://www.phoronix.com/review/amd-rocm-7-strix-halo/3

# Conflicts: # gpu/gpu.go

# Conflicts: # gpu/gpu_linux.go

Making amd gpu work on arm achitecture with vulkan

Fix variable name

inforithmics · Oct 7, 2025

I included this commit as a patch because it could cause issues in Flash Attention which is now enabled by default for certain Models ggml-org/llama.cpp#16365

dhiltgen · Oct 7, 2025

It looks like PCI IDs aren't getting set up properly on Vulkan on Windows. I'll take a look at the code and see if I can spot why, but it seems consistent across GPU vendors on Windows, which is leading to duplicate devices showing up. In my branch I'm pretty sure I had it working, so it's hopefully just a simple glitch somewhere.

Update: Looks like I had a stale build on the windows test systems - refreshed to your latest commit and the PCI IDs are showing up correctly.

virajwad · Oct 7, 2025

Hi, in current state of PR I'm also seeing a functional regression where no layers of the model is being put on GPU and there are also ggml_uncaught_exceptions showing up in the log - tested on Meteor Lake IGPU

~~ollama_vulkan_11835_MTL_IGPU_exception.txt~~

~~tested current state on this commit~~

EDIT Sorry, I forgot to rebuild after pull. Everything working fine on my side

dhiltgen · Oct 13, 2025

@inforithmics the GGML update is now merged.

dhiltgen · Oct 14, 2025

@inforithmics I've tested your latest commit and things are looking good. I believe once you rebase on main with the latest GGML update, you should be able to drop the extra vulkan patch you had to carry. What we'd like to do is merge this soon and bring in Vulkan support in 2 phases. First would be local build support. After we merge your PR, I'll follow up with some minor CI changes so we can disable Vulkan in the official binary release temporarily, and make sure it still builds by default for anyone who checks out and builds locally. Then we can continue to test it and work through any remaining major issues as follow up commits on main in smaller PRs. Once things are looking solid, then we'll undo those CI changes so it gets built in the official releases for Linux and Windows. Thanks for sticking in there!

As far as follow ups I'm tracking after this merges: my test systems include a selection of AMD iGPUs and Intel integrated and discrete GPUs which use Vulkan, and there are some library models that hit GGML asserts in the Vulkan backend. Additionally the scheduler will need some adjusting to be iGPU aware instead of naively favoring the GPU with the most VRAM available.

inforithmics · Oct 14, 2025

@dhiltgen

I have now merged with main
synchronized vulkan and new *.glsl files
Added Commit only use for vulkan the filteredid (numeric device number)
Which uses the FilteredID only for Vulkan (otherwise would cause behavioral changes on ROCm and CUDA). FilteredID is a numeric ID of the Position as Vulkan Device. It is calculated by the position of the returned devices. Maybe this should be a parameter that is filled in the Vulkan backend.
Removed unnecessary patch.

whyvl and others added 30 commits June 14, 2024 19:56

implement the vulkan C backend

f46b4a6

add support in gpu.go

9c6b049

add support in gen_linux.sh

93c4d69

it builds

24c8840

fix segfault

724fac4

fix compilation

e4e8a5d

fix free memory monitor

257364c

fix total memory monitor

11c55fa

Merge branch 'refs/heads/main' into vulkan

e77ea68

# Conflicts: # gpu/gpu.go

update gpu.go

18f3f96

fix build

38466f1

fix check_perfmon len

e3f9ca4

remove cap_get_bound check

b958cd2

fix vulkan handle releasing

b6554e9

Merge remote-tracking branch 'upstream/main' into vulkan

7fe16ea

Merge branch 'main' of https://github.com/ollama/ollama into vulkan

022b921

Merge branch 'main' into vulkan

5f1a301

# Conflicts: # gpu/gpu_linux.go

fix build on federa 40

ace3d10

fix vulkan on windows

e61c329

fix conflict

9ad63a7

making amdgpu work on arm achitecutre with vulkan

4b74cee

add x86_64 lines in VulkanGlobs and capLinuxGlobs

6d7579b

Merge remote-tracking branch 'upstream/vulkan' into vulkan

9ac01e8

add aarch64 lines in vulkanGlobs and capLinuxGlobs

2bf59a5

Merge pull request #3 from yeongbba/vulkan

481ab07

Making amd gpu work on arm achitecture with vulkan

Merge branch 'ollama:main' into vulkan

f7e40b5

Fix variable name

0d277d3

Merge pull request #4 from tomaThomas/vulkan

3839e8f

Fix variable name

Merge github.com:ollama/ollama into vulkan

582d41e

Add vulkan build patch from @jmorganca

2d443b3

inforithmics added 2 commits October 7, 2025 02:36

Merge remote-tracking branch 'upstream/main' into vulkanV3

6329485

Vulkan Fix FA coopmat1 invalid array indexing

c146800

This comment was marked as resolved.

Sign in to view

inforithmics added 3 commits October 7, 2025 23:30

Merge remote-tracking branch 'upstream/main' into vulkanV3

854bb86

Merge remote-tracking branch 'upstream/main' into vulkanV3

e7319da

Use everywhere the same Vulkan Version 1.4.321.1

9f6a4f9

This comment was marked as resolved.

Sign in to view

inforithmics added 3 commits October 10, 2025 01:15

Merge remote-tracking branch 'upstream/main' into vulkanV3

699e843

Merge remote-tracking branch 'upstream/main' into vulkanV3

14c0e11

Merge remote-tracking branch 'upstream/main' into vulkanV3

4593a76

rick-github mentioned this pull request Oct 13, 2025

Continue support for AMD gfx906 #12600

Open

inforithmics added 5 commits October 14, 2025 05:53

Merge remote-tracking branch 'upstream/main' into vulkanV3

759972c

Remove unneeded patch

2f82e32

vulkan update

bbfd1e3

sync vulkan glsl files

3b4471d

only use for vulkan the filteredid (numeric device number)

557b7d6

simplify code

b9f3772

dhiltgen approved these changes Oct 14, 2025

View reviewed changes

dhiltgen merged commit 2aba569 into ollama:main Oct 14, 2025
8 checks passed

ollama deleted a comment from ericcurtin Oct 14, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Vulkan based on #9650 #11835

Vulkan based on #9650 #11835

inforithmics commented Aug 9, 2025 •

edited

Loading

Uh oh!

inforithmics commented Oct 7, 2025 •

edited

Loading

Uh oh!

dhiltgen commented Oct 7, 2025 •

edited

Loading

Uh oh!

virajwad commented Oct 7, 2025 •

edited

Loading

Uh oh!

This comment was marked as resolved.

This comment was marked as resolved.

This comment was marked as resolved.

This comment was marked as resolved.

This comment was marked as resolved.

This comment was marked as resolved.

dhiltgen commented Oct 13, 2025

Uh oh!

dhiltgen commented Oct 14, 2025

Uh oh!

inforithmics commented Oct 14, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

13 participants

Search code, repositories, users, issues, pull requests...

Vulkan based on #9650 #11835

Vulkan based on #9650 #11835

Conversation

inforithmics commented Aug 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

inforithmics commented Oct 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dhiltgen commented Oct 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

virajwad commented Oct 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

This comment was marked as resolved.

This comment was marked as resolved.

This comment was marked as resolved.

This comment was marked as resolved.

This comment was marked as resolved.

This comment was marked as resolved.

dhiltgen commented Oct 13, 2025

Uh oh!

dhiltgen commented Oct 14, 2025

Uh oh!

inforithmics commented Oct 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

13 participants

inforithmics commented Aug 9, 2025 •

edited

Loading

inforithmics commented Oct 7, 2025 •

edited

Loading

dhiltgen commented Oct 7, 2025 •

edited

Loading

virajwad commented Oct 7, 2025 •

edited

Loading

inforithmics commented Oct 14, 2025 •

edited

Loading