cuda : synchronize graph capture and cublas handle destruction #14288

slaren · Jun 19, 2025

Workarounds an issue that may cause CUDA graph capture to fail when a cuBLAS handle is destroyed in a different thread.

Should fix #13990

Workarounds an issue that may cause CUDA graph capture to fail when a cuBLAS handle is destroyed in a different thread ggml-ci

…org#14288) Workarounds an issue that may cause CUDA graph capture to fail when a cuBLAS handle is destroyed in a different thread

* mamba2-sync: (24 commits) sync : ggml Add `ggml_roll` (ggml/1274) docs : fix the link to llama.h (ggml-org#14293) CUDA: add conv_2d_transpose (ggml-org#14287) lint : remove trailing whitepace (ggml-org#14304) vocab : prevent tokenizer overflow (ggml-org#14301) sycl: add usage of enqueue_functions extension (ggml-org#14244) Implement GGML_CPU_ALL_VARIANTS for PowerPC (ggml-org#14286) llama : improve sep token handling (ggml-org#14272) cuda : synchronize graph capture and cublas handle destruction (ggml-org#14288) ggml : fix repack work size for mul_mat_id (ggml-org#14292) ggml: Update KleidiAI to v1.9.0 (ggml-org#14277) model : more uniform output id handling (ggml-org#14275) ubatch : new splitting logic (ggml-org#14217) CUDA: add conv_2d_dw (ggml-org#14265) ggml-cpu : remove unnecesary arm feature detection (ggml-org#14281) gguf-py : make sentencepiece optional (ggml-org#14200) server : add server parameters for draft model cache type (ggml-org#13782) build : suppress gcc15 compile warnings (ggml-org#14261) sycl: Cleanup codepaths in Get Rows in sycl backend (ggml-org#14215) ...

slaren mentioned this pull request Jun 19, 2025

llama : add thread safety test #14035

Merged

github-actions bot added Nvidia GPU Issues specific to Nvidia GPUs ggml changes relating to the ggml tensor library for machine learning labels Jun 19, 2025

slaren force-pushed the sl/cuda-cublas-graph-sync branch from 77d208b to 87a4f95 Compare June 19, 2025 21:28

cuda : synchronize graph capture and cublas handle destruction

319f734

Workarounds an issue that may cause CUDA graph capture to fail when a cuBLAS handle is destroyed in a different thread ggml-ci

slaren force-pushed the sl/cuda-cublas-graph-sync branch from 87a4f95 to 319f734 Compare June 19, 2025 21:35

ggerganov approved these changes Jun 20, 2025

View reviewed changes

slaren merged commit e28c1b9 into master Jun 20, 2025
55 checks passed

slaren deleted the sl/cuda-cublas-graph-sync branch June 20, 2025 11:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

cuda : synchronize graph capture and cublas handle destruction #14288

cuda : synchronize graph capture and cublas handle destruction #14288

Uh oh!

slaren commented Jun 19, 2025

Uh oh!

Uh oh!

Uh oh!

Search code, repositories, users, issues, pull requests...

cuda : synchronize graph capture and cublas handle destruction #14288

cuda : synchronize graph capture and cublas handle destruction #14288

Uh oh!

Conversation

slaren commented Jun 19, 2025

Uh oh!

Uh oh!

Uh oh!