Tags · ishandutta2007/llama.cpp

b6788

ggml : fix SpaceMit IME array out-of-bounds in task assignment (ggml-…

…org#16629)

Fix incorrect task-to-batch index calculation in the quantization phase.

The bug caused out-of-bounds access to qnbitgemm_args array when
compute_idx exceeded per_gemm_block_count_m, leading to invalid
pointer dereferences and SIGBUS errors.

Correctly map tasks to batches by dividing compute_idx by
per_gemm_block_count_m instead of block_size_m.

Example:
  batch_feature=1, gemm_m=30, block_size_m=4
  per_gemm_block_count_m = 8, task_count = 8

  Old: gemm_idx = 4/4 = 1 (out of bounds  New: gemm_idx = 4/8 = 0 (correct)

Tested on SpaceMit K1 RISC-V64 with qwen2.5:0.5b model.

Co-authored-by: muggle <mingjun.rong@spacemit.com>

Oct 17, 2025
342c728
zip
tar.gz
Downloads

b6783

SYCL SET operator optimized for F32 tensors (ggml-org#16350)

* SYCL/SET: implement operator + wire-up; docs/ops updates; element_wise & ggml-sycl changes

* sycl(SET): re-apply post-rebase; revert manual docs/ops.md; style cleanups

* move SET op to standalone file, GPU-only implementation

* Update SYCL SET operator for F32

* ci: fix editorconfig issues (LF endings, trailing spaces, final newline)

* fixed ggml-sycl.cpp

---------

Co-authored-by: Gitty Burstein <gitty@example.com>

Oct 17, 2025
ceff6bb
zip
tar.gz
Downloads

b6782

mtmd : support home-cooked Mistral Small Omni (ggml-org#14928)

Oct 16, 2025
1bb4f43
zip
tar.gz
Downloads

b6781

fix: added a normalization step for MathJax-style \[\] and \(\) delim…

…iters (ggml-org#16599)

* fix: added a normalization step for MathJax-style \[\] and \(\) delimiters

So inline and block equations are converted before KaTeX rendering,
enabling proper display of model-generated LaTeX in the WebUI

* chore: update webui build output

Oct 16, 2025
683fa6b
zip
tar.gz
Downloads

b6779

CANN: format code using .clang-format (ggml-org#15863)

This commit applies .clang-format rules to all source files under the
ggml-cann directory to ensure consistent coding style and readability.
The .clang-format option `SortIncludes: false` has been set to disable
automatic reordering of include directives.
No functional changes are introduced.

Co-authored-by: hipudding <huafengchun@gmail.com>

Oct 16, 2025
7a50cf3
zip
tar.gz
Downloads

b6776

SYCL: Add GGML_OP_MEAN operator support (ggml-org#16009)

* SYCL: Add GGML_OP_MEAN operator support

* SYCL: Fix formatting for GGML_OP_MEAN case

* Update ggml/src/ggml-sycl/ggml-sycl.cpp

Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>

---------

Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>

Oct 16, 2025
ee50ee1
zip
tar.gz
Downloads

b6775

gguf-py : add support for endian conversion of BF16 data (ggml-org#16594

)

BF16 requires special handling in this script
while it's a 2-bytes data, but view is 1-byte by default.
Switch to correct view before attempting byteswapping.

With this change correctly byteswapping models like
Meta-Llama-3-8B-Instruct-bf16-GGUF
should be possible.

Oct 15, 2025
7adc79c
zip
tar.gz
Downloads

b6771

Add server-driven parameter defaults and syncing (ggml-org#16515)

Oct 15, 2025
f9fb33f
zip
tar.gz
Downloads

b6766

server : fix mtmd checkpoints (ggml-org#16591)

Oct 15, 2025
554fd57
zip
tar.gz
Downloads

b6765

metal : avoid using Metal's gpuAddress property (ggml-org#16576)

* metal : avoid using Metal's gpuAddress property

* metal : fix rope kernels buffer check

Oct 14, 2025
fa882fd
zip
tar.gz
Downloads

PreviousNext

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

b6788

b6783

b6782

b6781

b6779

b6776

b6775

b6771

b6766

b6765

Search code, repositories, users, issues, pull requests...

Tags: ishandutta2007/llama.cpp