Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Appearance settings

Tags: ishandutta2007/llama.cpp

Tags

b6788

Toggle b6788's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
ggml : fix SpaceMit IME array out-of-bounds in task assignment (ggml-…

…org#16629)

Fix incorrect task-to-batch index calculation in the quantization phase.

The bug caused out-of-bounds access to qnbitgemm_args array when
compute_idx exceeded per_gemm_block_count_m, leading to invalid
pointer dereferences and SIGBUS errors.

Correctly map tasks to batches by dividing compute_idx by
per_gemm_block_count_m instead of block_size_m.

Example:
  batch_feature=1, gemm_m=30, block_size_m=4
  per_gemm_block_count_m = 8, task_count = 8

  Old: gemm_idx = 4/4 = 1 (out of bounds  New: gemm_idx = 4/8 = 0 (correct)

Tested on SpaceMit K1 RISC-V64 with qwen2.5:0.5b model.

Co-authored-by: muggle <mingjun.rong@spacemit.com>

b6783

Toggle b6783's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
SYCL SET operator optimized for F32 tensors (ggml-org#16350)

* SYCL/SET: implement operator + wire-up; docs/ops updates; element_wise & ggml-sycl changes

* sycl(SET): re-apply post-rebase; revert manual docs/ops.md; style cleanups

* move SET op to standalone file, GPU-only implementation

* Update SYCL SET operator for F32

* ci: fix editorconfig issues (LF endings, trailing spaces, final newline)

* fixed ggml-sycl.cpp

---------

Co-authored-by: Gitty Burstein <gitty@example.com>

b6782

Toggle b6782's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
mtmd : support home-cooked Mistral Small Omni (ggml-org#14928)

b6781

Toggle b6781's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
fix: added a normalization step for MathJax-style \[\] and \(\) delim…

…iters (ggml-org#16599)

* fix: added a normalization step for MathJax-style \[\] and \(\) delimiters

So inline and block equations are converted before KaTeX rendering,
enabling proper display of model-generated LaTeX in the WebUI

* chore: update webui build output

b6779

Toggle b6779's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
CANN: format code using .clang-format (ggml-org#15863)

This commit applies .clang-format rules to all source files under the
ggml-cann directory to ensure consistent coding style and readability.
The .clang-format option `SortIncludes: false` has been set to disable
automatic reordering of include directives.
No functional changes are introduced.

Co-authored-by: hipudding <huafengchun@gmail.com>

b6776

Toggle b6776's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
SYCL: Add GGML_OP_MEAN operator support (ggml-org#16009)

* SYCL: Add GGML_OP_MEAN operator support

* SYCL: Fix formatting for GGML_OP_MEAN case

* Update ggml/src/ggml-sycl/ggml-sycl.cpp

Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>

---------

Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>

b6775

Toggle b6775's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
gguf-py : add support for endian conversion of BF16 data (ggml-org#16594

)

BF16 requires special handling in this script
while it's a 2-bytes data, but view is 1-byte by default.
Switch to correct view before attempting byteswapping.

With this change correctly byteswapping models like
Meta-Llama-3-8B-Instruct-bf16-GGUF
should be possible.

b6771

Toggle b6771's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Add server-driven parameter defaults and syncing (ggml-org#16515)

b6766

Toggle b6766's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
server : fix mtmd checkpoints (ggml-org#16591)

b6765

Toggle b6765's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
metal : avoid using Metal's gpuAddress property (ggml-org#16576)

* metal : avoid using Metal's gpuAddress property

* metal : fix rope kernels buffer check
Morty Proxy This is a proxified and sanitized view of the page, visit original site.