Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Appearance settings

Issues: ggml-org/llama.cpp

changelog : libllama API
#9289 opened Sep 3, 2024 by ggerganov
Open 9
changelog : llama-server REST API
#9291 opened Sep 3, 2024 by ggerganov
Open 15
tutorials : list for llama.cpp
#13523 opened May 14, 2025 by ggerganov
Open 3
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Issues list

server : support audio input examples python python script changes server
#13714 opened May 22, 2025 by ngxson Loading…
model : jina-embeddings-v3 support python python script changes
#13693 opened May 21, 2025 by CISC Draft
3 of 6 tasks
2
4
server : separate the notion of position and KV tokens, remove prompt truncation breaking change Changes that break ABIs, APIs, file formats, or other forms of backwards compatibility. examples python python script changes server
#13576 opened May 15, 2025 by ngxson Loading…
Update python verions examples python python script changes server
#13574 opened May 15, 2025 by robbiemu Loading…
Granite Four Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning python python script changes testing Everything test related
#13550 opened May 14, 2025 by gabe-l-hart Draft
2 tasks
convert: Swap GLM4 EOS / EOT token python python script changes
#13505 opened May 13, 2025 by henk717 Loading…
gguf-py: Optimize GGUFReader read-only mode performance python python script changes
#13378 opened May 8, 2025 by Isotr0py Loading…
llama: Fix typos in multiple files ggml changes relating to the ggml tensor library for machine learning Kompute https://github.com/KomputeProject/kompute/ python python script changes SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13369 opened May 8, 2025 by co63oc Loading…
python : bump transformers version python python script changes
#13351 opened May 7, 2025 by ngxson Loading…
Introduce New Lookup-Table(LUT)-Based Matrix Multiplication Method (TMAC) ggml changes relating to the ggml tensor library for machine learning python python script changes
#13206 opened Apr 30, 2025 by QingtaoLi1 Loading…
2 of 4 tasks
Fix ChatGLMModel for glm-4-9b cannot find tokenizer merges in model file python python script changes
#13058 opened Apr 22, 2025 by glide-the Loading…
Bitnet: directly use scale instead of inverting it twice python python script changes
#13026 opened Apr 19, 2025 by viraatdas Loading…
Fix convert script for non-hf GLM4 checkpoints python python script changes
#12992 opened Apr 17, 2025 by Tianyue-Zhao Loading…
2 of 4 tasks
gguf-py: byteswapping improvements python python script changes
#12851 opened Apr 9, 2025 by AlekseiNikiforovIBM Loading…
convert : write tensors in parallel performance Speed related topics python python script changes
#12837 opened Apr 8, 2025 by compilade Loading…
3 of 6 tasks
Support for OuteTTS 1.0 examples python python script changes
#12794 opened Apr 7, 2025 by edwko Draft
WIP: Add support for CogAgent examples python python script changes server
#12679 opened Mar 31, 2025 by Tianyue-Zhao Draft
tts : implement sesame CSM + Mimi decoder examples python python script changes
#12648 opened Mar 29, 2025 by ngxson Loading…
(draft) tts: Orpheus support examples ggml changes relating to the ggml tensor library for machine learning python python script changes
#12487 opened Mar 21, 2025 by jamorphy Draft
server: streaming of tool calls and thoughts when --jinja is on documentation Improvements or additions to documentation examples python python script changes script Script related server testing Everything test related tool calling
#12379 opened Mar 14, 2025 by ochafik Draft
5 of 10 tasks
tool-call: Phi-4 support android Issues specific to Android Apple Metal https://en.wikipedia.org/wiki/Metal_(API) devops improvements to build systems and github actions documentation Improvements or additions to documentation examples ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs python python script changes server SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language testing Everything test related Vulkan Issues specific to the Vulkan backend
#12288 opened Mar 9, 2025 by jpohhhh Loading…
ProTip! Updated in the last three days: updated:>2025-05-19.
Morty Proxy This is a proxified and sanitized view of the page, visit original site.