-
Notifications
You must be signed in to change notification settings - Fork 11.9k
Issues: ggml-org/llama.cpp
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
server : separate the notion of position and KV tokens, remove prompt truncation
breaking change
Changes that break ABIs, APIs, file formats, or other forms of backwards compatibility.
examples
python
python script changes
server
#13576
opened May 15, 2025 by
ngxson
Loading…
gguf-py : add support for sub_type (in arrays) in GGUFWriter add_key_value method
python
python script changes
#13561
opened May 15, 2025 by
CISC
Loading…
Granite Four
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
python
python script changes
testing
Everything test related
#13550
opened May 14, 2025 by
gabe-l-hart
•
Draft
2 tasks
convert: Swap GLM4 EOS / EOT token
python
python script changes
#13505
opened May 13, 2025 by
henk717
Loading…
gguf-py: Optimize python script changes
GGUFReader
read-only mode performance
python
#13378
opened May 8, 2025 by
Isotr0py
Loading…
python : bump transformers version
python
python script changes
#13351
opened May 7, 2025 by
ngxson
Loading…
Support start strings, the opposite of stop tokens.
examples
python
python script changes
server
#13214
opened Apr 30, 2025 by
matteoserva
•
Draft
Introduce New Lookup-Table(LUT)-Based Matrix Multiplication Method (TMAC)
ggml
changes relating to the ggml tensor library for machine learning
python
python script changes
#13206
opened Apr 30, 2025 by
QingtaoLi1
Loading…
2 of 4 tasks
Fix ChatGLMModel for glm-4-9b cannot find tokenizer merges in model file
python
python script changes
#13058
opened Apr 22, 2025 by
glide-the
Loading…
Bitnet: directly use scale instead of inverting it twice
python
python script changes
#13026
opened Apr 19, 2025 by
viraatdas
Loading…
Fix convert script for non-hf GLM4 checkpoints
python
python script changes
#12992
opened Apr 17, 2025 by
Tianyue-Zhao
Loading…
2 of 4 tasks
gguf-py: byteswapping improvements
python
python script changes
#12851
opened Apr 9, 2025 by
AlekseiNikiforovIBM
Loading…
convert : write tensors in parallel
performance
Speed related topics
python
python script changes
#12837
opened Apr 8, 2025 by
compilade
Loading…
3 of 6 tasks
WIP: Add support for CogAgent
examples
python
python script changes
server
#12679
opened Mar 31, 2025 by
Tianyue-Zhao
•
Draft
tts : implement sesame CSM + Mimi decoder
examples
python
python script changes
#12648
opened Mar 29, 2025 by
ngxson
Loading…
server
: streaming of tool calls and thoughts when --jinja
is on
documentation
tool-call
: Phi-4 support
android
#12288
opened Mar 9, 2025 by
jpohhhh
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2025-05-19.