Commits
User selector
Commit History
Commits on Jun 1, 2026
fix: clear prompt for recurrent / hybrid models when only a partial prefix matches (#2108)
Show description for cdb7a75andauthored
Commits on May 8, 2026
fix: configure n_seq_max for batched embeddings (#2206)
Show description for 128c331authored
Commits on May 4, 2026
fix(_internals): use n_tokens0 offset when enabling last-token logits in add_sequence (#2205)
Show description for 90e8df9authored
Commits on Mar 25, 2026
fix: handle embedding models without KV memory (#2160)
Show description for ac59e5aauthored
Commits on Mar 23, 2026
fix: Qwen 3.5 support (#2152)
Show description for 11e7a55authoredfeat: Update llama.cpp to ggerganov/llama.cpp@49bfddeca18e62fa3d39114a23e9fcbdf8a22388 (#2151)
Show description for 18aa31eauthored
Commits on Mar 22, 2026
misc: Add Ruff formatting (#2148)
Show description for a9b4a06authored
Commits on Jul 5, 2025
Commits on Sep 19, 2024
fix: Fix memory allocation of ndarray (#1704)
Show description for 22cedadandauthoredfeat: Update sampling API for llama.cpp (#1742)
Show description for f8fcb3eauthored
Commits on Jun 4, 2024
fix: Avoid duplicate special tokens in chat formats (#1439)
Show description for 027f7bcandauthored
Commits on Apr 30, 2024
- committed
Commits on Feb 26, 2024
- committed
Commits on Feb 21, 2024
Commits on Feb 8, 2024
- committed
Commits on Jan 31, 2024
Add speculative decoding (#1120)
Show description for fb762a6authored
Commits on Jan 29, 2024
- committed
Commits on Jan 22, 2024
Commits on Jan 19, 2024
Commits on Jan 17, 2024
Integration of Jinja2 Templating (#875)
Show description for 6bfe98bauthored
Commits on Jan 15, 2024
Commits on Dec 16, 2023
Fix logits_to_logprobs for 2-D and 3-D logits (#1002)
Show description for 5a89446authored
Commits on Nov 22, 2023
Commits on Nov 21, 2023
- committed
Do not set `grammar` to `None` for new `LlamaGrammar` objects (#834)
Show description for c21edb6authored- committed
Commits on Nov 20, 2023
- committed
Commits on Nov 6, 2023
Commits on Nov 3, 2023
Migrate inference to llama_batch and llama_decode api (#795)
Show description for ab028cbandauthored
Commits on Nov 2, 2023
fix: tokenization of special characters: (#850)
Show description for 4d4e0f1authored andcommitted