Commits
Branch selector
User selector
Commit History
Commits on May 8, 2026
- authored
fix: configure n_seq_max for batched embeddings (#2206)
Show description for 128c331authored
Commits on May 4, 2026
fix(_internals): use n_tokens0 offset when enabling last-token logits in add_sequence (#2205)
Show description for 90e8df9authored
Commits on May 2, 2026
- authored
- authored
- authored
- authored
- authored
feat(ci): re-enable Windows CUDA wheels (#2198)
Show description for d2113a1authored- authored
Commits on Apr 27, 2026
- authored
- authored
- authored
- authored
fix(ci): Build one arm64 py3 release wheel (#2191)
Show description for 511b3f4authoredfeat: Update llama.cpp to ggerganov/llama.cpp@f53577432 (#2189)
Show description for d87bf08authored
Commits on Apr 13, 2026
Commits on Apr 8, 2026
feat: Update llama.cpp to ggerganov/llama.cpp@3bd9aa1f9 (#2176)
Show description for 1bcc5bcauthored
Commits on Apr 3, 2026
Commits on Mar 30, 2026
Commits on Mar 29, 2026
ci: publish release wheels as py3-none (#2166)
Show description for 7613acaauthoredfix(ci): publish distinct manylinux and musllinux cpu wheels (#2165)
Show description for fcd932aauthored
Commits on Mar 25, 2026
- authored
feat: Update llama.cpp to ggerganov/llama.cpp@c0159f9c1f874da15e94f371d136f5920b4b5335 (#2161)
Show description for c670222authoredfix: handle embedding models without KV memory (#2160)
Show description for ac59e5aauthoredfix(ci): reduce CUDA binary wheel size only including cubins for current arches and one PTX target for forward compatibility (#2158)
Show description for 5f9c231authored
Commits on Mar 24, 2026
- authored
feat: expose attention_type parameter in Llama.__init__ (#2143)
Show description for 7b38c31authored
fix(ci): docker build workflow (#2156)
Show description for ccc6bc0authoredfix(ci): cuda wheel workflow (#2155)
Show description for 909ebf1authoredfix(ci): release wheel workflow (#2154)
Show description for f0391c5authored
Commits on Mar 23, 2026
- authored
fix: Qwen 3.5 support (#2152)
Show description for 11e7a55authored