Commits
Branch selector
User selector
Commit History
Commits on Apr 8, 2026
feat: Update llama.cpp to ggerganov/llama.cpp@3bd9aa1f9 (#2176)
Show description for 1bcc5bcauthored
Commits on Apr 3, 2026
Commits on Mar 30, 2026
Commits on Mar 29, 2026
ci: publish release wheels as py3-none (#2166)
Show description for 7613acaauthoredfix(ci): publish distinct manylinux and musllinux cpu wheels (#2165)
Show description for fcd932aauthored
Commits on Mar 25, 2026
- authored
feat: Update llama.cpp to ggerganov/llama.cpp@c0159f9c1f874da15e94f371d136f5920b4b5335 (#2161)
Show description for c670222authoredfix: handle embedding models without KV memory (#2160)
Show description for ac59e5aauthoredfix(ci): reduce CUDA binary wheel size only including cubins for current arches and one PTX target for forward compatibility (#2158)
Show description for 5f9c231authored
Commits on Mar 24, 2026
- authored
feat: expose attention_type parameter in Llama.__init__ (#2143)
Show description for 7b38c31authored
fix(ci): docker build workflow (#2156)
Show description for ccc6bc0authoredfix(ci): cuda wheel workflow (#2155)
Show description for 909ebf1authoredfix(ci): release wheel workflow (#2154)
Show description for f0391c5authored
Commits on Mar 23, 2026
- authored
fix: Qwen 3.5 support (#2152)
Show description for 11e7a55authoredci: add riscv64 wheel builds to release workflow (#2139)
Show description for e1f8ac0andauthoredfeat: Update llama.cpp to ggerganov/llama.cpp@49bfddeca18e62fa3d39114a23e9fcbdf8a22388 (#2151)
Show description for 18aa31eauthored
Commits on Mar 22, 2026
misc: Add Ruff formatting (#2148)
Show description for a9b4a06authoredfix(ci): Fix macos tests, support both Intel and Apple Silicon testing (#2150)
Show description for 9f661ffauthoredfix(ci): Rename `huggingface-cli` to `hf` (#2149)
Show description for ca3b00aauthored
Commits on Aug 15, 2025
- committed
- committed
Commits on Aug 7, 2025
- committed
- authored
- committed
- committed
- committed
- committed
Commits on Jul 18, 2025
- committed
Commits on Jul 16, 2025
- committed