Commits
User selector
Commit History
Commits on Jul 3, 2025
Commits on Sep 20, 2024
Commits on Aug 29, 2024
Commits on Jul 17, 2024
fix(server): Use split_mode from model settings (#1594)
Show description for 66d5cddandauthored
Commits on Jun 13, 2024
feat: Add `.close()` method to `Llama` class to explicitly free model from memory (#1513)
Show description for 320a5d7andauthored
Commits on Jun 4, 2024
feat: adding `rpc_servers` parameter to `Llama` class (#1477)
Show description for d634efcandauthored
Commits on May 29, 2024
Commits on May 3, 2024
Commits on May 2, 2024
Commits on Apr 30, 2024
feat: Generic Chat Formats, Tool Calling, and Huggingface Pull Support for Multimodal Models (Obsidian, LLaVA1.6, Moondream) (#1147)
Show description for fe2da09authored
Commits on Apr 1, 2024
feat: add support for KV cache quantization options (#1307)
Show description for f165048andauthored
Commits on Feb 28, 2024
- committed
Commits on Feb 26, 2024
feat(server): Add support for pulling models from Huggingface Hub (#1222)
Show description for 4d574bdauthored
Commits on Feb 8, 2024
- committed
feat: Integrate functionary v1.4 and v2 models + add custom tokenizer support to Llama class (#1078)
Show description for 9018270andauthored
Commits on Jan 31, 2024
Add speculative decoding (#1120)
Show description for fb762a6authored
Commits on Jan 21, 2024
Commits on Jan 19, 2024
Commits on Jan 15, 2024
Implement GGUF metadata KV overrides (#1011)
Show description for 76aafa6andauthored
Commits on Dec 22, 2023
[Feat] Multi model support (#931)
Show description for 12b7f2fandauthored
Loading