Commits
User selector
Commit History
Commits on Aug 7, 2025
- committed
Commits on Jul 1, 2025
- committed
Commits on Jan 29, 2025
- committed
Commits on Dec 19, 2024
- committed
Commits on Oct 31, 2024
- committed
Commits on Sep 19, 2024
- committed
feat: Update sampling API for llama.cpp (#1742)
Show description for f8fcb3eauthored
Commits on Aug 15, 2024
- committed
Commits on Aug 4, 2024
fix: llama_grammar_accept_token arg order (#1649)
Show description for 5575fedauthored
Commits on Jul 18, 2024
fix: Change repeat_penalty to 1.0 to match llama.cpp defaults (#1590)
Show description for 0700476andauthored
Commits on Jul 9, 2024
- committed
- committed
Commits on Jul 6, 2024
- committed
Commits on Jun 19, 2024
Commits on Jun 13, 2024
feat: Add `.close()` method to `Llama` class to explicitly free model from memory (#1513)
Show description for 320a5d7andauthoredfeat: Support SPM infill (#1492)
Show description for dbcf64candauthored
Commits on Jun 7, 2024
- committed
Commits on Jun 4, 2024
fix: Avoid duplicate special tokens in chat formats (#1439)
Show description for 027f7bcandauthored
Commits on Jun 1, 2024
Commits on May 4, 2024
fix: detokenization case where first token does not start with a leading space (#1375)
Show description for e0d7674andauthored
Commits on Apr 30, 2024
Commits on Apr 26, 2024
feat: Allow for possibly non-pooled embeddings (#1380)
Show description for f6ed21fandauthored
Commits on Apr 22, 2024
- committed
Commits on Apr 1, 2024
fix: last tokens passing to sample_repetition_penalties function (#1295)
Show description for 62aad61authored
Commits on Feb 28, 2024
Commits on Feb 26, 2024
- committed
Commits on Feb 23, 2024
Commits on Feb 22, 2024
Commits on Feb 21, 2024
Commits on Feb 14, 2024
feat: Support batch embeddings (#1186)
Show description for d7a6791andauthored- committed
Commits on Feb 6, 2024
Commits on Jan 29, 2024
Automatically set chat format from gguf (#1110)
Show description for da003d8authored
Commits on Jan 19, 2024
Commits on Jan 17, 2024
Loading