Commits
User selector
Commit History
Commits on Jun 21, 2024
Commits on Jun 19, 2024
Commits on Jun 17, 2024
- committed
Commits on Jun 13, 2024
feat: Add `.close()` method to `Llama` class to explicitly free model from memory (#1513)
Show description for 320a5d7andauthoredfeat: Support SPM infill (#1492)
Show description for dbcf64candauthored
Commits on Jun 10, 2024
- committed
Commits on Jun 7, 2024
- committed
Commits on Jun 4, 2024
feat: adding `rpc_servers` parameter to `Llama` class (#1477)
Show description for d634efcandauthoredfix: fix logprobs when BOS is not present (#1471)
Show description for 6e0642cauthoredfix: Avoid duplicate special tokens in chat formats (#1439)
Show description for 027f7bcandauthored- committed
- committed
Commits on Jun 3, 2024
Commits on Jun 1, 2024
Commits on May 29, 2024
Commits on May 27, 2024
- committed
Commits on May 24, 2024
- committed
feat: Improve Llama.eval performance by avoiding list conversion (#1476)
Show description for 5cae104andauthored- committed
Commits on May 22, 2024
- committed
Commits on May 16, 2024
Commits on May 14, 2024
feat: add MinTokensLogitProcessor and min_tokens argument to server (#1333)
Show description for 5212fb0authoredmisc: Remove unnecessary metadata lookups (#1448)
Show description for 389e09cauthored- committed
Commits on May 12, 2024
- committed
Commits on May 10, 2024
- committed
- committed
- committed
fix(security): Render all jinja templates in immutable sandbox (#1441)
Show description for 561e880andauthoredMerge pull request from GHSA-56xg-wfcc-g829
Show description for b454f40andauthored