Commits
User selector
Commit History
Commits on Apr 3, 2024
Commits on Apr 1, 2024
fix: last tokens passing to sample_repetition_penalties function (#1295)
Show description for 62aad61authored
- committed
feat: add support for KV cache quantization options (#1307)
Show description for f165048andauthored
Commits on Mar 31, 2024
feat: Add logprobs support to chat completions (#1311)
Show description for aa9f1aeandauthored
Commits on Mar 28, 2024
- committed
Commits on Mar 27, 2024
- committed
Commits on Mar 23, 2024
- committed
- committed
- committed
Commits on Mar 19, 2024
- committed
feat: Add tools/functions variables to Jinja2ChatFormatter, add function response formatting for all simple chat formats (#1273)
Show description for 60d8498andauthored
Commits on Mar 18, 2024
- committed
- committed
fix: Fix and optimize functionary chat handler (#1282)
Show description for 8a60c7bandauthored- committed
Commits on Mar 15, 2024
- committed
- committed
Commits on Mar 14, 2024
Commits on Mar 13, 2024
- committed
Commits on Mar 11, 2024
- committed
Commits on Mar 9, 2024
- committed
- committed
- committed
feat: Add endpoints for tokenize, detokenize and count tokens (#1136)
Show description for c139f8bandauthored- authored
feat: Switch embed to llama_get_embeddings_seq (#1263)
Show description for 2811014andauthored- committed
Commits on Mar 6, 2024
- committed
Commits on Mar 3, 2024
- committed
- committed
- committed