Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Appearance settings

How to add KV cache quantization options? #1220

Closed Unanswered
limour-blog asked this question in Q&A
Discussion options

server : add KV cache quantization options

You must be logged in to vote

Replies: 1 comment

Comment options

I see llama_context_params, but how do I pass it to the class Llama?

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
🙏
Q&A
Labels
None yet
1 participant
Morty Proxy This is a proxified and sanitized view of the page, visit original site.