Skip to content

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Appearance settings

Issues: ggml-org/llama.cpp

examples : add configuration presets
#10932 opened Dec 21, 2024 by ggerganov
Open 5
changelog : libllama API
#9289 opened Sep 3, 2024 by ggerganov
Open 9
changelog : llama-server REST API
#9291 opened Sep 3, 2024 by ggerganov
Open 15
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

server : separate the notion of position and KV tokens, remove prompt truncation breaking change Changes that break ABIs, APIs, file formats, or other forms of backwards compatibility. examples python python script changes server
#13576 opened May 15, 2025 by ngxson Loading…
Update python verions examples python python script changes server
#13574 opened May 15, 2025 by robbiemu Loading…
Webui dynamic config examples server
#13429 opened May 10, 2025 by ServeurpersoCom Loading…
kv-cache : add SWA support examples server
#13194 opened Apr 29, 2025 by ggerganov Loading…
15 of 22 tasks
server : crash when -b > -ub with embeddings bug Something isn't working embeddings embedding related topics good first issue Good for newcomers server
#12836 opened Apr 8, 2025 by ggerganov
WIP: Add support for CogAgent examples python python script changes server
#12679 opened Mar 31, 2025 by Tianyue-Zhao Draft
server: streaming of tool calls and thoughts when --jinja is on documentation Improvements or additions to documentation examples python python script changes script Script related server testing Everything test related tool calling
#12379 opened Mar 14, 2025 by ochafik Draft
5 of 10 tasks
tool-call: Phi-4 support android Issues specific to Android Apple Metal https://en.wikipedia.org/wiki/Metal_(API) devops improvements to build systems and github actions documentation Improvements or additions to documentation examples ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs python python script changes server SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language testing Everything test related Vulkan Issues specific to the Vulkan backend
#12288 opened Mar 9, 2025 by jpohhhh Loading…
Server: openai-style lookup decoding examples python python script changes server
#12127 opened Mar 1, 2025 by eeroel Draft
Cache based tokenization for the server input prompts demo Demonstrate some concept or idea, not intended to be merged examples server
#12067 opened Feb 25, 2025 by vnicolici Loading…
server webui easy config selection demo Demonstrate some concept or idea, not intended to be merged examples server
#12031 opened Feb 22, 2025 by poulphunter Loading…
llama : add llama_batch_ext android Issues specific to Android examples python python script changes server
#11875 opened Feb 14, 2025 by ngxson Loading…
Update CMakeLists.txt examples server
#11558 opened Jan 31, 2025 by magicse Loading…
ProTip! Type g i on any issue or pull request to go back to the issue listing page.
Morty Proxy This is a proxified and sanitized view of the page, visit original site.