Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Appearance settings

llama : create llamax library #5215

Copy link
Copy link
Open
@ggerganov

Description

@ggerganov
Issue body actions

Depends on: #5214

The llamax library will wrap llama and expose common high-level functionality. The main goal is to ease the integration of llama.cpp into 3rd party projects. Ideally, most projects would interface through the llamax API for all common use cases, while still have the option to use the low-level llama API for more uncommon applications that require finer control of the state.

A simple way to think about llamax is that it will simplify all of the existing examples in llama.cpp by hiding the low-level stuff, such as managing the KV cache and batching requests.

Roughly, llamax will require it's own state object and a run-loop function.

The specifics of the API are yet to be determined - suggestions are welcome.

ngxson, lin72h, AshD, iceychris, Raphy42 and 14 morejoseph777111, lin7sh, HanClinto, manyoso, shakfu and 2 morejoseph777111, lin7sh, manyoso and borgoatfreelerobot, joseph777111, manyoso and borgoat

Metadata

Metadata

Assignees

No one assigned

    Labels

    refactoringRefactoringRefactoringroadmapPart of a roadmap projectPart of a roadmap project

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions

      Morty Proxy This is a proxified and sanitized view of the page, visit original site.