Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Appearance settings
@gpustack

GPUStack

Simple, scalable AI model deployment on GPU clusters

Pinned Loading

  1. gpustack gpustack Public

    Simple, scalable AI model deployment on GPU clusters

    Python 3.9k 387

  2. gguf-parser-go gguf-parser-go Public

    Review/Check GGUF files and estimate the memory usage and maximum tokens per second.

    Go 209 22

  3. llama-box llama-box Public

    LM inference server implementation based on *.cpp.

    C++ 282 26

  4. vox-box vox-box Public

    A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.

    Python 166 25

Repositories

Loading
Type
Select type
Language
Select language
Sort
Select order
Showing 10 of 12 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…

Morty Proxy This is a proxified and sanitized view of the page, visit original site.