Review/Check GGUF files and estimate the memory usage and maximum tokens per second.
-
Updated
Aug 18, 2025 - Go
Review/Check GGUF files and estimate the memory usage and maximum tokens per second.
A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp
Android native AI inference library, bringing gguf models and stable-diffusion inference on android devices, powered by llama.cpp and stable-diffusion.cpp
Examples using the llmedge library
Add a description, image, and links to the stable-diffusion-cpp topic page so that developers can more easily learn about it.
To associate your repository with the stable-diffusion-cpp topic, visit your repo's landing page and select "manage topics."