Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Appearance settings
@soniqo

Soniqo

Open-source audio intelligence.

soniqo

Open-source audio intelligence.

Documentation · HuggingFace (Apple · ONNX & LiteRT) · Blog

📖 English · 中文 · 日本語 · 한국어 · Español · Deutsch · Français · हिन्दी · Português · Русский · العربية · Tiếng Việt · Türkçe · ไทย

Repositories

speech-swift — AI speech models for Apple Silicon. ASR, TTS, speech-to-speech, VAD, diarization, and speech enhancement — all running locally via MLX and CoreML. No cloud, no API keys.

speech-android — On-device speech SDK for Android. ASR, TTS, VAD, and noise cancellation powered by ONNX Runtime with Qualcomm NNAPI acceleration.

speech-core — On-device VAD, streaming STT, TTS, and diarization in C++17 (ONNX + LiteRT) with a voice-agent pipeline state machine. Linux, Windows, Android.

speech-studio — Open-source desktop voice-cloning studio for creators. Tauri + Qwen3-TTS on Apple Silicon.

Documentation

soniqo.audio covers setup, usage, and architecture for all SDKs:

  • Getting Started — Installation via Homebrew, SPM, and Gradle
  • Guides — Per-model walkthroughs: Qwen3-ASR, Parakeet TDT, Qwen3-TTS, CosyVoice, Kokoro, PersonaPlex, VAD, diarization, denoising, and more
  • CLI Reference — All commands and flags
  • API & Protocols — Shared Swift protocols and types
  • Architecture — Module structure, backends, weight formats, and memory tables
  • Benchmarks — RTF, latency, WER, and memory across devices

Community

Join our Discord → — questions, support, model requests, and updates.


Need help?

Integrating on-device speech into your app, need support, or want your model to be supported?

Reach out to Ivan →

Pinned Loading

  1. speech-swift speech-swift Public

    AI speech toolkit for Apple Silicon — ASR, TTS, speech-to-speech, VAD, and diarization powered by MLX and CoreML

    Swift 791 103

  2. speech-android speech-android Public

    On-device speech SDK for Android — ASR, TTS, VAD, and noise cancellation powered by ONNX Runtime with Qualcomm NNAPI acceleration

    Kotlin 56 3

  3. speech-core speech-core Public

    On-device VAD / streaming STT / TTS / diarization in C++17 (ONNX + LiteRT) with a voice-agent pipeline. Linux, Windows, Android.

    C++ 29 2

  4. speech-studio speech-studio Public

    Open-source desktop voice-cloning studio for creators — clone a voice, script lines with emotion markers, synthesize on-device. Tauri + VoxCPM2, runs on macOS, Windows, and Linux.

    TypeScript 6

Repositories

Loading
Type
Select type
Language
Select language
Sort
Select order
Showing 6 of 6 repositories

Top languages

Loading…

Most used topics

Loading…

Morty Proxy This is a proxified and sanitized view of the page, visit original site.