Skip to content

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Appearance settings

Pinned Loading

  1. VITA VITA Public

    ✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

    Python 2.3k 169

  2. Long-VITA Long-VITA Public

    ✨✨Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuracy

    Python 281 28

  3. VITA-Audio VITA-Audio Public

    ✨✨VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model

    Python 316 22

  4. Freeze-Omni Freeze-Omni Public

    ✨✨Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM

    Python 316 20

  5. Woodpecker Woodpecker Public

    ✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models

    Python 636 30

Repositories

Loading
Type
Select type
Language
Select language
Sort
Select order
Showing 7 of 7 repositories
  • VITA-Audio Public

    ✨✨VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model

    VITA-MLLM/VITA-Audio’s past year of commit activity
    Python 316 22 9 0 Updated May 17, 2025
  • Long-VITA Public

    ✨✨Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuracy

    VITA-MLLM/Long-VITA’s past year of commit activity
    Python 281 28 4 0 Updated May 14, 2025
  • LUCY Public

    LUCY: Linguistic Understanding and Control Yielding Early Stage of Her

    VITA-MLLM/LUCY’s past year of commit activity
    Python 38 3 10 0 Updated Apr 14, 2025
  • Sparrow Public

    Sparrow: Data-Efficient Video-LLM with Text-to-Image Augmentation

    VITA-MLLM/Sparrow’s past year of commit activity
    Jupyter Notebook 29 Apache-2.0 0 0 0 Updated Mar 28, 2025
  • VITA Public

    ✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

    VITA-MLLM/VITA’s past year of commit activity
    Python 2,288 169 53 0 Updated Mar 28, 2025
  • Freeze-Omni Public

    ✨✨Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM

    VITA-MLLM/Freeze-Omni’s past year of commit activity
    Python 316 20 11 2 Updated Jan 2, 2025
  • Woodpecker Public

    ✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models

    VITA-MLLM/Woodpecker’s past year of commit activity
    Python 636 30 2 0 Updated Dec 23, 2024
Morty Proxy This is a proxified and sanitized view of the page, visit original site.