Skip to content

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Appearance settings

VQ-GAN vs. RVQ (soundstream) #55

WinterStraw started this conversation in Ideas
Discussion options

Would people consider using RVQ as a replacement for VQ? It is similar to the structure of audioLM. For example, let the llama model predict shallow RVQ first, then deep RVQ based on shallow RVQ. Finally, passed RVQ to the vocoder to generate the audio.

You must be logged in to vote

Replies: 0 comments

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
💡
Ideas
Labels
None yet
1 participant
Morty Proxy This is a proxified and sanitized view of the page, visit original site.