- SpectroStream for Audio Codec
- MusicCoCa for Audio Emebdding
- Transformer LLM for predicting next audio token with given token and embeddings of previous 10 seconds context window
google/magenta-realtime · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
https://huggingface.co/google/magenta-realtime

Seonglae Cho