Streamable causal architectures for discrete audio tokenizers
Develop causal, streamable architectures for discrete audio tokenizers that can operate in real time while maintaining high perceptual quality and computational efficiency, overcoming the current reliance of many self-supervised learning–based tokenizers on non-causal encoders.
Sponsor
References
Thus, achieving streamability with high-quality and efficient causal architectures remains an open research challenge.
— Discrete Audio Tokens: More Than a Survey!
(2506.10274 - Mousavi et al., 12 Jun 2025) in Section 2.5 (Streamability and Domain Categorization) – Streamability paragraph