DAC-JAX: A JAX Implementation of the Descript Audio Codec (2405.11554v1)

Published 19 May 2024 in cs.SD and eess.AS

Abstract: We present an open-source implementation of the Descript Audio Codec (DAC) using Google's JAX ecosystem of Flax, Optax, Orbax, AUX, and CLU. Our codebase enables the reuse of model weights from the original PyTorch DAC, and we confirm that the two implementations produce equivalent token sequences and decoded audio if given the same input. We provide a training and fine-tuning script which supports device parallelism, although we have only verified it using brief training runs with a small dataset. Even with limited GPU memory, the original DAC can compress or decompress a long audio file by processing it as a sequence of overlapping "chunks." We implement this feature in JAX and benchmark the performance on two types of GPUs. On a consumer-grade GPU, DAC-JAX outperforms the original DAC for compression and decompression at all chunk sizes. However, on a high-performance, cluster-based GPU, DAC-JAX outperforms the original DAC for small chunk sizes but performs worse for large chunks.

References (14)

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Follow-up Questions

We haven't generated follow-up questions for this paper yet.

Generate Now

Authors (1)

David Braun

Tweets

https://twitter.com/csteinmetz1/status/1792908223428575284

https://twitter.com/ArxivSound/status/1792768188817539227

https://twitter.com/AudioAndSpeech/status/1792858689314951277

DAC-JAX: A JAX Implementation of the Descript Audio Codec (2405.11554v1)

Summary

Follow-up Questions

Related Papers

Authors (1)

Tweets