Tiny-Sepformer: A Tiny Time-Domain Transformer Network for Speech Separation (2206.13689v2)

Published 28 Jun 2022 in cs.SD and eess.AS

Abstract: Time-domain Transformer neural networks have proven their superiority in speech separation tasks. However, these models usually have a large number of network parameters, thus often encountering the problem of GPU memory explosion. In this paper, we proposed Tiny-Sepformer, a tiny version of Transformer network for speech separation. We present two techniques to reduce the model parameters and memory consumption: (1) Convolution-Attention (CA) block, spliting the vanilla Transformer to two paths, multi-head attention and 1D depthwise separable convolution, (2) parameter sharing, sharing the layer parameters within the CA block. In our experiments, Tiny-Sepformer could greatly reduce the model size, and achieves comparable separation performance with vanilla Sepformer on WSJ0-2/3Mix datasets.

Authors (6)

Jian Luo (66 papers)
Jianzong Wang (144 papers)
Ning Cheng (96 papers)
Edward Xiao (2 papers)
Xulong Zhang (60 papers)
Jing Xiao (267 papers)

Citations (11)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tiny-Sepformer: A Tiny Time-Domain Transformer Network for Speech Separation (2206.13689v2)

Summary

Related Papers