Deep Active Speech Cancellation with Mamba-Masking Network (2502.01185v2)

Published 3 Feb 2025 in cs.SD, cs.AI, cs.LG, eess.AS, and eess.SP

Abstract: We present a novel deep learning network for Active Speech Cancellation (ASC), advancing beyond Active Noise Cancellation (ANC) methods by effectively canceling both noise and speech signals. The proposed Mamba-Masking architecture introduces a masking mechanism that directly interacts with the encoded reference signal, enabling adaptive and precisely aligned anti-signal generation-even under rapidly changing, high-frequency conditions, as commonly found in speech. Complementing this, a multi-band segmentation strategy further improves phase alignment across frequency bands. Additionally, we introduce an optimization-driven loss function that provides near-optimal supervisory signals for anti-signal generation. Experimental results demonstrate substantial performance gains, achieving up to 7.2dB improvement in ANC scenarios and 6.2dB in ASC, significantly outperforming existing methods.

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Follow-up Questions

We haven't generated follow-up questions for this paper yet.

Generate Now

Authors (3)

GitHub

Deep Active Speech Cancellation with Multi-Band Mamba Network

Deep Active Speech Cancellation with Mamba-Masking Network (2502.01185v2)

Summary

Follow-up Questions

Related Papers

Authors (3)

GitHub