Scaling Up Adaptive Filter Optimizers (2403.00977v1)
Abstract: We introduce a new online adaptive filtering method called supervised multi-step adaptive filters (SMS-AF). Our method uses neural networks to control or optimize linear multi-delay or multi-channel frequency-domain filters and can flexibly scale-up performance at the cost of increased compute -- a property rarely addressed in the AF literature, but critical for many applications. To do so, we extend recent work with a set of improvements including feature pruning, a supervised loss, and multiple optimization steps per time-frame. These improvements work in a cohesive manner to unlock scaling. Furthermore, we show how our method relates to Kalman filtering and meta-adaptive filtering, making it seamlessly applicable to a diverse set of AF tasks. We evaluate our method on acoustic echo cancellation (AEC) and multi-channel speech enhancement tasks and compare against several baselines on standard synthetic and real-world datasets. Results show our method performance scales with inference cost and model capacity, yields multi-dB performance gains for both tasks, and is real-time capable on a single CPU core.
- “Adaptive switching circuits,” Tech. Rep., Stanford University, 1960.
- V. John Mathews, “Adaptive polynomial filters,” IEEE Signal Processing Magazine (SPM), 1991.
- QRD-RLS adaptive filtering, Springer, 2009.
- Simon S. Haykin, Adaptive filter theory, Pearson, 2008.
- Theory and application of digital signal processing, Prentice-Hall, 2016.
- “Broken neural scaling laws,” in International Conference on Learning Representations (ICLR), 2022.
- “Training compute-optimal large language models,” arXiv preprint arXiv:2203.15556, 2022.
- “A ConvNet for the 2020s,” in IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022.
- “Scaling up gans for text-to-image synthesis,” in IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023.
- “NICE-Beam: Neural integrated covariance estimators for time-varying beamformers,” arXiv:2112.04613, 2021.
- “End-to-end deep learning-based adaptation control for frequency-domain adaptive system identification,” in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022.
- “Deep learning-based joint control of acoustic echo cancellation, beamforming and postfiltering,” in IEEE European Signal Processing Conference (EUSIPCO), 2022.
- “Deep adaptation control for acoustic echo cancellation,” in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022.
- “Neural-afc: Learning-based step-size control for adaptive feedback cancellation with closed-loop model training,” in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023.
- “Low-complexity acoustic echo cancellation with neural kalman filtering,” in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023.
- “Auto-DSP: Learning to optimize acoustic echo cancellers,” in IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2021.
- “Meta-AF: Meta-learning for adaptive filters,” IEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2022.
- “Meta-learning for adaptive filters with higher-order frequency dependencies,” in IEEE International Workshop on Acoustic Signal Enhancement (IWAENC), 2022.
- “Meta-af echo cancellation for improved keyword spotting,” arXiv:2312.10605, 2023.
- “Learning to learn by gradient descent by gradient descent,” in NeurIPS, 2016.
- “Complex gated recurrent neural networks,” in NeurIPS, 2018.
- “Frequency-domain adaptive Kalman filter for acoustic echo control in hands-free telephones,” Elsevier Signal Processing, 2006.
- “ICASSP 2022 acoustic echo cancellation challenge,” in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022.
- “Acoustic echo control,” in Academic press library in signal processing. Elsevier, 2014.
- “The third CHiME speech separation and recognition challenge: Dataset, task and baselines,” in Automatic Speech Recongition and Understanding Workshop (ASRU). IEEE, 2015.
- “SDR–half-baked or well done?,” in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2019.
- “Performance measurement in blind audio source separation,” IEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2006.
- “An algorithm for intelligibility prediction of time–frequency weighted noisy speech,” IEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2011.