Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
158 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Scaling Up Adaptive Filter Optimizers (2403.00977v1)

Published 1 Mar 2024 in cs.SD and eess.AS

Abstract: We introduce a new online adaptive filtering method called supervised multi-step adaptive filters (SMS-AF). Our method uses neural networks to control or optimize linear multi-delay or multi-channel frequency-domain filters and can flexibly scale-up performance at the cost of increased compute -- a property rarely addressed in the AF literature, but critical for many applications. To do so, we extend recent work with a set of improvements including feature pruning, a supervised loss, and multiple optimization steps per time-frame. These improvements work in a cohesive manner to unlock scaling. Furthermore, we show how our method relates to Kalman filtering and meta-adaptive filtering, making it seamlessly applicable to a diverse set of AF tasks. We evaluate our method on acoustic echo cancellation (AEC) and multi-channel speech enhancement tasks and compare against several baselines on standard synthetic and real-world datasets. Results show our method performance scales with inference cost and model capacity, yields multi-dB performance gains for both tasks, and is real-time capable on a single CPU core.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (28)
  1. “Adaptive switching circuits,” Tech. Rep., Stanford University, 1960.
  2. V. John Mathews, “Adaptive polynomial filters,” IEEE Signal Processing Magazine (SPM), 1991.
  3. QRD-RLS adaptive filtering, Springer, 2009.
  4. Simon S. Haykin, Adaptive filter theory, Pearson, 2008.
  5. Theory and application of digital signal processing, Prentice-Hall, 2016.
  6. “Broken neural scaling laws,” in International Conference on Learning Representations (ICLR), 2022.
  7. “Training compute-optimal large language models,” arXiv preprint arXiv:2203.15556, 2022.
  8. “A ConvNet for the 2020s,” in IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022.
  9. “Scaling up gans for text-to-image synthesis,” in IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023.
  10. “NICE-Beam: Neural integrated covariance estimators for time-varying beamformers,” arXiv:2112.04613, 2021.
  11. “End-to-end deep learning-based adaptation control for frequency-domain adaptive system identification,” in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022.
  12. “Deep learning-based joint control of acoustic echo cancellation, beamforming and postfiltering,” in IEEE European Signal Processing Conference (EUSIPCO), 2022.
  13. “Deep adaptation control for acoustic echo cancellation,” in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022.
  14. “Neural-afc: Learning-based step-size control for adaptive feedback cancellation with closed-loop model training,” in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023.
  15. “Low-complexity acoustic echo cancellation with neural kalman filtering,” in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023.
  16. “Auto-DSP: Learning to optimize acoustic echo cancellers,” in IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2021.
  17. “Meta-AF: Meta-learning for adaptive filters,” IEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2022.
  18. “Meta-learning for adaptive filters with higher-order frequency dependencies,” in IEEE International Workshop on Acoustic Signal Enhancement (IWAENC), 2022.
  19. “Meta-af echo cancellation for improved keyword spotting,” arXiv:2312.10605, 2023.
  20. “Learning to learn by gradient descent by gradient descent,” in NeurIPS, 2016.
  21. “Complex gated recurrent neural networks,” in NeurIPS, 2018.
  22. “Frequency-domain adaptive Kalman filter for acoustic echo control in hands-free telephones,” Elsevier Signal Processing, 2006.
  23. “ICASSP 2022 acoustic echo cancellation challenge,” in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022.
  24. “Acoustic echo control,” in Academic press library in signal processing. Elsevier, 2014.
  25. “The third CHiME speech separation and recognition challenge: Dataset, task and baselines,” in Automatic Speech Recongition and Understanding Workshop (ASRU). IEEE, 2015.
  26. “SDR–half-baked or well done?,” in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2019.
  27. “Performance measurement in blind audio source separation,” IEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2006.
  28. “An algorithm for intelligibility prediction of time–frequency weighted noisy speech,” IEEE Transactions on Audio, Speech, and Language Processing (TASLP), 2011.
Citations (1)

Summary

We haven't generated a summary for this paper yet.