Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A Survey on Deep Reinforcement Learning for Audio-Based Applications (2101.00240v1)

Published 1 Jan 2021 in cs.SD, cs.LG, and eess.AS

Abstract: Deep reinforcement learning (DRL) is poised to revolutionise the field of AI by endowing autonomous systems with high levels of understanding of the real world. Currently, deep learning (DL) is enabling DRL to effectively solve various intractable problems in various fields. Most importantly, DRL algorithms are also being employed in audio signal processing to learn directly from speech, music and other sound signals in order to create audio-based autonomous systems that have many promising application in the real world. In this article, we conduct a comprehensive survey on the progress of DRL in the audio domain by bringing together the research studies across different speech and music-related areas. We begin with an introduction to the general field of DL and reinforcement learning (RL), then progress to the main DRL methods and their applications in the audio domain. We conclude by presenting challenges faced by audio-based DRL agents and highlighting open areas for future research and investigation.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Heriberto CuayƔhuitl (12 papers)
  2. Siddique Latif (38 papers)
  3. Farrukh Pervez (2 papers)
  4. Fahad Shamshad (21 papers)
  5. Hafiz Shehbaz Ali (3 papers)
  6. Erik Cambria (136 papers)
Citations (67)