Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
167 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Few-shot bioacoustic event detection at the DCASE 2023 challenge (2306.09223v1)

Published 15 Jun 2023 in cs.SD, cs.LG, and eess.AS

Abstract: Few-shot bioacoustic event detection consists in detecting sound events of specified types, in varying soundscapes, while having access to only a few examples of the class of interest. This task ran as part of the DCASE challenge for the third time this year with an evaluation set expanded to include new animal species, and a new rule: ensemble models were no longer allowed. The 2023 few shot task received submissions from 6 different teams with F-scores reaching as high as 63% on the evaluation set. Here we describe the task, focusing on describing the elements that differed from previous years. We also take a look back at past editions to describe how the task has evolved. Not only have the F-score results steadily improved (40% to 60% to 63%), but the type of systems proposed have also become more complex. Sound event detection systems are no longer simple variations of the baselines provided: multiple few-shot learning methodologies are still strong contenders for the task.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (12)
  1. A. Mesaros, T. Heittola, A. Eronen, and T. Virtanen, “Acoustic event detection in real life recordings,” in 2010 18th European signal processing conference.   IEEE, 2010, pp. 1267–1271.
  2. D. Stowell, D. Giannoulis, E. Benetos, M. Lagrange, and M. D. Plumbley, “Detection and classification of acoustic scenes and events,” IEEE Transactions on Multimedia, vol. 17, no. 10, pp. 1733–1746, 2015.
  3. N. Turpault, R. Serizel, A. P. Shah, and J. Salamon, “Sound event detection in domestic environments with weakly labeled data and soundscape synthesis,” in Workshop on Detection and Classification of Acoustic Scenes and Events, 2019.
  4. D. Stowell, “Computational bioacoustic scene analysis,” Computational analysis of sound scenes and events, pp. 303–333, 2018.
  5. I. Nolasco, S. Singh, V. Morfi, V. Lostanlen, A. Strandburg-Peshkin, E. Vidaña-Vila, L. Gill, H. Pamuła, H. Whitehead, I. Kiskin, et al., “Learning to detect an animal sound from five examples,” arXiv preprint arXiv:2305.13210, 2023.
  6. D. Stowell, L. Gill, and D. Clayton, “Detailed temporal structure of communication networks in groups of songbirds,” Journal of the Royal Society Interface, vol. 13, no. 119, p. 20160296, 2016.
  7. J. Snell, K. Swersky, and R. Zemel, “Prototypical networks for few-shot learning,” Advances in neural information processing systems, vol. 30, 2017.
  8. Y. Wang, N. J. Bryan, M. Cartwright, J. P. Bello, and J. Salamon, “Few-shot continual learning for audio classification,” in ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).   IEEE, 2021, pp. 321–325.
  9. https://dcase.community/challenge2023/task-few-shot-bioacoustic-event-detection-results, accessed: 10-06-2023.
  10. I. Nolasco, S. Singh, E. Vidana-Vila, E. Grout, J. Morford, M. E. F. Jensen, I. Kiskin, H. Whitehead, A. Strandburg-Peshkin, L. Gill10, et al., “Few-shot bioacoustic event detection at the dcase 2022 challenge.”
  11. J. Tang, Z. Xueyang, T. Gao, D. Liu, X. Fang, J. Pan, Q. Wang, J. Du, K. Xu, and Q. Pan, “Few-shot embedding learning and event filtering for bioacoustic event detection technical report,” DCASE2022 Challenge, Tech. Rep., June 2022.
  12. H. Liu, X. Liu, X. Mei, Q. Kong, W. Wang, and M. D. Plumbley, “Surrey system for dcase 2022 task 5 : Few-shot bioacoustic event detection with segment-level metric learning technical report,” DCASE2022 Challenge, Tech. Rep., June 2022.
Citations (6)

Summary

We haven't generated a summary for this paper yet.