Self-Supervised Learning for Few-Shot Bird Sound Classification (2312.15824v4)

Published 25 Dec 2023 in cs.SD, cs.LG, and eess.AS

Abstract: Self-supervised learning (SSL) in audio holds significant potential across various domains, particularly in situations where abundant, unlabeled data is readily available at no cost. This is pertinent in bioacoustics, where biologists routinely collect extensive sound datasets from the natural environment. In this study, we demonstrate that SSL is capable of acquiring meaningful representations of bird sounds from audio recordings without the need for annotations. Our experiments showcase that these learned representations exhibit the capacity to generalize to new bird species in few-shot learning (FSL) scenarios. Additionally, we show that selecting windows with high bird activation for self-supervised learning, using a pretrained audio neural network, significantly enhances the quality of the learned representations.

References (21)

Citations (6)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/1643766907747864577/status/1739907682348536009

https://twitter.com/MoummadIlyass/status/1768577911081054502

https://twitter.com/ArxivSound/status/1747816112757072374

https://twitter.com/ArxivSound/status/1756907058232701264

https://twitter.com/1581197063215357952/status/1739913858167222673

Self-Supervised Learning for Few-Shot Bird Sound Classification (2312.15824v4)

Summary

Related Papers

Tweets