Synthetic data enables context-aware bioacoustic sound event detection (2503.00296v1)

Published 1 Mar 2025 in cs.SD, cs.LG, and eess.AS

Abstract: We propose a methodology for training foundation models that enhances their in-context learning capabilities within the domain of bioacoustic signal processing. We use synthetically generated training data, introducing a domain-randomization-based pipeline that constructs diverse acoustic scenes with temporally strong labels. We generate over 8.8 thousand hours of strongly-labeled audio and train a query-by-example, transformer-based model to perform few-shot bioacoustic sound event detection. Our second contribution is a public benchmark of 13 diverse few-shot bioacoustics tasks. Our model outperforms previously published methods by 49%, and we demonstrate that this is due to both model design and data scale. We make our trained model available via an API, to provide ecologists and ethologists with a training-free tool for bioacoustic sound event detection.

Collections

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Paper Prompts

Explore 10 Community Prompts

Follow-up Questions

We haven't generated follow-up questions for this paper yet.

Generate Now

Synthetic data enables context-aware bioacoustic sound event detection (2503.00296v1)

Collections

Summary

Paper Prompts

Follow-up Questions

Authors (8)

Don't miss out on important new AI/ML research

Synthetic data enables context-aware bioacoustic sound event detection (2503.00296v1)

Collections

Summary

Paper Prompts

Follow-up Questions

Related Papers

Authors (8)

Don't miss out on important new AI/ML research