RADIA -- Radio Advertisement Detection with Intelligent Analytics (2403.03538v1)
Abstract: Radio advertising remains an integral part of modern marketing strategies, with its appeal and potential for targeted reach undeniably effective. However, the dynamic nature of radio airtime and the rising trend of multiple radio spots necessitates an efficient system for monitoring advertisement broadcasts. This study investigates a novel automated radio advertisement detection technique incorporating advanced speech recognition and text classification algorithms. RadIA's approach surpasses traditional methods by eliminating the need for prior knowledge of the broadcast content. This contribution allows for detecting impromptu and newly introduced advertisements, providing a comprehensive solution for advertisement detection in radio broadcasting. Experimental results show that the resulting model, trained on carefully segmented and tagged text data, achieves an F1-macro score of 87.76 against a theoretical maximum of 89.33. This paper provides insights into the choice of hyperparameters and their impact on the model's performance. This study demonstrates its potential to ensure compliance with advertising broadcast contracts and offer competitive surveillance. This groundbreaking research could fundamentally change how radio advertising is monitored and open new doors for marketing optimization.
- InfoAdex. Resumen de los estudios InfoAdex. https://www.infoadex.es/home/resumen-estudios-infoadex, 2023. [Online; accessed 19-July-2023].
- Oblicua. Publicidad en radio. https://www.oblicua.es/publicidad/publicidad-radio.htm, 2023. [Online; accessed 19-July-2023].
- Feng Rong. Audio Classification Method Based on Machine Learning. In 2016 International Conference on Intelligent Transportation, Big Data & Smart City (ICITBS), pages 81–84. IEEE Computer Society, December 2016.
- Trends in audio signal feature extraction methods. Applied Acoustics, 158:107020, January 2020.
- Tara N. Sainath and Bo Li. Modeling Time-Frequency Patterns with LSTM vs. Convolutional Architectures for LVCSR Tasks. In Interspeech 2016, pages 813–817. ISCA, September 2016.
- CNN architectures for large-scale audio classification. In 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 131–135, March 2017. ISSN: 2379-190X.
- Avery Wang. The Shazam music recognition service. Communications of the ACM, 49(8):44–48, August 2006.
- Will Drevo. Dejavu: open-source audio fingerprinting project. https://pypi.org/project/PyDejavu/, 2014. [Online; accessed 19-July-2023].
- Content-based advertisement detection. In Interspeech 2010, pages 2214–2217. ISCA, September 2010.
- Alexandre Storelli. Adblock Radio. https://www.adblockradio.com, 2021. [Online; accessed 19-July-2023].
- Advertisement detection in commercial radio channels. In 2015 IEEE 10th International Conference on Industrial and Information Systems (ICIIS), pages 272–277, Peradeniya, Sri Lanka, December 2015. IEEE.
- Detection of repeating items in audio streams using data-driven ALISP sequencing. In 2014 1st International Conference on Advanced Technologies for Signal and Image Processing (ATSIP), pages 446–451, Sousse, Tunisia, March 2014. IEEE.
- A deep hybrid model for advertisements detection in broadcast TV and radio content. International Journal of Computational Vision and Robotics, 12(4):397, 2022.
- Comparison of semi-supervised deep learning algorithms for audio classification. EURASIP Journal on Audio, Speech, and Music Processing, 2022(1):23, September 2022.
- wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations, October 2020. arXiv:2006.11477 [cs, eess].
- Automatic speech recognition: a survey. Multimedia Tools and Applications, 80(6):9411–9457, March 2021.
- SpeechStew: Simply Mix All Available Speech Recognition Data to Train One Large Neural Network, April 2021. arXiv:2104.02133 [cs].
- Robust Speech Recognition via Large-Scale Weak Supervision. arXiv, 2022. Publisher: arXiv Version Number: 1.
- Yuanzhen Guo and M. Iwaihara. Detection of text-based advertising and promotion in Wikipedia by deep learning method. In DEIM Forum D5-1, pages 1–7, 2015.
- Label Studio: Data labeling software, 2020-2022. Open source software available from https://github.com/heartexlabs/label-studio.
- Unsupervised Cross-lingual Representation Learning at Scale, April 2020. arXiv:1911.02116 [cs].
- BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding, May 2019. arXiv:1810.04805 [cs].
Sponsor
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.