ARAUS: A Large-Scale Dataset and Baseline Models of Affective Responses to Augmented Urban Soundscapes (2207.01078v4)
Abstract: Choosing optimal maskers for existing soundscapes to effect a desired perceptual change via soundscape augmentation is non-trivial due to extensive varieties of maskers and a dearth of benchmark datasets with which to compare and develop soundscape augmentation models. To address this problem, we make publicly available the ARAUS (Affective Responses to Augmented Urban Soundscapes) dataset, which comprises a five-fold cross-validation set and independent test set totaling 25,440 unique subjective perceptual responses to augmented soundscapes presented as audio-visual stimuli. Each augmented soundscape is made by digitally adding "maskers" (bird, water, wind, traffic, construction, or silence) to urban soundscape recordings at fixed soundscape-to-masker ratios. Responses were then collected by asking participants to rate how pleasant, annoying, eventful, uneventful, vibrant, monotonous, chaotic, calm, and appropriate each augmented soundscape was, in accordance with ISO 12913-2:2018. Participants also provided relevant demographic information and completed standard psychological questionnaires. We perform exploratory and statistical analysis of the responses obtained to verify internal consistency and agreement with known results in the literature. Finally, we demonstrate the benchmarking capability of the dataset by training and comparing four baseline models for urban soundscape pleasantness: a low-parameter regression model, a high-parameter convolutional neural network, and two attention-based networks in the literature.
- M. Raimbault, C. Lavandier, and M. Bérengier, “Ambient sound assessment of urban environments: Field studies in two French cities,” Applied Acoustics, vol. 64, no. 12, pp. 1241–1256, 2003.
- P. Jennings and R. Cain, “A framework for improving urban soundscapes,” Applied Acoustics, vol. 74, no. 2, pp. 293–299, 2013.
- K. M. De Paiva Vianna, M. R. Alves Cardoso, and R. M. C. Rodrigues, “Noise pollution and annoyance: An urban soundscapes study,” Noise and Health, vol. 17, no. 76, pp. 125–133, 2015.
- J. Kang, et al., “Towards soundscape indices,” in 23rd International Congress on Acoustics, 2019, pp. 2488–2495.
- H. M. E. Miedema and H. Vos, “Exposure-response relationships for transportation noise,” The Journal of the Acoustical Society of America, vol. 104, no. 6, pp. 3432–3445, 1998.
- B. De Coensel and D. Botteldooren, “Models for soundscape perception and their use in planning,” in Proceedings of Inter-Noise 2007, 2007.
- O. Axelsson, M. E. Nilsson, B. Hellström, and P. Lundén, “A field experiment on the impact of sounds from a jet-and-basin fountain on soundscape quality in an urban park,” Landscape and Urban Planning, vol. 123, pp. 49–60, 2014.
- Z. Abdalrahman and L. Galbrun, “Audio-visual preferences, perception, and use of water features in open-plan offices,” Journal of the Acoustical Society of America, vol. 147, no. 3, pp. 1661–1672, 2020.
- J. Y. Hong, et al., “The effects of spatial separations on water sound and traffic noise sources on soundscape assessment,” Building and Environment, vol. 167, no. 106423, 2020.
- F. Aletta and J. Kang, “Towards an urban vibrancy model: A soundscape approach,” International Journal of Environmental Research and Public Health, vol. 15, no. 8, 2018.
- T. Van Renterghem, et al., “Interactive soundscape augmentation by natural sounds in a noise polluted urban park,” Landscape and Urban Planning, vol. 194, no. October 2019, p. 103705, 2020.
- T. Wong, et al., “Deployment of an IoT System for Adaptive In-Situ Soundscape Augmentation,” in Proceedings of Inter-Noise 2022, 2022.
- T. M. Leung, C. K. Chau, and S. K. Tang, “On the study of effects on different types of natural sounds on the perception of combined sound environment with road traffic noise,” in Proceedings of Inter-Noise 2016, 2016, pp. 1764–1770.
- J. Y. Hong, et al., “A mixed-reality approach to soundscape assessment of outdoor urban environments augmented with natural sounds,” Building and Environment, vol. 194, no. July 2020, p. 107688, 2021.
- P. Aumond, A. Can, B. De Coensel, D. Botteldooren, C. Ribeiro, and C. Lavandier, “Modeling soundscape pleasantness using perceptual assessments and acoustic measurements along paths in urban context,” Acta Acustica united with Acustica, vol. 103, no. 3, pp. 430–443, 2017.
- V. Puyana-Romero, G. Ciaburro, G. Brambilla, C. Garzón, and L. Maffei, “Representation of the soundscape quality in urban areas through colours,” Noise Mapping, vol. 6, no. 1, pp. 8–21, 2019.
- F. Aletta, et al., “Soundscape assessment: Towards a validated translation of perceptual attributes in different languages,” in Proceedings of Inter-Noise 2020, 2020.
- M. Lionello, F. Aletta, and J. Kang, “A systematic review of prediction models for the experience of urban soundscapes,” Applied Acoustics, vol. 170, p. 107479, 2020.
- S. Wang, T. Heittola, A. Mesaros, and T. Virtanen, “Audio-visual scene classification: analysis of DCASE 2021 Challenge submissions,” in Proceedings of DCASE 2021 Workshop, 2021, pp. 45–49.
- I. Martín-Morató, T. Heittola, A. Mesaros, and T. Virtanen, “Low-complexity acoustic scene classification for multi-device audio: Analysis of DCASE 2021 challenge systems,” in Proceedings of DCASE 2021 Workshop, 2021, pp. 85–89.
- A. Politis, S. Adavanne, D. Krause, A. Deleforge, P. Srivastava, and T. Virtanen, “A Dataset of Dynamic Reverberant Sound Scenes with Directional Interferers for Sound Event Localization and Detection,” in Proceedings of DCASE 2021 Workshop, 2021, pp. 125–129.
- Y. Kawaguchi, et al., “Description and Discussion on DCASE 2021 Challenge Task 2: Unsupervised Anomalous Sound Detection for Machine Condition Monitoring under Domain Shifted Conditions,” in Proceedings of DCASE 2021 Workshop, 2021, pp. 186–190.
- J. F. Gemmeke, et al., “Audio Set: An ontology and human-labeled dataset for audio events,” in Proceedings of IEEE ICASSP 2017, 2017, pp. 776–780.
- S. Hershey, et al., “The benefit of temporally-strong labels in audio event classification,” in Proceedings of IEEE ICASSP 2021, 2021, pp. 366–370.
- ——, “AudioSet: Temporally-Strong Labels Download (May 2021),” 2021. [Online]. Available: https://research.google.com/audioset/download˙strong.html
- ——, “CNN architectures for large-scale audio classification,” in Proceedings of IEEE ICASSP 2017, 2017, pp. 131–135.
- J. Salamon, C. Jacoby, and J. P. Bello, “A dataset and taxonomy for urban sound research,” in Proceedings of the 2014 ACM Multimedia Conference, 2014, pp. 1041–1044.
- K. J. Piczak, “ESC: Dataset for environmental sound classification,” in Proceedings of the 2015 ACM Multimedia Conference, 2015, pp. 1015–1018.
- E. Fonseca, X. Favory, J. Pons, F. Font, and X. Serra, “FSD50K: An Open Dataset of Human-Labeled Sound Events,” IEEE/ACM Transactions on Audio Speech and Language Processing, vol. 30, pp. 829–852, 2022.
- F. Font, G. Roma, and X. Serra, “Freesound Technical Demo,” in Proceedings of the 2013 ACM Multimedia Conference, 2013, pp. 411–412.
- J. Salamon, D. MacConnell, M. Cartwright, P. Li, and J. P. Bello, “Scaper: a library for soundscape synthesis and augmentation,” 2017, pp. 344–348.
- B. De Coensel, K. Sun, and D. Botteldooren, “Urban Soundscapes of the World: Selection and reproduction of urban acoustic environments with soundscape in mind,” in Proceedings of Inter-Noise 2017, 2017.
- M. Ciufo and D. Thomas, “EigenScape: A Database of Spatial Acoustic Scene Recordings,” Applied Sciences, vol. 7, 2017.
- M. Cartwright, et al., “SONYC Urban Sound Tagging (SONYC-UST): A Multilabel Dataset from an Urban Acoustic Sensor Network,” in Proceedings of DCASE 2019 Workshop, 2019.
- ——, “SONYC-UST-V2: An Urban Sound Tagging Dataset with Spatiotemporal Context,” in Proceedings of DCASE 2020 Workshop, 2020, pp. 16–20.
- A. Mesaros, T. Heittola, and T. Virtanen, “TUT database for acoustic scene classification and sound event detection,” in Proceedings of EUSIPCO 2016, 2016, pp. 1128–1132.
- A. Mesaros, et al., “DCASE 2017 challenge setup: tasks, datasets and baseline system,” in Proceedings of DCASE 2017 Workshop, 2017.
- A. Mesaros, T. Heittola, and T. Virtanen, “A multi-device dataset for urban acoustic scene classification,” in Proceedings of DCASE 2018 Workshop, 2018.
- T. Heittola, A. Mesaros, and T. Virtanen, “Acoustic scene classification in DCASE 2020 Challenge: generalization across devices and low complexity solutions,” in Proceedings of DCASE 2020 Workshop, 2020.
- S. Wang, A. Mesaros, T. Heittola, and T. Virtanen, “A curated dataset of urban scenes for audio-visual scene analysis,” in Proceedings of IEEE ICASSP 2021, 2021, pp. 626–630.
- K. Ooi, et al., “A Strongly-Labelled Polyphonic Dataset of Urban Sounds with Spatiotemporal Context,” in Proceedings of APSIPA ASC 2021, 2021.
- A. Weisser, et al., “The Ambisonic Recordings of Typical Environments (ARTE) database,” Acta Acustica united with Acustica, vol. 105, no. 4, pp. 695–713, 2019.
- A. Politis, et al., “STARSS22: A dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events,” 2022. [Online]. Available: https://doi.org/10.5281/zenodo.6387880
- M. M. Bradley and P. J. Lang, “The International Affective Digitized Sounds (2nd Edition; IADS-2): Affective ratings of sounds and instruction manual. Technical report B-3.” University of Florida, Gainesville, Fl., Tech. Rep., 2007.
- R. A. Stevenson and T. W. James, “Affective auditory stimuli: Characterization of the International Affective Digitized Sounds (IADS) by discrete emotional categories,” Behavior Research Methods, vol. 40, no. 1, pp. 315–321, 2008.
- W. Yang, et al., “Affective auditory stimulus database: An expanded version of the International Affective Digitized Sounds (IADS-E),” Behavior Research Methods, vol. 50, no. 4, pp. 1415–1429, 2018.
- J. Fan, M. Thorogood, and P. Pasquier, “Emo-soundscapes: A dataset for soundscape emotion recognition,” in Proceedings of ACII 2017, 2017, pp. 196–201.
- M. M. Bradley and P. J. Lang, “Measuring emotion: The self-assessment manikin and the semantic differential,” Journal of Behavior Therapy and Experimental Psychiatry, vol. 25, no. 1, pp. 49–59, 1994.
- T. Giannakopoulos, M. Orfanidi, and S. Perantonis, “Athens Urban Soundscape (ATHUS): A Dataset for Urban Soundscape Quality Recognition,” in Proceedings of 25th International Conference on Multimedia Modeling, 2019, pp. 338–348.
- A. Mitchell, et al., “The International Soundscape Database: An integrated multimedia database of urban soundscape surveys – questionnaires with acoustical and contextual information,” 2021. [Online]. Available: https://doi.org/10.5281/Zenodo.5914762
- ——, “The Soundscape Indices (SSID) Protocol : A Method for Urban Soundscape Surveys — Questionnaires with Acoustical and Contextual Information,” Applied Sciences, vol. 10, no. 2397, pp. 1–27, 2020.
- Y. Hao, J. Kang, and H. Wortche, “Assessment of the masking effects of birdsong on the road traffic noise environment,” Journal of the Acoustical Society of America, vol. 140, no. 2, pp. 978–987, 2016.
- T. M. Leung, C. K. Chau, S. K. Tang, and J. M. Xu, “Developing a multivariate model for predicting the noise annoyance responses due to combined water sound and road traffic noise exposure,” Applied Acoustics, vol. 127, pp. 284–291, 2017.
- F. Aletta, T. Oberman, A. Mitchell, H. Tong, and J. Kang, “Assessing the changing urban sound environment during the COVID-19 lockdown period using short-term acoustic measurements,” Noise Mapping, vol. 7, no. 1, pp. 123–134, 2020.
- M. E. Lutman, “What is the risk of noise-induced hearing loss at 80, 85, 90 dB(A) and above?” Occupational Medicine, vol. 50, no. 4, pp. 274–275, 2000.
- B. Planqué and W.-P. Vellinga, “Xeno-canto: a 21st century way to appreciate Neotropical bird song,” Neotropical Birding, vol. 3, no. January, pp. 17–23, 2008.
- J. Y. Jeon, P. J. Lee, J. You, and J. Kang, “Acoustical characteristics of water sounds for soundscape enhancement in urban open spaces,” Journal of the Acoustical Society of America, vol. 131, no. 3, pp. 2101–2109, 2012.
- L. Galbrun and T. T. Ali, “Acoustical and perceptual assessment of water sounds and their use over road traffic noise,” Journal of the Acoustical Society of America, vol. 133, no. 1, pp. 227–237, 2013.
- B. De Coensel, S. Vanwetswinkel, and D. Botteldooren, “Effects of natural sounds on the perception of road traffic noise,” JASA Express Letters, vol. 129, no. 4, pp. 148–153, 2011.
- D. M. Ferraro, et al., “The phantom chorus: birdsong boosts human well-being in protected areas,” Proceedings of the Royal Society B, vol. 287, no. 1941, 2020.
- M. Hedblom, I. Knez, Ode Sang, and B. Gunnarsson, “Evaluation of natural sounds in urban greenery: Potential impact for urban nature preservation,” Royal Society Open Science, vol. 4, no. 2, 2017.
- J. You and J. Y. Jeon, “Sound-masking technique for combined noise exposure in open public spaces,” in Proceedings of ICBEN 2008, 2008.
- N. Pieretti and A. Farina, “Application of a recently introduced index for acoustic complexity to an avian soundscape with traffic noise,” Journal of the Acoustical Society of America, vol. 134, no. 1, pp. 891–900, 2013.
- X. Lu, J. Tang, P. Zhu, F. Guo, J. Cai, and H. Zhang, “Spatial variations in pedestrian soundscape evaluation of traffic noise,” Environmental Impact Assessment Review, vol. 83, 2020.
- C. Xu and J. Kang, “Soundscape evaluation: Binaural or monaural?” Journal of the Acoustical Society of America, vol. 145, no. 5, pp. 3208–3217, 2019.
- M. S. Engel, A. Fiebig, C. Pfaffenbach, and J. Fels, “A Review of the Use of Psychoacoustic Indicators on Soundscape Studies,” Current Pollution Reports, vol. 7, no. 3, pp. 359–378, 2021.
- J. Y. Hong, et al., “Effects of adding natural sounds to urban noises on the perceived loudness of noise and soundscape quality,” Science of the Total Environment, vol. 711, 2020.
- R. San Millán-Castillo, E. Latorre-Iglesias, D. Jiménez-Caminero, J. M. Álvarez-Jimeno, M. Glesser, and S. Wanty, “MOSQITO: An open-source and free toolbox for sound quality metrics in the industry and education,” in Proceedings of Inter-Noise 2021, 2021.
- C. Flowers, F. M. Le Tourneau, N. Merchant, B. Heidorn, R. Ferriere, and J. Harwood, “Looking for the -scape in the sound: Discriminating soundscapes categories in the Sonoran Desert using indices and clustering,” Ecological Indicators, vol. 127, 2021.
- K. Ooi, Y. Xie, B. Lam, and W. S. Gan, “Automation of binaural headphone audio calibration on an artificial head,” MethodsX, vol. 8, no. February, pp. 1–12, 2021.
- J. Abeßer, “USM-SED - A Dataset for Polyphonic Sound Event Detection in Urban Sound Monitoring Scenarios,” 2021. [Online]. Available: http://arxiv.org/abs/2105.02592
- J. J. Walker, L. M. Cleveland, J. L. Davis, and J. S. Seales, “Audiometry screening and interpretation,” American Family Physician, vol. 87, no. 1, pp. 41–47, 2013.
- G. M. Echevarria Sanchez, T. Van Renterghem, K. Sun, B. De Coensel, and D. Botteldooren, “Using Virtual Reality for assessing the role of noise in the audio-visual design of an urban public space,” Landscape and Urban Planning, vol. 167, pp. 98–107, 2017.
- M. Wang, Y. Ai, Y. Han, Z. Fan, P. Shi, and H. Wang, “Extended high-frequency audiometry in healthy adults with different age groups,” Journal of Otolaryngology - Head and Neck Surgery, vol. 50, no. 1, pp. 1–6, 2021.
- M. Andrew, T. Oberman, F. Aletta, M. Kachlicka, M. Lionello, M. Erfanian, and J. Kang, “Investigating urban soundscapes of the COVID-19 lockdown: A predictive soundscape modeling approach,” The Journal of the Acoustical Society of America, vol. 150, no. 6, pp. 4474–4488, 2021.
- N. D. Weinstein, “Individual differences in reactions to noise: A longitudinal study in a college dormitory,” Journal of Applied Psychology, vol. 63, no. 4, pp. 458–466, 1978.
- S. Cohen, T. Kamarck, and R. Mermelstein, “A Global Measure of Perceived Stress,” Journal of Health and Social Behavior, vol. 24, no. 4, pp. 385–396, 1983.
- G. Gamst, L. S. Meyers, H. M. Burke, and A. J. Guarino, “Development and Validation of Brief Measures of Positive and Negative Affect: The PANAS Scales,” Journal of Personality and Social Psychology, vol. 54, no. 6, pp. 1063–1070, 1988.
- F. Aletta, et al., “The relationship between noise sensitivity and soundscape appraisal of care professionals in their work environment: a case study in Nursing Homes in Flanders, Belgium,” in Proceedings of Euro-Noise 2018, 2018.
- E. Ratcliffe, “Sound and Soundscape in Restorative Natural Environments: A Narrative Literature Review.” Frontiers in Psychology, vol. 12, p. 570563, 2021.
- M. Masullo, et al., “A questionnaire investigating the emotional salience of sounds,” Applied Acoustics, vol. 182, p. 108281, 2021.
- S. Cohen and G. Williamson, “Perceived stress in a probability sample of the United States,” The Social Psychology of Health, vol. 13, pp. 31–67, 1988.
- A. Mitchell, F. Aletta, and J. Kang, “How to analyse and represent quantitative soundscape data,” JASA Express Letters, vol. 2, p. 037201, 2022.
- R. Guski, D. Schreckenberg, and R. Schuemer, “A systematic review on environmental noise and annoyance,” International Journal of Environmental Research and Public Health, vol. 14, no. 12, pp. 1–39, 2017.
- A. Hong, B. Kim, and M. Widener, “Noise and the city: Leveraging crowdsourced big data to examine the spatio-temporal relationship between urban development and noise annoyance,” Environment and Planning B: Urban Analytics and City Science, vol. 47, no. 7, pp. 1201–1218, 2020.
- T. Van Renterghem and D. Botteldooren, “Effect of a row of trees behind noise barriers in wind,” Acta Acustica united with Acustica, vol. 88, no. 6, pp. 869–878, 2002.
- W. Yang and J. Kang, “Acoustic comfort evaluation in urban open public spaces,” Applied Acoustics, vol. 66, no. 2, pp. 211–229, 2005.
- X. Fang, et al., “Soundscape Perceptions and Preferences for Different Groups of Users in Urban Recreational Forest Parks,” Forests, vol. 12, no. 4, p. 468, 2021.
- L. J. Cronbach, “Coefficient alpha and the internal structure of tests,” Psychometrika, vol. 16, no. 3, pp. 297–334, 1951.
- K. S. Taber, “The Use of Cronbach’s Alpha When Developing and Reporting Research Instruments in Science Education,” Research in Science Education, vol. 48, no. 6, pp. 1273–1296, 2018.
- D. Worthington, “Weinstein Noise Sensitivity Scale (WNSS),” in The Sourcebook of Listening Research: Methodology and Measures, 1st ed., D. Worthington and G. Bodie, Eds. John Wiley and Sons Ltd, 2018, pp. 475–481.
- K. Ooi, K. N. Watcharasupat, B. Lam, Z.-T. Ong, and W.-S. Gan, “Probably Pleasant? A Neural-Probabilistic Approach to Automatic Masker Selection for Urban Soundscape Augmentation,” in Proceedings of IEEE ICASSP 2022, 2022, p. 5.
- K. N. Watcharasupat, K. Ooi, B. Lam, T. Wong, Z.-T. Ong, and W.-S. Gan, “Autonomous In-Situ Soundscape Augmentation via Joint Selection of Masker and Gain,” pp. 1–5, 2022. [Online]. Available: http://arxiv.org/abs/2204.13883
- H. Zou and T. Hastie, “Regularization and variable selection via the elastic net,” Journal of the Royal Statistical Society. Series B: Statistical Methodology, vol. 67, no. 5, p. 768, 2005.
- D. P. Kingma and J. L. Ba, “Adam: A method for stochastic optimization,” in Proceedings of ICLR 2015, 2015, pp. 1–15.
- K. Ma, C. Mak, and H. Wong, “Effects of environmental sound quality on soundscape preference in a public urban space,” Applied Acoustics, vol. 171, 2021.
- G. R. Kidd and C. S. Watson, “The perceptual dimensionality of environmental sounds,” Noise Control Engineering Journal, vol. 51, no. 4, pp. 216–231, 2003.