Impact of Non-Babble Noise Types on XLAVS-R Robustness
Determine the impact of noise types other than babble on the performance and robustness of the XLAVS-R cross-lingual audio-visual speech representation when evaluated on noisy inputs, by assessing how different noise conditions affect accuracy and error rates during inference.
References
For instance, we simulate noisy environments only with the “babble” sound in testing experimental setup, and it remains to be seen how other types of noise might impact our model.
                — XLAVS-R: Cross-Lingual Audio-Visual Speech Representation Learning for Noise-Robust Speech Perception
                
                (2403.14402 - Han et al., 21 Mar 2024) in Section Limitations