Face-voice Association in Multilingual Environments (FAME) Challenge 2024 Evaluation Plan (2404.09342v3)

Published 14 Apr 2024 in cs.CV, cs.SD, and eess.AS

Abstract: The advancements of technology have led to the use of multimodal systems in various real-world applications. Among them, the audio-visual systems are one of the widely used multimodal systems. In the recent years, associating face and voice of a person has gained attention due to presence of unique correlation between them. The Face-voice Association in Multilingual Environments (FAME) Challenge 2024 focuses on exploring face-voice association under a unique condition of multilingual scenario. This condition is inspired from the fact that half of the world's population is bilingual and most often people communicate under multilingual scenario. The challenge uses a dataset namely, Multilingual Audio-Visual (MAV-Celeb) for exploring face-voice association in multilingual environments. This report provides the details of the challenge, dataset, baselines and task details for the FAME Challenge.

References (15)

Citations (3)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/AudioAndSpeech/status/1780116214968127750

https://twitter.com/CSVisionPapers/status/1780576824294039896

Face-voice Association in Multilingual Environments (FAME) Challenge 2024 Evaluation Plan (2404.09342v3)

Summary

Related Papers

Tweets