SBAAM! Eliminating Transcript Dependency in Automatic Subtitling (2405.10741v1)

Published 17 May 2024 in cs.CL

Abstract: Subtitling plays a crucial role in enhancing the accessibility of audiovisual content and encompasses three primary subtasks: translating spoken dialogue, segmenting translations into concise textual units, and estimating timestamps that govern their on-screen duration. Past attempts to automate this process rely, to varying degrees, on automatic transcripts, employed diversely for the three subtasks. In response to the acknowledged limitations associated with this reliance on transcripts, recent research has shifted towards transcription-free solutions for translation and segmentation, leaving the direct generation of timestamps as uncharted territory. To fill this gap, we introduce the first direct model capable of producing automatic subtitles, entirely eliminating any dependence on intermediate transcripts also for timestamp prediction. Experimental results, backed by manual evaluation, showcase our solution's new state-of-the-art performance across multiple language pairs and diverse conditions.

PDF HTML Abstract

Summarize Bookmark Chat (Pro)

References (83)

Authors (5)

Marco Gaido (47 papers)
Sara Papi (33 papers)
Matteo Negri (93 papers)
Mauro Cettolo (20 papers)
Luisa Bentivogli (38 papers)

Citations (1)

View on Semantic Scholar

SBAAM! Eliminating Transcript Dependency in Automatic Subtitling (2405.10741v1)

Related Papers

Tweets