Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Between Flexibility and Consistency: Joint Generation of Captions and Subtitles (2107.06246v1)

Published 13 Jul 2021 in cs.CL

Abstract: Speech translation (ST) has lately received growing interest for the generation of subtitles without the need for an intermediate source language transcription and timing (i.e. captions). However, the joint generation of source captions and target subtitles does not only bring potential output quality advantages when the two decoding processes inform each other, but it is also often required in multilingual scenarios. In this work, we focus on ST models which generate consistent captions-subtitles in terms of structure and lexical content. We further introduce new metrics for evaluating subtitling consistency. Our findings show that joint decoding leads to increased performance and consistency between the generated captions and subtitles while still allowing for sufficient flexibility to produce subtitles conforming to language-specific needs and norms.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Alina Karakanta (9 papers)
  2. Marco Gaido (47 papers)
  3. Matteo Negri (93 papers)
  4. Marco Turchi (51 papers)
Citations (9)

Summary

We haven't generated a summary for this paper yet.