Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Template-free Data-to-Text Generation of Finnish Sports News (1910.01863v1)

Published 4 Oct 2019 in cs.CL

Abstract: News articles such as sports game reports are often thought to closely follow the underlying game statistics, but in practice they contain a notable amount of background knowledge, interpretation, insight into the game, and quotes that are not present in the official statistics. This poses a challenge for automated data-to-text news generation with real-world news corpora as training data. We report on the development of a corpus of Finnish ice hockey news, edited to be suitable for training of end-to-end news generation methods, as well as demonstrate generation of text, which was judged by journalists to be relatively close to a viable product. The new dataset and system source code are available for research purposes at https://github.com/scoopmatic/finnish-hockey-news-generation-paper.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Jenna Kanerva (17 papers)
  2. Samuel Rönnqvist (14 papers)
  3. Riina Kekki (1 paper)
  4. Tapio Salakoski (9 papers)
  5. Filip Ginter (28 papers)
Citations (18)

Summary

We haven't generated a summary for this paper yet.