Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Template-free Data-to-Text Generation of Finnish Sports News (1910.01863v1)

Published 4 Oct 2019 in cs.CL

Abstract: News articles such as sports game reports are often thought to closely follow the underlying game statistics, but in practice they contain a notable amount of background knowledge, interpretation, insight into the game, and quotes that are not present in the official statistics. This poses a challenge for automated data-to-text news generation with real-world news corpora as training data. We report on the development of a corpus of Finnish ice hockey news, edited to be suitable for training of end-to-end news generation methods, as well as demonstrate generation of text, which was judged by journalists to be relatively close to a viable product. The new dataset and system source code are available for research purposes at https://github.com/scoopmatic/finnish-hockey-news-generation-paper.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Jenna Kanerva (17 papers)
  2. Samuel Rönnqvist (14 papers)
  3. Riina Kekki (1 paper)
  4. Tapio Salakoski (9 papers)
  5. Filip Ginter (28 papers)
Citations (18)