Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

WikiMulti: a Corpus for Cross-Lingual Summarization (2204.11104v1)

Published 23 Apr 2022 in cs.CL

Abstract: Cross-lingual summarization (CLS) is the task to produce a summary in one particular language for a source document in a different language. We introduce WikiMulti - a new dataset for cross-lingual summarization based on Wikipedia articles in 15 languages. As a set of baselines for further studies, we evaluate the performance of existing cross-lingual abstractive summarization methods on our dataset. We make our dataset publicly available here: https://github.com/tikhonovpavel/wikimulti

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (2)
  1. Pavel Tikhonov (5 papers)
  2. Valentin Malykh (24 papers)
Citations (3)
Github Logo Streamline Icon: https://streamlinehq.com