2000 character limit reached
WikiMulti: a Corpus for Cross-Lingual Summarization (2204.11104v1)
Published 23 Apr 2022 in cs.CL
Abstract: Cross-lingual summarization (CLS) is the task to produce a summary in one particular language for a source document in a different language. We introduce WikiMulti - a new dataset for cross-lingual summarization based on Wikipedia articles in 15 languages. As a set of baselines for further studies, we evaluate the performance of existing cross-lingual abstractive summarization methods on our dataset. We make our dataset publicly available here: https://github.com/tikhonovpavel/wikimulti
- Pavel Tikhonov (5 papers)
- Valentin Malykh (24 papers)