Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Improving Low-resource Reading Comprehension via Cross-lingual Transposition Rethinking (2107.05002v2)

Published 11 Jul 2021 in cs.CL and cs.AI

Abstract: Extractive Reading Comprehension (ERC) has made tremendous advances enabled by the availability of large-scale high-quality ERC training data. Despite of such rapid progress and widespread application, the datasets in languages other than high-resource languages such as English remain scarce. To address this issue, we propose a Cross-Lingual Transposition ReThinking (XLTT) model by modelling existing high-quality extractive reading comprehension datasets in a multilingual environment. To be specific, we present multilingual adaptive attention (MAA) to combine intra-attention and inter-attention to learn more general generalizable semantic and lexical knowledge from each pair of language families. Furthermore, to make full use of existing datasets, we adopt a new training framework to train our model by calculating task-level similarities between each existing dataset and target dataset. The experimental results show that our XLTT model surpasses six baselines on two multilingual ERC benchmarks, especially more effective for low-resource languages with 3.9 and 4.1 average improvement in F1 and EM, respectively.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Gaochen Wu (5 papers)
  2. Bin Xu (192 papers)
  3. Yuxin Qin (3 papers)
  4. Fei Kong (39 papers)
  5. Bangchang Liu (4 papers)
  6. Hongwen Zhao (2 papers)
  7. Dejie Chang (4 papers)
Citations (3)

Summary

We haven't generated a summary for this paper yet.