2000 character limit reached
ABSent: Cross-Lingual Sentence Representation Mapping with Bidirectional GANs (2001.11121v1)
Published 29 Jan 2020 in cs.CL and cs.LG
Abstract: A number of cross-lingual transfer learning approaches based on neural networks have been proposed for the case when large amounts of parallel text are at our disposal. However, in many real-world settings, the size of parallel annotated training data is restricted. Additionally, prior cross-lingual mapping research has mainly focused on the word level. This raises the question of whether such techniques can also be applied to effortlessly obtain cross-lingually aligned sentence representations. To this end, we propose an Adversarial Bi-directional Sentence Embedding Mapping (ABSent) framework, which learns mappings of cross-lingual sentence representations from limited quantities of parallel data.
- Zuohui Fu (28 papers)
- Yikun Xian (12 papers)
- Shijie Geng (33 papers)
- Yingqiang Ge (36 papers)
- Yuting Wang (112 papers)
- Xin Dong (90 papers)
- Guang Wang (21 papers)
- Gerard de Melo (78 papers)