2000 character limit reached
Bootstrapping Multilingual AMR with Contextual Word Alignments (2102.02189v1)
Published 3 Feb 2021 in cs.CL and cs.AI
Abstract: We develop high performance multilingualAbstract Meaning Representation (AMR) sys-tems by projecting English AMR annotationsto other languages with weak supervision. Weachieve this goal by bootstrapping transformer-based multilingual word embeddings, in partic-ular those from cross-lingual RoBERTa (XLM-R large). We develop a novel technique forforeign-text-to-English AMR alignment, usingthe contextual word alignment between En-glish and foreign language tokens. This wordalignment is weakly supervised and relies onthe contextualized XLM-R word embeddings.We achieve a highly competitive performancethat surpasses the best published results forGerman, Italian, Spanish and Chinese.
- Janaki Sheth (5 papers)
- Young-Suk Lee (17 papers)
- Tahira Naseem (27 papers)
- Radu Florian (54 papers)
- Salim Roukos (41 papers)
- Todd Ward (4 papers)
- Ramon Fernandez Astudillo (11 papers)