Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Combining Static Word Embeddings and Contextual Representations for Bilingual Lexicon Induction (2106.03084v2)

Published 6 Jun 2021 in cs.CL

Abstract: Bilingual Lexicon Induction (BLI) aims to map words in one language to their translations in another, and is typically through learning linear projections to align monolingual word representation spaces. Two classes of word representations have been explored for BLI: static word embeddings and contextual representations, but there is no studies to combine both. In this paper, we propose a simple yet effective mechanism to combine the static word embeddings and the contextual representations to utilize the advantages of both paradigms. We test the combination mechanism on various language pairs under the supervised and unsupervised BLI benchmark settings. Experiments show that our mechanism consistently improves performances over robust BLI baselines on all language pairs by averagely improving 3.2 points in the supervised setting, and 3.1 points in the unsupervised setting.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Jinpeng Zhang (11 papers)
  2. Baijun Ji (5 papers)
  3. Nini Xiao (4 papers)
  4. Xiangyu Duan (10 papers)
  5. Min Zhang (630 papers)
  6. Yangbin Shi (2 papers)
  7. Weihua Luo (63 papers)
Citations (20)