2000 character limit reached
UWB @ DIACR-Ita: Lexical Semantic Change Detection with CCA and Orthogonal Transformation (2011.14678v1)
Published 30 Nov 2020 in cs.CL
Abstract: In this paper, we describe our method for detection of lexical semantic change (i.e., word sense changes over time) for the DIACR-Ita shared task, where we ranked $1{st}$. We examine semantic differences between specific words in two Italian corpora, chosen from different time periods. Our method is fully unsupervised and language independent. It consists of preparing a semantic vector space for each corpus, earlier and later. Then we compute a linear transformation between earlier and later spaces, using CCA and Orthogonal Transformation. Finally, we measure the cosines between the transformed vectors.