Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Capturing Evolution in Word Usage: Just Add More Clusters? (2001.06629v2)

Published 18 Jan 2020 in cs.CL

Abstract: The way the words are used evolves through time, mirroring cultural or technological evolution of society. Semantic change detection is the task of detecting and analysing word evolution in textual data, even in short periods of time. In this paper we focus on a new set of methods relying on contextualised embeddings, a type of semantic modelling that revolutionised the NLP field recently. We leverage the ability of the transformer-based BERT model to generate contextualised embeddings capable of detecting semantic change of words across time. Several approaches are compared in a common setting in order to establish strengths and weaknesses for each of them. We also propose several ideas for improvements, managing to drastically improve the performance of existing approaches.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Matej Martinc (14 papers)
  2. Syrielle Montariol (22 papers)
  3. Elaine Zosa (9 papers)
  4. Lidia Pivovarova (6 papers)
Citations (47)

Summary

We haven't generated a summary for this paper yet.