Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Improving Distantly-supervised Entity Typing with Compact Latent Space Clustering (1904.06475v1)

Published 13 Apr 2019 in cs.CL and cs.AI

Abstract: Recently, distant supervision has gained great success on Fine-grained Entity Typing (FET). Despite its efficiency in reducing manual labeling efforts, it also brings the challenge of dealing with false entity type labels, as distant supervision assigns labels in a context agnostic manner. Existing works alleviated this issue with partial-label loss, but usually suffer from confirmation bias, which means the classifier fit a pseudo data distribution given by itself. In this work, we propose to regularize distantly supervised models with Compact Latent Space Clustering (CLSC) to bypass this problem and effectively utilize noisy data yet. Our proposed method first dynamically constructs a similarity graph of different entity mentions; infer the labels of noisy instances via label propagation. Based on the inferred labels, mention embeddings are updated accordingly to encourage entity mentions with close semantics to form a compact cluster in the embedding space,thus leading to better classification performance. Extensive experiments on standard benchmarks show that our CLSC model consistently outperforms state-of-the-art distantly supervised entity typing systems by a significant margin.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Bo Chen (309 papers)
  2. Xiaotao Gu (32 papers)
  3. Yufeng Hu (6 papers)
  4. Siliang Tang (116 papers)
  5. Guoping Hu (39 papers)
  6. Yueting Zhuang (164 papers)
  7. Xiang Ren (194 papers)
Citations (18)

Summary

We haven't generated a summary for this paper yet.