Ultra-Fine Entity Typing (1807.04905v1)

Published 13 Jul 2018 in cs.CL, cs.AI, and cs.LG

Abstract: We introduce a new entity typing task: given a sentence with an entity mention, the goal is to predict a set of free-form phrases (e.g. skyscraper, songwriter, or criminal) that describe appropriate types for the target entity. This formulation allows us to use a new type of distant supervision at large scale: head words, which indicate the type of the noun phrases they appear in. We show that these ultra-fine types can be crowd-sourced, and introduce new evaluation sets that are much more diverse and fine-grained than existing benchmarks. We present a model that can predict open types, and is trained using a multitask objective that pools our new head-word supervision with prior supervision from entity linking. Experimental results demonstrate that our model is effective in predicting entity types at varying granularity; it achieves state of the art performance on an existing fine-grained entity typing benchmark, and sets baselines for our newly-introduced datasets. Our data and model can be downloaded from: http://nlp.cs.washington.edu/entity_type

Authors (4)

Eunsol Choi (76 papers)
Omer Levy (70 papers)
Yejin Choi (287 papers)
Luke Zettlemoyer (225 papers)

Citations (201)

View on Semantic Scholar

Summary

Ultra-Fine Entity Typing

The paper "Ultra-Fine Entity Typing" by Eunsol Choi, Omer Levy, Yejin Choi, and Luke Zettlemoyer explores the intricate challenge of entity typing at a granularity beyond traditional approaches in natural language processing. Entity typing involves categorizing entities mentioned in text into predefined categories. Traditional systems have typically employed a limited set of coarse-grained categories. This research proposes and explores a more nuanced approach by introducing ultra-fine entity types that extend existing labels to a much finer granularity, ultimately aiming to improve the precision and applicability of entity typing in various domain-specific tasks.

In this work, the authors introduce a new dataset that includes both traditional and ultra-fine types annotated for several entities within sentences. This dataset serves as a benchmark for evaluating entity typing systems designed to handle a highly granular taxonomy of entity types. The taxonomy introduced is hierarchically structured to encompass a broad range of categories, which allows for a more sophisticated representation of entities.

A notable contribution of the paper is the deployment of multi-label classification techniques to predict ultra-fine entity types. The model combines features from linguistic cues, context understanding, and co-occurring entity patterns. The authors experimented with attentive neural networks capable of leveraging these features, highlighting the challenge of balancing precision and recall in the face of a highly detailed classification schema.

The paper reports strong numerical results, providing evidence of the model's capacity to outperform baseline methods on this newly established dataset. Particularly, the sophisticated use of neural attention mechanisms and multi-task learning techniques shows substantial improvements in correctly identifying and classifying entities into ultra-fine categories.

A key implication of this research is the enhancement of information extraction systems by offering more specific and contextually relevant types, thereby improving downstream applications such as question answering and knowledge base population. From a theoretical standpoint, the expansion to ultra-fine types challenges existing modeling assumptions and pushes for advancements in both model architectures and training strategies to cope with the increased label complexity.

Speculating on future developments, the introduction of ultra-fine entity typing opens avenues for more richly annotated corpora that could empower more context-aware AI systems. This could lead to greater improvements in language understanding and the performance of AI systems in real-world applications. Continual advances are anticipated as machine learning models become more adept at handling the intricacies of ultra-fine granularity in natural language processing.

PDF Markdown

Related Papers

Find Related Papers