Unified Structure Generation for Universal Information Extraction (2203.12277v1)

Published 23 Mar 2022 in cs.CL

Abstract: Information extraction suffers from its varying targets, heterogeneous structures, and demand-specific schemas. In this paper, we propose a unified text-to-structure generation framework, namely UIE, which can universally model different IE tasks, adaptively generate targeted structures, and collaboratively learn general IE abilities from different knowledge sources. Specifically, UIE uniformly encodes different extraction structures via a structured extraction language, adaptively generates target extractions via a schema-based prompt mechanism - structural schema instructor, and captures the common IE abilities via a large-scale pre-trained text-to-structure model. Experiments show that UIE achieved the state-of-the-art performance on 4 IE tasks, 13 datasets, and on all supervised, low-resource, and few-shot settings for a wide range of entity, relation, event and sentiment extraction tasks and their unification. These results verified the effectiveness, universality, and transferability of UIE.

PDF Abstract

Insightful Overview of "Unified Structure Generation for Universal Information Extraction"

The paper "Unified Structure Generation for Universal Information Extraction" introduces a conceptual breakthrough in Information Extraction (IE) through the proposal of Unified Information Extraction (UIE), a universal architecture designed to overcome the fragmentation that characterizes traditional IE systems. Traditional IE methods have been typified by their dependency on task-specific models, resulting in intricate architectures and isolated model training processes for varying IE tasks such as entity, relation, event, and sentiment extraction. The paper posits that such specialization constrains effective knowledge transfer and fails to promote efficient adaptation to novel scenarios.

The UIE framework suggested in this research strives to streamline the IE task landscape by presenting a universal model capable of text-to-structure generation across multiple IE tasks. This unified approach is made feasible through leveraging a Structural Extraction Language (SEL) and a novel schema-based prompting mechanism termed the Structural Schema Instructor (SSI), which guide the generation of diverse targeted structures. SEL allows the encoding of various structures into a homogeneous representation, thus simplifying the IE task into a uniform text-to-structure transformation problem. By employing SSI, the UIE model manages to control the extraction process through schema-based prompts which specify the desired spotting and associating tasks.

The framework is further enhanced by a large-scale pre-trained text-to-structure model which captures the general IE capabilities, providing a robust foundation for adapting the model to different tasks efficiently. The UIE framework has been thoroughly evaluated across an extensive set of benchmarks comprising 13 datasets and 4 distinct tasks. The results showcase UIE achieving state-of-the-art performance, with an average improvement of 1.42% F1-score over specialized systems. Particularly noteworthy is the robustness of UIE under low-resource and few-shot settings, highlighting its adaptability and transferability.

Technically, UIE provides an efficient solution for the joint extraction of entities and their relationships by integrating different pipeline tasks into a single framework. The SSI's schema-based prompts, guiding which labels to associate and which structural transformations to generate, are a pivotal innovation enabling this flexible extraction process across tasks with varying schema specifications. Moreover, by pre-training a text-to-structure model capable of managing both structured and unstructured data, the framework presents a complete, accessible, and comprehensive outlook on unified information extraction.

This fusion of unification and adaptability sets a trajectory for future work in areas such as KB-aware tasks, including entity linking and coreference resolution within the IE field. The promise shown by UIE in efficiently adapting to different extraction tasks without extensive task-specific engineering underscores a significant milestone toward generalist AI models capable of human-like comprehension and structure generation across diverse data forms. This achievement in eliminating cross-task variability within IE is poised to streamline knowledge extraction processes from vast and heterogeneously structured datasets, which are ubiquitous in real-world applications such as business intelligence, sentiment analysis, and event monitoring.

PDF Markdown Bookmark Chat (Pro)

Authors (8)

Yaojie Lu (61 papers)
Qing Liu (196 papers)
Dai Dai (3 papers)
Xinyan Xiao (41 papers)
Hongyu Lin (94 papers)
Xianpei Han (103 papers)
Le Sun (111 papers)
Hua Wu (191 papers)

Citations (359)

View on Semantic Scholar

Related Papers

Find Related Papers

GitHub

Unified Structure Generation for Universal Information Extraction | universal-ie.github.io