Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

QA4IE: A Question Answering based Framework for Information Extraction (1804.03396v2)

Published 10 Apr 2018 in cs.IR, cs.AI, and cs.CL

Abstract: Information Extraction (IE) refers to automatically extracting structured relation tuples from unstructured texts. Common IE solutions, including Relation Extraction (RE) and open IE systems, can hardly handle cross-sentence tuples, and are severely restricted by limited relation types as well as informal relation specifications (e.g., free-text based relation tuples). In order to overcome these weaknesses, we propose a novel IE framework named QA4IE, which leverages the flexible question answering (QA) approaches to produce high quality relation triples across sentences. Based on the framework, we develop a large IE benchmark with high quality human evaluation. This benchmark contains 293K documents, 2M golden relation triples, and 636 relation types. We compare our system with some IE baselines on our benchmark and the results show that our system achieves great improvements.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (10)
  1. Lin Qiu (47 papers)
  2. Hao Zhou (351 papers)
  3. Yanru Qu (19 papers)
  4. Weinan Zhang (322 papers)
  5. Suoheng Li (1 paper)
  6. Shu Rong (3 papers)
  7. Dongyu Ru (11 papers)
  8. Lihua Qian (8 papers)
  9. Kewei Tu (74 papers)
  10. Yong Yu (219 papers)
Citations (18)

Summary

We haven't generated a summary for this paper yet.