Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Entity-centered Cross-document Relation Extraction (2210.16541v1)

Published 29 Oct 2022 in cs.CL

Abstract: Relation Extraction (RE) is a fundamental task of information extraction, which has attracted a large amount of research attention. Previous studies focus on extracting the relations within a sentence or document, while currently researchers begin to explore cross-document RE. However, current cross-document RE methods directly utilize text snippets surrounding target entities in multiple given documents, which brings considerable noisy and non-relevant sentences. Moreover, they utilize all the text paths in a document bag in a coarse-grained way, without considering the connections between these text paths.In this paper, we aim to address both of these shortages and push the state-of-the-art for cross-document RE. First, we focus on input construction for our RE model and propose an entity-based document-context filter to retain useful information in the given documents by using the bridge entities in the text paths. Second, we propose a cross-document RE model based on cross-path entity relation attention, which allow the entity relations across text paths to interact with each other. We compare our cross-document RE method with the state-of-the-art methods in the dataset CodRED. Our method outperforms them by at least 10% in F1, thus demonstrating its effectiveness.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (9)
  1. Fengqi Wang (1 paper)
  2. Fei Li (232 papers)
  3. Hao Fei (105 papers)
  4. Jingye Li (15 papers)
  5. Shengqiong Wu (36 papers)
  6. Fangfang Su (3 papers)
  7. Wenxuan Shi (7 papers)
  8. Donghong Ji (50 papers)
  9. Bo Cai (5 papers)
Citations (24)