Reward-based Input Construction for Cross-document Relation Extraction (2405.20649v1)

Published 31 May 2024 in cs.CL and cs.LG

Abstract: Relation extraction (RE) is a fundamental task in natural language processing, aiming to identify relations between target entities in text. While many RE methods are designed for a single sentence or document, cross-document RE has emerged to address relations across multiple long documents. Given the nature of long documents in cross-document RE, extracting document embeddings is challenging due to the length constraints of pre-trained LLMs. Therefore, we propose REward-based Input Construction (REIC), the first learning-based sentence selector for cross-document RE. REIC extracts sentences based on relational evidence, enabling the RE module to effectively infer relations. Since supervision of evidence sentences is generally unavailable, we train REIC using reinforcement learning with RE prediction scores as rewards. Experimental results demonstrate the superiority of our method over heuristic methods for different RE structures and backbones in cross-document RE. Our code is publicly available at https://github.com/aailabkaist/REIC.

PDF HTML Abstract

Summarize Bookmark Chat (Pro)

Authors (4)

Byeonghu Na (12 papers)
Suhyeon Jo (1 paper)
Yeongmin Kim (11 papers)
Il-Chul Moon (39 papers)

GitHub

GitHub - aailabkaist/REIC: Official PyTorch implementation for Reward-based Input Construction for Cross-document Relation Extraction (REIC) in ACL 2024. (9 stars)

Reward-based Input Construction for Cross-document Relation Extraction (2405.20649v1)

Related Papers

GitHub