Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Sentence Bag Graph Formulation for Biomedical Distant Supervision Relation Extraction (2310.18912v1)

Published 29 Oct 2023 in cs.LG and cs.CL

Abstract: We introduce a novel graph-based framework for alleviating key challenges in distantly-supervised relation extraction and demonstrate its effectiveness in the challenging and important domain of biomedical data. Specifically, we propose a graph view of sentence bags referring to an entity pair, which enables message-passing based aggregation of information related to the entity pair over the sentence bag. The proposed framework alleviates the common problem of noisy labeling in distantly supervised relation extraction and also effectively incorporates inter-dependencies between sentences within a bag. Extensive experiments on two large-scale biomedical relation datasets and the widely utilized NYT dataset demonstrate that our proposed framework significantly outperforms the state-of-the-art methods for biomedical distant supervision relation extraction while also providing excellent performance for relation extraction in the general text mining domain.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Hao Zhang (948 papers)
  2. Yang Liu (2253 papers)
  3. Xiaoyan Liu (22 papers)
  4. Tianming Liang (8 papers)
  5. Gaurav Sharma (51 papers)
  6. Liang Xue (13 papers)
  7. Maozu Guo (9 papers)