Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Semantic Frame Parsing for Information Extraction : the CALOR corpus (1812.08039v1)

Published 19 Dec 2018 in cs.CL and cs.AI

Abstract: This paper presents a publicly available corpus of French encyclopedic history texts annotated according to the Berkeley FrameNet formalism. The main difference in our approach compared to previous works on semantic parsing with FrameNet is that we are not interested here in full text parsing but rather on partial parsing. The goal is to select from the FrameNet resources the minimal set of frames that are going to be useful for the applicative framework targeted, in our case Information Extraction from encyclopedic documents. Such an approach leverages the manual annotation of larger corpora than those obtained through full text parsing and therefore opens the door to alternative methods for Frame parsing than those used so far on the FrameNet 1.5 benchmark corpus. The approaches compared in this study rely on an integrated sequence labeling model which jointly optimizes frame identification and semantic role segmentation and identification. The models compared are CRFs and multitasks bi-LSTMs.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Gabriel Marzinotto (6 papers)
  2. Jeremy Auguste (1 paper)
  3. Alexis Nasr (7 papers)
  4. Frederic Bechet (6 papers)
  5. Géraldine Damnati (7 papers)
Citations (17)

Summary

We haven't generated a summary for this paper yet.