GoLLIE: Annotation Guidelines improve Zero-Shot Information-Extraction (2310.03668v5)

Published 5 Oct 2023 in cs.CL

Abstract: LLMs combined with instruction tuning have made significant progress when generalizing to unseen tasks. However, they have been less successful in Information Extraction (IE), lagging behind task-specific models. Typically, IE tasks are characterized by complex annotation guidelines that describe the task and give examples to humans. Previous attempts to leverage such information have failed, even with the largest models, as they are not able to follow the guidelines out of the box. In this paper, we propose GoLLIE (Guideline-following LLM for IE), a model able to improve zero-shot results on unseen IE tasks by virtue of being fine-tuned to comply with annotation guidelines. Comprehensive evaluation empirically demonstrates that GoLLIE is able to generalize to and follow unseen guidelines, outperforming previous attempts at zero-shot information extraction. The ablation study shows that detailed guidelines are key for good results.

PDF HTML Abstract

Summarize Bookmark Chat (Pro)

References (73)

Authors (6)

Oscar Sainz (14 papers)
Iker García-Ferrero (14 papers)
Rodrigo Agerri (41 papers)
Oier Lopez de Lacalle (19 papers)
German Rigau (30 papers)
Eneko Agirre (53 papers)

Citations (56)

View on Semantic Scholar

Tweets

https://twitter.com/msurd/status/1788883397118456295

GoLLIE: Annotation Guidelines improve Zero-Shot Information-Extraction (2310.03668v5)

Related Papers

Tweets