Papers

Topics

Authors

Recent

View all

Gemini 2.5 Flash

38 tokens/sec

GPT-4o

59 tokens/sec

Gemini 2.5 Pro Pro

41 tokens/sec

o3 Pro

7 tokens/sec

GPT-4.1 Pro

50 tokens/sec

DeepSeek R1 via Azure Pro

28 tokens/sec

2000 character limit reached

127 2 1

Large Language Models for Generative Information Extraction: A Survey (2312.17617v3)

Published 29 Dec 2023 in cs.CL

Abstract: Information extraction (IE) aims to extract structural knowledge from plain natural language texts. Recently, generative LLMs have demonstrated remarkable capabilities in text understanding and generation. As a result, numerous works have been proposed to integrate LLMs for IE tasks based on a generative paradigm. To conduct a comprehensive systematic review and exploration of LLM efforts for IE tasks, in this study, we survey the most recent advancements in this field. We first present an extensive overview by categorizing these works in terms of various IE subtasks and techniques, and then we empirically analyze the most advanced methods and discover the emerging trend of IE tasks with LLMs. Based on a thorough review conducted, we identify several insights in technique and promising research directions that deserve further exploration in future studies. We maintain a public repository and consistently update related works and resources on GitHub (\href{https://github.com/quqxui/Awesome-LLM4IE-Papers}{LLM4IE repository})

PDF HTML Abstract

Introduction to Generative IE

Generative Information Extraction (IE) is the process of obtaining structured knowledge, like entities, relations, and events, from unstructured textual data. It involves converting text into a structured form that is useful for various downstream applications such as knowledge graph construction, question answering, and knowledge reasoning. LLMs, with their advanced capabilities in text understanding and generation, have paved the way for new methodologies in IE, focusing on generating structured information rather than just extracting it.

Recent Developments in LLMs for IE

LLMs have been integrated into IE tasks with impressive outcomes, and a variety of learning paradigms have been explored to improve their effectiveness. These include supervised fine-tuning, few-shot learning, and zero-shot learning. Each of these paradigms has its own set of strategies tailored to enhance LLMs' performance in IE. For instance, supervised fine-tuning extends a model's capabilities by leveraging existing datasets, while few-shot learning makes use of a small amount of labeled data to train the models effectively. Zero-shot learning employs models that can generalize to new tasks without any labeled examples. Another interesting development is the use of prompts, transforming the extraction process into a query-answering format that guides LLMs in producing the desired output.

The Future of LLMs in IE

This survey has identified several promising directions for future research, emphasizing the potential for the creation of universal frameworks that can handle various IE tasks and domains. Despite advancements, current methods sometimes struggle with long context inputs and structuring outputs in alignment with their training data. Improving the robustness and versatility of these methods is key to unlocking broader applications. Additionally, future research might explore more innovative prompt design strategies that enable LLMs to comprehend tasks more effectively and produce reliable output. Lastly, OpenIE poses unique challenges, suggesting that further investigation is needed to better leverage the knowledge and reasoning abilities of LLMs.

Conclusion

This survey contributes to the understanding of how LLMs are shaping the field of generative IE. By categorizing recent studies according to their learning paradigms and IE tasks, it offers insights into how these models are being fine-tuned, how few-shot or zero-shot learning is applied, and how data augmentation impacts performance. The research suggests that LLMs have much potential in IE, but also underscores the need for ongoing exploration to address existing constraints and unlock the full capabilities of these powerful models.

PDF Markdown Bookmark Chat (Pro)

References (150)

Authors (10)

Derong Xu (26 papers)
Wei Chen (1288 papers)
Wenjun Peng (6 papers)
Chao Zhang (907 papers)
Tong Xu (113 papers)
Xiangyu Zhao (192 papers)
Xian Wu (139 papers)
Yefeng Zheng (197 papers)
Enhong Chen (242 papers)
Yang Wang (670 papers)

Citations (92)

View on Semantic Scholar

GitHub

GitHub - quqxui/Awesome-LLM4IE-Papers: Awesome papers about generative Information Extraction (IE) using Large Language Models (LLMs) (498 stars)

Tweets

https://twitter.com/22146921/status/1741942540008620092

https://twitter.com/GAIS_jp/status/1751802355215958342

https://twitter.com/GAIS_jp/status/1755139562328793199

https://twitter.com/1369158187635597314/status/1742218184915132571

https://twitter.com/thefunky_thread/status/1744689633202471003

https://twitter.com/knishimae0531/status/1743882292790268362

HackerNews

Large Language Models for Generative Information Extraction (2 points, 0 comments)