2000 character limit reached
Task-specific Pre-training and Prompt Decomposition for Knowledge Graph Population with Language Models (2208.12539v2)
Published 26 Aug 2022 in cs.CL
Abstract: We present a system for knowledge graph population with LLMs, evaluated on the Knowledge Base Construction from Pre-trained LLMs (LM-KBC) challenge at ISWC 2022. Our system involves task-specific pre-training to improve LM representation of the masked object tokens, prompt decomposition for progressive generation of candidate objects, among other methods for higher-quality retrieval. Our system is the winner of track 1 of the LM-KBC challenge, based on BERT LM; it achieves 55.0% F-1 score on the hidden test set of the challenge.
- Tianyi Li (84 papers)
- Wenyu Huang (7 papers)
- Nikos Papasarantopoulos (4 papers)
- Pavlos Vougiouklis (11 papers)
- Jeff Z. Pan (78 papers)