Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
38 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Textual Entailment for Effective Triple Validation in Object Prediction (2401.16293v1)

Published 29 Jan 2024 in cs.CL, cs.AI, and cs.DL

Abstract: Knowledge base population seeks to expand knowledge graphs with facts that are typically extracted from a text corpus. Recently, LLMs pretrained on large corpora have been shown to contain factual knowledge that can be retrieved using cloze-style strategies. Such approach enables zero-shot recall of facts, showing competitive results in object prediction compared to supervised baselines. However, prompt-based fact retrieval can be brittle and heavily depend on the prompts and context used, which may produce results that are unintended or hallucinatory.We propose to use textual entailment to validate facts extracted from LLMs through cloze statements. Our results show that triple validation based on textual entailment improves LLM predictions in different training regimes. Furthermore, we show that entailment-based triple validation is also effective to validate candidate facts extracted from other sources including existing knowledge graphs and text passages where named entities are recognized.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)