PullNet: Open Domain Question Answering with Iterative Retrieval on Knowledge Bases and Text (1904.09537v1)

Published 21 Apr 2019 in cs.CL and cs.LG

Abstract: We consider open-domain queston answering (QA) where answers are drawn from either a corpus, a knowledge base (KB), or a combination of both of these. We focus on a setting in which a corpus is supplemented with a large but incomplete KB, and on questions that require non-trivial (e.g., multi-hop'') reasoning. We describe PullNet, an integrated framework for (1) learning what to retrieve (from the KB and/or corpus) and (2) reasoning with this heterogeneous information to find the best answer. PullNet uses an {iterative} process to construct a question-specific subgraph that contains information relevant to the question. In each iteration, a graph convolutional network (graph CNN) is used to identify subgraph nodes that should be expanded using retrieval (orpull'') operations on the corpus and/or KB. After the subgraph is complete, a similar graph CNN is used to extract the answer from the subgraph. This retrieve-and-reason process allows us to answer multi-hop questions using large KBs and corpora. PullNet is weakly supervised, requiring question-answer pairs but not gold inference paths. Experimentally PullNet improves over the prior state-of-the art, and in the setting where a corpus is used with incomplete KB these improvements are often dramatic. PullNet is also often superior to prior systems in a KB-only setting or a text-only setting.

Authors (3)

Haitian Sun (16 papers)
Tania Bedrax-Weiss (7 papers)
William W. Cohen (79 papers)

Citations (317)

View on Semantic Scholar

Summary

PullNet: Open Domain Question Answering with Iterative Retrieval on Knowledge Bases and Text

The paper "PullNet: Open Domain Question Answering with Iterative Retrieval on Knowledge Bases and Text" introduces an innovative framework designed to address open-domain question answering (QA) tasks where answers may come from a corpus, a knowledge base (KB), or an overview of both. The authors focus on a scenario in which the corpus complements a large but incomplete KB and emphasize tackling queries that demand complex reasoning, specifically multi-hop reasoning. PullNet represents a significant step forward in QA architecture by integrating retrieval and reasoning processes, allowing for the accurate answering of multi-hop questions through both text and large-scale KBs.

Core Contributions

The authors present PullNet, an integrated framework that leverages an iterative process to tackle the QA challenge. This process consists of constructing a question-specific subgraph that contains the pertinent information required to answer the question. PullNet employs a graph convolutional network (graph CNN) to iteratively expand this subgraph by identifying nodes — entities from the KB or information from the corpus — that warrant further exploration. This ensures that relevant data from heterogeneous sources are coherently integrated for subsequent reasoning.

PullNet's architecture consists of several key components:

Iterative Subgraph Construction: Initially, the question subgraph is constructed with entities mentioned in the question. Iterative expansions involve classifying nodes for further retrieval using a graph CNN, coupled with pull operations on both the KB and the corpus. This enables relevant information gathering for efficient answer extraction.
Weak Supervision and Training: Unlike many traditional methods, PullNet does not require gold inference paths for supervision but uses weak supervision through question-answer pairs. The training adapts through iterative path expansion, guided by shortest path approximations derived from the KB.

Experimental Results and Implications

The experiments underscore the efficacy of PullNet, demonstrating notable improvements over prior systems in several settings. Notably, the manuscript reports that for MetaQA on 3-hop questions within a movie KB, PullNet achieved a marked increase in hits-at-one performance from 62.5% to 91.4%. Furthermore, in scenarios where the KB was made incomplete, PullNet's integration with corpus data allowed it to outperform systems relying solely on incomplete KBs or corpus data by substantial margins, such as a 7% absolute improvement over pure corpus-based methods and over 25% improvement over pure KB-based methods. These results reveal PullNet's superior capability in navigating complex queries and integrating diverse data sources.

Implications for Future Research

PullNet's approach of iteratively refining the information retrieval process hints at broader applications in AI, particularly for tasks where information spread across diverse databases necessitates selective and efficient retrieval. The model's learning strategy without dependence on explicit inference paths may also inspire innovations in training methodologies across AI domains.

The research invites future exploration into several areas:

Scalability and Efficiency: While the paper focuses on a specific set of benchmarks, examining PullNet's scalability to even larger datasets and KBs could provide insights into its broader applicability in real-world data-intensive environments.
Hybrid Models: The integration of text and KB as parallel knowledge sources opens new avenues for hybrid models that balance between text flexibility and KB structural precision, optimizing for specific types of information queries.
Transfer Learning: Leveraging PullNet for transfer learning applications could explore how its retrieval and reasoning capabilities extrapolate across different domains and content types.

Conclusion

PullNet represents a sophisticated advance in open-domain QA, offering a scalable and effective solution to multi-hop reasoning requirements that leverage information from complex, large-scale, and heterogeneous sources. Its iterative approach to graph construction and retrieval makes it a potent tool for future AI research focused on integrating diverse data types for coherent and accurate information extraction.

PDF Markdown

Related Papers

Find Related Papers