MATTER: Memory-Augmented Transformer Using Heterogeneous Knowledge Sources (2406.04670v1)

Published 7 Jun 2024 in cs.CL and cs.AI

Abstract: Leveraging external knowledge is crucial for achieving high performance in knowledge-intensive tasks, such as question answering. The retrieve-and-read approach is widely adopted for integrating external knowledge into a LLM. However, this approach suffers from increased computational cost and latency due to the long context length, which grows proportionally with the number of retrieved knowledge. Furthermore, existing retrieval-augmented models typically retrieve information from a single type of knowledge source, limiting their scalability to diverse knowledge sources with varying structures. In this work, we introduce an efficient memory-augmented transformer called MATTER, designed to retrieve relevant knowledge from multiple heterogeneous knowledge sources. Specifically, our model retrieves and reads from both unstructured sources (paragraphs) and semi-structured sources (QA pairs) in the form of fixed-length neural memories. We demonstrate that our model outperforms existing efficient retrieval-augmented models on popular QA benchmarks in terms of both accuracy and speed. Furthermore, MATTER achieves competitive results compared to conventional read-and-retrieve models while having 100x throughput during inference.

Authors (4)

Dongkyu Lee (32 papers)
Chandana Satya Prakash (5 papers)
Jack FitzGerald (11 papers)
Jens Lehmann (80 papers)

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/jgmfitz/status/1801029558432346193

https://twitter.com/gm8xx8/status/1799994309267100131

MATTER: Memory-Augmented Transformer Using Heterogeneous Knowledge Sources (2406.04670v1)

Summary

Related Papers

Tweets