Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A Memory Efficient Baseline for Open Domain Question Answering (2012.15156v1)

Published 30 Dec 2020 in cs.CL

Abstract: Recently, retrieval systems based on dense representations have led to important improvements in open-domain question answering, and related tasks. While very effective, this approach is also memory intensive, as the dense vectors for the whole knowledge source need to be kept in memory. In this paper, we study how the memory footprint of dense retriever-reader systems can be reduced. We consider three strategies to reduce the index size: dimension reduction, vector quantization and passage filtering. We evaluate our approach on two question answering benchmarks: TriviaQA and NaturalQuestions, showing that it is possible to get competitive systems using less than 6Gb of memory.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Gautier Izacard (17 papers)
  2. Fabio Petroni (37 papers)
  3. Lucas Hosseini (9 papers)
  4. Nicola De Cao (21 papers)
  5. Sebastian Riedel (140 papers)
  6. Edouard Grave (56 papers)
Citations (42)