Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
38 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Fast and Light-Weight Answer Text Retrieval in Dialogue Systems (2205.14226v2)

Published 27 May 2022 in cs.IR and cs.CL

Abstract: Dialogue systems can benefit from being able to search through a corpus of text to find information relevant to user requests, especially when encountering a request for which no manually curated response is available. The state-of-the-art technology for neural dense retrieval or re-ranking involves deep learning models with hundreds of millions of parameters. However, it is difficult and expensive to get such models to operate at an industrial scale, especially for cloud services that often need to support a big number of individually customized dialogue systems, each with its own text corpus. We report our work on enabling advanced neural dense retrieval systems to operate effectively at scale on relatively inexpensive hardware. We compare with leading alternative industrial solutions and show that we can provide a solution that is effective, fast, and cost-efficient.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Hui Wan (33 papers)
  2. Siva Sankalp Patel (9 papers)
  3. J. William Murdock (4 papers)
  4. Saloni Potdar (20 papers)
  5. Sachindra Joshi (32 papers)
Citations (1)