Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

PaperQA: Retrieval-Augmented Generative Agent for Scientific Research (2312.07559v2)

Published 8 Dec 2023 in cs.CL, cs.AI, and cs.LG

Abstract: LLMs generalize well across language tasks, but suffer from hallucinations and uninterpretability, making it difficult to assess their accuracy without ground-truth. Retrieval-Augmented Generation (RAG) models have been proposed to reduce hallucinations and provide provenance for how an answer was generated. Applying such models to the scientific literature may enable large-scale, systematic processing of scientific knowledge. We present PaperQA, a RAG agent for answering questions over the scientific literature. PaperQA is an agent that performs information retrieval across full-text scientific articles, assesses the relevance of sources and passages, and uses RAG to provide answers. Viewing this agent as a question answering model, we find it exceeds performance of existing LLMs and LLM agents on current science QA benchmarks. To push the field closer to how humans perform research on scientific literature, we also introduce LitQA, a more complex benchmark that requires retrieval and synthesis of information from full-text scientific papers across the literature. Finally, we demonstrate PaperQA's matches expert human researchers on LitQA.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Jakub Lála (2 papers)
  2. Odhran O'Donoghue (5 papers)
  3. Aleksandar Shtedritski (13 papers)
  4. Sam Cox (7 papers)
  5. Samuel G. Rodriques (10 papers)
  6. Andrew D. White (29 papers)
Citations (49)