Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

RAVEN: In-Context Learning with Retrieval-Augmented Encoder-Decoder Language Models (2308.07922v3)

Published 15 Aug 2023 in cs.CL, cs.AI, and cs.LG

Abstract: In this paper, we investigate the in-context learning ability of retrieval-augmented encoder-decoder LLMs. We first conduct a comprehensive analysis of existing models and identify their limitations in in-context learning, primarily due to a mismatch between pretraining and inference, as well as a restricted context length. To address these issues, we propose RAVEN, a model that combines retrieval-augmented masked LLMing and prefix LLMing. We further introduce Fusion-in-Context Learning to enhance the few-shot performance by enabling the model to leverage more in-context examples without requiring additional training. Through extensive experiments, we demonstrate that our simple yet effective design significantly improves performance, achieving results comparable to the most advanced LLMs in certain scenarios, despite having substantially fewer parameters. Our work underscores the potential of retrieval-augmented encoder-decoder LLMs for in-context learning and encourages further research in this direction.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Jie Huang (155 papers)
  2. Wei Ping (51 papers)
  3. Peng Xu (357 papers)
  4. Mohammad Shoeybi (60 papers)
  5. Kevin Chen-Chuan Chang (53 papers)
  6. Bryan Catanzaro (123 papers)
Citations (29)
X Twitter Logo Streamline Icon: https://streamlinehq.com