Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
60 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
8 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Vortex under Ripplet: An Empirical Study of RAG-enabled Applications (2407.05138v1)

Published 6 Jul 2024 in cs.SE and cs.AI

Abstract: LLMs enhanced by retrieval-augmented generation (RAG) provide effective solutions in various application scenarios. However, developers face challenges in integrating RAG-enhanced LLMs into software systems, due to lack of interface specification, requirements from software context, and complicated system management. In this paper, we manually studied 100 open-source applications that incorporate RAG-enhanced LLMs, and their issue reports. We have found that more than 98% of applications contain multiple integration defects that harm software functionality, efficiency, and security. We have also generalized 19 defect patterns and proposed guidelines to tackle them. We hope this work could aid LLM-enabled software development and motivate future research.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Yuchen Shao (2 papers)
  2. Yuheng Huang (26 papers)
  3. Jiawei Shen (14 papers)
  4. Lei Ma (195 papers)
  5. Ting Su (43 papers)
  6. Chengcheng Wan (14 papers)
X Twitter Logo Streamline Icon: https://streamlinehq.com