Generating realistic domain‑relevant QA data for RAG systems
Develop methods to generate realistic, domain‑relevant question–answer pairs to support testing and validation of Retrieval‑Augmented Generation (RAG) systems that index unstructured documents in application‑specific domains.
References
How to generate realistic domain relevant questions and answers remains an open problem.
— Seven Failure Points When Engineering a Retrieval Augmented Generation System
(2401.05856 - Barnett et al., 11 Jan 2024) in Section 6.3 (Testing and Monitoring RAG systems)