Selecting which data quality assertions to add to LLM pipelines

Determine which specific data quality assertions should be added to a given large language model (LLM) pipeline to effectively catch errors during deployment, balancing usefulness and developer burden for the target application.

Background

The paper motivates the need for data quality assertions in LLM pipelines to catch issues such as instruction non-compliance, format errors, and hallucinations during deployment. While toolkits exist to run assertions, practitioners struggle to decide which assertions to include because of diverse failure modes, imprecise specifications, and developer expertise constraints.

This difficulty is identified explicitly as an open problem that spade aims to address by mining prompt version histories to generate and filter candidate assertions, but the statement highlights the broader challenge of assertion selection in practice.

References

However, determining which assertions to add remains an open problem---and is a big customer painpoint based on our experience at LangChain---a company that helps people build LLM pipelines.

— SPADE: Synthesizing Data Quality Assertions for Large Language Model Pipelines (2401.03038 - Shankar et al., 5 Jan 2024) in Section 1, Introduction

Selecting which data quality assertions to add to LLM pipelines

Sponsor

Background

References

Related Problems