Choosing labeled examples to best inform assertion selection
Determine which labeled input–output examples for a specific large language model (LLM) pipeline should be selected to best inform the choice among candidate data quality assertions, in order to optimize failure coverage and false failure rate under limited labeling budgets.
Sponsor
References
Determining which labeled examples would help best select from the set of assertions is an open question that is reminiscent of active learning.
— SPADE: Synthesizing Data Quality Assertions for Large Language Model Pipelines
(2401.03038 - Shankar et al., 5 Jan 2024) in Conclusion and Future Work