Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Discover, Explanation, Improvement: An Automatic Slice Detection Framework for Natural Language Processing (2211.04476v2)

Published 8 Nov 2022 in cs.CL, cs.AI, and cs.LG

Abstract: Pretrained NLP models have achieved high overall performance, but they still make systematic errors. Instead of manual error analysis, research on slice detection models (SDM), which automatically identify underperforming groups of datapoints, has caught escalated attention in Computer Vision for both understanding model behaviors and providing insights for future model training and designing. However, little research on SDM and quantitative evaluation of their effectiveness have been conducted on NLP tasks. Our paper fills the gap by proposing a benchmark named "Discover, Explain, Improve (DEIM)" for classification NLP tasks along with a new SDM Edisa. Edisa discovers coherent and underperforming groups of datapoints; DEIM then unites them under human-understandable concepts and provides comprehensive evaluation tasks and corresponding quantitative metrics. The evaluation in DEIM shows that Edisa can accurately select error-prone datapoints with informative semantic features that summarize error patterns. Detecting difficult datapoints directly boosts model performance without tuning any original model parameters, showing that discovered slices are actionable for users.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Wenyue Hua (51 papers)
  2. Lifeng Jin (24 papers)
  3. Linfeng Song (76 papers)
  4. Haitao Mi (56 papers)
  5. Yongfeng Zhang (163 papers)
  6. Dong Yu (329 papers)
Citations (1)

Summary

We haven't generated a summary for this paper yet.