Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

ImageNet-X: Understanding Model Mistakes with Factor of Variation Annotations (2211.01866v1)

Published 3 Nov 2022 in cs.CV and cs.LG

Abstract: Deep learning vision systems are widely deployed across applications where reliability is critical. However, even today's best models can fail to recognize an object when its pose, lighting, or background varies. While existing benchmarks surface examples challenging for models, they do not explain why such mistakes arise. To address this need, we introduce ImageNet-X, a set of sixteen human annotations of factors such as pose, background, or lighting the entire ImageNet-1k validation set as well as a random subset of 12k training images. Equipped with ImageNet-X, we investigate 2,200 current recognition models and study the types of mistakes as a function of model's (1) architecture, e.g. transformer vs. convolutional, (2) learning paradigm, e.g. supervised vs. self-supervised, and (3) training procedures, e.g., data augmentation. Regardless of these choices, we find models have consistent failure modes across ImageNet-X categories. We also find that while data augmentation can improve robustness to certain factors, they induce spill-over effects to other factors. For example, strong random cropping hurts robustness on smaller objects. Together, these insights suggest to advance the robustness of modern vision models, future research should focus on collecting additional data and understanding data augmentation schemes. Along with these insights, we release a toolkit based on ImageNet-X to spur further study into the mistakes image recognition systems make.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (10)
  1. Badr Youbi Idrissi (6 papers)
  2. Diane Bouchacourt (32 papers)
  3. Randall Balestriero (91 papers)
  4. Ivan Evtimov (24 papers)
  5. Caner Hazirbas (19 papers)
  6. Nicolas Ballas (49 papers)
  7. Pascal Vincent (78 papers)
  8. Michal Drozdzal (45 papers)
  9. David Lopez-Paz (48 papers)
  10. Mark Ibrahim (36 papers)
Citations (40)

Summary

We haven't generated a summary for this paper yet.