Shortcut Learning in Deep Neural Networks (2004.07780v5)

Published 16 Apr 2020 in cs.CV, cs.AI, cs.LG, and q-bio.NC

Abstract: Deep learning has triggered the current rise of artificial intelligence and is the workhorse of today's machine intelligence. Numerous success stories have rapidly spread all over science, industry and society, but its limitations have only recently come into focus. In this perspective we seek to distill how many of deep learning's problems can be seen as different symptoms of the same underlying problem: shortcut learning. Shortcuts are decision rules that perform well on standard benchmarks but fail to transfer to more challenging testing conditions, such as real-world scenarios. Related issues are known in Comparative Psychology, Education and Linguistics, suggesting that shortcut learning may be a common characteristic of learning systems, biological and artificial alike. Based on these observations, we develop a set of recommendations for model interpretation and benchmarking, highlighting recent advances in machine learning to improve robustness and transferability from the lab to real-world applications.

Authors (7)

Robert Geirhos (28 papers)
Jörn-Henrik Jacobsen (24 papers)
Claudio Michaelis (8 papers)
Richard Zemel (82 papers)
Wieland Brendel (55 papers)
Matthias Bethge (103 papers)
Felix A. Wichmann (19 papers)

Citations (1,801)

View on Semantic Scholar

Summary

Introduction

The field of deep learning has made significant strides in recent years, achieving remarkable success across a range of applications, from speech recognition to autonomous driving. Despite these advances, the opacity of deep neural networks (DNNs) and their unintuitive failure modes remain challenging. A convergent theme across numerous studies is the issue of shortcut learning, where DNNs develop seemingly effective but fundamentally unreliable strategies for tasks such as object classification and natural language processing.

Shortcut Learning Defined

Shortcut learning emerges when a neural network adopts a decision rule that performs well during testing under identical distribution (i.i.d.) conditions but fails under out-of-distribution (o.o.d.) scenarios. This reflects a deeper issue within the learning process, stemming from both the data presented and the inherent biases of the learning algorithm. For instance, in the presence of biased data where cows predominantly appear on grass in training sets, a model might associate the context (grass) rather than the subject (cow) as the key feature for recognition, which is a prime example of shortcut learning.

Sources and Implications

Shortcut learning can be traced back to two main sources: dataset shortcut opportunities and discriminative feature learning. The former is often due to dataset biases, where certain features are correlated with outcomes more by artifact than true causation. Discriminative learning, on the other hand, involves the model's inclination to overfit to the most readily available signals in the training data, thereby ignoring other informative cues. This scenario not only highlights model weaknesses in stressing adaptability but also has broader implications for AI transparency and reliability in critical applications.

Addressing Shortcut Learning

Efforts to tackle shortcut learning involve a multifaceted approach, emphasizing a shift from i.i.d. testing towards rigorous o.o.d. generalization benchmarks. This involves the creation of datasets and testing protocols that challenge models to generalize beyond the superficial features they extrapolate from training data. Research also seeks to understand the inductive biases of models, including the choice of architecture, data presentation, and optimization techniques, which play pivotal roles in the kinds of solutions that deep learning models are disposed to learn.

Closing Remarks

In reviewing the phenomenon of shortcut learning, it is vital to anchor expectations on DNNs within realistic parameters. Though these models have shown superhuman performance on specific tasks, under certain conditions they reveal a vulnerability to simplistic and ultimately unreliable strategies. Forwarding our understanding of DNNs and aligning their performance with human-like generalization abilities remains a key goal, necessitating continuous scrutiny of DNN behavior through o.o.d. generalisation testing and an exploration of architectural and data-driven solutions.

PDF Markdown

Related Papers

Find Related Papers

Tweets

https://twitter.com/paraschopra/status/1818486207216570696

https://twitter.com/ChombaBupe/status/1929970741531029980

https://twitter.com/YLyashchuk/status/1791205675994869770

https://twitter.com/kparikh2001/status/1834824626603802759

https://twitter.com/kparikh2001/status/1836755244803403853

YouTube

Show All Videos