Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

On Robustness and Transferability of Convolutional Neural Networks (2007.08558v2)

Published 16 Jul 2020 in cs.CV and cs.LG

Abstract: Modern deep convolutional networks (CNNs) are often criticized for not generalizing under distributional shifts. However, several recent breakthroughs in transfer learning suggest that these networks can cope with severe distribution shifts and successfully adapt to new tasks from a few training examples. In this work we study the interplay between out-of-distribution and transfer performance of modern image classification CNNs for the first time and investigate the impact of the pre-training data size, the model scale, and the data preprocessing pipeline. We find that increasing both the training set and model sizes significantly improve the distributional shift robustness. Furthermore, we show that, perhaps surprisingly, simple changes in the preprocessing such as modifying the image resolution can significantly mitigate robustness issues in some cases. Finally, we outline the shortcomings of existing robustness evaluation datasets and introduce a synthetic dataset SI-Score we use for a systematic analysis across factors of variation common in visual data such as object size and position.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (14)
  1. Josip Djolonga (21 papers)
  2. Jessica Yung (5 papers)
  3. Michael Tschannen (49 papers)
  4. Rob Romijnders (13 papers)
  5. Lucas Beyer (46 papers)
  6. Alexander Kolesnikov (44 papers)
  7. Joan Puigcerver (20 papers)
  8. Matthias Minderer (19 papers)
  9. Alexander D'Amour (37 papers)
  10. Dan Moldovan (12 papers)
  11. Sylvain Gelly (43 papers)
  12. Neil Houlsby (62 papers)
  13. Xiaohua Zhai (51 papers)
  14. Mario Lucic (42 papers)
Citations (146)

Summary

We haven't generated a summary for this paper yet.