Pretrained Visual Uncertainties (2402.16569v2)
Abstract: Accurate uncertainty estimation is vital to trustworthy machine learning, yet uncertainties typically have to be learned for each task anew. This work introduces the first pretrained uncertainty modules for vision models. Similar to standard pretraining this enables the zero-shot transfer of uncertainties learned on a large pretraining dataset to specialized downstream datasets. We enable our large-scale pretraining on ImageNet-21k by solving a gradient conflict in previous uncertainty modules and accelerating the training by up to 180x. We find that the pretrained uncertainties generalize to unseen datasets. In scrutinizing the learned uncertainties, we find that they capture aleatoric uncertainty, disentangled from epistemic components. We demonstrate that this enables safe retrieval and uncertainty-aware dataset visualization. To encourage applications to further problems and domains, we release all pretrained checkpoints and code under https://github.com/mkirchhof/url .
- A gentle introduction to conformal prediction and distribution-free uncertainty quantification. arXiv preprint arXiv:2107.07511, 2022.
- Are we done with ImageNet? arXiv preprint arXiv:2006.07159, 2020.
- Symbolic discovery of optimization algorithms. arXiv preprint arXiv:2302.06675, 2023.
- Chun, S. Improved probabilistic image-text representations. arXiv preprint arXiv:2305.18171, 2023.
- Describing textures in the wild. In Computer Vision and Pattern Recognition (CVPR), 2014.
- Massively scaling heteroscedastic classifiers. arXiv preprint arXiv:2301.12860, 2023.
- Learning sample difficulty from pre-trained models for reliable prediction. In Neural Information Processing Systems (NeurIPS), 2023.
- Scaling vision transformers to 22 billion parameters. In International Conference on Machine Learning (ICML), 2023.
- ImageNet: A large-scale hierarchical image database. In Computer Vision and Pattern Recognition (CVPR), 2009.
- An image is worth 16x16 words: Transformers for image recognition at scale. In International Conference on Learning Representations (ICLR), 2021.
- The Faiss library. arXiv preprint arXiv:2401.08281, 2024.
- Learning generative visual models from few training examples: An incremental Bayesian approach tested on 101 object categories. Computer Vision and Pattern Recognition Workshop, 2004.
- A framework for benchmarking class-out-of-distribution detection and its application to ImageNet. In The Eleventh International Conference on Learning Representations (ICLR), 2023a.
- What can we learn from the selective prediction and uncertainty estimation performance of 523 ImageNet classifiers? In International Conference on Learning Representations (ICLR), 2023b.
- AugMix: A simple method to improve robustness and uncertainty under data shift. In International Conference on Learning Representations (ICLR), 2020.
- Probabilistic embeddings revisited. arXiv preprint arXiv:2202.06768, 2022.
- Probabilistic concept bottleneck models. In International Conference on Machine Learning (ICML).
- Adam: A method for stochastic optimization. International Conference on Learning Representations (ICLR), 2015.
- A non-isotropic probabilistic take on proxy-based deep metric learning. In European Conference on Computer Vision (ECCV), 2022.
- Probabilistic contrastive learning recovers the correct aleatoric uncertainty of ambiguous inputs. International Conference on Machine Learning (ICML), 2023a.
- Url: A representation learning benchmark for transferable uncertainty estimates. Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks, 2023b.
- 3D object representations for fine-grained categorization. In Conference on Computer Vision and Pattern Recognition (CVPR) Workshop, 2013.
- Krizhevsky, A. Learning multiple layers of features from tiny images. Technical report, 2009.
- DEUP: Direct epistemic uncertainty prediction. Transactions on Machine Learning Research (TMLR), 2023. ISSN 2835-8856.
- Well-calibrated regression uncertainty in medical imaging with deep learning. In Medical Imaging with Deep Learning, pp. 393–412. PMLR, 2020.
- Simple and principled uncertainty estimation with deterministic deep learning via distance awareness. Advances in Neural Information Processing Systems (NeurIPS), 2020.
- Decoupled weight decay regularization. arXiv preprint arXiv:1711.05101, 2017.
- UMAP: Uniform manifold approximation and projection. Journal of Open Source Software, 3(29):861, 2018. doi: 10.21105/joss.00861. URL https://doi.org/10.21105/joss.00861.
- Trustworthy machine learning, 2023.
- Representation uncertainty in self-supervised learning as variational inference. In International Conference on Computer Vision (ICCV), 2023.
- Reading digits in natural images with unsupervised feature learning. 2011.
- Automated flower classification over a large number of classes. In Proceedings of the Indian Conference on Computer Vision, Graphics and Image Processing, 2008.
- Modeling uncertainty with hedged instance embeddings. In International Conference on Learning Representations (ICLR), 2019.
- Can you trust your model’s uncertainty? Evaluating predictive uncertainty under dataset shift. Advances in Neural Information Processing Systems (NeurIPS), 32, 2019.
- Cats and dogs. In Computer Vision and Pattern Recognition (CVPR), 2012.
- On the practicality of deterministic epistemic uncertainty. In International Conference on Machine Learning (ICML), 2022.
- Is one annotation enough? A data-centric image classification benchmark for noisy and ambiguous label estimation. arXiv preprint arXiv:2207.06214, 2022.
- Deep metric learning via lifted structured feature embedding. In Computer Vision and Pattern Recognition (CVPR), 2016.
- How to train your ViT? Data, augmentation, and regularization in vision transformers. arXiv preprint arXiv:2106.10270, 2021.
- TorchVision. Torchvision: Pytorch’s computer vision library. GitHub repository: https://github.com/pytorch/vision, 2016.
- Plex: Towards reliability using pretrained large model extensions. arXiv preprint arXiv:2207.07411, 2022.
- A deeper look into aleatoric and epistemic uncertainty disentanglement. In Computer Vision and Pattern Recognition Workshops (CVPRW), 2022.
- Visualizing data using t-SNE. Journal of Machine Learning Research (JMLR), 9(86):2579–2605, 2008.
- The Caltech-UCSD birds-200-2011 dataset. Technical Report CNS-TR-2011-001, California Institute of Technology, 2011.
- Wightman, R. PyTorch image models, 2019.
- Quantifying aleatoric and epistemic uncertainty in machine learning: Are conditional entropy and mutual information appropriate measures? In Uncertainty in Artificial Intelligence (UAI), 2023.
- Sun database: Large-scale scene recognition from abbey to zoo. In Conference on Computer Vision and Pattern Recognition (CVPR), 2010.
- Learning loss for active learning. In Conference on Computer Vision and Pattern Recognition (CVPR), 2019.
- Cutmix: Regularization strategy to train strong classifiers with localizable features. In International Conference on Computer Vision (ICCV), 2019.
- A large-scale study of representation learning with the visual task adaptation benchmark, 2020.
- Mixup: Beyond empirical risk minimization. In International Conference on Learning Representations (ICLR), 2018.