Taming False Positives in Out-of-Distribution Detection with Human Feedback (2404.16954v1)
Abstract: Robustness to out-of-distribution (OOD) samples is crucial for safely deploying machine learning models in the open world. Recent works have focused on designing scoring functions to quantify OOD uncertainty. Setting appropriate thresholds for these scoring functions for OOD detection is challenging as OOD samples are often unavailable up front. Typically, thresholds are set to achieve a desired true positive rate (TPR), e.g., $95\%$ TPR. However, this can lead to very high false positive rates (FPR), ranging from 60 to 96\%, as observed in the Open-OOD benchmark. In safety-critical real-life applications, e.g., medical diagnosis, controlling the FPR is essential when dealing with various OOD samples dynamically. To address these challenges, we propose a mathematically grounded OOD detection framework that leverages expert feedback to \emph{safely} update the threshold on the fly. We provide theoretical results showing that it is guaranteed to meet the FPR constraint at all times while minimizing the use of human feedback. Another key feature of our framework is that it can work with any scoring function for OOD uncertainty quantification. Empirical evaluation of our system on synthetic and benchmark OOD datasets shows that our method can maintain FPR at most $5\%$ while maximizing TPR.
- C. C. Aggarwal. An Introduction to Outlier Analysis, pages 1–34. Springer International Publishing, Cham, 2017. ISBN 978-3-319-47578-3.
- Concrete problems in AI safety. CoRR, abs/1606.06565, 2016.
- A. Balsubramani. Sharp finite-time iterated-logarithm martingale concentration, 2015.
- Testing for outliers with conformal p-values. The Annals of Statistics, 51(1):149 – 178, 2023.
- Gradorth: A simple yet efficient out-of-distribution detection with orthogonal projection of gradients, 2023.
- A. Bendale and T. E. Boult. Towards open set deep networks. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 1563–1572, 2015.
- Discriminative out-of-distribution detection for semantic segmentation, 2018.
- C. M. Bishop. Novelty detection and neural network validation. IEE Proceedings-Vision, Image and Signal processing, 141(4):217–222, 1994.
- F. Cai and X. Koutsoukos. Real-time out-of-distribution detection in learning-enabled cyber-physical systems. In 2020 ACM/IEEE 11th International Conference on Cyber-Physical Systems (ICCPS), pages 174–183, 2020.
- A simple framework for contrastive learning of visual representations. In International conference on machine learning, pages 1597–1607. PMLR, 2020.
- D. A. Darling and H. Robbins. Confidence sequences for mean, variance, and median. Proceedings of the National Academy of Sciences, 58(1):66–68, 1967.
- D. A. Darling and H. Robbins. Some nonparametric sequential tests with power one. Proceedings of the National Academy of Sciences of the United States of America, 61(3):804–809, 1968. ISSN 00278424. URL http://www.jstor.org/stable/58954.
- Imagenet: A large-scale hierarchical image database. In 2009 IEEE conference on computer vision and pattern recognition, pages 248–255. Ieee, 2009.
- Extremely simple activation shaping for out-of-distribution detection. In The Eleventh International Conference on Learning Representations, 2023.
- Asymptotic Minimax Character of the Sample Distribution Function and of the Classical Multinomial Estimator. The Annals of Mathematical Statistics, 27(3):642 – 669, 1956.
- A simple test-time method for out-of-distribution detection. arXiv preprint arXiv:2207.08210, 2022.
- Y. Geifman and R. El-Yaniv. SelectiveNet: A deep neural network with an integrated reject option. In Proceedings of the 36th International Conference on Machine Learning, volume 97, pages 2151–2159, 2019.
- D. Hendrycks and K. Gimpel. A baseline for detecting misclassified and out-of-distribution examples in neural networks. In International Conference on Learning Representations, 2017.
- Deep anomaly detection with outlier exposure. In International Conference on Learning Representations, 2019.
- W. Hoeffding. Probability inequalities for sums of bounded random variables. Journal of the American Statistical Association, 58(301), 1963. ISSN 01621459.
- S. R. Howard and A. Ramdas. Sequential estimation of quantiles with applications to A/B testing and best-arm identification. Bernoulli, 28(3):1704 – 1728, 2022. doi: 10.3150/21-BEJ1388.
- On the importance of gradients for detecting distributional shifts in the wild. Advances in Neural Information Processing Systems, 34:677–689, 2021.
- Y. Iwasawa and Y. Matsuo. Test-time classifier adjustment module for model-agnostic domain generalization. Advances in Neural Information Processing Systems, 34:2427–2440, 2021.
- lil’ ucb : An optimal exploration algorithm for multi-armed bandits. In Proceedings of The 27th Conference on Learning Theory, volume 35, pages 423–439. PMLR, 2014.
- T. Jeong and H. Kim. Ood-maml: Meta-learning for few-shot out-of-distribution detection and classification. In Advances in Neural Information Processing Systems, volume 33, pages 3907–3916, 2020.
- R. Johari. Can i take a peek?: Continuous monitoring of online a/b tests. Proceedings of the 24th International Conference on World Wide Web, 2015.
- Training ood detectors in their natural habitats. In International Conference on Machine Learning, 2022.
- On the complexity of best-arm identification in multi-armed bandit models. Journal of Machine Learning Research, 17(1), 2016. ISSN 1532-4435.
- idecode: In-distribution equivariance for conformal out-of-distribution detection. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 36, pages 7104–7114, 2022.
- A. Khinchine. Über einen satz der wahrscheinlichkeitsrechnung. Fundamenta Mathematicae, 6:9–20, 1924.
- Why normalizing flows fail to detect out-of-distribution data. Advances in neural information processing systems, 33:20578–20589, 2020.
- A. Kolmogorov. Über das gesetz des iterierten logarithmus. Mathematische Annalen, 101:126–135, 1929.
- Learning multiple layers of features from tiny images. 2009.
- T. L. Lai. On Confidence Sequences. The Annals of Statistics, 4(2):265 – 280, 1976.
- R. Laxhammar and G. Falkman. Sequential conformal anomaly detection in trajectories based on hausdorff distance. In 14th International Conference on Information Fusion, pages 1–8, 2011.
- R. Laxhammar and G. Falkman. Inductive conformal anomaly detection for sequential detection of anomalous sub-trajectories. Annals of Mathematics and Artificial Intelligence, 74:67–94, 2015.
- Training confidence-calibrated classifiers for detecting out-of-distribution samples. 2018a.
- A simple unified framework for detecting out-of-distribution samples and adversarial attacks. Advances in neural information processing systems, 31, 2018b.
- Enhancing the reliability of out-of-distribution image detection in neural networks. arXiv preprint arXiv:1706.02690, 2017.
- Enhancing the reliability of out-of-distribution image detection in neural networks. In International Conference on Learning Representations, 2018.
- Open category detection with PAC guarantees. In Proceedings of the 35th International Conference on Machine Learning, volume 80, pages 3169–3178. PMLR, 2018.
- Energy-based out-of-distribution detection. Advances in Neural Information Processing Systems, 33:21464–21475, 2020.
- Anytime-valid confidence sequences in an enterprise a/b testing platform. In Companion Proceedings of the ACM Web Conference 2023, WWW ’23 Companion, page 396–400. Association for Computing Machinery, 2023.
- P. Massart. The Tight Constant in the Dvoretzky-Kiefer-Wolfowitz Inequality. The Annals of Probability, 18(3):1269 – 1283, 1990.
- Cider: Exploiting hyperspherical embeddings for out-of-distribution detection. arXiv preprint arXiv:2203.04450, 2022.
- Self-supervised learning for generalizable out-of-distribution detection. In AAAI Conference on Artificial Intelligence, 2020.
- Hybrid models with deep and invertible features. In International Conference on Machine Learning, pages 4723–4732. PMLR, 2019.
- Deep neural networks are easily fooled: High confidence predictions for unrecognizable images. In CVPR, pages 427–436. IEEE Computer Society, 2015.
- N. Papernot and P. Mcdaniel. Deep k-nearest neighbors: Towards confident, interpretable and robust deep learning. ArXiv, abs/1803.04765, 2018.
- Likelihood ratios for out-of-distribution detection. In Advances in Neural Information Processing Systems, volume 32, 2019.
- Out-of-distribution detection and selective generation for conditional language models. In The Eleventh International Conference on Learning Representations,ICLR 2023, 2023.
- A unified survey on anomaly, novelty, open-set, and out-of-distribution detection: Solutions and future challenges, 2022.
- Understanding anomaly detection with deep invertible networks through hierarchies of distributions and features. Advances in Neural Information Processing Systems, 33:21038–21049, 2020.
- Ssd: A unified framework for self-supervised outlier detection. arXiv preprint arXiv:2103.12051, 2021.
- Input complexity and out-of-distribution detection with likelihood-based generative models. In International Conference on Learning Representations, 2020.
- N. Smirnov. Approximate laws of distribution of random variables from empirical data. Uspekhi Matematicheskikh Nauk, 10:179–206, 1944.
- React: Out-of-distribution detection with rectified activations. Advances in Neural Information Processing Systems, 34:144–157, 2021.
- Out-of-distribution detection with deep nearest neighbors. In International Conference on Machine Learning, pages 20827–20840. PMLR, 2022.
- Qualitative multi-armed bandits: A quantile-based approach. In Proceedings of the 32nd International Conference on Machine Learning, volume 37 of Proceedings of Machine Learning Research, pages 1660–1668, Lille, France, 07–09 Jul 2015. PMLR.
- Csi: Novelty detection via contrastive learning on distributionally shifted instances. In Advances in Neural Information Processing Systems, volume 33, pages 11839–11852. Curran Associates, Inc., 2020.
- Algorithmic Learning in a Random World. Springer-Verlag, Berlin, Heidelberg, 2005. ISBN 0387001522.
- Tent: Fully test-time adaptation by entropy minimization. arXiv preprint arXiv:2006.10726, 2020.
- Can multi-label classification networks know what they don’t know? In Neural Information Processing Systems, 2021.
- Vim: Out-of-distribution with virtual-logit matching. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 4921–4930, 2022.
- Mitigating neural network overconfidence with logit normalization. In Proceedings of the 39th International Conference on Machine Learning, volume 162, pages 23631–23644, 2022.
- Contrastive training for improved out-of-distribution detection. ArXiv, abs/2007.05566, 2020.
- Energy-based out-of-distribution detection for graph neural networks. In The Eleventh International Conference on Learning Representations, 2023.
- Likelihood regret: An out-of-distribution detection score for variational auto-encoder. In Advances in Neural Information Processing Systems, volume 33, pages 20685–20696. Curran Associates, Inc., 2020.
- Semantically coherent out-of-distribution detection. 2021 IEEE/CVF International Conference on Computer Vision (ICCV), pages 8281–8289, 2021a.
- Generalized out-of-distribution detection: A survey. arXiv preprint arXiv:2110.11334, 2021b.
- Openood: Benchmarking generalized out-of-distribution detection, 2022a.
- Generalized out-of-distribution detection: A survey, 2022b.
- Memo: Test time robustness via adaptation and augmentation. Advances in Neural Information Processing Systems, 35:38629–38642, 2022.
- Adaptive concentration inequalities for sequential decision problems. In Advances in Neural Information Processing Systems, volume 29, 2016.
- Harit Vishwakarma (15 papers)
- Heguang Lin (4 papers)
- Ramya Korlakai Vinayak (13 papers)