Fast and reliable uncertainty quantification with neural network ensembles for industrial image classification (2403.10182v4)
Abstract: Image classification with neural networks (NNs) is widely used in industrial processes, situations where the model likely encounters unknown objects during deployment, i.e., out-of-distribution (OOD) data. Worryingly, NNs tend to make confident yet incorrect predictions when confronted with OOD data. To increase the models' reliability, they should quantify the uncertainty in their own predictions, communicating when the output should (not) be trusted. Deep ensembles, composed of multiple independent NNs, have been shown to perform strongly but are computationally expensive. Recent research has proposed more efficient NN ensembles, namely the snapshot, batch, and multi-input multi-output ensemble. This study investigates the predictive and uncertainty performance of efficient NN ensembles in the context of image classification for industrial processes. It is the first to provide a comprehensive comparison and it proposes a novel Diversity Quality metric to quantify the ensembles' performance on the in-distribution and OOD sets in one single metric. The results highlight the batch ensemble as a cost-effective and competitive alternative to the deep ensemble. It matches the deep ensemble in both uncertainty and accuracy while exhibiting considerable savings in training time, test time, and memory storage.
- Weld defect detection with convolutional neural network: an application of deep learning. Annals of Operations Research, pages 1–24, 2023.
- Exploiting time-varying rfm measures for customer churn prediction with deep neural networks. Annals of Operations Research, pages 1–23, 2023.
- On calibration of modern neural networks. In International Conference on Machine Learning, pages 1321–1330. PMLR, 2017.
- Explainability through uncertainty: Trustworthy decision-making with neural networks. European Journal of Operational Research, 2023. doi:10.1016/j.ejor.2023.09.009.
- Can you trust your model’s uncertainty? evaluating predictive uncertainty under dataset shift. Advances in neural information processing systems, 32, 2019.
- Snapshot ensembles: Train 1, get m for free. arXiv preprint arXiv:1704.00109, 2017. doi:10.48550/arXiv.1704.00109.
- Batchensemble: an alternative approach to efficient ensemble and lifelong learning. arXiv preprint arXiv:2002.06715, 2020. doi:10.48550/arXiv.2002.06715.
- Training independent subnetworks for robust prediction. arXiv preprint arXiv:2010.06610, 2020. doi:10.48550/arXiv.2010.06610.
- Philippe Du Jardin. Forecasting bankruptcy using biclustering and neural network-based ensembles. Annals of Operations Research, 299(1-2):531–566, 2021. doi:10.1007/s10479-019-03283-2.
- Carbon trading price prediction based on a two-stage heterogeneous ensemble method. Annals of Operations Research, pages 1–25, 2022. doi:10.1007/s10479-022-04821-1.
- Carbon trading and covid-19: a hybrid machine learning approach for international carbon price forecasting. Annals of Operations Research, pages 1–29, 2023. doi:10.1007/s10479-023-05327-0.
- The two-stage machine learning ensemble models for stock price prediction by combining mode decomposition, extreme learning machine and improved harmony search algorithm. Annals of Operations Research, pages 1–33, 2022. doi:10.1007/s10479-020-03690-w.
- A hybrid model based on bidirectional long short-term memory neural network and catboost for short-term electricity spot price forecasting. Journal of the Operational Research Society, 73(2):301–325, 2022. doi:10.1080/01605682.2020.1843976.
- Entropy method of constructing a combined model for improving loan default prediction: A case study in china. Journal of the Operational Research Society, 72(5):1099–1109, 2021. doi:10.1080/01605682.2019.1702905.
- Deep neural networks, gradient-boosted trees, random forests: Statistical arbitrage on the s&p 500. European Journal of Operational Research, 259(2):689–702, 2017. doi:10.1016/j.ejor.2016.10.031.
- Deep learning-based exchange rate prediction during the covid-19 pandemic. Annals of Operations Research, pages 1–52, 2021. doi:10.1007/s10479-021-04420-6.
- An ensemble method of the machine learning to prognosticate the gastric cancer. Annals of Operations Research, 328(1):151–192, 2023. doi:10.1007/s10479-022-04964-1.
- Deep ensemble learning with non-equivalent costs of fault severities for rolling bearing diagnostics. Journal of Manufacturing Systems, 61:249–264, 2021. doi:10.1016/j.jmsy.2021.09.009.
- A cs-adaboost-bp model for product quality inspection. Annals of Operations Research, 308:685–701, 2022. doi:10.1007/s10479-020-03798-z.
- Using polls to forecast popular vote share for us presidential elections 2016 and 2020: An optimal forecast combination based on ensemble empirical model. Journal of the Operational Research Society, 74(3):905–911, 2023. doi:10.1080/01605682.2022.2101951.
- Combining deep ensemble learning and explanation for intelligent ticket management. Expert Systems with Applications, 206:117815, 2022. doi:10.1016/j.eswa.2022.117815.
- A deep ensemble hippocampal cnn model for brain age estimation applied to alzheimer’s diagnosis. Expert Systems with Applications, 195:116622, 2022. doi:10.1016/j.eswa.2022.116622.
- Big data with deep learning for benchmarking profitability performance in project tendering. Expert Systems with Applications, 147:113194, 2020. doi:10.1016/j.eswa.2020.113194.
- Deep learning model for defect analysis in industry using casting images. Expert Systems with Applications, page 120758, 2023. doi:10.1016/j.eswa.2023.120758.
- Gamification design using tourist-generated pictures to enhance visitor engagement at intercity tourist sites. Annals of Operations Research, pages 1–33, 2023. doi:10.1007/s10479-023-05590-1.
- Simple and scalable predictive uncertainty estimation using deep ensembles. Advances in neural information processing systems, 30, 2017.
- Aleatory or epistemic? does it matter? Structural safety, 31(2):105–112, 2009. doi:10.1016/j.strusafe.2008.06.020.
- Decomposition of uncertainty in bayesian deep learning for efficient and risk-sensitive learning. In International Conference on Machine Learning, pages 1184–1193. PMLR, 2018.
- Te Han and Yan-Fu Li. Out-of-distribution detection-assisted trustworthy machinery fault diagnosis approach with uncertainty-aware deep ensembles. Reliability Engineering & System Safety, 226:108648, 2022. doi:10.1016/j.ress.2022.108648.
- A unified benchmark for the unknown detection capability of deep neural networks. Expert Systems with Applications, 229:120461, 2023. doi:10.1016/j.eswa.2023.120461.
- Very short-term solar ultraviolet-a radiation forecasting system with cloud cover images and a bayesian optimized interpretable artificial intelligence model. Expert Systems with Applications, 236:121273, 2024. doi:10.1016/j.eswa.2023.121273.
- A new ensemble convolutional neural network with diversity regularization for fault diagnosis. Journal of Manufacturing Systems, 62:964–971, 2022. doi:10.1016/j.jmsy.2020.12.002.
- Resilience-based recovery scheduling of transportation network in mixed traffic environment: A deep-ensemble-assisted active learning approach. Reliability Engineering & System Safety, 215:107800, 2021. doi:10.1016/j.ress.2021.107800.
- Deep learning in business analytics and operations research: Models, applications and managerial implications. European Journal of Operational Research, 281(3):628–641, 2020. ISSN 0377-2217. doi:10.1016/j.ejor.2019.09.018. Featured Cluster: Business Analytics: Defining the field and identifying a research agenda.
- To prune, or not to prune: exploring the efficacy of pruning for model compression. arXiv preprint arXiv:1710.01878, 2017. doi:10.48550/arXiv.1710.01878.
- Spare parts classification in industrial manufacturing using the dominance-based rough set approach. European Journal of Operational Research, 262(3):1136–1163, 2017. doi:10.1016/j.ejor.2017.04.040.
- Modeling spare parts demands forecast under two-dimensional preventive maintenance policy. Mathematical Problems in Engineering, 2015, 2015. doi:10.1155/2015/728241.
- Operations related groups (orgs): a clustering procedure for production/inventory systems. Journal of Operations Management, 9(4):574–598, 1990. doi:10.1016/0272-6963(90)90010-B.
- Towards sim-to-real industrial parts classification with synthetic dataset. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 4453–4462, 2023. doi:10.1109/CVPRW59228.2023.00468.
- Uncertainty baselines: Benchmarks for uncertainty & robustness in deep learning. arXiv preprint arXiv:2106.04015, 2021. doi:10.48550/arXiv.2106.04015.
- Deep ensembles: A loss landscape perspective. arXiv preprint arXiv:1912.02757, 2019. doi:10.48550/arXiv.1912.02757.
- Reject, March 2024. URL https://github.com/arthur-thuy/reject.
- Using the price-to-earnings harmonic mean to improve firm valuation estimates. Journal of Financial Education, pages 98–110, 2010.
- A systematic analysis of performance measures for classification tasks. Information processing & management, 45(4):427–437, 2009. doi:10.1016/j.ipm.2009.03.002.
- Uncertainty-based rejection in machine learning: Implications for model development and interpretability. Electronics, 11(3):396, 2022. doi:10.3390/electronics11030396.
- Arthur Thuy (3 papers)
- Dries F. Benoit (3 papers)