Examining the Effect of Implementation Factors on Deep Learning Reproducibility
Abstract: Reproducing published deep learning papers to validate their conclusions can be difficult due to sources of irreproducibility. We investigate the impact that implementation factors have on the results and how they affect reproducibility of deep learning studies. Three deep learning experiments were ran five times each on 13 different hardware environments and four different software environments. The analysis of the 780 combined results showed that there was a greater than 6% accuracy range on the same deterministic examples introduced from hardware or software environment variations alone. To account for these implementation factors, researchers should run their experiments multiple times in different hardware and software environments to verify their conclusions are not affected.
- M. Hutson, “Artificial intelligence faces reproducibility crisis,” 2018.
- E. Raff, “A step toward quantifying independently reproducible machine learning research,” Advances in Neural Information Processing Systems, vol. 32, 2019.
- O. E. Gundersen, K. Coakley, and C. Kirkpatrick, “Sources of irreproducibility in machine learning: A review,” arXiv preprint arXiv:2204.07610, 2022.
- O. E. Gundersen, “The fundamental principles of reproducibility,” Philosophical Transactions of the Royal Society A, vol. 379, no. 2197, p. 20200210, 2021.
- R. Pordes, D. Petravick, B. Kramer, D. Olson, M. Livny, A. Roy, P. Avery, K. Blackburn, T. Wenaus, F. Würthwein et al., “The open science grid,” in Journal of Physics: Conference Series, vol. 78. IOP Publishing, 2007, p. 012057.
- I. Sfiligoi, D. C. Bradley, B. Holzman, P. Mhashilkar, S. Padhi, and F. Wurthwein, “The pilot way to grid resources using glideinwms,” in 2009 WRI World congress on computer science and information engineering, vol. 2. IEEE, 2009, pp. 428–432.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.