Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
120 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
46 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Repeatability, Reproducibility, Replicability, Reusability (4R) in Journals' Policies and Software/Data Management in Scientific Publications: A Survey, Discussion, and Perspectives (2312.11028v1)

Published 18 Dec 2023 in cs.SE

Abstract: With the recognized crisis of credibility in scientific research, there is a growth of reproducibility studies in computer science, and although existing surveys have reviewed reproducibility from various perspectives, especially very specific technological issues, they do not address the author-publisher relationship in the publication of reproducible computational scientific articles. This aspect requires significant attention because it is the basis for reliable research. We have found a large gap between the reproducibility-oriented practices, journal policies, recommendations, publisher artifact Description/Evaluation guidelines, submission guides, technological reproducibility evolution, and its effective adoption to contribute to tackling the crisis. We conducted a narrative survey, a comprehensive overview and discussion identifying the mutual efforts required from Authors, Journals, and Technological actors to achieve reproducibility research. The relationship between authors and scientific journals in their mutual efforts to jointly improve the reproducibility of scientific results is analyzed. Eventually, we propose recommendations for the journal policies, as well as a unified and standardized Reproducibility Guide for the submission of scientific articles for authors. The main objective of this work is to analyze the implementation and experiences of reproducibility policies, techniques and technologies, standards, methodologies, software, and data management tools required for scientific reproducible publications. Also, the benefits and drawbacks of such an adoption, as well as open challenges and promising trends, to propose possible strategies and efforts to mitigate the identified gaps. To this purpose, we analyzed 200 scientific articles, surveyed 16 Computer Science journals, and systematically classified them according to reproducibility strategies, technologies, policies, code citation, and editorial business. We conclude there is still a reproducibility gap in scientific publications, although at the same time also the opportunity to reduce this gap with the joint effort of authors, publishers, and technological providers.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (206)
  1. Reproducibility in Scientific Computing. ACM Computing Surveys, 51(3):63:1–63:36, July 2018.
  2. T.P. Hughes. History of Technology. In International Encyclopedia of the Social & Behavioral Sciences, pages 6852–6857. Elsevier, 2001.
  3. Evolution of DNA replication origin specification and gene silencing mechanisms. Nature Communications, 11(1):5175, October 2020.
  4. The readability of scientific texts is decreasing over time. eLife, 6:e27725, September 2017.
  5. Odd Erik Gundersen. The Reproducibility Crisis Is Real. AI Magazine, 41(3):103–106, September 2020.
  6. Matthew Hutson. Artificial intelligence faces reproducibility crisis. Science, 359(6377):725–726, February 2018.
  7. Francis Dodds. The future of academic publishing: Revolution or evolution revisited. Learned Publishing, 32(4):345–354, October 2019.
  8. Reflections on the future of research curation and research reproducibility [point of view]. 106(5):779–783.
  9. The future of academic publishing. Nature Human Behaviour, 7(7):1021–1026, July 2023.
  10. Meta-analysis for families of experiments in software engineering: a systematic review and reproducibility and validity assessment. Empirical Software Engineering, 25(1):353–401, January 2020.
  11. Resources for reproducibility of experiments in empirical software engineering: Topics derived from a secondary study. 8:8992–9004.
  12. Charlotte Stoddart. Is there a reproducibility crisis in science? Nature, pages d41586–019–00067–3, May 2016.
  13. Leakage and the Reproducibility Crisis in ML-based Science. 2022.
  14. Reproducibility as a Mechanism for Teaching Fairness, Accountability, Confidentiality, and Transparency in Artificial Intelligence. Proceedings of the AAAI Conference on Artificial Intelligence, 36(11):12792–12800, June 2022.
  15. Measuring Reproduciblity of Machine Learning Methods for Medical Diagnosis. In 2022 Fourth International Conference on Transdisciplinary AI (TransAI), pages 9–16, Laguna Hills, CA, USA, September 2022. IEEE.
  16. Reproducible Model Sharing for AI Practitioners. In Proceedings of the Fifth Workshop on Distributed Infrastructures for Deep Learning (DIDL) 2021, pages 1–6, Virtual Event Canada, December 2021. ACM.
  17. Transparency and reproducibility in artificial intelligence. Nature, 586(7829):E14–E16, October 2020.
  18. Reply to: Transparency and reproducibility in artificial intelligence. Nature, 586(7829):E17–E18, October 2020.
  19. Reproducible Artificial Intelligence Research Requires Open Communication of Complete Source Code. Radiology: Artificial Intelligence, 2(4):e200060, July 2020.
  20. Elizabeth Gibney. This AI researcher is trying to ward off a reproducibility crisis. Nature, 577(7788):14–14, January 2020.
  21. Interpretability and Reproducability in Production Machine Learning Applications. In 2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA), pages 658–664, Orlando, FL, December 2018. IEEE.
  22. Explainability in Graph Data Science: Interpretability, replicability, and reproducibility of community detection. IEEE Signal Processing Magazine, 39(4):25–39, July 2022.
  23. Reproducibility: Evaluating the Evaluations. In Bertrand Kerautret, Miguel Colom, Adrien Krähenbühl, Daniel Lopresti, Pascal Monasse, and Hugues Talbot, editors, Reproducible Research in Pattern Recognition, volume 12636, pages 12–23. Springer International Publishing, Cham, 2021.
  24. Edward Raff. Research Reproducibility as a Survival Analysis. arXiv:2012.09932 [cs, stat], December 2020. arXiv: 2012.09932.
  25. Reproducibility and research integrity: the role of scientists and institutions. 14(1):451.
  26. Hidden technical debt in machine learning systems. In C. Cortes, N. Lawrence, D. Lee, M. Sugiyama, and R. Garnett, editors, Advances in Neural Information Processing Systems, volume 28. Curran Associates, Inc., 2015.
  27. The practice of reproducible research: case studies and lessons from the data-intensive sciences. University of California Press, Oakland, California, 2018.
  28. Workshop Report on Basic Research Needs for Scientific Machine Learning: Core Technologies for Artificial Intelligence. Technical Report 1478744, February 2019.
  29. Research Reproducibility. Computer, 55(8):16–18, August 2022.
  30. Reproducible research. CORE Issues in Professional and Research Ethics, 1, 01 2012.
  31. Reproducibility in Computing Research: An Empirical Study. IEEE Access, 10:29207–29223, 2022.
  32. Odd Erik Gundersen. The Fundamental Principles of Reproducibility. Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, 379(2197):20200210, May 2021. arXiv:2011.10098 [cs].
  33. Reproducibility: A Researcher-Centered Definition. Multimodal Technologies and Interaction, 6(2):17, February 2022.
  34. Malcolm Macleod and the University of Edinburgh Research Strategy Group. Improving the reproducibility and integrity of research: what can different stakeholders contribute? BMC Research Notes, 15(1):146, April 2022.
  35. Reproducibility and Replicability in Science. National Academies Press, Washington, D.C., September 2019.
  36. Hans E. Plesser. Reproducibility vs. Replicability: A Brief History of a Confused Terminology. Frontiers in Neuroinformatics, 11:76, January 2018.
  37. A taxonomy for reproducible and replicable research in environmental modelling. Environmental Modelling & Software, 134:104753, December 2020.
  38. Toward a Compatible Reproducibility Taxonomy for Computational and Computing Sciences. Technical Report SAND2018-11186, 1481626, October 2018.
  39. Reproducibility is a Process, Not an Achievement: The Replicability of IR Reproducibility Experiments. In Joemon M. Jose, Emine Yilmaz, João Magalhães, Pablo Castells, Nicola Ferro, Mário J. Silva, and Flávio Martins, editors, Advances in Information Retrieval, volume 12036, pages 43–49. Springer International Publishing, Cham, 2020.
  40. State of the Art: Reproducibility in Artificial Intelligence. Proceedings of the AAAI Conference on Artificial Intelligence, 32(1), April 2018.
  41. Reproducibility in learning. In Proceedings of the 54th Annual ACM SIGACT Symposium on Theory of Computing, pages 818–831, Rome Italy, June 2022. ACM.
  42. Toward Long-Term and Archivable Reproducibility. Computing in Science & Engineering, 23(3):82–91, May 2021.
  43. Monya Baker. Why scientists must share their research code.
  44. Escaping dependency hell: finding build dependency errors with the unified dependency graph. In Proceedings of the 29th ACM SIGSOFT International Symposium on Software Testing and Analysis, ISSTA 2020, pages 463–474, New York, NY, USA, July 2020. Association for Computing Machinery.
  45. Chris A. Mack. How to write a good scientific paper. SPIE Press, Bellingham, Washington, 2018. OCLC: on1019885580.
  46. David H Bailey. Reproducibility and variable precision computing. The International Journal of High Performance Computing Applications, 34(5):483–490, September 2020.
  47. Improving Numerical Reproducibility of Scientific Software in Parallel Systems. In 2020 IEEE International Conference on Electro Information Technology (EIT), pages 066–074, Chicago, IL, USA, July 2020. IEEE.
  48. First steps towards more numerical reproducibility. ESAIM: Proceedings and Surveys, 45:229, September 2014.
  49. NixOS: A purely functional Linux distribution. Journal of Functional Programming, 20(5-6):577–615, November 2010.
  50. Toward practical transparent verifiable and long-term reproducible research using Guix. Scientific Data, 9(1):597, October 2022.
  51. Promoting and Enabling Reproducible Data Science Through a Reproducibility Challenge. Harvard Data Science Review, July 2022.
  52. Valentina Ghimpau. Incentives, rewards, and recognition - what really motivates a researcher? In Judging Research. MDPI.
  53. Hugh Desmond. Incentivizing replication is insufficient to safeguard default trust. Philosophy of Science, 88, 03 2020.
  54. Citations, Citation Indicators, and Research Quality: An Overview of Basic Concepts and Theories. SAGE Open, 9(1):215824401982957, January 2019.
  55. Scientist impact factor (sif): a new metric for improving scientists’ evaluation? Annals of Translational Medicine, 5(15), 2017.
  56. Edward Raff. Does the Market of Citations Reward Reproducible Work? arXiv:2204.03829 [cs], April 2022. arXiv: 2204.03829.
  57. A Badging System for Reproducibility and Replicability in Remote Sensing Research. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 13:4988–4995, 2020.
  58. Beyond the badge: reproducibility engineering as a lifetime skill. In Proceedings of the 4th International Workshop on Software Engineering Education for the Next Generation, pages 1–4, Pittsburgh Pennsylvania, May 2022. ACM.
  59. Academic Research in the 21st Century: Maintaining Scientific Integrity in a Climate of Perverse Incentives and Hypercompetition. Environmental Engineering Science, 34(1):51–61, January 2017.
  60. Checklists to detect potential predatory biomedical journals: a systematic review. BMC Medicine, 18(1):104, December 2020.
  61. Fabien C. Y. Benureau and Nicolas P. Rougier. Re-run, repeat, reproduce, reuse, replicate: Transforming code into scientific contributions. 11:69.
  62. Epistemic Parity: Reproducibility as an Evaluation Metric for Differential Privacy. Proceedings of the VLDB Endowment, 16(11):3178–3191, July 2023.
  63. Edward Raff. A Step Toward Quantifying Independently Reproducible Machine Learning Research. 2019.
  64. A literature review of methods for assessment of reproducibility in science. preprint, In Review, November 2022.
  65. Christian S. Collberg. Measuring reproducibility in computer systems research. 2014.
  66. On challenges in machine learning model management. IEEE Data Engineering Bulletin, 2015.
  67. Computational reproducibility: state-of-the-art, challenges, and database research opportunities. In Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data, pages 593–596, Scottsdale Arizona USA, May 2012. ACM.
  68. Current approaches for executing big data science projects—a systematic literature review. PeerJ Computer Science, 8:e862, February 2022.
  69. Jiating Chen Gonzalo Rivero. Best coding practices to ensure reproducibility. 2020.
  70. Reproducible research for scientific computing: Tools and strategies for changing the culture. Computing in Science & Engineering, 14(4):13–17, 2012.
  71. Partha Pratim Ray. A review on TinyML: State-of-the-art and prospects. Journal of King Saud University - Computer and Information Sciences, 34(4):1595–1623, April 2022.
  72. Computational reproducibility of scientific workflows at extreme scales. The International Journal of High Performance Computing Applications, 33(5):763–776, September 2019.
  73. Lorena A. Barba. Defining the Role of Open Source Software in Research Reproducibility. Computer, 55(8):40–48, August 2022.
  74. Cloud-native repositories for big scientific data. Computing in Science & Engineering, 23(2):26–35, 2021.
  75. Revisiting the reproducibility of empirical software engineering studies based on data retrieved from development repositories. Information and Software Technology, 164:107318, December 2023.
  76. How to Open Science: Promoting Principles and Reproducibility Practices Within the Artificial Intelligence in Education Community. In Ning Wang, Genaro Rebolledo-Mendez, Vania Dimitrova, Noboru Matsuda, and Olga C. Santos, editors, Artificial Intelligence in Education. Posters and Late Breaking Results, Workshops and Tutorials, Industry and Innovation Tracks, Practitioners, Doctoral Consortium and Blue Sky, volume 1831, pages 74–78. Springer Nature Switzerland, Cham, 2023.
  77. How to Open Science: A Principle and Reproducibility Review of the Learning Analytics and Knowledge Conference. In LAK23: 13th International Learning Analytics and Knowledge Conference, pages 156–164, Arlington TX USA, March 2023. ACM.
  78. Victoria Stodden. Beyond Open Data: A Model for Linking Digital Artifacts to Enable Reproducibility of Scientific Claims. In Proceedings of the 3rd International Workshop on Practical Reproducible Evaluation of Computer Systems, pages 9–14, Stockholm Sweden, June 2020. ACM.
  79. An empirical analysis of journal policy effectiveness for computational reproducibility. Proceedings of the National Academy of Sciences, 115(11):2584–2589, March 2018.
  80. Supporting FAIR data: categorization of research data as a tool in data management. Informaatiotutkimus, 37(4), December 2018.
  81. Reproducibility of Machine Learning: Terminology, Recommendations and Open Issues, February 2023. arXiv:2302.12691 [cs].
  82. Open science in machine learning. Implementing Reproducible Research, 02 2014.
  83. Towards an Accountable and Reproducible Federated Learning: A FactSheets Approach. 2022.
  84. Towards a Metadata Management System for Provenance, Reproducibility and Accountability in Federated Machine Learning. In Christian Zirpins, Guadalupe Ortiz, Zoltan Nochta, Oliver Waldhorst, Jacopo Soldani, Massimo Villari, and Damian Tamburri, editors, Advances in Service-Oriented and Cloud Computing, volume 1617, pages 5–18. Springer Nature Switzerland, Cham, 2022.
  85. Repeatability, reproducibility, and rigor in systems research. In Proceedings of the ninth ACM international conference on Embedded software, pages 33–38, Taipei Taiwan, October 2011. ACM.
  86. Collective mind: Towards practical and collaborative auto-tuning. Sci. Program., 22:309–329, 2014.
  87. Grigori Fursin. Invited Talk Abstract: Introducing ReQuEST: An Open Platform for Reproducible and Quality-Efficient Systems-ML Tournaments. In 2018 1st Workshop on Energy Efficient Machine Learning and Cognitive Computing for Embedded Applications (EMC2), pages 3–3, Williamsburg, VA, March 2018. IEEE.
  88. The Brain Tumor Segmentation (BraTS) Challenge 2023: Focus on Pediatrics (CBTN-CONNECT-DIPGR-ASNR-MICCAI BraTS-PEDs). 2023.
  89. Adopting Microservices and DevOps in the Cyber-Physical Systems Domain: A Rapid Review and Case Study. Software: Practice and Experience, 53(3):790–810, March 2023. arXiv:2210.06858 [cs].
  90. Cloud Programming Simplified: A Berkeley View on Serverless Computing. 2019.
  91. A Large-Scale Study About Quality and Reproducibility of Jupyter Notebooks. In 2019 IEEE/ACM 16th International Conference on Mining Software Repositories (MSR), pages 507–517, May 2019. ISSN: 2574-3864.
  92. Computational reproducibility of jupyter notebooks from biomedical publications, 2023.
  93. Understanding and improving the quality and reproducibility of Jupyter notebooks. Empirical Software Engineering, 26(4):65, July 2021.
  94. ReproduceMeGit: A Visualization Tool for Analyzing Reproducibility of Jupyter Notebooks. In Boris Glavic, Vanessa Braganholo, and David Koop, editors, Provenance and Annotation of Data and Processes, volume 12839, pages 201–206. Springer International Publishing, Cham, 2021.
  95. Restoring reproducibility of Jupyter notebooks. In Proceedings of the ACM/IEEE 42nd International Conference on Software Engineering: Companion Proceedings, pages 288–289, Seoul South Korea, June 2020. ACM.
  96. Reproducible experiments on dynamic resource allocation in cloud data centers. Information Systems, 59:98–101, July 2016.
  97. Faical Congo. Building a Cloud Service for Reproducible Simulation Management. pages 187–193, Austin, Texas, 2015.
  98. Bill Howe. Virtual appliances, cloud computing, and reproducible research. Computing in Science & Engineering, 14(4):36–41, 2012.
  99. Toward Reusable Science with Readable Code and Reproducibility. arXiv:2109.10387 [cs], September 2021. arXiv: 2109.10387.
  100. R. Shane Canon. The Role of Containers in Reproducibility. In 2020 2nd International Workshop on Containers and New Orchestration Paradigms for Isolated Environments in HPC (CANOPIE-HPC), pages 19–25, Atlanta, GA, USA, November 2020. IEEE.
  101. Reproducibility of Computational Experiments on Kubernetes-Managed Container Clouds with HyperFlow. In Valeria V. Krzhizhanovskaya, Gábor Závodszky, Michael H. Lees, Jack J. Dongarra, Peter M. A. Sloot, Sérgio Brissos, and João Teixeira, editors, Computational Science – ICCS 2020, volume 12137, pages 220–233. Springer International Publishing, Cham, 2020.
  102. Using Computing Containers and Continuous Integration to Improve Numerical Research Reproducibility. International Journal of Computer (IJC), 30(1):27–33, July 2018.
  103. Reproducible Research Environments with Repo2Docker. June 2018.
  104. Building a Reproducible Machine Learning Pipeline. arXiv:1810.04570 [cs, stat], October 2018. arXiv: 1810.04570.
  105. The pipeline for the continuous development of artificial intelligence models—Current state of research and practice. Journal of Systems and Software, 199:111615, May 2023.
  106. Watchdog 2.0: New developments for reusability, reproducibility, and workflow execution. GigaScience, 9(6):giaa068, June 2020.
  107. Machine Learning Pipelines: Provenance, Reproducibility and FAIR Data Principles. In Boris Glavic, Vanessa Braganholo, and David Koop, editors, Provenance and Annotation of Data and Processes, volume 12839, pages 226–230. Springer International Publishing, Cham, 2021.
  108. A Community Roadmap for Scientific Workflows Research and Development. In 2021 IEEE Workshop on Workflows in Support of Large-Scale Science (WORKS), pages 81–90, St. Louis, MO, USA, November 2021. IEEE.
  109. A Model-Driven Approach for Systematic Reproducibility and Replicability of Data Science Projects. In Xavier Franch, Geert Poels, Frederik Gailly, and Monique Snoeck, editors, Advanced Information Systems Engineering, volume 13295, pages 147–163. Springer International Publishing, Cham, 2022.
  110. Marcelle: Composing Interactive Machine Learning Workflows and Interfaces. In The 34th Annual ACM Symposium on User Interface Software and Technology, pages 39–53, Virtual Event USA, October 2021. ACM.
  111. Reproducible Workflow. 2020.
  112. Experiences with Reproducibility: Case Studies from Scientific Workflows. In Proceedings of the 4th International Workshop on Practical Reproducible Evaluation of Computer Systems, pages 3–8, Virtual Event Sweden, June 2020. ACM.
  113. KheOps: Cost-effective Repeatability, Reproducibility, and Replicability of Edge-to-Cloud Experiments. In Proceedings of the 2023 ACM Conference on Reproducibility and Replicability, pages 62–73, Santa Cruz CA USA, June 2023. ACM.
  114. Scientific workflows for computational reproducibility in the life sciences: Status, challenges and opportunities. Future Generation Computer Systems, 75:284–298, October 2017.
  115. Reproducibility Practice in High-Performance Computing: Community Survey Results. Computing in Science & Engineering, 23(5):55–60, September 2021.
  116. Towards Reproducibility in Scientific Workflows: An Infrastructure-Based Approach. Scientific Programming, 2015:e243180, February 2015.
  117. The ml test score: A rubric for ml production readiness and technical debt reduction. In 2017 IEEE International Conference on Big Data (Big Data), pages 1123–1132, 2017.
  118. Practical MLOps: operationalizing machine learning models. O’Reilly Media Inc, Sebastopol, CA, first edition edition, 2021. OCLC: on1249501065.
  119. Software engineering for machine learning: A case study. In 2019 IEEE/ACM 41st International Conference on Software Engineering: Software Engineering in Practice (ICSE-SEIP), pages 291–300, 2019.
  120. Do machine learning platforms provide out-of-the-box reproducibility? Future Generation Computer Systems, 126:34–47, January 2022.
  121. Management of Machine Learning Lifecycle Artifacts: A Survey. 2022.
  122. Preprint. Ml reproducibility systems: Status and research agenda. 2021.
  123. Grigori Fursin. Collective Knowledge: organizing research projects as a database of reusable components and portable workflows with common APIs. 2020.
  124. YAWL: An open source Business Process Management System from science for science. SoftwareX, 12:100576, July 2020.
  125. Shiwa workflow interoperability solutions for neuroimaging data analysis. Studies in health technology and informatics, 175:109–10, 09 2012.
  126. Experimental Research Reproducibility and Experiment Workflow Management. In 2023 15th International Conference on COMmunication Systems & NETworkS (COMSNETS), pages 835–840, Bangalore, India, January 2023. IEEE.
  127. Richard W. Bowman. Improving instrument reproducibility with open source hardware. 3(1):27.
  128. https://octoverse.github.com/. State of the octoverse 2022. 2023.
  129. Pedro Mestre Daniel Adorno Gomes and Carlos Serodio. Infrastructure-as-code for scientific computing environments. CENTRIC 2019.
  130. pulumi.com. Delivering cloud native infrastructure as code - pulumi. 2019.
  131. Provenance-and machine learning-based recommendation of parameter values in scientific workflows. PeerJ Computer Science, 7:e606, July 2021.
  132. ProvStore: A Public Provenance Repository. In Bertram Ludäscher and Beth Plale, editors, Provenance and Annotation of Data and Processes, volume 8628, pages 275–277. Springer International Publishing, Cham, 2015.
  133. Reprozip: Using provenance to support computational reproducibility. 01 2013.
  134. Integrated Reproducibility with Self-describing Machine Learning Models. In Proceedings of the 2023 ACM Conference on Reproducibility and Replicability, pages 1–14, Santa Cruz CA USA, June 2023. ACM.
  135. A collaborative semantic-based provenance management platform for reproducibility. PeerJ Computer Science, 8:e921, March 2022.
  136. End-to-End provenance representation for the understandability and reproducibility of scientific experiments using a semantic approach. Journal of Biomedical Semantics, 13(1):1, December 2022.
  137. A Blockchain-Based Approach to Provenance and Reproducibility in Research Workflows. In 2021 IEEE International Conference on Blockchain and Cryptocurrency (ICBC), pages 1–6, Sydney, Australia, May 2021. IEEE.
  138. AI pedigree verification platform using blockchain. In 2020 2nd Conference on Blockchain Research & Applications for Innovative Networks and Services (BRAINS), pages 204–205, Paris, France, September 2020. IEEE.
  139. Joseph Wonsil. Reproducibility as a service.
  140. Reproducibility as a Technical Specification. 2015.
  141. IPOL: a new journal for fully reproducible research; analysis of four years development. July 2015.
  142. Towards Evidence-Based Software Quality Practices for Reproducibility: Preliminary Results and Research Directions. In Proceedings of the 2023 ACM Conference on Reproducibility and Replicability, pages 85–88, Santa Cruz CA USA, June 2023. ACM.
  143. Best practices for computational science: Software infrastructure and environments for reproducible and extensible research. 2(1):e21.
  144. Illustrations from The Turing Way: Shared under CC-BY 4.0 for reuse, May 2022.
  145. Towards computational reproducibility: researcher perspectives on the use and sharing of software. PeerJ Computer Science, 4:e163, September 2018.
  146. A survey study of success factors in data science projects. In 2021 IEEE International Conference on Big Data (Big Data), pages 2313–2318, Orlando, FL, USA, December 2021. IEEE.
  147. A systematic literature review on applying crisp-dm process model. Procedia Computer Science, 181:526–534, 01 2021.
  148. Reproducibility Starts from You Today. Patterns, 1(6):100099, September 2020.
  149. Editorial: Method and data sharing and reproducibility of scientific results. 60(12):5868–5869.
  150. Understanding experiments and research practices for reproducibility: an exploratory study. PeerJ, 9:e11140, April 2021.
  151. A better approach for dealing with reproducibility and replicability in science. 118(7):e2100769118.
  152. Remco Heesen. Communism and the incentive to share in science. Philosophy of Science, 84(4):698–716, 2017.
  153. Fabio Bonsignorio. A New Kind of Article for Reproducible Research in Intelligent Robotics [From the Field]. IEEE Robotics & Automation Magazine, 24(3):178–182, September 2017.
  154. Enhancing Reproducibility of Research Papers in SISC, JSC and JCP. Journal of Scientific Computing, 95(3):77, s10915–023–02193–7, June 2023.
  155. Advancing Computational Reproducibility in the Dataverse Data Repository Platform. In Proceedings of the 3rd International Workshop on Practical Reproducible Evaluation of Computer Systems, pages 15–20, Stockholm Sweden, June 2020. ACM.
  156. Improving Reproducibility in Machine Learning Research (A Report from the NeurIPS 2019 Reproducibility Program). arXiv:2003.12206 [cs, stat], December 2020. arXiv: 2003.12206.
  157. Toward enabling reproducibility for data-intensive research using the whole tale platform. In Ian Foster, Gerhard R. Joubert, Luděk Kučera, Wolfgang E. Nagel, and Frans Peters, editors, Advances in Parallel Computing. IOS Press.
  158. Computing environments for reproducibility: Capturing the “Whole Tale”. Future Generation Computer Systems, 94:854–867, May 2019.
  159. Implementing Computational Reproducibility in the Whole Tale Environment. In Proceedings of the 2nd International Workshop on Practical Reproducible Evaluation of Computer Systems, pages 17–22, Phoenix AZ USA, June 2019. ACM.
  160. ReproducedPapers.org: Openly teaching and structuring machine learning reproducibility. arXiv:2012.01172 [cs], 12636:3–11, 2021. arXiv: 2012.01172.
  161. ReScience C: A Journal for Reproducible Replications in Computational Science. In Bertrand Kerautret, Miguel Colom, Daniel Lopresti, Pascal Monasse, and Hugues Talbot, editors, Reproducible Research in Pattern Recognition, volume 11455, pages 150–156. Springer International Publishing, Cham, 2019.
  162. Chameleon: A Scalable Production Testbed for Computer Science Research, pages 123–148. 05 2019.
  163. A Study of Computational Reproducibility using URLs Linking to Open Access Datasets and Software. In Companion Proceedings of the Web Conference 2022, pages 784–788, Virtual Event, Lyon France, April 2022. ACM.
  164. Documenting the Creation, Manipulation and Evaluation of Links for Reuse and Reproducibility. In Oscar Corcho, Laura Hollink, Oliver Kutz, Nicolas Troquard, and Fajar J. Ekaputra, editors, Knowledge Engineering and Knowledge Management, volume 13514, pages 81–96. Springer International Publishing, Cham, 2022.
  165. A Siren Song of Open Source Reproducibility. arXiv:2204.04372 [cs], April 2022. arXiv: 2204.04372.
  166. Why don’t we share data and code? Perceived barriers and benefits to public archiving practices. Proceedings of the Royal Society B: Biological Sciences, 289(1987):20221113, November 2022.
  167. Supporting computational reproducibility through code review. Nature Human Behaviour, 5(8):965–966, August 2021.
  168. Artifacts Availability & Reproducibility (VLDB 2021 Round Table). ACM SIGMOD Record, 51(2):74–77, July 2022.
  169. Verifiable Badging System for scientific data reproducibility. Blockchain: Research and Applications, 2(2):100015, June 2021.
  170. Grigori Fursin. Enabling reproducible ML and Systems research: the good, the bad, and the ugly. August 2020.
  171. Reproducibility Requires Consolidated Artifacts. In 2023 IEEE/ACM 2nd International Conference on AI Engineering – Software Engineering for AI (CAIN), pages 100–101, Melbourne, Australia, May 2023. IEEE.
  172. Tanu Malik. Artifact Description/Artifact Evaluation: A Reproducibility Bane or a Boon. In Proceedings of the 4th International Workshop on Practical Reproducible Evaluation of Computer Systems, pages 1–1, Virtual Event Sweden, June 2020. ACM.
  173. Certify reproducibility with confidential data. Science, 365(6449):127–128, July 2019.
  174. A data citation roadmap for scientific publishers. Scientific Data, 5(1):180259, November 2018.
  175. A computational reproducibility study of PLOS ONE articles featuring longitudinal data analyses. 16(6):e0251194.
  176. Reproducibility efforts as a teaching tool: A pilot study. PLOS Computational Biology, 18(11):e1010615, November 2022.
  177. Expanding the Scope of Reproducibility Research Through Data Analysis Replications. In Companion Proceedings of the Web Conference 2020, pages 567–571, Taipei Taiwan, April 2020. ACM.
  178. Fraida Fund. We Need More Reproducibility Content Across the Computer Science Curriculum. In Proceedings of the 2023 ACM Conference on Reproducibility and Replicability, pages 97–101, Santa Cruz CA USA, June 2023. ACM.
  179. Jimmy Lin. Building a Culture of Reproducibility in Academic Research. 2022.
  180. The History and Future of Data Citation in Practice. Data Science Journal, 18:52, November 2019.
  181. Mikhail G. Dozmorov. GitHub Statistics as a Measure of the Impact of Open-Source Bioinformatics Software. Frontiers in Bioengineering and Biotechnology, 6:198, December 2018.
  182. Toward Reproducible Computational Research: An Empirical Analysis of Data and Code Policy Adoption by Journals. PLoS ONE, 8(6):e67111, June 2013.
  183. Thu-Mai Lewis. From policy to practice: How journal-based data policies encourage scientists’ adoption of reproducible research practices. PhD thesis, The University of North Carolina at Chapel Hill University Libraries.
  184. Zhaokun Stodden Victoria, Guo Peixuan. How journals are adopting open data and code policies.
  185. OA Diamond Journals Study. Part 1: Findings. Technical report, Zenodo, March 2021.
  186. Reproducible and reusable research: are journal data sharing policies meeting the mark? PeerJ, 5:e3208, April 2017.
  187. Publishing computational research - a review of infrastructures for reproducible and transparent scholarly communication. Research Integrity and Peer Review, 5(1):10, December 2020.
  188. Trust but Verify: How to Leverage Policies, Workflows, and Infrastructure to Ensure Computational Reproducibility in Publication. Harvard Data Science Review, 2(4), December 2020.
  189. A Survey on Data Reproducibility and the Effect of Publication Process on the Ethical Reporting of Laboratory Research. Clinical Cancer Research, 24(14):3447–3455, July 2018.
  190. State of AI Ethics Report (Volume 6, February 2022), February 2022. arXiv:2202.07435 [cs].
  191. Containers for computational reproducibility. Nature Reviews Methods Primers, 3(1):1–16, July 2023.
  192. Grigori Fursin. The Collective Knowledge project: making ML models more portable and reproducible with open APIs, reusable best practices and MLOps. 2020.
  193. Russell A. Poldrack. The Costs of Reproducibility. Neuron, 101(1):11–14, January 2019.
  194. Armbrust. Above the clouds: A berkeley view of cloud computing. 01 2009.
  195. Christopher Gandrud. Reproducible research with R and RStudio. The R series. CRC Press, Boca Raton, FL, third edition edition, 2020.
  196. The galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2018 update. Nucleic acids research, 46(W1):W537–W544, 2018.
  197. Toward Trustworthy AI Development: Mechanisms for Supporting Verifiable Claims. 2020.
  198. The race to the vulnerable: Measuring the log4j shell incident. arXiv preprint arXiv:2205.02544, 2022.
  199. A network-based normalized impact measure reveals successful periods of scientific discovery across disciplines. Proceedings of the National Academy of Sciences, 120(48):e2309378120, November 2023.
  200. Software citation principles. PeerJ Computer Science, 2:e86, September 2016.
  201. 204.4 Identifiers for Digital Objects: The case of software source code preservation. August 2022.
  202. Instilling purpose and value in the implementation of digital badges in higher education. International Journal of Educational Technology in Higher Education, 16, 12 2019.
  203. MLDev: Data Science Experiment Automation and Reproducibility Software. arXiv:2107.12322 [cs], July 2021. arXiv: 2107.12322.
  204. Unifying package managers, workflow engines, and containers: Computational reproducibility with BioNix. 9(11):giaa121.
  205. reproducibilidad de las estrategias de búsqueda en revisiones sistemáticas publicadas en revistas españolas de Biblioteconomía y Documentación. Ibersid: revista de sistemas de información y documentación, 17(1):129–137, June 2023.
  206. The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. BMJ, page n71, March 2021.
Citations (2)

Summary

We haven't generated a summary for this paper yet.