Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Reproducibility in Machine Learning-Driven Research (2307.10320v1)

Published 19 Jul 2023 in cs.LG, cs.CY, and stat.ME

Abstract: Research is facing a reproducibility crisis, in which the results and findings of many studies are difficult or even impossible to reproduce. This is also the case in ML and AI research. Often, this is the case due to unpublished data and/or source-code, and due to sensitivity to ML training conditions. Although different solutions to address this issue are discussed in the research community such as using ML platforms, the level of reproducibility in ML-driven research is not increasing substantially. Therefore, in this mini survey, we review the literature on reproducibility in ML-driven research with three main aims: (i) reflect on the current situation of ML reproducibility in various research fields, (ii) identify reproducibility issues and barriers that exist in these research fields applying ML, and (iii) identify potential drivers such as tools, practices, and interventions that support ML reproducibility. With this, we hope to contribute to decisions on the viability of different solutions for supporting ML reproducibility.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Harald Semmelrock (2 papers)
  2. Simone Kopeinik (10 papers)
  3. Dieter Theiler (7 papers)
  4. Tony Ross-Hellauer (2 papers)
  5. Dominik Kowald (58 papers)
Citations (9)

Summary

We haven't generated a summary for this paper yet.