Can citations tell us about a paper's reproducibility? A case study of machine learning papers (2405.03977v1)

Published 7 May 2024 in cs.DL, cs.AI, and cs.LG

Abstract: The iterative character of work in ML and AI and reliance on comparisons against benchmark datasets emphasize the importance of reproducibility in that literature. Yet, resource constraints and inadequate documentation can make running replications particularly challenging. Our work explores the potential of using downstream citation contexts as a signal of reproducibility. We introduce a sentiment analysis framework applied to citation contexts from papers involved in Machine Learning Reproducibility Challenges in order to interpret the positive or negative outcomes of reproduction attempts. Our contributions include training classifiers for reproducibility-related contexts and sentiment analysis, and exploring correlations between citation context sentiment and reproducibility scores. Study data, software, and an artifact appendix are publicly available at https://github.com/lamps-lab/ccair-ai-reproducibility .

References (29)

Authors (3)

Rochana R. Obadage (2 papers)
Sarah M. Rajtmajer (5 papers)
Jian Wu (315 papers)

Citations (2)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/DigitalLibs/status/1788088236927074710

Can citations tell us about a paper's reproducibility? A case study of machine learning papers (2405.03977v1)

Summary

Related Papers

Tweets