Can citations tell us about a paper's reproducibility? A case study of machine learning papers (2405.03977v1)
Abstract: The iterative character of work in ML and AI and reliance on comparisons against benchmark datasets emphasize the importance of reproducibility in that literature. Yet, resource constraints and inadequate documentation can make running replications particularly challenging. Our work explores the potential of using downstream citation contexts as a signal of reproducibility. We introduce a sentiment analysis framework applied to citation contexts from papers involved in Machine Learning Reproducibility Challenges in order to interpret the positive or negative outcomes of reproduction attempts. Our contributions include training classifiers for reproducibility-related contexts and sentiment analysis, and exploring correlations between citation context sentiment and reproducibility scores. Study data, software, and an artifact appendix are publicly available at https://github.com/lamps-lab/ccair-ai-reproducibility .
- Edward Raff. A step toward quantifying independently reproducible machine learning research. In H. Wallach, H. Larochelle, A. Beygelzimer, F. d'Alché-Buc, E. Fox, and R. Garnett, editors, Advances in Neural Information Processing Systems, volume 32. Curran Associates, Inc., 2019.
- A systematic review of reproducibility research in natural language processing. In Paola Merlo, Jorg Tiedemann, and Reut Tsarfaty, editors, Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, pages 381–393, Online, April 2021. Association for Computational Linguistics.
- Predicting the reproducibility of social and behavioral science papers using supervised learning models, 2021.
- Estimating the deep replicability of scientific findings using human and artificial intelligence. Proceedings of the National Academy of Sciences, 117:10762–10768, 2020.
- Inline citation classification using peripheral context and time-evolving augmentation. In Hisashi Kashima, Tsuyoshi Ide, and Wen-Chih Peng, editors, Advances in Knowledge Discovery and Data Mining, pages 3–14, Cham, 2023. Springer Nature Switzerland.
- Citation function, polarity and influence classification. Natural Language Engineering, 23(4):561–588, 2017.
- A meta-analysis of semantic classification of citations. Quantitative Science Studies, 2(4):1170–1215, 12 2021.
- Citation context classification: Critical vs non-critical. In Arman Cohan, Guy Feigenblat, Dayne Freitag, Tirthankar Ghosal, Drahomira Herrmannova, Petr Knoth, Kyle Lo, Philipp Mayr, Michal Shmueli-Scheuer, Anita de Waard, and Lucy Lu Wang, editors, Proceedings of the Third Workshop on Scholarly Document Processing, pages 49–53, Gyeongju, Republic of Korea, October 2022. Association for Computational Linguistics.
- Reproducibility and replicability in science. 2019.
- ACM. , 2020. Accessed March 28, 2024.
- Issues and challenges of aspect-based sentiment analysis: A comprehensive survey. IEEE Transactions on Affective Computing, 13(2):845–863, 2022.
- State of the art: Reproducibility in artificial intelligence. Proceedings of the AAAI Conference on Artificial Intelligence, 32(1), Apr. 2018.
- Laying foundations to quantify the “effort of reproducibility”. In 2023 ACM/IEEE Joint Conference on Digital Libraries (JCDL), pages 56–60, Los Alamitos, CA, USA, jun 2023. IEEE Computer Society.
- Multi-task learning model based on recurrent convolutional neural networks for citation sentiment and purpose classification. Neurocomputing, 335:195–205, 2019.
- Hugging Face Team. Models - Hugging Face — huggingface.co. https://huggingface.co/models, 2024. [Accessed 05-01-2024].
- Structural scaffolds for citation intent classification in scientific publications. In Jill Burstein, Christy Doran, and Thamar Solorio, editors, Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pages 3586–3596, Minneapolis, Minnesota, June 2019. Association for Computational Linguistics.
- Understanding the meanings of citations using sentiment, role, and citation function classifications. Scientometrics, 128(1):735–759, Jan 2023.
- A study on reproducibility and replicability of table structure recognition methods. In Gernot A. Fink, Rajiv Jain, Koichi Kise, and Richard Zanibbi, editors, Document Analysis and Recognition - ICDAR 2023, pages 3–19, Cham, 2023. Springer Nature Switzerland.
- The semantic scholar open data platform. ArXiv, abs/2301.10140, 2023.
- Koustuv Sinha. ML Reproducibility Challenge 2023 | MLRC2023 — reproml.org. https://reproml.org/, 2023. [Accessed 03-02-2024].
- RESCIENCE C. ReScience C — rescience.github.io. https://rescience.github.io/read/, 2023. [Accessed 10-02-2024].
- Distilbert, a distilled version of BERT: smaller, faster, cheaper and lighter. CoRR, abs/1910.01108, 2019.
- BERT: Pre-training of deep bidirectional transformers for language understanding. In Jill Burstein, Christy Doran, and Thamar Solorio, editors, Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pages 4171–4186, Minneapolis, Minnesota, June 2019. Association for Computational Linguistics.
- A study of computational reproducibility using urls linking to open access datasets and software. In Companion Proceedings of the Web Conference 2022, WWW ’22, page 784–788, New York, NY, USA, 2022. Association for Computing Machinery.
- The power of brand selfies. Journal of Marketing Research, 2021.
- Seethal. Seethal/sentiment-analysis-generic-dataset · Hugging Face — huggingface.co. https://huggingface.co/Seethal/sentiment_analysis_generic_dataset, 2022. [Accessed 10-02-2024].
- pysentimiento: A python toolkit for sentiment analysis and socialnlp tasks, 2021.
- Souvick Das. Souvikcmsa/BERT-sentiment-analysis Hugging Face — huggingface.co. https://huggingface.co/Souvikcmsa/BERT_sentiment_analysis, 2022. [Accessed 10-02-2024].
- BI team. sbcBI/sentiment-analysis-model · Hugging Face — huggingface.co. https://huggingface.co/sbcBI/sentiment_analysis_model, 2022. [Accessed 10-02-2024].
- Rochana R. Obadage (2 papers)
- Sarah M. Rajtmajer (5 papers)
- Jian Wu (315 papers)