An Empirical Study of Challenges in Machine Learning Asset Management (2402.15990v2)
Abstract: In ML, efficient asset management, including ML models, datasets, algorithms, and tools, is vital for resource optimization, consistent performance, and a streamlined development lifecycle. This enables quicker iterations, adaptability, reduced development-to-deployment time, and reliable outputs. Despite existing research, a significant knowledge gap remains in operational challenges like model versioning, data traceability, and collaboration, which are crucial for the success of ML projects. Our study aims to address this gap by analyzing 15,065 posts from developer forums and platforms, employing a mixed-method approach to classify inquiries, extract challenges using BERTopic, and identify solutions through open card sorting and BERTopic clustering. We uncover 133 topics related to asset management challenges, grouped into 16 macro-topics, with software dependency, model deployment, and model training being the most discussed. We also find 79 solution topics, categorized under 18 macro-topics, highlighting software dependency, feature development, and file management as key solutions. This research underscores the need for further exploration of identified pain points and the importance of collaborative efforts across academia, industry, and the research community.
- URL https://github.com/zhimin-z/Asset-Management-Topic-Modeling. https://github.com/zhimin-z/MSR-Asset-Management, https://github.com/zhimin-z/QA-Asset-Management
- URL https://github.com/topics/mlops
- URL https://github.com/topics/machine-learning-engineering
- URL https://github.com/topics/artifact-management
- URL https://github.com/topics/data-management
- URL https://github.com/topics/experiments
- URL https://github.com/topics/experiment-management
- URL https://github.com/awesome-mlops/awesome-ml-experiment-management
- URL https://github.com/topics/experiment-tracking
- URL https://github.com/topics/lifecycle-management
- URL https://github.com/topics/project-management
- URL https://github.com/topics/workflow-management
- ACM Transactions on Storage (TOS) 3(3), 9–es (2007)
- In: Proceedings of the Annual Conference on Innovative Data Systems Research (CIDR), 2021. CIDR (2021)
- Proceedings of the 12th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement (2018)
- In: 2018 16th International Conference on Frontiers in Handwriting Recognition (ICFHR), pp. 423–428. IEEE (2018)
- In: 2019 IEEE/ACM 41st International Conference on Software Engineering: Software Engineering in Practice (ICSE-SEIP), pp. 291–300. IEEE (2019)
- In: Proceedings of the 2019 27th ACM joint meeting on european software engineering conference and symposium on the foundations of software engineering, pp. 432–442 (2019)
- arXiv preprint arXiv:1511.06435 (2015)
- In: ECIS, vol. 1 (2019)
- In: 2017 International Conference on Intelligent Computing and Control Systems (ICICCS), pp. 745–750. IEEE (2017)
- In: 2021 IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER), pp. 422–433. IEEE (2021)
- Electronic Notes in theoretical computer science 182, 17–32 (2007)
- Knowledge-Based Systems 232, 107489 (2021)
- Journal of the Royal statistical society: series B (Methodological) 57(1), 289–300 (1995)
- In: 2019 USENIX Conference on Operational Machine Learning (OpML 19), pp. 59–61 (2019)
- arXiv preprint arXiv:2108.07258 (2021)
- Journal of Systems and Software 146, 112–129 (2018)
- Expert Systems with Applications 202, 117232 (2022)
- Sociological methods & research 42(3), 294–320 (2013)
- In: 2019 IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp. 283–292. IEEE (2019)
- In: Proceedings of the fourth international workshop on data management for end-to-end machine learning, pp. 1–4 (2020)
- Empirical Software Engineering (2023)
- In: Proceedings of the 28th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, pp. 750–762 (2020)
- arXiv preprint arXiv:2305.15038 (2023)
- In: Proceedings of the 2017 11th Joint meeting on foundations of software engineering, pp. 186–196 (2017)
- Princeton university press (1999)
- In: 2023 IEEE/ACM 20th International Conference on Mining Software Repositories (MSR), pp. 218–222. IEEE (2023)
- Dunn, O.J.: Multiple comparisons among means. Journal of the American statistical association 56(293), 52–64 (1961)
- IEEE Security & Privacy 20(2), 96–100 (2022)
- Queue 16(4), 44–65 (2018)
- URL https://github.com/EthicalML/awesome-production-machine-learning
- SoftwareX 12, 100551 (2020)
- In: The 34th Annual ACM Symposium on User Interface Software and Technology, pp. 39–53 (2021)
- In: KDD CMI Workshop, vol. 114, pp. 1–4 (2018)
- In: Proceedings of the 3rd International Workshop on Data Management for End-to-End Machine Learning, pp. 1–4 (2019)
- arXiv preprint arXiv:2303.15056 (2023)
- Giray, G.: A software engineering perspective on engineering machine learning systems: State of the art and challenges. Journal of Systems and Software 180, 111031 (2021)
- Cloud Native Architecture and Design: A Handbook for Modern Day Architecture and Design with Enterprise-Grade Examples pp. 661–676 (2022)
- International Journal of Computer Vision 129, 1789–1819 (2021)
- arXiv preprint arXiv:2402.00838 (2024)
- Grootendorst, M.: Bertopic: Neural topic modeling with a class-based tf-idf procedure. arXiv preprint arXiv:2203.05794 (2022)
- World Scientific (2003)
- In: 2023 IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER), pp. 627–638. IEEE (2023)
- Patterns 1(5) (2020)
- Springer (2009)
- arXiv preprint arXiv:2202.10169 (2022)
- In: 2019 IEEE International Conference on Cloud Engineering (IC2E), pp. 113–120. IEEE (2019)
- ACM Comput. Surv. (2022). DOI 10.1145/3543847. URL https://doi.org/10.1145/3543847. Just Accepted
- In: 2022 48th Euromicro Conference on Software Engineering and Advanced Applications (SEAA), pp. 48–55. IEEE (2022)
- IEEE Access 7, 154300–154316 (2019)
- In: 2017 ACM/IEEE 20th International Conference on Model Driven Engineering Languages and Systems (MODELS), pp. 292–302. IEEE (2017)
- In: Proceedings of the ACM-IEEE international symposium on Empirical software engineering and measurement, pp. 29–38 (2012)
- arXiv preprint arXiv:2310.06825 (2023)
- arXiv preprint arXiv:2303.02552 (2023)
- Kelvins: awesome-mlops: A curated list of awesome mlops tools. URL https://github.com/kelvins/awesome-mlops
- In: Open Source Software: Quality Verification: 9th IFIP WG 2.13 International Conference, OSS 2013, Koper-Capodistria, Slovenia, June 25-28, 2013. Proceedings 9, pp. 61–79. Springer (2013)
- Journal of Software Maintenance: Research and Practice 11(6), 365–389 (1999)
- arXiv preprint arXiv:2007.06299 (2020)
- Proceedings of the IEEE 103(1), 14–76 (2014)
- In: Proceedings of the 2017 ACM International Conference on Management of Data, pp. 1717–1722 (2017)
- Packt Publishing Ltd (2018)
- Le, V.D.: Veml: An end-to-end machine learning lifecycle for large-scale and high-dimensional data. arXiv preprint arXiv:2304.13037 (2023)
- arXiv preprint arXiv:2401.08565 (2024)
- arXiv preprint arXiv:2303.16634 (2023)
- ” O’Reilly Media, Inc.” (2012)
- In: 11th USENIX Conference on File and Storage Technologies (FAST 13), pp. 31–44 (2013)
- Journal of network and computer applications 41, 424–440 (2014)
- McHugh, M.L.: Interrater reliability: the kappa statistic. Biochemia medica 22(3), 276–282 (2012)
- J. Open Source Softw. 2(11), 205 (2017)
- Python for high performance and scientific computing 14(9), 1–9 (2011)
- Melin, P.D.: Tackling version management and reproducibility in mlops (2023)
- In: 7th International Seminar Series on Advanced Techniques & Tools for Software Evolution (SATToSE), pp. 79–82 (2014)
- In: Proceedings of the 2nd Workshop on Human-in-the-Loop Data Analytics, pp. 1–6 (2017)
- In: 2017 IEEE 33rd International Conference on Data Engineering (ICDE), pp. 1393–1394. IEEE (2017)
- In: 2017 IEEE 33rd International Conference on Data Engineering (ICDE), pp. 571–582. IEEE (2017)
- Briefings in bioinformatics 19(6), 1236–1246 (2018)
- International Journal of Semantic Computing 14(02), 295–309 (2020)
- arXiv preprint arXiv:2304.03254 (2023)
- Journal of Systems and Software 191, 111359 (2022)
- Computers & Electrical Engineering 47, 186–203 (2015)
- Pervasive and Mobile Computing 50, 148–163 (2018)
- arXiv preprint arXiv:2001.01861 (2020)
- Artificial Intelligence Review 52, 77–124 (2019)
- In: Proceedings of the 36th International Conference on Software Maintenance and Evolution (ICSME), pp. 104–114 (2020)
- ACM Computing Surveys 55(6), 1–29 (2022)
- Empirical Software Engineering 27(2), 40 (2022)
- Ph.D. thesis, Université Paris-Saclay, FRA. (2022)
- Pearson, K.: X. on the criterion that a given system of deviations from the probable in the case of a correlated system of variables is such that it can be reasonably supposed to have arisen from random sampling. The London, Edinburgh, and Dublin Philosophical Magazine and Journal of Science 50(302), 157–175 (1900)
- In: 2018 IEEE 3rd International Conference on Cloud Computing and Big Data Analysis (ICCCBDA), pp. 552–557. IEEE (2018)
- ACM SIGMOD Record 47(2), 17–28 (2018)
- In: 2022 48th Euromicro Conference on Software Engineering and Advanced Applications (SEAA), pp. 84–91. IEEE (2022)
- ACM Transactions on Software Engineering and Methodology, Submission number TOSEM-2009-0087 p. 33 (2009)
- Rochkind, M.J.: The source code control system. IEEE transactions on Software Engineering (4), 364–370 (1975)
- Empirical Software Engineering 21, 1192–1223 (2016)
- Applied Sciences 11(19), 8861 (2021)
- In: ACM/IEEE 46th International Conference on Software Engineering. ACM/IEEE (2024)
- arXiv preprint arXiv:2009.07118 (2020)
- ACM SIGMOD Record 51(4), 18–35 (2023)
- Advances in neural information processing systems 28 (2015)
- International journal of information management 36(2), 215–225 (2016)
- In: 2008 IEEE computer society conference on computer vision and pattern recognition workshops, pp. 1–8. IEEE (2008)
- Squire, M.: ”should we move to stack overflow?” measuring the utility of social media for developer support. In: 2015 IEEE/ACM 37th IEEE International Conference on Software Engineering, vol. 2, pp. 219–228. IEEE (2015)
- Storey, J.D.: A direct approach to false discovery rates. Journal of the Royal Statistical Society Series B: Statistical Methodology 64(3), 479–498 (2002)
- In: EDBT, vol. 20, pp. 474–485 (2020)
- arXiv preprint arXiv:1712.05902 (2017)
- In: 2017 IEEE International conference on data science and advanced analytics (DSAA), pp. 165–174. IEEE (2017)
- In: 2022 IEEE 12th Annual Computing and Communication Workshop and Conference (CCWC), pp. 0453–0460. IEEE (2022)
- In: 2023 IEEE/ACM 45th International Conference on Software Engineering: Software Engineering in Practice (ICSE-SEIP), pp. 196–198. IEEE (2023)
- Tensorchord: awesome-llmops: An awesome curated list of best llmops tools for developers. URL https://github.com/tensorchord/Awesome-LLMOps
- arXiv preprint arXiv:2307.09288 (2023)
- In: Proceedings of the 33rd international conference on software engineering, pp. 804–807 (2011)
- In: Conference on systems and machine learning (sysML) (2018)
- In: 2020 IEEE International Conference on Software Maintenance and Evolution (ICSME), pp. 312–323. IEEE (2020)
- IEEE Data Eng. Bull. 41(4), 16–25 (2018)
- In: 2013 International Conference on Social Computing, pp. 188–195. IEEE (2013)
- In: 2016 IEEE International Conference on Web Services (ICWS), pp. 131–138. IEEE (2016)
- Archives of computational methods in engineering pp. 1–24 (2019)
- Information Management & Computer Security 17(1), 4–19 (2009)
- Journal of Usability Studies 4(1), 1–6 (2008)
- BMC bioinformatics 19(18), 59–69 (2018)
- IEEE Communications Surveys & Tutorials 17(1), 27–51 (2014)
- In: Proceedings of the 2021 International Conference on Management of Data, pp. 2639–2652 (2021)
- IEEE Software 38(1), 114–122 (2020)
- Machine Learning 110, 2993–3013 (2021)
- Journal of Computer Science and Technology 31, 910–924 (2016)
- arXiv preprint arXiv:2312.02003 (2023)
- IEEE Data Eng. Bull. 41(4), 39–45 (2018)
- arXiv preprint arXiv:2308.10792 (2023)