Model Share AI: An Integrated Toolkit for Collaborative Machine Learning Model Development, Provenance Tracking, and Deployment in Python (2309.15719v1)
Abstract: Machine learning (ML) has the potential to revolutionize a wide range of research areas and industries, but many ML projects never progress past the proof-of-concept stage. To address this issue, we introduce Model Share AI (AIMS), an easy-to-use MLOps platform designed to streamline collaborative model development, model provenance tracking, and model deployment, as well as a host of other functions aiming to maximize the real-world impact of ML research. AIMS features collaborative project spaces and a standardized model evaluation process that ranks model submissions based on their performance on unseen evaluation data, enabling collaborative model development and crowd-sourcing. Model performance and various model metadata are automatically captured to facilitate provenance tracking and allow users to learn from and build on previous submissions. Additionally, AIMS allows users to deploy ML models built in Scikit-Learn, TensorFlow Keras, PyTorch, and ONNX into live REST APIs and automatically generated web apps with minimal code. The ability to deploy models with minimal effort and to make them accessible to non-technical end-users through web apps has the potential to make ML research more applicable to real-world challenges.
- Thomas H. Davenport and D.J. Patil “Is Data Scientist Still the Sexiest Job of the 21st Century?” Section: Analytics and data science In Harvard Business Review, 2022 URL: https://hbr.org/2022/07/is-data-scientist-still-the-sexiest-job-of-the-21st-century
- Eric Siegel “Models Are Rarely Deployed: An Industry-wide Failure in Machine Learning Leadership” Section: KDnuggets Originals, 2022 URL: https://www.kdnuggets.com/models-are-rarely-deployed-an-industry-wide-failure-in-machine-learning-leadership.html
- “Hugging Face Hub”, 2023 URL: https://huggingface.co/docs/hub/index
- “TensorFlow Hub”, 2023 URL: https://www.tensorflow.org/hub
- “Developments in MLflow: A System to Accelerate the Machine Learning Lifecycle” In Proceedings of the Fourth International Workshop on Data Management for End-to-End Machine Learning, DEEM’20 New York, NY, USA: Association for Computing Machinery, 2020, pp. 1–4 DOI: 10.1145/3399579.3399867
- “MLflow”, 2023 URL: https://mlflow.org/
- “Accelerating the Machine Learning Lifecycle with MLflow” In IEEE Data Eng. Bull., 2018 URL: https://www.semanticscholar.org/paper/Accelerating-the-Machine-Learning-Lifecycle-with-Zaharia-Chen/b2e0b79e6f180af2e0e559f2b1faba66b2bd578a
- “Openml-python: an extensible python api for openml” Publisher: JMLRORG In The Journal of Machine Learning Research 22.1, 2021, pp. 4573–4577
- “OpenML: networked science in machine learning” In ACM SIGKDD Explorations Newsletter 15.2, 2014, pp. 49–60 DOI: 10.1145/2641190.2641198
- “OpenML: A Collaborative Science Platform” In Machine Learning and Knowledge Discovery in Databases, Lecture Notes in Computer Science Berlin, Heidelberg: Springer, 2013, pp. 645–649 DOI: 10.1007/978-3-642-40994-3_46
- “TensorFlow-Serving: Flexible, High-Performance ML Serving” arXiv, 2017 DOI: 10.48550/arXiv.1712.06139
- “Scikit-learn: Machine Learning in Python” In Journal of Machine Learning Research 12, 2011, pp. 2825–2830 URL: http://jmlr.csail.mit.edu/papers/v12/pedregosa11a.html
- “TensorFlow: A system for large-scale machine learning”, 2016
- Francois Chollet “Keras: the Python deep learning API”, 2015 URL: https://keras.io/
- “PyTorch: An Imperative Style, High-Performance Deep Learning Library” arXiv, 2019 DOI: 10.48550/arXiv.1912.01703
- “Model Share AI - Share your machine learning models in seconds” URL: https://www.modelshare.ai/
- “Measuring the predictability of life outcomes with a scientific mass collaboration” Publisher: Proceedings of the National Academy of Sciences In Proceedings of the National Academy of Sciences 117.15, 2020, pp. 8398–8403 DOI: 10.1073/pnas.1915006117
- “Automated Machine Learning: Methods, Systems, Challenges”, The Springer Series on Challenges in Machine Learning Springer International Publishing, 2019 URL: https://www.springer.com/gp/book/9783030053178
- Xin He, Kaiyong Zhao and Xiaowen Chu “AutoML: A Survey of the State-of-the-Art” In Knowledge-Based Systems 212, 2021, pp. 106622 DOI: 10.1016/j.knosys.2020.106622
- “Meta-Learning in Neural Networks: A Survey” arXiv, 2020 DOI: 10.48550/arXiv.2004.05439
- Chelsea Finn, Pieter Abbeel and Sergey Levine “Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks” arXiv, 2017 DOI: 10.48550/arXiv.1703.03400