Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
139 tokens/sec
GPT-4o
47 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Model Share AI: An Integrated Toolkit for Collaborative Machine Learning Model Development, Provenance Tracking, and Deployment in Python (2309.15719v1)

Published 27 Sep 2023 in cs.SE, cs.AI, and cs.LG

Abstract: Machine learning (ML) has the potential to revolutionize a wide range of research areas and industries, but many ML projects never progress past the proof-of-concept stage. To address this issue, we introduce Model Share AI (AIMS), an easy-to-use MLOps platform designed to streamline collaborative model development, model provenance tracking, and model deployment, as well as a host of other functions aiming to maximize the real-world impact of ML research. AIMS features collaborative project spaces and a standardized model evaluation process that ranks model submissions based on their performance on unseen evaluation data, enabling collaborative model development and crowd-sourcing. Model performance and various model metadata are automatically captured to facilitate provenance tracking and allow users to learn from and build on previous submissions. Additionally, AIMS allows users to deploy ML models built in Scikit-Learn, TensorFlow Keras, PyTorch, and ONNX into live REST APIs and automatically generated web apps with minimal code. The ability to deploy models with minimal effort and to make them accessible to non-technical end-users through web apps has the potential to make ML research more applicable to real-world challenges.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (21)
  1. Thomas H. Davenport and D.J. Patil “Is Data Scientist Still the Sexiest Job of the 21st Century?” Section: Analytics and data science In Harvard Business Review, 2022 URL: https://hbr.org/2022/07/is-data-scientist-still-the-sexiest-job-of-the-21st-century
  2. Eric Siegel “Models Are Rarely Deployed: An Industry-wide Failure in Machine Learning Leadership” Section: KDnuggets Originals, 2022 URL: https://www.kdnuggets.com/models-are-rarely-deployed-an-industry-wide-failure-in-machine-learning-leadership.html
  3. “Hugging Face Hub”, 2023 URL: https://huggingface.co/docs/hub/index
  4. “TensorFlow Hub”, 2023 URL: https://www.tensorflow.org/hub
  5. “Developments in MLflow: A System to Accelerate the Machine Learning Lifecycle” In Proceedings of the Fourth International Workshop on Data Management for End-to-End Machine Learning, DEEM’20 New York, NY, USA: Association for Computing Machinery, 2020, pp. 1–4 DOI: 10.1145/3399579.3399867
  6. “MLflow”, 2023 URL: https://mlflow.org/
  7. “Accelerating the Machine Learning Lifecycle with MLflow” In IEEE Data Eng. Bull., 2018 URL: https://www.semanticscholar.org/paper/Accelerating-the-Machine-Learning-Lifecycle-with-Zaharia-Chen/b2e0b79e6f180af2e0e559f2b1faba66b2bd578a
  8. “Openml-python: an extensible python api for openml” Publisher: JMLRORG In The Journal of Machine Learning Research 22.1, 2021, pp. 4573–4577
  9. “OpenML: networked science in machine learning” In ACM SIGKDD Explorations Newsletter 15.2, 2014, pp. 49–60 DOI: 10.1145/2641190.2641198
  10. “OpenML: A Collaborative Science Platform” In Machine Learning and Knowledge Discovery in Databases, Lecture Notes in Computer Science Berlin, Heidelberg: Springer, 2013, pp. 645–649 DOI: 10.1007/978-3-642-40994-3_46
  11. “TensorFlow-Serving: Flexible, High-Performance ML Serving” arXiv, 2017 DOI: 10.48550/arXiv.1712.06139
  12. “Scikit-learn: Machine Learning in Python” In Journal of Machine Learning Research 12, 2011, pp. 2825–2830 URL: http://jmlr.csail.mit.edu/papers/v12/pedregosa11a.html
  13. “TensorFlow: A system for large-scale machine learning”, 2016
  14. Francois Chollet “Keras: the Python deep learning API”, 2015 URL: https://keras.io/
  15. “PyTorch: An Imperative Style, High-Performance Deep Learning Library” arXiv, 2019 DOI: 10.48550/arXiv.1912.01703
  16. “Model Share AI - Share your machine learning models in seconds” URL: https://www.modelshare.ai/
  17. “Measuring the predictability of life outcomes with a scientific mass collaboration” Publisher: Proceedings of the National Academy of Sciences In Proceedings of the National Academy of Sciences 117.15, 2020, pp. 8398–8403 DOI: 10.1073/pnas.1915006117
  18. “Automated Machine Learning: Methods, Systems, Challenges”, The Springer Series on Challenges in Machine Learning Springer International Publishing, 2019 URL: https://www.springer.com/gp/book/9783030053178
  19. Xin He, Kaiyong Zhao and Xiaowen Chu “AutoML: A Survey of the State-of-the-Art” In Knowledge-Based Systems 212, 2021, pp. 106622 DOI: 10.1016/j.knosys.2020.106622
  20. “Meta-Learning in Neural Networks: A Survey” arXiv, 2020 DOI: 10.48550/arXiv.2004.05439
  21. Chelsea Finn, Pieter Abbeel and Sergey Levine “Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks” arXiv, 2017 DOI: 10.48550/arXiv.1703.03400

Summary

We haven't generated a summary for this paper yet.