Challenges in Deploying Machine Learning: a Survey of Case Studies (2011.09926v3)

Published 18 Nov 2020 in cs.LG

Abstract: In recent years, machine learning has transitioned from a field of academic research interest to a field capable of solving real-world business problems. However, the deployment of machine learning models in production systems can present a number of issues and concerns. This survey reviews published reports of deploying machine learning solutions in a variety of use cases, industries and applications and extracts practical considerations corresponding to stages of the machine learning deployment workflow. By mapping found challenges to the steps of the machine learning deployment workflow we show that practitioners face issues at each stage of the deployment process. The goal of this paper is to lay out a research agenda to explore approaches addressing these challenges.

Citations (336)

View on Semantic Scholar

Summary

The paper identifies key deployment challenges across data management, model learning, verification, and production integration.
It details obstacles including resource constraints, inadequate testing, and regulatory compliance that affect model performance and trust.
The study underscores the need for scalable, interdisciplinary approaches to overcome ML deployment hurdles in practical applications.

Challenges in Deploying Machine Learning: A Comprehensive Survey

Machine learning (ML) has become a pivotal tool across a variety of industries, moving beyond academic curiosity to solve real-world business problems. However, implementing ML models in production environments remains fraught with challenges. The paper "Challenges in Deploying Machine Learning: a Survey of Case Studies" offers a detailed examination of these challenges, focusing on the deployment workflow and common obstacles practitioners face.

Overview of Machine Learning Deployment Challenges

The deployment of ML models encompasses several stages, each with unique challenges. This paper categorizes these challenges into four primary stages: data management, model learning, model verification, and model deployment. It also considers cross-cutting aspects that impact all stages, including ethics, law, end users' trust, and security.

Data Management: Data is a crucial component of any ML-based solution. Data discovery and collection are often challenged by dispersion and inconsistencies across multiple sources. Furthermore, preprocessing steps like data cleaning and augmentation are resource-intensive and often hindered by a lack of labeled data and access to domain experts. Robust data profiling tools are essential to address biases and distribution shifts.

Model Learning: This stage is fraught with issues like selecting the optimal model in resource-constrained environments, managing computational costs and environmental impacts during training, and negotiating privacy concerns. Hyper-parameter selection is an ongoing struggle due to the expansive search space and hardware constraints.

Model Verification: Ensuring that ML models meet defined functional and business-driven requirements is complex. Formal verification is rare, but regulatory frameworks offer guidelines for model evaluation, especially in sectors like finance. Test-based verification often employs simulations, which can fall short of replicating real-world conditions, underscoring the importance of robust testing suites.

Model Deployment: Integration into existing infrastructures poses software and organizational challenges. Models require constant monitoring to detect concept drift and maintain relevance. Updating models introduces risks of backward incompatibility, thus necessitating a seamless continuous delivery pipeline.

Cross-Cutting Aspects

Several aspects influence all phases of ML deployment:

Ethics: Issues such as bias, fairness, and accountability are crucial to address. Ethical considerations must inform all stages to avoid reinforcing existing societal biases.
Law: Regulatory compliance is essential, especially in sensitive areas like healthcare. Existing laws are often inadequate for the rapid advancements in ML, necessitating adaptive policies.
End-Users' Trust: Engagement and transparency are key to fostering trust among end users. Explainability and user experience design are critical success factors in ML adoption.
Security: ML systems are prone to unique adversarial attacks like data poisoning, model inversion, and model stealing, highlighting the need for robust security measures.

Implications and Future Directions

Addressing these challenges requires both technical and strategic approaches. From an academic perspective, there's an opportunity to develop tools, services, and holistic frameworks tailored to the intricacies of ML deployment. For practitioners, expanding the dialogue with other disciplines, such as software engineering and human-computer interaction, may yield valuable insights.

Given this landscape, the academic community is encouraged to focus on scalable solutions that make ML deployment accessible to diverse industries. This entails fostering cross-disciplinary collaborations and creating comprehensive toolsets and guidelines that streamline the ML deployment process.

Overall, while machine learning offers substantial potential for innovation across different sectors, realizing this potential depends on effectively navigating the deployment challenges outlined in this comprehensive survey.

PDF Markdown