Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

ML Health: Fitness Tracking for Production Models (1902.02808v1)

Published 7 Feb 2019 in cs.LG and stat.ML

Abstract: Deployment of ML algorithms in production for extended periods of time has uncovered new challenges such as monitoring and management of real-time prediction quality of a model in the absence of labels. However, such tracking is imperative to prevent catastrophic business outcomes resulting from incorrect predictions. The scale of these deployments makes manual monitoring prohibitive, making automated techniques to track and raise alerts imperative. We present a framework, ML Health, for tracking potential drops in the predictive performance of ML models in the absence of labels. The framework employs diagnostic methods to generate alerts for further investigation. We develop one such method to monitor potential problems when production data patterns do not match training data distributions. We demonstrate that our method performs better than standard "distance metrics", such as RMSE, KL-Divergence, and Wasserstein at detecting issues with mismatched data sets. Finally, we present a working system that incorporates the ML Health approach to monitor and manage ML deployments within a realistic full production ML lifecycle.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Sindhu Ghanta (3 papers)
  2. Sriram Subramanian (7 papers)
  3. Lior Khermosh (2 papers)
  4. Swaminathan Sundararaman (6 papers)
  5. Harshil Shah (10 papers)
  6. Yakov Goldberg (2 papers)
  7. Drew Roselli (2 papers)
  8. Nisha Talagala (2 papers)
Citations (7)

Summary

We haven't generated a summary for this paper yet.