Dice Question Streamline Icon: https://streamlinehq.com

Consistency and stability of AIOps model interpretations

Determine rigorous criteria and practical methodologies to evaluate and ensure the internal consistency, external consistency, and temporal stability of explanations produced by interpretable AIOps models for incident management, specifically under (i) arrival of new data distributions, (ii) across different models that yield similar predictive performance but differing explanations, and (iii) across successive model updates over time.

Information Square Streamline Icon: https://streamlinehq.com

Background

The paper emphasizes interpretability as a core desideratum for AIOps solutions and formalizes three dimensions of explanation consistency: internal consistency across repeated trainings of the same model setup, external consistency across different models with similar performance, and time consistency across different time periods. These dimensions are introduced in the desiderata section to guide trustworthy adoption of AIOps in incident management.

In the concluding discussion, the authors explicitly note unresolved questions regarding how to achieve and assess these forms of consistency and stability in practice. Addressing these questions is crucial to build practitioner trust, integrate human-in-the-loop workflows, and ensure that explanations remain reliable as models face new data, architectures, and periodic retraining.

References

However, several critical questions persist in this regard. It remains unclear how these interpretations remain internally consistent when the model encounters new data, or how they compare externally when different models produce the same results but with differing interpretations. Additionally, the stability of model interpretations over time, particularly with updates and improvements, raises important questions.

AIOps Solutions for Incident Management: Technical Guidelines and A Comprehensive Literature Review (2404.01363 - Remil et al., 1 Apr 2024) in Section 7: Conclusion and Open Challenges