Predictive Business Process Monitoring with LSTM Neural Networks (1612.02130v2)

Published 7 Dec 2016 in stat.AP, cs.DB, cs.LG, cs.NE, and stat.ML

Abstract: Predictive business process monitoring methods exploit logs of completed cases of a process in order to make predictions about running cases thereof. Existing methods in this space are tailor-made for specific prediction tasks. Moreover, their relative accuracy is highly sensitive to the dataset at hand, thus requiring users to engage in trial-and-error and tuning when applying them in a specific setting. This paper investigates Long Short-Term Memory (LSTM) neural networks as an approach to build consistently accurate models for a wide range of predictive process monitoring tasks. First, we show that LSTMs outperform existing techniques to predict the next event of a running case and its timestamp. Next, we show how to use models for predicting the next task in order to predict the full continuation of a running case. Finally, we apply the same approach to predict the remaining time, and show that this approach outperforms existing tailor-made methods.

Authors (4)

Niek Tax (27 papers)
Ilya Verenich (3 papers)
Marcello La Rosa (28 papers)
Marlon Dumas (61 papers)

Citations (420)

View on Semantic Scholar

Summary

Predictive Business Process Monitoring with LSTM Neural Networks

The paper "Predictive Business Process Monitoring with LSTM Neural Networks" explores the application of Long Short-Term Memory (LSTM) neural networks in the context of predictive monitoring of business processes. The primary advantage of using LSTM models, as demonstrated in the paper, is their capability to consistently outperform existing methods tailored for specific predictive tasks across various datasets. This advantage emerges from their function as a general learning framework for sequence prediction tasks in business processes.

The authors highlight the core tasks of predictive business process monitoring, such as predicting the next activity and timestamp in an ongoing case, forecasting the continuation of a case, and estimating the remaining cycle time. Traditional approaches to these tasks typically rely on custom methods for each unique task and are highly dependent on the specifics of the dataset being used. This paper argues that LSTM neural networks can address these tasks with a single, unified model, alleviating the need for complex customizations.

Technical Approach

The paper details several architectures that utilize LSTMs to predict upcoming events in a business process. The proposed approach constructs feature vectors from the historical event logs, including both categorical and temporal attributes. These feature vectors are used as inputs for the LSTMs, allowing the model to learn the sequential dependencies inherent in business process logs.

The LSTM models are compared against baseline approaches in two main evaluations. First, they predict next activities and their corresponding timestamps. Secondly, they extend those predictions to infer entire future sequences of activities (suffix prediction) and estimate the remaining cycle time of cases.

Numerical Evaluation

The paper reports comprehensive experimental results using multiple datasets, such as helpdesk management logs and BPI challenge logs. The findings demonstrate that the LSTM models exhibit higher accuracy in predicting next activities and timestamps compared to baseline methods. Notably, prediction accuracy is reported at approximately 71% for the helpdesk log’s next activity.

For suffix prediction and remaining time estimation, the LSTM models also surpass traditional methods, achieving higher Damerau-Levenshtein Similarity for suffix prediction across different datasets. The paper further identifies edge cases, such as sequences with repeating events, where LSTMs tend to predict longer suffixes due to the repetition patterns present in the logs.

Implications and Future Work

The implications of this work are significant for the field of predictive business process monitoring. By employing LSTMs, organizations can potentially streamline their process prediction efforts without resorting to labor-intensive, task-specific model developments. This versatility could lead to more agile process monitoring systems capable of adapting to varying conditions and demands.

The paper suggests future research avenues including extending the prediction task to outcome predictions and incorporating more detailed case attributes into the feature set. Additionally, addressing the identified limitations—specifically, LSTM's difficulty with repetitive event sequences—could further enhance model performance and broaden its applicability.

In summary, this paper presents an important step in advancing predictive business process monitoring by showcasing the efficacy of LSTM neural networks in handling various sequence prediction tasks with consistent accuracy. This work lays the foundation for further investigation into generalized sequence modeling approaches and their applications in business process environments.

PDF Markdown

Related Papers

Find Related Papers