A User Evaluation of Automated Process Discovery Algorithms (1806.03150v1)

Published 8 Jun 2018 in cs.SE

Abstract: Process mining methods allow analysts to use logs of historical executions of business processes in order to gain knowledge about the actual behavior of these processes. One of the most widely studied process mining operations is automated process discovery. An event log is taken as input by an automated process discovery method and produces a business process model as output that captures the control-flow relations between tasks that are described by the event log. In this setting, this paper provides a systematic comparative evaluation of existing implementations of automated process discovery methods with domain experts by using a real-life event log extracted from an international software engineering company and four quality metrics. The evaluation results highlight gaps and unexplored trade-offs in the field and allow researchers to improve the lacks in the automated process discovery methods in terms of usability of process discovery techniques in industry.

Authors (5)

Fabrizio Maria Maggi (30 papers)
Andrea Marrella (9 papers)
Fredrik Milani (8 papers)
Allar Soo (2 papers)
Silva Kasela (1 paper)

Citations (1)

View on Semantic Scholar

Summary

A User Evaluation of Automated Process Discovery Algorithms

The paper "A User Evaluation of Automated Process Discovery Algorithms" conducts a systematic and insightful empirical evaluation of process discovery methods, a fundamental component of process mining. It extends the understanding of how existing automated process discovery algorithms can be enhanced from both usability and domain-specific applicability perspectives. Using a real-world event log from an international software engineering company, the paper benchmarks and evaluates several algorithms by involving domain experts from the business world.

Problem Context and Objective

In the digital age, organizations face an imperative to optimize their business processes, shifting from manual analysis to data-driven process mining techniques. Automated process discovery plays a crucial role within this paradigm, converting event logs into process models with the aim of capturing intricate control-flow relations. However, existing methodologies often generate overly complex models that may lack the balance needed in four key dimensions: fitness, generalization, precision, and simplicity. This complex nature of models often renders them impractical for business stakeholders.

The paper primarily addresses the question of identifying which discovery algorithm is favored by domain experts when applied in a real-world context and evaluated on several quality metrics. This investigation aids in bridging the gap between technical development and practical, cross-domain applicability of these models in industry settings.

Methodology and Implementation

The research employs a Systematic Literature Review (SLR) to identify and classify significant contributions in process discovery, scoping the search on studies post-2012 following prior evaluations. The selected algorithms were then applied to a log dataset from a software company’s issue tracking system, processed to derive actionable insights.

The analyzed methods were selected based on their accessibility and compatibility with BPMN modeling—allowing evaluation by non-technical business stakeholders. These included advancements such as the Evolutionary Tree Miner, Structured Miner, and Inductive Miner, each evaluated for behavioral soundness, discovery efficiency, and compliance with industry standards.

Key Findings and Results

Quantitative evaluation was centered around critical complexity metrics and the algorithmic efficiency in handling real-life chains of events. The reported analysis shows substantial variability among discovery methods in terms of their generated model comprehensibility and utility for process evaluation tasks.

Model Complexity: Models varied significantly in node count, CNC, and density, with smaller models being generally perceived as clearer and easier to interpret by domain experts.
Domain Expert Feedback: Involving industry participants provided qualitative insights into model usability, emphasizing requirements for simpler, annotated models that accurately represent business operations.

Interestingly, while measures of correctness and precision were pivotal in assessing perceived generalization, feedback stressed the models' augmentation with frequency and performance data to enhance analytical utility.

Implications and Future Directions

Conclusively, the paper illuminates the prevailing disconnect between technical advancements in process discovery and their alignment with business usability standards. This disparity earmarks a frontier for research directed at enhancing model interpretability and integrating meaningful business data overlays. Future developments might explore algorithmic refinements that prioritize behavioral correctness while automating process decomposition into more digestible subprocesses.

This research sets a precedent for continual improvements tailored to industry requirements, moving beyond mere academia-centric developments to encompass comprehensive enterprise applicability and optimization capabilities.

PDF Markdown