Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Discovering High-Quality Process Models Despite Data Scarcity (2310.11332v1)

Published 17 Oct 2023 in cs.DB

Abstract: Process discovery algorithms learn process models from executed activity sequences, describing concurrency, causality, and conflict. Concurrent activities require observing multiple permutations, increasing data requirements, especially for processes with concurrent subprocesses such as hierarchical, composite, or distributed processes. While process discovery algorithms traditionally use sequences of activities as input, recently introduced object-centric process discovery algorithms can use graphs of activities as input, encoding partial orders between activities. As such, they contain the concurrency information of many sequences in a single graph. In this paper, we address the research question of reducing process discovery data requirements when using object-centric event logs for process discovery. We classify different real-life processes according to the control-flow complexity within and between subprocesses and introduce an evaluation framework to assess process discovery algorithm quality of traditional and object-centric process discovery based on the sample size. We complement this with a large-scale production process case study. Our results show reduced data requirements, enabling the discovery of large, concurrent processes such as manufacturing with little data, previously infeasible with traditional process discovery. Our findings suggest that object-centric process mining could revolutionize process discovery in various sectors, including manufacturing and supply chains.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (9)
  1. Jan Niklas Adams (8 papers)
  2. Jari Peeperkorn (8 papers)
  3. Tobias Brockhoff (2 papers)
  4. Isabelle Terrier (1 paper)
  5. Heiko Göhner (1 paper)
  6. Merih Seran Uysal (12 papers)
  7. Seppe vanden Broucke (6 papers)
  8. Jochen De Weerdt (22 papers)
  9. Wil M. P. van der Aalst (116 papers)
Citations (1)

Summary

We haven't generated a summary for this paper yet.