Bridging Domain Knowledge and Process Discovery Using Large Language Models

Published 30 Aug 2024 in cs.AI and cs.CL | (2408.17316v1)

Abstract: Discovering good process models is essential for different process analysis tasks such as conformance checking and process improvements. Automated process discovery methods often overlook valuable domain knowledge. This knowledge, including insights from domain experts and detailed process documentation, remains largely untapped during process discovery. This paper leverages LLMs to integrate such knowledge directly into process discovery. We use rules derived from LLMs to guide model construction, ensuring alignment with both domain knowledge and actual process executions. By integrating LLMs, we create a bridge between process knowledge expressed in natural language and the discovery of robust process models, advancing process discovery methodologies significantly. To showcase the usability of our framework, we conducted a case study with the UWV employee insurance agency, demonstrating its practical benefits and effectiveness.

Abstract PDF HTML Upgrade to Chat

Citations (1)

View on Semantic Scholar

Summary

The paper introduces a domain-aware process discovery framework that translates textual process knowledge into formal constraints using LLMs.
It employs prompt engineering to extract declarative rules from natural language, refining process models under the Inductive Miner Recursive framework.
A case study at UWV demonstrates that integrating domain insights improves model fidelity and conformance checking compared to traditional methods.

Bridging Domain Knowledge and Process Discovery Using LLMs

This paper presents a framework that addresses the integration of domain knowledge into process discovery by leveraging LLMs. The authors identify the challenge of existing automated process discovery methods, which often neglect the incorporation of valuable domain expertise and documentation that are typically expressed in natural language. By utilizing LLMs, this research aims to bridge the gap between textual process knowledge and the discovery of process models, proposing a methodology that ensures alignment between domain insights and process execution data.

Summary of the Paper

The core innovation of the paper lies in the utilization of LLMs to translate textual domain knowledge into declarative constraints, which can guide the process discovery phase. The framework operates under the Inductive Miner Recursive (IMr) framework, which is designed to incrementally construct process models from event logs while pruning suboptimal structures based on the imposed rules derived from domain knowledge. The authors illustrate the feasibility of this approach through a case study with the UWV employee insurance agency.

Key Contributions

Domain-Aware Process Discovery Framework: The framework enables the encoding of process knowledge into declarative constraints that inform the model discovery process. This approach enhances traditional process discovery methods, which generally depend solely on event log data.
Rule Extraction via LLMs: Through role-promoting and prompt engineering techniques, LLMs are employed to parse and interpret process-related natural language texts into a formal set of rules. These rules can then be used to refine process models.
Case Study Implementation: The framework’s applicability is demonstrated in a real-life scenario at the UWV agency, showcasing improvements in process model fidelity compared to models generated without domain knowledge incorporation.

Implications for Process Mining

Theoretical and practical implications are evident. Theoretically, this research advances the process mining field by integrating natural language processing capabilities, enhancing the encoding of process knowledge into a machine-interpretable format. Practically, the framework offers a pathway for organizations to harness existing domain expertise effectively in improving process model discovery, leading to more accurate conformance checking and process improvement activities.

Future Directions

The study opens several avenues for further research. There is potential to expand the range of declarative constraints supported by the IMr framework, as well as to refine LLM prompting strategies to improve the accuracy of rule extraction. Additionally, future work may explore the integration of real-time feedback loops from domain experts, enhancing the interactivity and adaptability of process model discovery through continuous learning paradigms.

Conclusion

This research highlights a novel approach to process discovery by synchronizing domain knowledge with event log data, using LLMs as a bridge. By incorporating structured domain input into process models, organizations can achieve a more robust alignment of discovered models with real-world processes, offering benefits in both process accuracy and alignment with organizational objectives. This innovation marks a significant step toward more intelligent and integrated process management solutions, combining the strengths of human expertise with advanced machine learning capabilities.

Markdown

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Paper Prompts

Top Community Prompts

Explain it Like I'm 14

off on

Knowledge Gaps

off on

Practical Applications

off on

Glossary

off on

Conceptual Simplification

off on

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Generate Now

Continue Learning

We haven't generated follow-up questions for this paper yet.

Generate Now

Authors (4)

Collections

Tweets

YouTube

Show All Videos

Bridging Domain Knowledge and Process Discovery Using Large Language Models

Summary

Bridging Domain Knowledge and Process Discovery Using LLMs

Summary of the Paper

Key Contributions

Implications for Process Mining

Future Directions

Conclusion

Paper to Video (Beta)

Whiteboard

Paper Prompts

Top Community Prompts

Open Problems

Continue Learning

Related Papers

Authors (4)

Collections

Tweets

YouTube