Process discovery from event data: Relating models and logs through abstractions

被引:34
|
作者
van der Aalst, Wil M. P. [1 ]
机构
[1] Rhein Westfal TH Aachen, Proc & Data Sci PADS, Aachen, Germany
关键词
business process management; data science; process discovery; process mining; process modeling; MINING PROCESS MODELS; OF-THE-ART;
D O I
10.1002/widm.1244
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Event data are collected in logistics, manufacturing, finance, health care, customer relationship management, e-learning, e-government, and many other domains. The events found in these domains typically refer to activities executed by resources at particular times and for a particular case (i.e., process instances). Process mining techniques are able to exploit such data. In this article, we focus on process discovery. However, process mining also includes conformance checking, performance analysis, decision mining, organizational mining, predictions, recommendations, and so on. These techniques help to diagnose problems and improve processes. All process mining techniques involve both event data and process models. Therefore, a typical first step is to automatically learn a control-flow model from the event data. This is very challenging, but in recent years, many powerful discovery techniques have been developed. It is not easy to compare these techniques since they use different representations and make different assumptions. Users often need to resort to trying different algorithms in an ad-hoc manner. Developers of new techniques are often trying to solve specific instances of a more general problem. Therefore, we aim to unify existing approaches by focusing on log and model abstractions. These abstractions link observed and modeled behavior: Concrete behaviors recorded in event logs are related to possible behaviors represented by process models. Hence, such behavioral abstractions provide an interface between both of them. We discuss four discovery approaches involving three abstractions and different types of process models (Petri nets, block-structured models, and declarative models). The goal is to provide a comprehensive understanding of process discovery and show how to develop new techniques. Examples illustrate the different approaches and pointers to software are given. The discussion on abstractions and process representations is also presented to reflect on the gap between process mining literature and commercial process mining tools. This facilitates users to select an appropriate process discovery technique. Moreover, structuring the role of internal abstractions and representations helps broaden the view and facilitates the creation of new discovery approaches. This article is categorized under: Algorithmic Development > Spatial and Temporal Data Mining Application Areas > Business and Industry Technologies > Machine Learning Application Areas > Data Mining Software Tools
引用
收藏
页数:21
相关论文
共 50 条
  • [31] Lucent Process Models and Translucent Event Logs
    van der Aalst, Wil M. P.
    FUNDAMENTA INFORMATICAE, 2019, 169 (1-2) : 151 - 177
  • [32] Process scenario discovery from event logs based on activity and timing information
    Zhang, Zhenyu
    Johnson, Caleb
    Venkatasubramanian, Nalini
    Ren, Shangping
    JOURNAL OF SYSTEMS ARCHITECTURE, 2022, 125
  • [33] Mining process models from event logs in distributed bioinformatics workflows
    Xing, Jianchuan
    Li, Zhishu
    Cheng, Yanhong
    Yin, Feng
    Li, Baolin
    Chen, Li
    PROCEEDINGS OF THE FIRST INTERNATIONAL SYMPOSIUM ON DATA, PRIVACY, AND E-COMMERCE, 2007, : 8 - +
  • [34] Data is Moody: Discovering Data Modification Rules from Process Event Logs
    Schuster, Marco Bjarne
    Wiegand, Boris
    Vreeken, Jilles
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, PT II, ECML PKDD 2024, 2024, 14942 : 285 - 302
  • [35] Process Activity Ontology Learning From Event Logs Through Gamification
    Sadeghianasl, Sareh
    Ter Hofstede, Arthur H. M.
    Wynn, Moe Thandar
    Turkay, Selen
    Myers, Trina
    IEEE ACCESS, 2021, 9 : 165865 - 165880
  • [36] The impact of biased sampling of event logs on the performance of process discovery
    Mohammadreza Fani Sani
    Sebastiaan J. van Zelst
    Wil M. P. van der Aalst
    Computing, 2021, 103 : 1085 - 1104
  • [37] The impact of biased sampling of event logs on the performance of process discovery
    Fani Sani, Mohammadreza
    van Zelst, Sebastiaan J.
    van der Aalst, Wil M. P.
    COMPUTING, 2021, 103 (06) : 1085 - 1104
  • [38] Extracting Event Logs for Process Mining from Data Stored on the Blockchain
    Muehlberger, Roman
    Bachhofner, Stefan
    Di Ciccio, Claudio
    Garcia-Banuelos, Luciano
    Lopez-Pintado, Orlenys
    BUSINESS PROCESS MANAGEMENT WORKSHOPS (BPM 2019), 2019, 362 : 690 - 703
  • [39] Cross-Department Collaborative Healthcare Process Model Discovery From Event Logs
    Liu, Cong
    Li, Huiling
    Zhang, Shuaipeng
    Cheng, Long
    Zeng, Qingtian
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2023, 20 (03) : 2115 - 2125
  • [40] Discovery of clinical pathway patterns from event logs using probabilistic topic models
    Huang, Zhengxing
    Dong, Wei
    Ji, Lei
    Gan, Chenxi
    Lu, Xudong
    Duan, Huilong
    JOURNAL OF BIOMEDICAL INFORMATICS, 2014, 47 : 39 - 57