Validation set sampling strategies for predictive process monitoring

被引:2
|
作者
Peeperkorn, Jari [1 ]
vanden Broucke, Seppe [1 ,2 ]
De Weerdt, Jochen [1 ]
机构
[1] Katholieke Univ Leuven, Res Ctr Informat Syst Engn LIRIS, Leuven, Belgium
[2] Univ Ghent, Dept Business Informat & Operat Management, Ghent, Belgium
基金
欧盟地平线“2020”;
关键词
Process mining; Predictive process monitoring; LSTM; Generalization; Validation set; Log completeness;
D O I
10.1016/j.is.2023.102330
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Previous studies investigating the efficacy of long short-term memory (LSTM) recurrent neural networks in predictive process monitoring and their ability to capture the underlying process structure have raised concerns about their limited ability to generalize to unseen behavior. Event logs often fail to capture the full spectrum of behavior permitted by the underlying processes. To overcome these challenges, this study introduces innovative validation set sampling strategies based on control-flow variant-based resampling. These strategies have undergone extensive evaluation to assess their impact on hyperparameter selection and early stopping, resulting in notable enhancements to the generalization capabilities of trained LSTM models. In addition, this study expands the experimental framework to enable accurate interpretation of underlying process models and provide valuable insights. By conducting experiments with event logs representing process models of varying complexities, this research elucidates the effectiveness of the proposed validation strategies. Furthermore, the extended framework facilitates investigations into the influence of event log completeness on the learning quality of predictive process models. The novel validation set sampling strategies proposed in this study facilitate the development of more effective and reliable predictive process models, ultimately bolstering generalization capabilities and improving the understanding of underlying process dynamics.
引用
收藏
页数:23
相关论文
共 50 条
  • [21] Monitoring the process location by using new ranked set sampling-based memory control charts
    Nawaz, Tahir
    Han, Dong
    QUALITY TECHNOLOGY AND QUANTITATIVE MANAGEMENT, 2020, 17 (03): : 255 - 284
  • [22] EFFICIENT SEQUENTIAL SAMPLING STRATEGIES FOR ENVIRONMENTAL MONITORING
    MUKHOPADHYAY, N
    BENDEL, RB
    NIKOLAIDIS, NP
    CHATTOPADHYAY, S
    WATER RESOURCES RESEARCH, 1992, 28 (09) : 2245 - 2256
  • [23] DEVELOPMENT OF TEMPORAL SAMPLING STRATEGIES FOR MONITORING NOISE
    DEVOR, RE
    SCHOMER, PD
    KLINE, WA
    NEATHAMER, RD
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1979, 66 (03): : 763 - 771
  • [24] Comparing sampling strategies in forest monitoring programs
    Swiss Fed. Inst. Forest, Snow L., 8903-Birmensdorf, Switzerland
    Forest Ecol Manage, 1-3 (231-238):
  • [25] Channel sampling strategies for monitoring wireless networks
    Deshpande, Udayan
    Henderson, Tristan
    Kotz, David
    2006 4TH INTERNATIONAL SYMPOSIUM ON MODELING AND OPTIMIZATION IN MOBILE, AD HOC AND WIRELESS NETWORKS, VOLS 1 AND 2, 2006, : 423 - +
  • [26] SAMPLING STRATEGIES FOR MONITORING NOISE IN THE VICINITY OF AIRPORTS
    SCHOMER, PD
    DEVOR, RE
    KLINE, WA
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1983, 73 (06): : 2041 - 2050
  • [27] Comparing sampling strategies in forest monitoring programs
    Ghosh, S
    Innes, JL
    FOREST ECOLOGY AND MANAGEMENT, 1996, 82 (1-3) : 231 - 238
  • [28] Sampling strategies for monitoring lameness in dairy cattle
    Main, D. C. J.
    Barker, Z. E.
    Leach, K. A.
    Bell, N. J.
    Whay, H. R.
    Browne, W. J.
    JOURNAL OF DAIRY SCIENCE, 2010, 93 (05) : 1970 - 1978
  • [29] Roundup on bioprocess validation issues - Strategies for process validation
    Calcott, P
    GENETIC ENGINEERING NEWS, 2000, 20 (01): : 36 - +
  • [30] Temporal stability in predictive process monitoring
    Teinemaa, Irene
    Dumas, Marlon
    Leontjeva, Anna
    Maggi, Fabrizio Maria
    DATA MINING AND KNOWLEDGE DISCOVERY, 2018, 32 (05) : 1306 - 1338