A Scalable Analytical Framework for Complex Event Episode Mining With Various Domains Applications

被引:0
|
作者
Tseng, Jerry C. C. [1 ]
Hsieh, Sun-Yuan [1 ]
Tseng, Vincent S. [2 ]
机构
[1] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan 701, Taiwan
[2] Natl Yang Ming Chiao Tung Univ, Dept Comp Sci, Hsinchu 300, Taiwan
关键词
Complex event sequence; data stream; episode pattern mining; incremental mining; lambda architecture; SEQUENTIAL PATTERNS; FREQUENT EPISODES; LARGE DATABASES; DATA STREAM; PREFIXSPAN; DISCOVERY; MODEL;
D O I
10.1109/ACCESS.2022.3228962
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the ubiquity of sensor networks and smart devices that continuously collect data, we face the challenge of analyzing the growing stream of data in real time. In recent years, there has been a huge need to gain useful knowledge by incrementally analyzing event sequence data. Although episode pattern mining techniques have existed for years, people have recently become more aware of their practical value in solving real-life domain problems such as manufacturing records, stock markets, and weather forecasts. The effective and efficient application of episode pattern mining techniques to analyze complex event data is becoming increasingly important for solving real-life problems in wide domains. However, few studies have focused on developing a scalable framework based on episode pattern mining of complex event sequences for applications in various domains. In this work, we propose a novel framework named SAAF (Scalable Analytical Application Framework) based on complex event episode mining techniques, including batch episode mining, delta episode mining, incremental episode mining, and pattern merging, to consider both efficiency and accuracy. Moreover, to enhance scalability, we adopt the lambda architecture with Apache Spark and Apache Spark Streaming as the system development framework. Finally, the experimental results on three real datasets of different domains and two benchmark datasets showed that the proposed SAAF framework exhibits excellent performance in terms of efficiency, accuracy, and scalability.
引用
收藏
页码:130672 / 130685
页数:14
相关论文
共 14 条
  • [1] A Scalable Complex Event Analytical System with Incremental Episode Mining over Data Streams
    Tseng, Jerry C. C.
    Gu, Jia-Yuan
    Tseng, Vincent S.
    Wang, P. F.
    Chen, Ching-Yu
    Li, Chu-Feng
    2016 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2016, : 648 - 655
  • [2] An analytical framework for event mining in video data
    Maryam Koohzadi
    Mohammad Reza Keyvanpour
    Artificial Intelligence Review, 2014, 41 : 401 - 413
  • [3] An analytical framework for event mining in video data
    Koohzadi, Maryam
    Keyvanpour, Mohammad Reza
    ARTIFICIAL INTELLIGENCE REVIEW, 2014, 41 (03) : 401 - 413
  • [4] A Scalable Complex Pattern Mining Framework for Global Settlement Mapping
    Vatsavai, Ranga Raju
    2015 IEEE INTERNATIONAL CONGRESS ON BIG DATA - BIGDATA CONGRESS 2015, 2015, : 514 - 521
  • [5] A Review Of Text Mining Techniques: Trends, and Applications In Various Domains
    Aleqabie H.J.
    Sfoq M.S.
    Albeer R.A.
    Abd E.H.
    Iraqi Journal for Computer Science and Mathematics, 2024, 5 (01): : 125 - 141
  • [6] Large-Scale Frequent Episode Mining from Complex Event Sequences with Hierarchies
    Ao, Xiang
    Shi, Haoran
    Wang, Jin
    Zuo, Luo
    Li, Hongwei
    He, Qing
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2019, 10 (04)
  • [7] A Novel Complex-Events Analytical System Using Episode Pattern Mining Techniques
    Tseng, Jerry C. C.
    Gu, Jia-Yuan
    Wang, P. F.
    Chen, Ching-Yu
    Tseng, Vincent S.
    INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING: BIG DATA AND MACHINE LEARNING TECHNIQUES, ISCIDE 2015, PT II, 2015, 9243 : 487 - 498
  • [8] Mining complex clinical data for patient safety research: a framework for event discovery
    Hripcsak, G
    Bakken, S
    Stetson, PD
    Patel, VL
    JOURNAL OF BIOMEDICAL INFORMATICS, 2003, 36 (1-2) : 120 - 130
  • [9] A Framework for Mining Signatures from Event Sequences and Its Applications in Healthcare Data
    Wang, Fei
    Lee, Noah
    Hu, Jianying
    Sun, Jimeng
    Ebadollahi, Shahram
    Laine, Andrew F.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (02) : 272 - 285
  • [10] A framework of mining semantic-based probabilistic event relations for complex activity recognition
    Liu, Li
    Wang, Shu
    Su, Guoxin
    Hu, Bin
    Peng, Yuxin
    Xiong, Qingyu
    Wen, Junhao
    INFORMATION SCIENCES, 2017, 418 : 13 - 33