CCSpan: Mining closed contiguous sequential patterns

被引:47
|
作者
Zhang, Jingsong [1 ]
Wang, Yinglin [2 ]
Yang, Dingyu [1 ]
机构
[1] Shanghai Jiao Tong Univ, Dept CSE, Shanghai 200030, Peoples R China
[2] Shanghai Univ Finance & Econ, Dept CST, Shanghai, Peoples R China
基金
中国国家自然科学基金;
关键词
Data mining; Sequential pattern mining; Closed sequential pattern; Contiguous constraint; Closed contiguous sequential pattern; FREQUENT PATTERNS; EFFICIENT APPROACH; ALGORITHM;
D O I
10.1016/j.knosys.2015.06.014
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing closed sequential pattern mining generates a more compact yet complete resulting set compared with general sequential pattern mining. However, conventional closed sequential pattern mining algorithms pose a great challenge at spawning a large number of inefficient and redundant patterns, especially when using low support thresholds or pattern-enriched databases. Driven by wide applications of sequential patterns with contiguous constraint, we propose CCSpan (Closed Contiguous Sequential pattern mining), an efficient algorithm for mining closed contiguous sequential patterns, which contributes to a much more compact pattern set but with the same information w.r.t. closed sequential patterns. Moreover, with the shorter feature of patterns, the closed contiguous sequential patterns are preferred for feature selection and sequence classification based on the Minimum Description Length principle. CCSpan adopts a novel snippet-growth paradigm to generate a series of snippets as candidates, each of which is attached with a set of item(s) that precisely record the pattern's occurrences in the database, and CCSpan leverages three pruning techniques to improve the computational efficiency significantly. Our experiments based on both sparse and dense datasets demonstrated that CCSpan is efficient and scalable in terms of both database size and support threshold. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:1 / 13
页数:13
相关论文
共 50 条
  • [1] Mining and visual exploration of closed contiguous sequential patterns in trajectories
    Yang, Can
    Gidofalvi, Gyozo
    INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2018, 32 (07) : 1282 - 1303
  • [2] Targeted mining of contiguous sequential patterns
    Hu, Kaixia
    Gan, Wensheng
    Huang, Shan
    Peng, Hao
    Fournier-Viger, Philippe
    INFORMATION SCIENCES, 2024, 653
  • [3] Fast mining of closed sequential patterns
    Department of Computer Science and Information Engineering, Tamkang University, 151 Ying-Chuan Road, Tamsui, Taipei, Taiwan
    WSEAS Trans. Comput., 2008, 3 (133-139):
  • [4] Mining Closed Sequential Patterns in Progressive Databases
    Subramanyam, R. B. V.
    Rao, A. Suresh
    Karnati, Ramesh
    Suvvari, Somaraju
    Somayajulu, D. V. L. N.
    JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2013, 12 (03)
  • [5] Mining Closed Sequential Patterns - A Novel Approach
    Rahaman, Sophia Banu
    Shashi, M.
    2012 6TH INTERNATIONAL CONFERENCE ON NEW TRENDS IN INFORMATION SCIENCE, SERVICE SCIENCE AND DATA MINING (ISSDM2012), 2012, : 649 - 653
  • [6] An Approach for Mining Weighted Closed Sequential Patterns
    Raju, V. Purushothama
    Varma, G. P. Saradhi
    2014 FIRST INTERNATIONAL CONFERENCE ON NETWORKS & SOFT COMPUTING (ICNSC), 2014, : 158 - 161
  • [7] Mining closed sequential patterns with time constraints
    Lin, Ming-Yen
    Hsueh, Sue-Chen
    Chang, Chia-Wen
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2008, 24 (01) : 33 - 46
  • [8] IMCS: Incremental mining of closed sequential patterns
    Chang, Lei
    Yang, Dongqing
    Wang, Tengjiao
    Tang, Shiwei
    ADVANCES IN DATA AND WEB MANAGEMENT, PROCEEDINGS, 2007, 4505 : 50 - +
  • [9] LCCspm: l-Length Closed Contiguous Sequential Patterns Mining Algorithm to Find Frequent Athlete Movement Patterns from GPS
    Adeyemo, Victor Elijah
    Palczewska, Anna
    Jones, Ben
    20TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2021), 2021, : 455 - 460
  • [10] Mining Interesting and Contiguous Maximal Sequential Patterns on High Dimensional Sequences
    Ding, Jian
    Han, Meng
    2013 FIFTH INTERNATIONAL CONFERENCE ON MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION (ICMTMA 2013), 2013, : 691 - 694