Memory-Efficient Sequential Pattern Mining with Hybrid Tries

被引:0
|
作者
Hosseininasab, Amin [1 ]
van Hoeve, Willem-Jan [2 ]
Cire, Andre A. [3 ]
机构
[1] Univ Florida, Warrington Coll Business, Gainesville, FL 32611 USA
[2] Carnegie Mellon Univ, Tepper Sch Business, Pittsburgh, PA USA
[3] Univ Toronto, Rotman Sch Management, Toronto, ON, Canada
关键词
Sequential pattern mining; Memory efficiency; Large-scale pattern mining; Trie data set models; GENERATION;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper develops a memory-efficient approach for Sequential Pattern Mining (SPM), a fundamental topic in knowledge discovery that faces a well-known memory bottleneck for large data sets. Our methodology involves a novel hybrid trie data structure that exploits recurring patterns to compactly store the data set in memory; and a corresponding mining algorithm designed to effectively extract patterns from this compact representation. Numerical results on small to medium-sized real-life test instances show an average improvement of 85% in memory consumption and 49% in computation time compared to the state of the art. For large data sets, our algorithm stands out as the only capable SPM approach within 256GB of system memory, potentially saving 1.7TB in memory consumption.
引用
收藏
页数:29
相关论文
共 50 条
  • [1] Fast and Memory-Efficient Significant Pattern Mining via Permutation Testing
    Llinares-Lopez, Felipe
    Sugiyama, Mahito
    Papaxanthos, Laetitia
    Borgwardt, Karsten M.
    KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, : 725 - 734
  • [2] A memory-efficient scheme for address lookup using compact prefix tries
    Sarda, A
    Sen, A
    GLOBECOM'03: IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE, VOLS 1-7, 2003, : 3943 - 3947
  • [3] Efficient weighted sequential pattern mining
    Chen, Shaotao
    Chen, Jiahui
    Wan, Shicheng
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 243
  • [4] An Efficient Approach for Mining Sequential Pattern
    Pant, Nidhi
    Kant, Surya
    Pant, Bhaskar
    Sharma, Shashi Kumar
    PROCEEDINGS OF FIFTH INTERNATIONAL CONFERENCE ON SOFT COMPUTING FOR PROBLEM SOLVING (SOCPROS 2015), VOL 2, 2016, 437 : 587 - 596
  • [5] Efficient sequential pattern mining algorithms
    Ivancsy, Renata
    Vajk, Istvan
    WSEAS Transactions on Computers, 2005, 4 (02): : 96 - 101
  • [6] Hybrid memory-efficient multimatch packet classification for NIDS
    Lee, KyuHee
    Yun, SangKyun
    MICROPROCESSORS AND MICROSYSTEMS, 2015, 39 (02) : 113 - 121
  • [7] A flexible and efficient sequential pattern mining algorithm
    Lin, Jie-Ru
    Hsieh, Chia-Ying
    Yang, Don-Lin
    Wu, Jungpin
    Huang, Ming-Chuan
    International Journal of Intelligent Information and Database Systems, 2009, 3 (03) : 291 - 310
  • [8] Piranha: Fast and memory-efficient pattern matching for intrusion detection
    Antonatos, S
    Polychronakis, M
    Akritidis, P
    Anagnostakis, KG
    Markatos, EP
    SECURITY AND PRIVACY IN THE AGE OF UBIQUITOUS COMPUTING, 2005, 181 : 393 - 408
  • [9] Memory-Efficient Adaptive Test Pattern Reordering for Accurate Diagnosis
    Fang, Chenlei
    Huang, Qicheng
    Blanton, R. D.
    2021 IEEE 39TH VLSI TEST SYMPOSIUM (VTS), 2021,
  • [10] Piranha: Fast and memory-efficient pattern matching for intrusion detection
    et al; International Communication Foundation; OTSUKA CORPORATION OTSUKA CORPORATION; Support Cent. Adv. Telecommun. Technol. Res.; Systems Development Laboratory,Hitachi Ltd; The Telecommunication Advancement Foundation (Springer Science and Business Media, LLC):