Memory-Efficient Sequential Pattern Mining with Hybrid Tries

被引:0
|
作者
Hosseininasab, Amin [1 ]
van Hoeve, Willem-Jan [2 ]
Cire, Andre A. [3 ]
机构
[1] Univ Florida, Warrington Coll Business, Gainesville, FL 32611 USA
[2] Carnegie Mellon Univ, Tepper Sch Business, Pittsburgh, PA USA
[3] Univ Toronto, Rotman Sch Management, Toronto, ON, Canada
关键词
Sequential pattern mining; Memory efficiency; Large-scale pattern mining; Trie data set models; GENERATION;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper develops a memory-efficient approach for Sequential Pattern Mining (SPM), a fundamental topic in knowledge discovery that faces a well-known memory bottleneck for large data sets. Our methodology involves a novel hybrid trie data structure that exploits recurring patterns to compactly store the data set in memory; and a corresponding mining algorithm designed to effectively extract patterns from this compact representation. Numerical results on small to medium-sized real-life test instances show an average improvement of 85% in memory consumption and 49% in computation time compared to the state of the art. For large data sets, our algorithm stands out as the only capable SPM approach within 256GB of system memory, potentially saving 1.7TB in memory consumption.
引用
收藏
页数:29
相关论文
共 50 条
  • [41] Toward memory-efficient linear solvers
    Baker, A
    Dennis, J
    Jessup, ER
    HIGH PERFORMANCE COMPUTING FOR COMPUTATIONAL SCIENCE - VECPAR 2002, 2003, 2565 : 315 - 327
  • [42] A Pattern Partitioning Algorithm for Memory-Efficient Parallel String Matching in Deep Packet Inspection
    Kim, HyunJin
    Hong, Hyejeong
    Baek, Dongmyoung
    Kang, Sungho
    IEICE TRANSACTIONS ON COMMUNICATIONS, 2010, E93B (06) : 1612 - 1614
  • [43] Memory-efficient decoding of LDPC codes
    Lee, JKS
    Thorpe, J
    2005 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), VOLS 1 AND 2, 2005, : 459 - 463
  • [44] Memory-Efficient Parametric Semiglobal Matching
    Lee, Yeongmin
    Park, Min-Gyu
    Hwang, Youngbae
    Shin, Youngsoo
    Kyung, Chong-Min
    IEEE SIGNAL PROCESSING LETTERS, 2018, 25 (02) : 194 - 198
  • [45] Memory-Efficient Minimax Distance Measures
    Hoseini, Fazeleh
    Chehreghani, Morteza Haghir
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2022, PT I, 2022, 13280 : 419 - 431
  • [46] Differentiable Slimming for Memory-Efficient Transformers
    Penkov, Nikolay
    Balaskas, Konstantinos
    Rapp, Martin
    Henkel, Joerg
    IEEE EMBEDDED SYSTEMS LETTERS, 2023, 15 (04) : 186 - 189
  • [47] Memory-Efficient Hashed Page Tables
    Stojkovic, Jovan
    Mantri, Namrata
    Skarlatos, Dimitrios
    Xu, Tianyin
    Torrellas, Josep
    2023 IEEE INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE, HPCA, 2023, : 1221 - 1235
  • [48] A memory-efficient progressive JPEG decoder
    Lee, Kun-Bin
    Ju, Chi-Cheng
    2007 INTERNATIONAL SYMPOSIUM ON VLSI DESIGN, AUTOMATION AND TEST (VLSI-DAT), PROCEEDINGS OF TECHNICAL PAPERS, 2007, : 8 - +
  • [49] Pattern-Based DFA for Memory-Efficient and Scalable Multiple Regular Expression Matching
    Jiang, Junchen
    Xu, Yang
    Pan, Tian
    Tang, Yi
    Liu, Bin
    2010 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2010,
  • [50] Fast, memory-efficient retrograde algorithms
    Wu, R
    Beal, D
    ICGA JOURNAL, 2001, 24 (03) : 147 - 159