Memory-Efficient Sequential Pattern Mining with Hybrid Tries

被引:0
|
作者
Hosseininasab, Amin [1 ]
van Hoeve, Willem-Jan [2 ]
Cire, Andre A. [3 ]
机构
[1] Univ Florida, Warrington Coll Business, Gainesville, FL 32611 USA
[2] Carnegie Mellon Univ, Tepper Sch Business, Pittsburgh, PA USA
[3] Univ Toronto, Rotman Sch Management, Toronto, ON, Canada
关键词
Sequential pattern mining; Memory efficiency; Large-scale pattern mining; Trie data set models; GENERATION;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper develops a memory-efficient approach for Sequential Pattern Mining (SPM), a fundamental topic in knowledge discovery that faces a well-known memory bottleneck for large data sets. Our methodology involves a novel hybrid trie data structure that exploits recurring patterns to compactly store the data set in memory; and a corresponding mining algorithm designed to effectively extract patterns from this compact representation. Numerical results on small to medium-sized real-life test instances show an average improvement of 85% in memory consumption and 49% in computation time compared to the state of the art. For large data sets, our algorithm stands out as the only capable SPM approach within 256GB of system memory, potentially saving 1.7TB in memory consumption.
引用
收藏
页数:29
相关论文
共 50 条
  • [21] A Memory-Efficient Hybrid Parallel Framework for Deep Neural Network Training
    Li, Dongsheng
    Li, Shengwei
    Lai, Zhiquan
    Fu, Yongquan
    Ye, Xiangyu
    Cai, Lei
    Qiao, Linbo
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2024, 35 (04) : 577 - 591
  • [22] Memory-efficient fingerprint verification
    Beleznai, C
    Ramoser, H
    Wachmann, B
    Birchbauer, J
    Bischof, H
    Kropatsch, W
    2001 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL II, PROCEEDINGS, 2001, : 463 - 466
  • [23] Memory-Efficient Hash Joins
    Barber, R.
    Lohman, G.
    Pandis, I.
    Raman, V.
    Sidle, R.
    Attaluri, G.
    Chainani, N.
    Lightstone, S.
    Sharpe, D.
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2014, 8 (04): : 353 - 364
  • [24] Memory-Efficient Fixpoint Computation
    Kim, Sung Kook
    Venet, Arnaud J.
    Thakur, Aditya, V
    STATIC ANALYSIS (SAS 2020), 2020, 12389 : 35 - 64
  • [25] Memory-Efficient Polar Decoders
    Hashemi, Seyyed Ali
    Condo, Carlo
    Ercan, Furkan
    Gross, Warren J.
    IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2017, 7 (04) : 604 - 615
  • [26] Memory-Efficient Adaptive Optimization
    Anil, Rohan
    Gupta, Vineet
    Koren, Tomer
    Singer, Yoram
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [27] A memory-efficient heterogeneous parallel pattern matching scheme in deep packet inspection
    Kim, HyunJin
    Hong, Hyejeong
    Baek, Dongmyoung
    Ahn, Jin-Ho
    Kang, Sungho
    IEICE ELECTRONICS EXPRESS, 2010, 7 (05): : 377 - 382
  • [28] A high-performance memory-efficient pattern matching algorithm and its implementation
    Lee, Tsern-Huei
    Liang, Chia-Chi
    TENCON 2006 - 2006 IEEE REGION 10 CONFERENCE, VOLS 1-4, 2006, : 512 - +
  • [29] Pattern-based Sparse Matrix Representation for Memory-Efficient SMVM Kernels
    Belgin, Mehmet
    Back, Godmar
    Ribbens, Calvin J.
    ICS'09: PROCEEDINGS OF THE 2009 ACM SIGARCH INTERNATIONAL CONFERENCE ON SUPERCOMPUTING, 2009, : 100 - 109
  • [30] Optimizing Pattern-Matching for Memory-Efficient Heterogeneous DNA Processing in Bioinformatics
    Pungila, Ciprian
    Galis, Darius
    Negru, Viorel
    2018 IEEE 12TH INTERNATIONAL SYMPOSIUM ON APPLIED COMPUTATIONAL INTELLIGENCE AND INFORMATICS (SACI), 2018, : 455 - 460