Parallel Frequent Pattern Discovery:Challenges and Methodology

被引:1
|
作者
张宇宙
王建勇
周立柱
机构
[1] Department of Computer Science and Technology Tsinghua University
[2] Beijing 100084 China
[3] Department of Computer Science and Technology Tsinghua University
关键词
frequent pattern mining; parallel computing; dynamic load balancing;
D O I
暂无
中图分类号
TP311.52 [];
学科分类号
081202 ; 0835 ;
摘要
Parallel frequent pattern discovery algorithms exploit parallel and distributed computing resources to relieve the sequential bottlenecks of current frequent pattern mining (FPM) algorithms. Thus, parallel FPM algorithms achieve better scalability and performance, so they are attracting much attention in the data min- ing research community. This paper presents a comprehensive survey of the state-of-the-art parallel and distributed frequent pattern mining algorithms with more emphasis on pattern discovery from complex data (e.g., sequences and graphs) on various platforms. A review of typical parallel FPM algorithms uncovers the major challenges, methodologies, and research problems in the field of parallel frequent pattern discovery, such as work-load balancing, finding good data layouts, and data decomposition. This survey also indicates a dramatic shift of the research interest in the field from the simple parallel frequent itemset mining on tradi- tional parallel and distributed platforms to parallel pattern mining of more complex data on emerging archi- tectures, such as multi-core systems and the increasingly mature grid infrastructure.
引用
收藏
页码:719 / 728
页数:10
相关论文
共 50 条
  • [1] Parallel Frequent Pattern Discovery: Challenges and Methodology
    Zhang, Yuzhou
    Wang, Jianyong
    Zhou, Lizhu
    Tsinghua Science and Technology, 2007, 12 (06) : 719 - 728
  • [2] Research on Parallel Frequent Pattern Discovery Based on Ontology and Rules
    Gu, Yuhan
    Sun, Ming
    2017 4TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING (ICISCE), 2017, : 611 - 615
  • [3] Non-iteration Parallel Algorithm for Frequent Pattern Discovery
    Liu, Chun
    Li, Yuqiang
    PROCEEDINGS OF THIRTEENTH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND APPLICATIONS TO BUSINESS, ENGINEERING AND SCIENCE, (DCABES 2014), 2014, : 127 - 132
  • [4] Scrutinizing frequent pattern discovery performance
    Zaïane, OR
    El-Hajj, M
    Li, Y
    Luk, S
    ICDE 2005: 21ST INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2005, : 1109 - 1110
  • [5] Frequent pattern discovery in online environment
    Dvorak, J
    Proceedings of the IASTED International Conference on Artificial Intelligence and Applications, 2006, : 434 - 438
  • [6] A parallel, distributed algorithm for relational frequent pattern discovery from very large data sets
    Appice, Annalisa
    Ceci, Michelangelo
    Turi, Antonio
    Malerba, Donato
    INTELLIGENT DATA ANALYSIS, 2011, 15 (01) : 69 - 88
  • [7] A frequent pattern discovery, method for Outlier detection
    He, ZY
    Xu, XF
    Huang, JZX
    Deng, SC
    ADVANCES IN WEB-AGE INFORMATION MANAGEMENT: PROCEEDINGS, 2004, 3129 : 726 - 732
  • [8] Approximate Frequent Pattern Discovery in Compressed Space
    Fukunaga, Shouhei
    Takabatake, Yoshimasa
    Tomohiro, I
    Sakamoto, Hiroshi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2018, E101D (03): : 593 - 601
  • [9] A frequent pattern discovery method for outlier detection
    He, Zengyou
    Xu, Xiaofei
    Huang, Joshua Zhexue
    Deng, Shengchun
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2004, 3129 : 726 - 732
  • [10] Abstract Representations and Generalized Frequent Pattern Discovery
    Ozkural, Eray
    ARTIFICIAL GENERAL INTELLIGENCE: 10TH INTERNATIONAL CONFERENCE, AGI 2017, 2017, 10414 : 67 - 76