Parallel Frequent Pattern Discovery:Challenges and Methodology

被引:1
|
作者
张宇宙
王建勇
周立柱
机构
[1] Department of Computer Science and Technology Tsinghua University
[2] Beijing 100084 China
[3] Department of Computer Science and Technology Tsinghua University
关键词
frequent pattern mining; parallel computing; dynamic load balancing;
D O I
暂无
中图分类号
TP311.52 [];
学科分类号
081202 ; 0835 ;
摘要
Parallel frequent pattern discovery algorithms exploit parallel and distributed computing resources to relieve the sequential bottlenecks of current frequent pattern mining (FPM) algorithms. Thus, parallel FPM algorithms achieve better scalability and performance, so they are attracting much attention in the data min- ing research community. This paper presents a comprehensive survey of the state-of-the-art parallel and distributed frequent pattern mining algorithms with more emphasis on pattern discovery from complex data (e.g., sequences and graphs) on various platforms. A review of typical parallel FPM algorithms uncovers the major challenges, methodologies, and research problems in the field of parallel frequent pattern discovery, such as work-load balancing, finding good data layouts, and data decomposition. This survey also indicates a dramatic shift of the research interest in the field from the simple parallel frequent itemset mining on tradi- tional parallel and distributed platforms to parallel pattern mining of more complex data on emerging archi- tectures, such as multi-core systems and the increasingly mature grid infrastructure.
引用
收藏
页码:719 / 728
页数:10
相关论文
共 50 条
  • [21] Parallel and Distributed Frequent Pattern Mining in Large Databases
    Tanbeer, Syed Khairuzzaman
    Ahmed, Chowdhury Farhan
    Jeong, Byeong-Soo
    HPCC: 2009 11TH IEEE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS, 2009, : 407 - 414
  • [22] Pattern discovery and detection: A unified statistical methodology
    Hand, DJ
    Bolton, RJ
    JOURNAL OF APPLIED STATISTICS, 2004, 31 (08) : 885 - 924
  • [23] Automata Theory Approach for Solving Frequent Pattern Discovery Problems
    Ivancsy, Renata
    Vajk, Istvan
    PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 8, 2005, 8 : 203 - 208
  • [24] Frequent pattern discovery from OWL DLP knowledge bases
    Jozefowska, Joanna
    Lawrynowicz, Agnieszka
    Lukaszewski, Tomasz
    MANAGING KNOWLEDGE IN A WORLD OF NETWORKS, PROCEEDINGS, 2006, 4248 : 287 - 302
  • [25] Pattern Discovery in Conceptual Models Using Frequent Itemset Mining
    Fumagalli, Mattia
    Sales, Tiago Prince
    Guizzardi, Giancarlo
    CONCEPTUAL MODELING (ER 2022), 2022, 13607 : 52 - 62
  • [26] Frequent Pattern Discovery from a Single Graph with Quantitative Itemsets
    Miyoshi, Yuuki
    Ozaki, Tomonobu
    Ohkawa, Takenao
    2009 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2009), 2009, : 527 - +
  • [27] Frequent Pattern Discovery in Multiple Biological Networks: Patterns and Algorithms
    Li W.
    Hu H.
    Huang Y.
    Li H.
    Mehan M.R.
    Nunez-Iglesias J.
    Xu M.
    Yan X.
    Zhou X.J.
    Statistics in Biosciences, 2012, 4 (1) : 157 - 176
  • [28] A parallel algorithm for pattern discovery in biological sequences
    Mauri, G
    Pavesi, G
    FUTURE GENERATION COMPUTER SYSTEMS, 2002, 18 (06) : 849 - 854
  • [29] On Pattern-Based Programming towards the Discovery of Frequent Patterns
    Kerdprasop, Kittisak
    Kerdprasop, Nittaya
    PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 26, PARTS 1 AND 2, DECEMBER 2007, 2007, 26 : 472 - +
  • [30] Frequent pattern discovery without binarization: Mining attribute profiles
    Gyenesei, Attila
    Schlapbach, Ralph
    Stolte, Etzard
    Wagner, Ulrich
    KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2006, PROCEEDINGS, 2006, 4213 : 528 - 535