MaPle:: A fast algorithm for maximal pattern-based clustering

被引:0
|
作者
Pei, J [1 ]
Zhang, XL [1 ]
Cho, MJ [1 ]
Wang, HX [1 ]
Yu, PS [1 ]
机构
[1] SUNY Buffalo, Buffalo, NY 14260 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pattern-based clustering is important in many applications, such as DNA micro-array data analysis, automatic recommendation systems and target marketing systems. However pattern-based clustering in large databases is challenging. On the one hand, there can be a huge number of clusters and many of them can be redundant and thus make the pattern-based clustering ineffective. On the other hand, the previous proposed methods may not be efficient or scalable in mining large databases. In this paper, we study the problem of maximal pattern-based clustering. Redundant clusters are avoided completely by mining only the maximal pattern-based clusters. MaPle, an efficient and scalable mining algorithm is developed. It conducts a depth-first, divide-and-conquer search and prunes unnecessary branches smartly. Our extensive performance study on both synthetic data sets and real data sets shows that maximal pattern-based clustering is effective. It reduces the number of clusters substantially. Moreover MaPle is more efficient and scalable than the previously proposed pattern-based clustering methods in mining large databases.
引用
收藏
页码:259 / 266
页数:8
相关论文
共 50 条
  • [41] Improving the Accuracy of the Annotation Algorithm in Pattern-Based Tennis Game Video
    Bastanfard, Azam
    Amirkhani, Dariush
    2021 29TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2021, : 493 - 497
  • [42] Fast density-based clustering algorithm
    Zhou, Shuigeng
    Zhou, Aoying
    Cao, Jing
    Hu, Yunfa
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2000, 37 (11): : 1287 - 1292
  • [43] Fast Correntropy-Based Clustering Algorithm
    Li Z.
    Yang B.
    Zhang J.
    Liu Y.
    Zhang X.
    Wang F.
    Hsi-An Chiao Tung Ta Hsueh/Journal of Xi'an Jiaotong University, 2021, 55 (06): : 121 - 130
  • [44] Pattern-based Algorithm for Part-of-Speech Tagging Arabic Text
    Alqrainy, Shihadeh
    Alserhan, Hasan Muaidi
    Ayesh, Aladdin
    ICCES: 2008 INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING & SYSTEMS, 2007, : 119 - +
  • [45] Semi-Supervised Pattern-Based Algorithm for Arabic Relation Extraction
    Sarhan, Injy
    El-Sonbaty, Yasser
    Abou El-Nasr, Mohamed
    2016 IEEE 28TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2016), 2016, : 177 - 183
  • [46] An effective maximal subspace clustering algorithm based on enumeration tree
    Yin, Jian
    Huang, Zhilan
    Chen, Jian
    FOURTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 1, PROCEEDINGS, 2007, : 572 - +
  • [47] Pattern-Based Mapping Refinement
    Hamdi, Faycal
    Reynaud, Chantal
    Safar, Brigitte
    KNOWLEDGE ENGINEERING AND MANAGEMENT BY THE MASSES, EKAW 2010, 2010, 6317 : 1 - 15
  • [48] Pattern-based texture metamorphosis
    Liu, ZQ
    Liu, C
    Shum, HY
    Yul, YZ
    10TH PACIFIC CONFERENCE ON COMPUTER GRAPHICS AND APPLICATIONS, PROCEEDINGS, 2002, : 184 - 191
  • [49] Pattern-based clustering of daily weigh-in trajectories using dynamic time warping
    Bothwell, Samantha
    Kaizer, Alex
    Peterson, Ryan
    Ostendorf, Danielle
    Catenacci, Victoria
    Wrobel, Julia
    BIOMETRICS, 2023, 79 (03) : 2719 - 2731
  • [50] Pattern-based verification for trees
    Ceska, Milan
    Erlebach, Pavel
    Vojnar, Tomas
    COMPUTER AIDED SYSTEMS THEORY- EUROCAST 2007, 2007, 4739 : 488 - 496