An Efficient Algorithm for Mining Large Item Sets

被引:1
|
作者
Zheng, Hong-Zhen [1 ]
Chu, Dian-Hui [1 ]
Zhan, De-Chen [1 ]
Xu, Xiao-Fei [1 ]
机构
[1] Harbin Inst Technol, Coll Comp Sci & Technol, Weihai 264209, Peoples R China
关键词
Large item sets; Data mining; Association rules;
D O I
10.1109/FSKD.2008.679
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
It propose Online Mining Algorithm (OMA) which online discover large item sets. Without presetting a default threshold, the OMA algorithm achieves its efficiency and threshold-flexibility by calculating item-sets' counts. It is unnecessary and independent of the default threshold and can flexibly adapt to any user's input threshold. In addition, we propose Cluster-Based Association Rule Algorithm (CARA) creates cluster tables to aid discovery of large item sets. It only requires a single scan of the database, followed by contrasts with the partial cluster tables. It not only prunes considerable amounts of data reducing the time needed to perform data scans and requiring less contrast, but also ensures the correctness of the mined results. By using the CARA algorithm to create cluster tables in advance, each CPU can be utilized to process a cluster table; thus large item sets can be immediately mined even when the database is very large.
引用
收藏
页码:561 / 564
页数:4
相关论文
共 50 条
  • [1] An efficient algorithm for mining large item sets
    Zheng, Hong-Zhen
    Chu, Dian-Hui
    Zhan, De-Chen
    3RD INT CONF ON CYBERNETICS AND INFORMATION TECHNOLOGIES, SYSTEMS, AND APPLICAT/4TH INT CONF ON COMPUTING, COMMUNICATIONS AND CONTROL TECHNOLOGIES, VOL 2, 2006, : 151 - +
  • [2] Efficient Mining of Frequent Item Sets on Large Uncertain Databases
    Wang, Liang
    Cheung, David Wai-Lok
    Cheng, Reynold
    Lee, Sau Dan
    Yang, Xuan S.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2012, 24 (12) : 2170 - 2183
  • [3] A vertical format algorithm for mining frequent item sets
    Guo Yi-ming
    Wang Zhi-jun
    2ND IEEE INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER CONTROL (ICACC 2010), VOL. 4, 2010, : 11 - 13
  • [4] A new mining algorithm based on frequent item sets
    Wen Yun
    FIRST INTERNATIONAL WORKSHOP ON KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2007, : 410 - 413
  • [5] An efficient method for discovery of large item sets
    Deshpande, Deepa S.
    INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2016, 8 (04) : 303 - 314
  • [6] An efficient algorithm for mining high utility itemsets with negative item values in large databases
    Chu, Chun-Jung
    Tseng, Vincent S.
    Liang, Tyne
    APPLIED MATHEMATICS AND COMPUTATION, 2009, 215 (02) : 767 - 778
  • [7] Algorithm of Frequent Item Sets Mining Based on Index Table
    Zhang Lin
    Yao Nanzhen
    Zhang Jianli
    MECHATRONICS, ROBOTICS AND AUTOMATION, PTS 1-3, 2013, 373-375 : 1076 - +
  • [8] Design and Implementation of Improved Algorithm for Frequent Item Sets Mining
    Zhang Lin
    Zhang Jianli
    PROCEEDINGS OF 2012 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2012), 2012, : 1696 - 1698
  • [9] Parallel algorithm for mining frequent item sets based on Spark
    Mao Y.
    Wu B.
    Xu C.
    Zhang M.
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2023, 29 (04): : 1267 - 1283
  • [10] Frequent Item Sets and Association Rules Mining Algorithm Based on Floyd Algorithm
    Zhang Lin
    Zhang Jianli
    JOURNAL OF COMPUTATIONAL AND THEORETICAL NANOSCIENCE, 2015, 12 (09) : 2574 - 2578