Parametric algorithms for mining share frequent itemsets

被引:8
|
作者
Barber, B [1 ]
Hamilton, HJ [1 ]
机构
[1] Univ Regina, Dept Comp Sci, Regina, SK S4S 0A2, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
knowledge discovery; data mining; itemsets; association rule mining; share based measures;
D O I
10.1023/A:1011276003319
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Itemset share, the fraction of some numerical total contributed by items when they occur in itemsets, has been proposed as a measure of the importance of itemsets in association rule mining. The IAB and CAC algorithms are able to find share frequent itemsets that have infrequent subsets. These algorithms perform well, but they do not always find all possible share frequent itemsets. In this paper, we describe the incorporation of a threshold factor into these algorithms. The threshold factor can be used to increase the number of frequent itemsets found at a cost of an increase in the number of infrequent itemsets examined. The modified algorithms are tested on a large commercial database. Their behavior is examined using principles of classifier evaluation from machine learning.
引用
收藏
页码:277 / 293
页数:17
相关论文
共 50 条
  • [21] Extracting share frequent itemsets with infrequent subsets
    Barber, B
    Hamilton, HJ
    DATA MINING AND KNOWLEDGE DISCOVERY, 2003, 7 (02) : 153 - 185
  • [22] Efficient algorithms for deriving complete frequent itemsets from frequent closed itemsets
    Wu, Cheng-Wei
    Huang, JianTao
    Lin, Yun-Wei
    Chuang, Chien-Yu
    Tseng, Yu-Chee
    APPLIED INTELLIGENCE, 2022, 52 (06) : 7002 - 7023
  • [23] Mining frequent itemsets from streaming transaction data using genetic algorithms
    Sikha Bagui
    Patrick Stanley
    Journal of Big Data, 7
  • [24] Efficient improvement of FT-tree based frequent itemsets mining algorithms
    Qiu, Yong
    Lan, Yong-Jie
    ICICIC 2006: FIRST INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING, INFORMATION AND CONTROL, VOL 3, PROCEEDINGS, 2006, : 374 - +
  • [25] Mining frequent itemsets from streaming transaction data using genetic algorithms
    Bagui, Sikha
    Stanley, Patrick
    JOURNAL OF BIG DATA, 2020, 7 (01)
  • [26] GeneticMax: An Efficient Approach to Mining Maximal Frequent Itemsets Based on Genetic Algorithms
    Kabir, Mir Md. Jahangir
    Xu, Shuxiang
    Kang, Byeong Ho
    Zhao, Zongyuan
    INFORMATION TECHNOLOGY IN INDUSTRY, 2015, 3 (03): : 64 - 73
  • [27] Mining frequent closed itemsets with the frequent pattern list
    Tseng, FC
    Hsu, CC
    Chen, H
    2001 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2001, : 653 - 654
  • [28] Distributed Frequent Closed Itemsets Mining
    Liu, Chun
    Zheng, Zheng
    Cai, Kai-Yuan
    Zhang, Shichao
    SITIS 2007: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SIGNAL IMAGE TECHNOLOGIES & INTERNET BASED SYSTEMS, 2008, : 43 - 50
  • [29] Summary queries for frequent itemsets mining
    Zhang, Shichao
    Jin, Zhi
    Lu, Jingli
    JOURNAL OF SYSTEMS AND SOFTWARE, 2010, 83 (03) : 405 - 411
  • [30] Incremental Frequent Itemsets Mining with MapReduce
    Kandalov, Kirill
    Gudes, Ehud
    ADVANCES IN DATABASES AND INFORMATION SYSTEMS, ADBIS 2017, 2017, 10509 : 247 - 261