Comprehensive mining of frequent itemsets for a combination of certain and uncertain databases

被引:0
|
作者
Wazir S. [1 ]
Beg M.M.S. [2 ]
Ahmad T. [1 ]
机构
[1] Department of Computer Engineering, Jamia Millia Islamia, New Delhi
[2] Department of Computer Engineering, Aligarh Muslim University, Aligarh
关键词
Approximate Frequent Items; Certain and Uncertain Transactional Database; Expected Support; Frequent Itemset Mining; Normal Distribution; Poisson Distribution;
D O I
10.1007/s41870-019-00310-0
中图分类号
学科分类号
摘要
The mechanism of Frequent Itemset Mining can be performed by using sequential algorithms like Apriori on a standalone system, or it can be applied using parallel algorithms like Count Distribution on a distributed system. Due to communication overhead in parallel algorithms and exponential candidate generation, many algorithms were developed for calculating frequent items either over the certain or uncertain database. Yet not a single algorithm is developed so far which can cover the requirement of generating frequent itemset by combining both the databases. We had proposed earlier MasterApriori algorithm which is used to calculate Approximate Frequent Items for a combination of certain and uncertain databases with the support of Apriori for Certain and Expected support based UApriori for the uncertain database. In this paper, the researcher would like to extend the former work by using Poisson and Normal Distribution based UApriori for the uncertain database. In proposed algorithms, there is only one-time communication between sites where data is distributed, which reduce the communication overhead. Scalability and efficiency of proposed algorithms are then checked by using standard, and synthetic databases. The performances were then measured by comparing time taken and a number of frequent items generated by each algorithm. © 2019, Bharati Vidyapeeth's Institute of Computer Applications and Management.
引用
收藏
页码:1205 / 1216
页数:11
相关论文
共 50 条
  • [31] Mining maximal frequent itemsets for large scale transaction databases
    Xia, R
    Yuan, W
    Ding, SC
    Liu, J
    Zhou, HB
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 1480 - 1485
  • [32] Mining frequent itemsets in large databases: The hierarchical partitioning approach
    Tseng, Fan-Chen
    EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (05) : 1654 - 1661
  • [33] Probabilistic Frequent Itemset Mining in Uncertain Databases
    Bernecker, Thomas
    Kriegel, Hans-Peter
    Renz, Matthias
    Verhein, Florian
    Zuefle, Andreas
    KDD-09: 15TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2009, : 119 - 127
  • [34] Mining weighted frequent sequences in uncertain databases
    Rahman, Md Mahmudur
    Ahmed, Chowdhury Farhan
    Leung, Carson Kai-Sang
    INFORMATION SCIENCES, 2019, 479 : 76 - 100
  • [35] A decremental approach for mining frequent itemsets from uncertain data
    Chui, Chun-Kit
    Kao, Ben
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2008, 5012 : 64 - 75
  • [36] An approximation algorithm of mining frequent itemsets from uncertain dataset
    Sun, Xiaoying
    Lim, Liming
    Wang, Shui
    International Journal of Advancements in Computing Technology, 2012, 4 (03) : 42 - 49
  • [37] Uncertain Frequent Itemsets Mining Algorithm on Data Streams with Constraints
    Yu, Qun
    Tang, Ke-Ming
    Tang, Shi-Xi
    Lv, Xin
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2016, 2016, 9937 : 192 - 201
  • [38] Efficient mining algorithm of frequent itemsets for uncertain data streams
    Wang Qianqian
    Liu Fang-ai
    PROCEEDINGS OF 2016 9TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 2, 2016, : 443 - 446
  • [39] Mining of Probabilistic Frequent Itemsets over Uncertain Data Streams
    Liu Lixin
    Zhang Xiaolin
    Zhang Huanxiang
    2014 11TH WEB INFORMATION SYSTEM AND APPLICATION CONFERENCE (WISA), 2014, : 231 - 237
  • [40] Mining constrained frequent itemsets from distributed uncertain data
    Cuzzocrea, Alfredo
    Leung, Carson Kai-Sang
    MacKinnon, Richard Kyle
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2014, 37 : 117 - 126