Comprehensive mining of frequent itemsets for a combination of certain and uncertain databases

被引:0
|
作者
Wazir S. [1 ]
Beg M.M.S. [2 ]
Ahmad T. [1 ]
机构
[1] Department of Computer Engineering, Jamia Millia Islamia, New Delhi
[2] Department of Computer Engineering, Aligarh Muslim University, Aligarh
关键词
Approximate Frequent Items; Certain and Uncertain Transactional Database; Expected Support; Frequent Itemset Mining; Normal Distribution; Poisson Distribution;
D O I
10.1007/s41870-019-00310-0
中图分类号
学科分类号
摘要
The mechanism of Frequent Itemset Mining can be performed by using sequential algorithms like Apriori on a standalone system, or it can be applied using parallel algorithms like Count Distribution on a distributed system. Due to communication overhead in parallel algorithms and exponential candidate generation, many algorithms were developed for calculating frequent items either over the certain or uncertain database. Yet not a single algorithm is developed so far which can cover the requirement of generating frequent itemset by combining both the databases. We had proposed earlier MasterApriori algorithm which is used to calculate Approximate Frequent Items for a combination of certain and uncertain databases with the support of Apriori for Certain and Expected support based UApriori for the uncertain database. In this paper, the researcher would like to extend the former work by using Poisson and Normal Distribution based UApriori for the uncertain database. In proposed algorithms, there is only one-time communication between sites where data is distributed, which reduce the communication overhead. Scalability and efficiency of proposed algorithms are then checked by using standard, and synthetic databases. The performances were then measured by comparing time taken and a number of frequent items generated by each algorithm. © 2019, Bharati Vidyapeeth's Institute of Computer Applications and Management.
引用
收藏
页码:1205 / 1216
页数:11
相关论文
共 50 条
  • [41] A Hybrid Solution of Mining Frequent Itemsets from Uncertain Database
    Yu, Xiaomei
    Wang, Hong
    Zheng, Xiangwei
    INTELLIGENT COMPUTING METHODOLOGIES, 2014, 8589 : 581 - 590
  • [42] A Novel Parallel Algorithm for Frequent Itemsets Mining in Large Transactional Databases
    Huan Phan
    Bac Le
    ADVANCES IN DATA MINING: APPLICATIONS AND THEORETICAL ASPECTS (ICDM 2018), 2018, 10933 : 272 - 287
  • [43] Efficient mining of maximal frequent itemsets from databases on a cluster of workstations
    Soon M. Chung
    Congnan Luo
    Knowledge and Information Systems, 2008, 16 : 359 - 391
  • [44] Efficient mining of maximal frequent itemsets from databases on a cluster of workstations
    Chung, Soon M.
    Luo, Congnan
    KNOWLEDGE AND INFORMATION SYSTEMS, 2008, 16 (03) : 359 - 391
  • [45] Incremental mining of weighted maximal frequent itemsets from dynamic databases
    Yun, Unil
    Lee, Gangin
    EXPERT SYSTEMS WITH APPLICATIONS, 2016, 54 : 304 - 327
  • [46] Distributed mining of maximal frequent itemsets from Databases on a cluster of workstations
    Chung, SM
    Luo, CN
    2004 IEEE INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID - CCGRID 2004, 2004, : 499 - 507
  • [47] Mining Recent High Expected Weighted Itemsets from Uncertain Databases
    Gan, Wensheng
    Lin, Jerry Chun-Wei
    Fournier-Viger, Philippe
    Chao, Han-Chieh
    WEB TECHNOLOGIES AND APPLICATIONS, PT I, 2016, 9931 : 581 - 593
  • [48] Frequent Sequence Mining with Weight Constraints in Uncertain Databases
    Rahman, Md Mahmudur
    Ahmed, Chowdhury F.
    Leung, Carson K.
    Pazdor, Adam G. M.
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON UBIQUITOUS INFORMATION MANAGEMENT AND COMMUNICATION (IMCOM 2018), 2018,
  • [49] Weighted frequent itemset mining over uncertain databases
    Jerry Chun-Wei Lin
    Wensheng Gan
    Philippe Fournier-Viger
    Tzung-Pei Hong
    Vincent S. Tseng
    Applied Intelligence, 2016, 44 : 232 - 250
  • [50] Interactive Mining of Probabilistic Frequent Patterns in Uncertain Databases
    Lin, Ming-Yen
    Fu, Cheng-Tai
    Hsueh, Sue-Chen
    INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2022, 30 (02) : 263 - 283