An evolutionary algorithm for mining rare association rules: a Big Data approach

被引:0
|
作者
Padillo, F. [1 ]
Luna, J. M. [2 ]
Ventura, S. [1 ]
机构
[1] Univ Cordoba, Dept Comp Sci & Numer Anal, Rabanales Campus, Cordoba, Spain
[2] Univ Jaen, Dept Comp Sci, Jaen, Spain
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Association rule mining is one of the most well-known techniques to discover interesting relations between items in data. To date, this task has been mainly focused on the discovery of frequent relationships. However, it is often interesting to focus on those that do not occur frequently. Rare association rule mining is an alluring field aiming at describing rare cases or unexpected behavior. This field is really useful over Big Data where abnormal endeavor are more curious than common behavior. In this sense, our aim is to propose a new evolutionary algorithm based on grammars to obtain rare association rules on Big Data. The novelty of our work is that it is eminently designed to be parallel, enabling its use over emerging technologies as Spark and Flink. Furthermore, while other algorithms focus on maximizing a couple of quality measure ignoring the rest, our fitness function has been precisely designed to obtain a trade-off while maximizing a set of well-known quality measures. The experimental study includes more than 70 datasets revealing alluring results in efficiency when more than 300 million of instances and file sizes up to 250 GBytes are considered, and proving that it is able to run efficiently in huge volumes of data.
引用
收藏
页码:2007 / 2014
页数:8
相关论文
共 50 条
  • [31] A Data Mining Algorithm for Association Rules with Chronic Disease Constraints
    Liu, YanRong
    Wang, LiJun
    Miao, Rong
    Ren, HengNi
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [32] A Web Data Mining Algorithm based on Weighted Association Rules
    Lv, Xiao
    Li, Yongjie
    Lu, Xu
    MATERIALS, MECHATRONICS AND AUTOMATION, PTS 1-3, 2011, 467-469 : 1386 - +
  • [33] Improvement and simulation of data mining algorithm based on association rules
    Tian, Li (lit@zucc.edu.cn), 1600, Universitas Ahmad Dahlan (14):
  • [34] Research on Data Mining Technology based on Association Rules Algorithm
    Zhang, Guihong
    Liu, Caiming
    Men, Tao
    PROCEEDINGS OF 2019 IEEE 8TH JOINT INTERNATIONAL INFORMATION TECHNOLOGY AND ARTIFICIAL INTELLIGENCE CONFERENCE (ITAIC 2019), 2019, : 526 - 530
  • [35] The Role of Apriori Algorithm for Finding the Association Rules in Data Mining
    Dongre, Lugendra
    Prajapati, Gend Lal
    Tokekar, S. V.
    PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON ISSUES AND CHALLENGES IN INTELLIGENT COMPUTING TECHNIQUES (ICICT), 2014, : 657 - 660
  • [36] A Study on the Mining Algorithm of Fast Association Rules for the XML Data
    Wu Gongxing
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, 2008, : 204 - 207
  • [37] Neutrosophic Association Rule Mining Algorithm for Big Data Analysis
    Abdel-Basset, Mohamed
    Mohamed, Mai
    Smarandache, Florentin
    Chang, Victor
    SYMMETRY-BASEL, 2018, 10 (04):
  • [38] Foreword: Evolutionary data mining for big data
    Ding, Weiping
    Yen, Gary G.
    Cai, Xinye
    Cao, Zehong
    SWARM AND EVOLUTIONARY COMPUTATION, 2020, 57
  • [39] Mining association rules on significant rare data using relative support
    Yun, HY
    Ha, DS
    Hwang, BY
    Ryu, KH
    JOURNAL OF SYSTEMS AND SOFTWARE, 2003, 67 (03) : 181 - 191
  • [40] Distributed mining of Censored Production Rules in data streams: An evolutionary approach
    Saroj
    Bharadwaj, K. K.
    ADVANCES ON ARTIFICIAL INTELLIGENCE, KNOWLEDGE ENGINEERING AND DATA BASES, PROCEEDINGS, 2008, : 500 - +