An evolutionary algorithm for mining rare association rules: a Big Data approach

被引:0
|
作者
Padillo, F. [1 ]
Luna, J. M. [2 ]
Ventura, S. [1 ]
机构
[1] Univ Cordoba, Dept Comp Sci & Numer Anal, Rabanales Campus, Cordoba, Spain
[2] Univ Jaen, Dept Comp Sci, Jaen, Spain
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Association rule mining is one of the most well-known techniques to discover interesting relations between items in data. To date, this task has been mainly focused on the discovery of frequent relationships. However, it is often interesting to focus on those that do not occur frequently. Rare association rule mining is an alluring field aiming at describing rare cases or unexpected behavior. This field is really useful over Big Data where abnormal endeavor are more curious than common behavior. In this sense, our aim is to propose a new evolutionary algorithm based on grammars to obtain rare association rules on Big Data. The novelty of our work is that it is eminently designed to be parallel, enabling its use over emerging technologies as Spark and Flink. Furthermore, while other algorithms focus on maximizing a couple of quality measure ignoring the rest, our fitness function has been precisely designed to obtain a trade-off while maximizing a set of well-known quality measures. The experimental study includes more than 70 datasets revealing alluring results in efficiency when more than 300 million of instances and file sizes up to 250 GBytes are considered, and proving that it is able to run efficiently in huge volumes of data.
引用
收藏
页码:2007 / 2014
页数:8
相关论文
共 50 条
  • [41] Rare Itemsets Selector with Association Rules for Revenue Analysis by Association Rare Itemset Rule Mining Approach
    Selvarani S.
    Jeyakarthic M.
    Recent Advances in Computer Science and Communications, 2021, 14 (07) : 2335 - 2344
  • [42] An evolutionary algorithm for the discovery of rare class association rules in learning management systems
    Luna, J. M.
    Romero, C.
    Romero, J. R.
    Ventura, S.
    APPLIED INTELLIGENCE, 2015, 42 (03) : 501 - 513
  • [43] An evolutionary algorithm for the discovery of rare class association rules in learning management systems
    J. M. Luna
    C. Romero
    J. R. Romero
    S. Ventura
    Applied Intelligence, 2015, 42 : 501 - 513
  • [44] Mining comprehensible clustering rules with an evolutionary algorithm
    Sarafis, L
    Trinder, P
    Zalzala, A
    GENETIC AND EVOLUTIONARY COMPUTATION - GECCO 2003, PT II, PROCEEDINGS, 2003, 2724 : 2301 - 2312
  • [45] Mining diversified association rules in big datasets: A cluster/GPU/genetic approach
    Djenouri, Youcef
    Belhadi, Asma
    Fournier-Viger, Philippe
    Fujita, Hamido
    INFORMATION SCIENCES, 2018, 459 : 117 - 134
  • [46] Application of Association Rules Mining in School Recruitment in the Background of Big Data Era
    Li Zuojun
    PROCEEDINGS OF THE 2018 8TH INTERNATIONAL CONFERENCE ON EDUCATION AND MANAGEMENT (ICEM 2018), 2018, 77 : 32 - 36
  • [47] Rare Association Rules Mining of Diabetic Complications Based on Improved Rarity Algorithm
    Pan, Qiao
    Xiang, Lan
    Jin, Yanhong
    PROCEEDINGS OF 2019 IEEE 7TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND COMPUTATIONAL BIOLOGY (ICBCB 2019), 2019, : 115 - 119
  • [48] ERAPN, an Algorithm for Extraction Positive and Negative Association Rules in Big Data
    Bemarisika, Parfait
    Totohasina, Andre
    BIG DATA ANALYTICS AND KNOWLEDGE DISCOVERY (DAWAK 2018), 2018, 11031 : 329 - 344
  • [49] Association feature mining algorithm of web accessing data in big data environment
    Gong, Jing
    JOURNAL OF DISCRETE MATHEMATICAL SCIENCES & CRYPTOGRAPHY, 2018, 21 (02): : 333 - 337
  • [50] A Recursive Algorithm for Mining Association Rules
    Mokkadem A.
    Pelletier M.
    Raimbault L.
    SN Computer Science, 3 (5)