Sampling Based on Genetic Algorithm for Data Mining

被引:0
|
作者
Wang Jianyong [1 ]
Huang Yu [1 ]
Hu Bin [1 ]
Wei Xiaomei [1 ]
机构
[1] Huazhong Agr Univ, Coll Sci, Wuhan 430070, Hubei Province, Peoples R China
关键词
Genetic algorithm; Association rules; Accuracy;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Collecting a large initial sample set from a huge data set, and then distilling a smaller sample set from the initial set in the same accuracy can greatly enhance the speeds of data mining algorithms. As the distilling process is proved as a NP-hard problem, the two-phase sampling algorithm FAST adopts a kind of geed method. Adopting genetic algorithm in sample distilling, a sampling algorithm SGA is presented in this paper, which performs better than popular sampling algorithms including FAST in the experiment.
引用
收藏
页码:3667 / 3672
页数:6
相关论文
共 50 条
  • [21] Mining hidden danger data association rules of coal mining face based on genetic algorithm
    Ning, Guifeng
    Gao, Long
    Liu, Liping
    Journal of Mining and Strata Control Engineering, 2024, 6 (02)
  • [22] A new meterage-data mining algorithm based on fuzzy neural network and genetic algorithm
    Huang, JC
    Zhang, WM
    Zhao, X
    Liu, Z
    ISTM/2005: 6th International Symposium on Test and Measurement, Vols 1-9, Conference Proceedings, 2005, : 1770 - 1773
  • [23] Validation of genetic algorithm-based optimal sampling for ocean data assimilation
    Kevin D. Heaney
    Pierre F. J. Lermusiaux
    Timothy F. Duda
    Patrick J. Haley
    Ocean Dynamics, 2016, 66 : 1209 - 1229
  • [24] Validation of genetic algorithm-based optimal sampling for ocean data assimilation
    Heaney, Kevin D.
    Lermusiaux, Pierre F. J.
    Duda, Timothy F.
    Haley, Patrick J., Jr.
    OCEAN DYNAMICS, 2016, 66 (10) : 1209 - 1229
  • [25] Data Mining Research in Wireless Sensor Network Based on Genetic BP Algorithm
    Wang Mengmeng
    Xiu Debin
    Wang Rongxin
    Du Fang
    Shi Yunbo
    PROCEEDINGS OF 2013 2ND INTERNATIONAL CONFERENCE ON MEASUREMENT, INFORMATION AND CONTROL (ICMIC 2013), VOLS 1 & 2, 2013, : 243 - 247
  • [26] Deep Mining of Redundant Data in Wireless Sensor Network Based on Genetic Algorithm
    Diao H.
    Automatic Control and Computer Sciences, 2018, 52 (4) : 291 - 296
  • [27] Research on data mining system based on artificial intelligence and improved genetic algorithm
    Shi Ruifeng
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (04) : 6731 - 6742
  • [28] Stratified Sampling Design Based on Data Mining
    Kim, Yeonkook J.
    Oh, Yoonhwan
    Park, Sunghoon
    Cho, Sungzoon
    Park, Hayoung
    HEALTHCARE INFORMATICS RESEARCH, 2013, 19 (03) : 186 - 195
  • [29] Data mining technology based on rough set and genetic algorithm under large data environment
    Wang, Liping
    PROCEEDINGS OF THE 2016 2ND INTERNATIONAL CONFERENCE ON MATERIALS ENGINEERING AND INFORMATION TECHNOLOGY APPLICATIONS (MEITA 2016), 2017, 107 : 561 - 565
  • [30] Data mining for data classification based on the KNN-fuzzy method supported by genetic algorithm
    Rosa, JLA
    Ebecken, NFF
    HIGH PERFORMANCE COMPUTING FOR COMPUTATIONAL SCIENCE - VECPAR 2002, 2003, 2565 : 126 - 133