Sampling Based on Genetic Algorithm for Data Mining

被引:0
|
作者
Wang Jianyong [1 ]
Huang Yu [1 ]
Hu Bin [1 ]
Wei Xiaomei [1 ]
机构
[1] Huazhong Agr Univ, Coll Sci, Wuhan 430070, Hubei Province, Peoples R China
关键词
Genetic algorithm; Association rules; Accuracy;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Collecting a large initial sample set from a huge data set, and then distilling a smaller sample set from the initial set in the same accuracy can greatly enhance the speeds of data mining algorithms. As the distilling process is proved as a NP-hard problem, the two-phase sampling algorithm FAST adopts a kind of geed method. Adopting genetic algorithm in sample distilling, a sampling algorithm SGA is presented in this paper, which performs better than popular sampling algorithms including FAST in the experiment.
引用
收藏
页码:3667 / 3672
页数:6
相关论文
共 50 条
  • [11] Mining Frequent Itemsets in Data Streams Based on Genetic Algorithm
    Han, Chong
    Sun, Lijuan
    Guo, Jian
    Chen, Xiaodong
    2013 15TH IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY (ICCT), 2013, : 748 - 753
  • [12] Genetic Algorithm in Data Capturing and Mining
    Thanuja, M. K.
    Mala, C.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON SOFT COMPUTING FOR PROBLEM SOLVING (SOCPROS 2011), VOL 1, 2012, 130 : 589 - 599
  • [13] Application of Genetic Algorithm in Data Mining
    Tan Jun-shan
    He Wei
    Qing Yan
    PROCEEDINGS OF THE FIRST INTERNATIONAL WORKSHOP ON EDUCATION TECHNOLOGY AND COMPUTER SCIENCE, VOL II, 2009, : 353 - +
  • [14] Novel data mining method based on genetic algorithm and its application
    Chen, Yancai
    Journal of Computational Information Systems, 2007, 3 (04): : 1531 - 1538
  • [15] Distributed Multi-Relational Data Mining Based on Genetic Algorithm
    Dou, Wenxiang
    Hu, Jinglu
    Hirasawa, Kotaro
    Wu, Gengfeng
    2008 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-8, 2008, : 744 - +
  • [16] Framework for Efficient Letter Selection in Genetic Algorithm Based Data Mining
    Chen, Xiaoyan
    Zheng, Shijue
    Tao, Tao
    DCABES 2008 PROCEEDINGS, VOLS I AND II, 2008, : 334 - +
  • [17] Genetic algorithm rule based categorization method for textual data mining
    Afif, Mohammed H.
    Ghareb, Abdullah Saeed
    Saif, Abdulgbar
    Abu Bakar, Azuraliza
    Bazighifan, Omer
    DECISION SCIENCE LETTERS, 2020, 9 (01) : 37 - 50
  • [18] Framework for efficient feature selection in genetic algorithm based data mining
    Sikora, Riyaz
    Piramuthu, Selwyn
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2007, 180 (02) : 723 - 737
  • [19] Genetic algorithm with a structure-based representation for genetic-fuzzy data mining
    Ting, Chuan-Kang
    Wang, Ting-Chen
    Liaw, Rung-Tzuo
    Hong, Tzung-Pei
    SOFT COMPUTING, 2017, 21 (11) : 2871 - 2882
  • [20] Genetic algorithm with a structure-based representation for genetic-fuzzy data mining
    Chuan-Kang Ting
    Ting-Chen Wang
    Rung-Tzuo Liaw
    Tzung-Pei Hong
    Soft Computing, 2017, 21 : 2871 - 2882