Sampling Based on Genetic Algorithm for Data Mining

被引:0
|
作者
Wang Jianyong [1 ]
Huang Yu [1 ]
Hu Bin [1 ]
Wei Xiaomei [1 ]
机构
[1] Huazhong Agr Univ, Coll Sci, Wuhan 430070, Hubei Province, Peoples R China
关键词
Genetic algorithm; Association rules; Accuracy;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Collecting a large initial sample set from a huge data set, and then distilling a smaller sample set from the initial set in the same accuracy can greatly enhance the speeds of data mining algorithms. As the distilling process is proved as a NP-hard problem, the two-phase sampling algorithm FAST adopts a kind of geed method. Adopting genetic algorithm in sample distilling, a sampling algorithm SGA is presented in this paper, which performs better than popular sampling algorithms including FAST in the experiment.
引用
收藏
页码:3667 / 3672
页数:6
相关论文
共 50 条
  • [1] A data mining based genetic algorithm
    Wu, Yi-Ta
    An, Yoo Jung
    Geller, James
    Wu, Yih-Tyng
    FOURTH IEEE WORKSHOP ON SOFTWARE TECHNOLOGIES FOR FUTURE EMBEDDED AND UBIQUITOUS SYSTEMS AND THE SECOND INTERNATIONAL WORKSHOP ON COLLABORATIVE COMPUTING, INTEGRATION, AND ASSURANCE, PROCEEDINGS, 2006, : 55 - +
  • [2] Data Mining Based on Genetic Algorithm
    Qin, Yonghua
    DCABES 2008 PROCEEDINGS, VOLS I AND II, 2008, : 683 - 686
  • [3] Data mining algorithm based on genetic algorithm and entropy
    Xing, Li-Ning
    Tang, Hua
    Journal of Computational Information Systems, 2007, 3 (02): : 595 - 600
  • [4] A data mining algorithm based on the genetic programming
    Aguilar, J
    Altamiranda, J
    8TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL IX, PROCEEDINGS: COMPUTER SCIENCE AND ENGINEERING: I, 2004, : 234 - 239
  • [5] Research and analysis of network data mining based on genetic algorithm
    Shi, Lei
    Zhao, Huiran
    Zhang, Kun
    MATERIAL SCIENCE, CIVIL ENGINEERING AND ARCHITECTURE SCIENCE, MECHANICAL ENGINEERING AND MANUFACTURING TECHNOLOGY II, 2014, 651-653 : 2181 - 2184
  • [6] Multimedia Technology of Spatial Data Mining Based on Genetic Algorithm
    Sun, Yingxin
    Computational Intelligence and Neuroscience, 2022, 2022
  • [7] Genetic Algorithm based on Evolution Strategy and the Application in Data Mining
    Zhu, Xiaoyuan
    Yu, Yongquan
    Guo, Xueyan
    PROCEEDINGS OF THE FIRST INTERNATIONAL WORKSHOP ON EDUCATION TECHNOLOGY AND COMPUTER SCIENCE, VOL I, 2009, : 848 - 852
  • [8] Multimedia Technology of Spatial Data Mining Based on Genetic Algorithm
    Sun, Yingxin
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [9] Research on Classification of Data Mining Based Niche Genetic Algorithm
    Zhang, Beibei
    Zhu, Li
    Li, Yanli
    PROGRESS IN INTELLIGENCE COMPUTATION AND APPLICATIONS, 2008, : 197 - 199
  • [10] Data mining and genetic algorithm based gene/SNP selection
    Shah, SC
    Kusiak, A
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2004, 31 (03) : 183 - 196