DB-HReduction: A data preprocessing algorithm for data mining applications

被引:27
|
作者
Hu, XH [1 ]
机构
[1] Drexel Univ, Coll Informat Sci & Techol, Philadelphia, PA 19104 USA
关键词
data mining; data preprocessing; data reduction; horizontal reduction;
D O I
10.1016/S0893-9659(03)90013-9
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Data preprocessing is an important and critical step in the data mining process and it has a huge impact on the success of. a data mining project. In this paper, we present an algorithm DBHReduction, which discretizes or eliminates numeric attributes and generalizes or eliminates symbolic attributes very efficiently and effectively. This algorithm greatly decreases the number of attributes and tuples of the data set and improves the accuracy and decreases the running time of the data mining algorithms in the later stage. (C) 2003 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:889 / 895
页数:7
相关论文
共 50 条
  • [41] Advanced data preprocessing for intersites web usage mining
    Tanasa, D
    Trousse, B
    IEEE INTELLIGENT SYSTEMS, 2004, 19 (02) : 59 - 65
  • [42] Interdependence of Text Mining Quality and the Input Data Preprocessing
    Darena, Frantisek
    Zizka, Jan
    ARTIFICIAL INTELLIGENCE PERSPECTIVES AND APPLICATIONS (CSOC2015), 2015, 347 : 141 - 150
  • [43] A Data Preprocessing Framework of Geoscience Data Sharing Portal for User Behavior Mining
    Wang, Mo
    Wang, Juanle
    2015 23RD INTERNATIONAL CONFERENCE ON GEOINFORMATICS, 2015,
  • [44] TOWARDS A UNIFIED STRATEGY FOR THE PREPROCESSING STEP IN DATA MINING
    Bratu, Camelia Vidrighin
    Potolea, Rodica
    ICEIS 2009 : PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS, VOL AIDSS, 2009, : 230 - 235
  • [45] Preprocessing and mining web log data for web personalization
    Baglioni, M
    Ferrara, U
    Romei, A
    Ruggieri, S
    Turini, F
    AI(ASTERISK)IA 2003: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2003, 2829 : 237 - 249
  • [46] Using ontologies for preprocessing and mining spectra data on the Grid
    Cannataro, M.
    Guzzi, P. H.
    Mazza, T.
    Tradigo, G.
    Veltri, P.
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2007, 23 (01): : 55 - 60
  • [47] Study on the Data preprocessing of the Questionnaire Based on the Combined Classification Data Mining Model
    Li Shuangcheng
    Wan Ping
    IEEE: 2009 INTERNATIONAL CONFERENCE ON E-LEARNING, E-BUSINESS, ENTERPRISE INFORMATION SYSTEMS AND E-GOVERNMENT, 2009, : 217 - 220
  • [48] A survey on data preprocessing for data stream mining: Current status and future directions
    Ramirez-Gallego, Sergio
    Krawczyk, Bartosz
    Garcia, Salvador
    Wozniak, Michal
    Herrera, Francisco
    NEUROCOMPUTING, 2017, 239 : 39 - 57
  • [49] A novel evolutionary data mining algorithm with applications to churn prediction
    Au, WH
    Chan, KCC
    Yao, X
    IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2003, 7 (06) : 532 - 545
  • [50] Deep Learning Algorithm and Applications in Location Big Data Mining
    Gao, Fa-Qin
    Xia, Hai-Xia
    FUZZY SYSTEM AND DATA MINING, 2016, 281 : 169 - 174