DB-HReduction: A data preprocessing algorithm for data mining applications

被引:27
|
作者
Hu, XH [1 ]
机构
[1] Drexel Univ, Coll Informat Sci & Techol, Philadelphia, PA 19104 USA
关键词
data mining; data preprocessing; data reduction; horizontal reduction;
D O I
10.1016/S0893-9659(03)90013-9
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Data preprocessing is an important and critical step in the data mining process and it has a huge impact on the success of. a data mining project. In this paper, we present an algorithm DBHReduction, which discretizes or eliminates numeric attributes and generalizes or eliminates symbolic attributes very efficiently and effectively. This algorithm greatly decreases the number of attributes and tuples of the data set and improves the accuracy and decreases the running time of the data mining algorithms in the later stage. (C) 2003 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:889 / 895
页数:7
相关论文
共 50 条
  • [31] An efficient data preprocessing approach for large scale medical data mining
    Hu, Ya-Han
    Lin, Wei-Chao
    Tsai, Chih-Fong
    Ke, Shih-Wen
    Chen, Chih-Wen
    TECHNOLOGY AND HEALTH CARE, 2015, 23 (02) : 153 - 160
  • [32] Motion Data Preprocessing in Robotic Applications
    Benicky, Peter
    Jurisica, Ladislav
    Vitko, Anton
    CONTROL ENGINEERING AND APPLIED INFORMATICS, 2015, 17 (01): : 3 - 11
  • [33] Data mining algorithm for text data
    Chen, Yuquan
    Zhu, Xijun
    Lu, Ruzhan
    Shanghai Jiaotong Daxue Xuebao/Journal of Shanghai Jiaotong University, 2000, 34 (07): : 936 - 938
  • [34] A Data Preprocessing Algorithm Based-on SVM in Data Warehouse
    Wang Jianfen
    Shi Changhong
    ISTM/2009: 8TH INTERNATIONAL SYMPOSIUM ON TEST AND MEASUREMENT, VOLS 1-6, 2009, : 648 - 650
  • [35] PolyAnalyst 4.1 - the first data mining tool supporting OLE DB for Data Mining
    Ananyan, S
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2000, 33 (04) : 460 - 461
  • [36] A Dashboard Tool for Mobility Data Mining Preprocessing Tasks
    Haranwala, Yaksh J.
    Haidri, Salman
    Tricco, Terrence S.
    da Fonseca, Vinicius P.
    Soares, Amilcar
    2022 23RD IEEE INTERNATIONAL CONFERENCE ON MOBILE DATA MANAGEMENT (MDM 2022), 2022, : 278 - 281
  • [37] Research and development of data preprocessing in Web Usage Mining
    Li Chaofeng
    PROCEEDINGS OF THE 2006 INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE AND ENGINEERING, 2006, : 1311 - 1315
  • [38] Tutorial on practical tips of the most influential data preprocessing algorithms in data mining
    Garcia, Salvador
    Luengo, Julian
    Herrera, Francisco
    KNOWLEDGE-BASED SYSTEMS, 2016, 98 : 1 - 29
  • [39] An effective Data Preprocessing method for Web Usage Mining
    Reddy, K. Sudheer
    Reddy, M. Kantha
    Sitaramulu, V.
    2013 INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND EMBEDDED SYSTEMS (ICICES), 2013, : 7 - 10
  • [40] Preprocessing Using Attribute Selection in Data Stream Mining
    Sangeetha, R.
    Sathappan, S.
    PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON COMMUNICATION AND ELECTRONICS SYSTEMS (ICCES 2018), 2018, : 431 - 438