DB-HReduction: A data preprocessing algorithm for data mining applications

被引:27
|
作者
Hu, XH [1 ]
机构
[1] Drexel Univ, Coll Informat Sci & Techol, Philadelphia, PA 19104 USA
关键词
data mining; data preprocessing; data reduction; horizontal reduction;
D O I
10.1016/S0893-9659(03)90013-9
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Data preprocessing is an important and critical step in the data mining process and it has a huge impact on the success of. a data mining project. In this paper, we present an algorithm DBHReduction, which discretizes or eliminates numeric attributes and generalizes or eliminates symbolic attributes very efficiently and effectively. This algorithm greatly decreases the number of attributes and tuples of the data set and improves the accuracy and decreases the running time of the data mining algorithms in the later stage. (C) 2003 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:889 / 895
页数:7
相关论文
共 50 条
  • [21] Data Preprocessing Method on Data Mining of Web Log File
    Li, Jia
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL AND INFORMATION SCIENCES (ICCIS 2014), 2014, : 712 - 717
  • [22] MS-Analyzer: preprocessing and data mining services for proteomics applications on the Grid
    Cannataro, Mario
    Veltri, Pierangelo
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2007, 19 (15): : 2047 - 2066
  • [23] Data preprocessing by sequential pattern mining for LZW
    Vergara-Villegas, OO
    García-Hernández, RA
    Carrasco-Ochoa, JA
    Elías, RP
    Martínez-Trinidad, JF
    Sixth Mexican International Conference on Computer Science, Proceedings, 2005, : 82 - 87
  • [24] Smart Preprocessing Improves Data Stream Mining
    Hu, Hanqing
    Kantardzic, Mehmed
    PROCEEDINGS OF THE 49TH ANNUAL HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES (HICSS 2016), 2016, : 1749 - 1757
  • [25] Data squashing as preprocessing in association rule mining
    Fister, Iztok
    Fister, Iztok, Jr.
    Novak, Damijan
    Verber, Domen
    2022 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2022, : 1720 - 1725
  • [26] Study on Data Preprocessing Process in Web Mining
    Peng, Sumian
    Zhou, Xingmei
    PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON INFORMATION, ELECTRONIC AND COMPUTER SCIENCE, VOLS I AND II, 2009, : 19 - 22
  • [27] Discretization and grouping: Preprocessing steps for data mining
    Berka, P
    Bruha, I
    PRINCIPLES OF DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 1510 : 239 - 245
  • [28] Data Preprocessing with GPU for DBSCAN Algorithm
    Cal, Piotr
    Wozniak, Michal
    PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON COMPUTER RECOGNITION SYSTEMS CORES 2013, 2013, 226 : 793 - 801
  • [29] Applications of Harmony Search Algorithm in Data Mining: A Survey
    Assad, Assif
    Deep, Kusum
    PROCEEDINGS OF FIFTH INTERNATIONAL CONFERENCE ON SOFT COMPUTING FOR PROBLEM SOLVING (SOCPROS 2015), VOL 2, 2016, 437 : 863 - 874
  • [30] A data preprocessing framework for students' outcome prediction by data mining techniques
    Danubianu, Mirela
    2015 19TH INTERNATIONAL CONFERENCE ON SYSTEM THEORY, CONTROL AND COMPUTING (ICSTCC), 2015, : 836 - 841