Unsupervised Discretization Method based on Adjustable Intervals

被引:3
|
作者
Bennasar, Mohamed [1 ]
Setchi, Rossitza [1 ]
Hicks, Yulia [1 ]
机构
[1] Cardiff Univ, Sch Engn, Cardiff CF24 3AA, S Glam, Wales
关键词
unsupervised discretization; supervised discretization; classification accuracy;
D O I
10.3233/978-1-61499-105-2-79
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Discretization is a process applied to transform continuous data into data with discrete attributes. It makes the learning step of many classification algorithms more accurate and faster. Although many efficient supervised discretization methods have been proposed, unsupervised methods such as Equal Width Discretization (EWD) and Equal Frequency Discretization (EFD) are still in use especially with datasets when classification is not available. Each of these algorithms has its drawbacks. To improve the classification accuracy of EWD, a new method based on adjustable intervals is proposed in this paper. The new method is tested using benchmarking datasets from the UCI repository of machine learning databases; the C4.5 classification algorithm is then used to test the classification accuracy. The experimental results show that the method improves the classification accuracy by about 5% compared to the conventional EWD and EFD methods, and is as good as the supervised Entropy Minimization Discretization (EMD) method.
引用
收藏
页码:79 / 87
页数:9
相关论文
共 50 条
  • [21] Boundary element method based on preliminary discretization
    Poblet-Puig J.
    Valyaev V.Y.
    Shanin A.V.
    Mathematical Models and Computer Simulations, 2014, 6 (2) : 172 - 182
  • [22] A global discretization method based on rough sets
    Shi, H
    Fu, JZ
    PROCEEDINGS OF 2005 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-9, 2005, : 3053 - 3057
  • [23] Improvement of the Accuracy of Prediction Using Unsupervised Discretization Method: Educational Data Set Case Study
    Dimic, Gabrijela
    Rancic, Dejan
    Milentijevic, Ivan
    Spalevic, Petar
    TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2018, 25 (02): : 407 - 414
  • [24] The generalized method of lines based on the discretization technique of the pseudospectral method
    RuShan, C
    Yung, EKN
    Wu, K
    Han, YF
    MICROWAVE AND OPTICAL TECHNOLOGY LETTERS, 1999, 20 (05) : 339 - 342
  • [25] Analysis and improvements of the adaptive discretization intervals knowledge representation
    Bacardit, J
    Garrell, JM
    GENETIC AND EVOLUTIONARY COMPUTATION GECCO 2004 , PT 2, PROCEEDINGS, 2004, 3103 : 726 - 738
  • [26] Genetic fuzzy discretization with adaptive intervals for classification problems
    Choi, Yoon-Seok
    Moon, Byung-Ro
    Seo, Sang Yong
    GECCO 2005: GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, VOLS 1 AND 2, 2005, : 2037 - 2043
  • [27] Latent representation discretization for unsupervised text style generation
    Gao, Yang
    Liu, Qianhui
    Yang, Yizhe
    Wang, Ke
    INFORMATION PROCESSING & MANAGEMENT, 2024, 61 (03)
  • [28] Unsupervised interaction-preserving discretization of multivariate data
    Hoang-Vu Nguyen
    Emmanuel Müller
    Jilles Vreeken
    Klemens Böhm
    Data Mining and Knowledge Discovery, 2014, 28 : 1366 - 1397
  • [29] A Bayesian Hybrid Approach to Unsupervised Time Series Discretization
    Kameya, Yoshitaka
    Synnaeve, Gabriel
    Doncescu, Andrei
    Inoue, Katsumi
    Sato, Taisuke
    INTERNATIONAL CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI 2010), 2010, : 342 - 349
  • [30] Unsupervised interaction-preserving discretization of multivariate data
    Hoang-Vu Nguyen
    Muller, Emmanuel
    Vreeken, Jilles
    Boehm, Klemens
    DATA MINING AND KNOWLEDGE DISCOVERY, 2014, 28 (5-6) : 1366 - 1397