Unsupervised Discretization Method based on Adjustable Intervals

被引:3
|
作者
Bennasar, Mohamed [1 ]
Setchi, Rossitza [1 ]
Hicks, Yulia [1 ]
机构
[1] Cardiff Univ, Sch Engn, Cardiff CF24 3AA, S Glam, Wales
关键词
unsupervised discretization; supervised discretization; classification accuracy;
D O I
10.3233/978-1-61499-105-2-79
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Discretization is a process applied to transform continuous data into data with discrete attributes. It makes the learning step of many classification algorithms more accurate and faster. Although many efficient supervised discretization methods have been proposed, unsupervised methods such as Equal Width Discretization (EWD) and Equal Frequency Discretization (EFD) are still in use especially with datasets when classification is not available. Each of these algorithms has its drawbacks. To improve the classification accuracy of EWD, a new method based on adjustable intervals is proposed in this paper. The new method is tested using benchmarking datasets from the UCI repository of machine learning databases; the C4.5 classification algorithm is then used to test the classification accuracy. The experimental results show that the method improves the classification accuracy by about 5% compared to the conventional EWD and EFD methods, and is as good as the supervised Entropy Minimization Discretization (EMD) method.
引用
收藏
页码:79 / 87
页数:9
相关论文
共 50 条
  • [31] Discretization Method of Continuous Attributes Based on Decision Attributes
    Sun, Yingjuan
    Ren, Zengqiang
    Zhou, Tong
    Zhai, Yandong
    Pu, Dongbing
    ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, AICI 2010, PT II, 2010, 6320 : 367 - 373
  • [32] A Density-based Discretization Method With Inconsistency Evaluation
    Zhao, Rong
    Qu, Yanpeng
    Deng, Ansheng
    Zwiggelaar, Reyer
    PROCEEDINGS OF 2018 TENTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI), 2018, : 758 - 763
  • [33] A discretization method of Continuous attributes based on rough set
    Tang Xiaokang
    Zhang Xuezhi
    Zouqiong
    Wei Youguo
    Cao Chengjun
    MECHATRONICS ENGINEERING, COMPUTING AND INFORMATION TECHNOLOGY, 2014, 556-562 : 3711 - +
  • [34] Evolution of multi-adaptive discretization intervals for a rule-based genetic learning system
    Bacardit, J
    Garrell, JM
    ADVANCES IN ARTIFICIAL INTELLIGENCE - IBERAMIA 2002, PROCEEDINGS, 2002, 2527 : 350 - 360
  • [35] A new method for discretization of continuous attributes based on VPRS
    Wei, Jin-Mao
    Wang, Guo-Ying
    Kong, Xiang-Ming
    Li, Shu-Jie
    Wang, Shu-Qin
    Liu, Da-You
    ROUGH SETS AND CURRENT TRENDS IN COMPUTING, PROCEEDINGS, 2006, 4259 : 183 - 190
  • [36] Convergence theorem for the Haar wavelet based discretization method
    Majak, J.
    Shvartsman, B. S.
    Kirs, M.
    Pohlak, M.
    Herranen, H.
    COMPOSITE STRUCTURES, 2015, 126 : 227 - 232
  • [37] A UMDA-Based Discretization Method for Continuous Attributes
    Zhao Jing
    Han ChongZhao
    Wei Bin
    Han DeQiang
    MEMS, NANO AND SMART SYSTEMS, PTS 1-6, 2012, 403-408 : 1834 - +
  • [38] A GLOBAL DISCRETIZATION METHOD BASED ON CLUSTERING AND ROUGH SET
    Luo, Hairui
    Yan, Jianzhuo
    Fang, Liying
    Wang, Hui
    Shi, Xinqing
    DECISION MAKING AND SOFT COMPUTING, 2014, 9 : 400 - 405
  • [39] New defuzzification method based on weighted intervals
    Poleshuk, O. M.
    Komarov, E. G.
    2008 ANNUAL MEETING OF THE NORTH AMERICAN FUZZY INFORMATION PROCESSING SOCIETY, VOLS 1 AND 2, 2008, : 121 - 123
  • [40] Pattern-based unsupervised parsing method
    Santamaria, Jesus
    Araujo, Lourdes
    NATURAL LANGUAGE ENGINEERING, 2016, 22 (03) : 397 - 422