Unsupervised Discretization Method based on Adjustable Intervals

被引:3
|
作者
Bennasar, Mohamed [1 ]
Setchi, Rossitza [1 ]
Hicks, Yulia [1 ]
机构
[1] Cardiff Univ, Sch Engn, Cardiff CF24 3AA, S Glam, Wales
关键词
unsupervised discretization; supervised discretization; classification accuracy;
D O I
10.3233/978-1-61499-105-2-79
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Discretization is a process applied to transform continuous data into data with discrete attributes. It makes the learning step of many classification algorithms more accurate and faster. Although many efficient supervised discretization methods have been proposed, unsupervised methods such as Equal Width Discretization (EWD) and Equal Frequency Discretization (EFD) are still in use especially with datasets when classification is not available. Each of these algorithms has its drawbacks. To improve the classification accuracy of EWD, a new method based on adjustable intervals is proposed in this paper. The new method is tested using benchmarking datasets from the UCI repository of machine learning databases; the C4.5 classification algorithm is then used to test the classification accuracy. The experimental results show that the method improves the classification accuracy by about 5% compared to the conventional EWD and EFD methods, and is as good as the supervised Entropy Minimization Discretization (EMD) method.
引用
收藏
页码:79 / 87
页数:9
相关论文
共 50 条
  • [1] An Automated Unsupervised Discretization Method: A Novel Approach
    Drias, Habiba
    Moulai, Hadjer
    Drias, Yassine
    VIETNAM JOURNAL OF COMPUTER SCIENCE, 2020, 7 (03) : 301 - 322
  • [2] IFIT: an unsupervised discretization method based on the Ramer-Douglas-Peucker algorithm
    Mutlu, Alev
    Goz, Furkan
    Akbulut, Orhan
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2019, 27 (03) : 2344 - 2360
  • [3] Merging of Numerical Intervals in Entropy-Based Discretization
    Grzymala-Busse, Jerzy W.
    Mroczek, Teresa
    ENTROPY, 2018, 20 (11)
  • [4] Combining the Unsupervised Discretization Method and the Statistical Machine Learning on the Students' Performance
    Yamasari, Yuni
    Qoiriah, Anita
    Rochmawati, Naim
    Yustanti, Wiyli
    Tjahyaningtijas, Hapsari P. A.
    Rusimamto, Puput W.
    2020 THIRD INTERNATIONAL CONFERENCE ON VOCATIONAL EDUCATION AND ELECTRICAL ENGINEERING (ICVEE): STRENGTHENING THE FRAMEWORK OF SOCIETY 5.0 THROUGH INNOVATIONS IN EDUCATION, ELECTRICAL, ENGINEERING AND INFORMATICS ENGINEERING, 2020,
  • [5] Unsupervised discretization using tree-based density estimation
    Schmidberger, G
    Frank, E
    KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2005, 2005, 3721 : 240 - 251
  • [6] EF_Unique: An Improved Version of Unsupervised Equal Frequency Discretization Method
    Hacibeyoglu, Mehmet
    Ibrahim, Mohammed H.
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2018, 43 (12) : 7695 - 7704
  • [7] EF_Unique: An Improved Version of Unsupervised Equal Frequency Discretization Method
    Mehmet Hacibeyoglu
    Mohammed H. Ibrahim
    Arabian Journal for Science and Engineering, 2018, 43 : 7695 - 7704
  • [8] Unsupervised discretization by two-dimensional MDL-based histogram
    Lincen Yang
    Mitra Baratchi
    Matthijs van Leeuwen
    Machine Learning, 2023, 112 : 2397 - 2431
  • [9] Unsupervised discretization by two-dimensional MDL-based histogram
    Yang, Lincen
    Baratchi, Mitra
    van Leeuwen, Matthijs
    MACHINE LEARNING, 2023, 112 (07) : 2397 - 2431
  • [10] An unsupervised learning method to identify reference intervals from a clinical database
    Poole, Sarah
    Schroeder, Lee Frederick
    Shah, Nigam
    JOURNAL OF BIOMEDICAL INFORMATICS, 2016, 59 : 276 - 284