ENSEMBLE CLASSIFIER WITH RANDOM FOREST ALGORITHM TO DEAL WITH IMBALANCED HEALTHCARE DATA

被引:0
|
作者
Anbarasi, M. S. [1 ]
Janani, V. [1 ]
机构
[1] Pondicherry Engn Coll, Dept Informat & Technol, Pondicherry 605014, India
关键词
Ensemble Classifier; Random Forest Algorithm; Data pre-processing; Anomaly Detection Technique; Clustering technique;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In day today life, data is generated in massive amount with rapid growth in health care environment. The medical industries have large amount of data sets, for diagnosis purpose and maintain patient's records. The medical researches come with new treatments and medicine every day. But availability of medical datasets is often not balanced in their class labels. The performance of some existing method is poor on imbalanced dataset. So the prediction of disease from imbalanced data becomes difficult to handle. In this proposal Classifier ensemble method (Random Forest algorithm) can be used to overcome existing classifier techniques. Multiple classifier system is more accurate and robust than an existing classifier technique. The ensemble method proves to be very efficient in classification of records from available imbalanced healthcare patient data, as it involves the process of considering opinion from multiple base classifiers, as opposed to the single classifier method. This method gives a very accurate and precise inference, as unrelated data's are removed because of multiple base classifiers. The problems of healthcare dataset especially with some uncertainty can be predicted.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Hybrid Classifier Ensemble for Imbalanced Data
    Yang, Kaixiang
    Yu, Zhiwen
    Wen, Xin
    Cao, Wenming
    Chen, C. L. Philip
    Wong, Hau-San
    You, Jane
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (04) : 1387 - 1400
  • [2] A Review on Random Forest: An Ensemble Classifier
    Parmar, Aakash
    Katariya, Rakesh
    Patel, Vatsal
    INTERNATIONAL CONFERENCE ON INTELLIGENT DATA COMMUNICATION TECHNOLOGIES AND INTERNET OF THINGS, ICICI 2018, 2019, 26 : 758 - 763
  • [3] Progressive Hybrid Classifier Ensemble for Imbalanced Data
    Yang, Kaixiang
    Yu, Zhiwen
    Chen, C. L. Philip
    Cao, Wenming
    Wong, Hau-San
    You, Jane
    Han, Guoqiang
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (04): : 2464 - 2478
  • [4] Multicriteria Classifier Ensemble Learning for Imbalanced Data
    Wegier, Weronika
    Koziarski, Michal
    Wozniak, Micha
    Wegier, Weronika
    IEEE Access, 2022, 10 : 16807 - 16818
  • [5] Multicriteria Classifier Ensemble Learning for Imbalanced Data
    Wegier, Weronika
    Koziarski, Michal
    Wozniak, Micha
    IEEE ACCESS, 2022, 10 : 16807 - 16818
  • [6] An Ensemble Tree Classifier for Highly Imbalanced Data Classification
    Shi, Peibei
    Wang, Zhong
    JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2021, 34 (06) : 2250 - 2266
  • [7] An Ensemble Tree Classifier for Highly Imbalanced Data Classification
    SHI Peibei
    WANG Zhong
    Journal of Systems Science & Complexity, 2021, 34 (06) : 2250 - 2266
  • [8] An ensemble classifier framework for mining imbalanced data streams
    Ouyang, Zhen-Zheng
    Luo, Jian-Shu
    Hu, Dong-Min
    Wu, Quan-Yuan
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2010, 38 (01): : 184 - 189
  • [9] An Ensemble Tree Classifier for Highly Imbalanced Data Classification
    Peibei Shi
    Zhong Wang
    Journal of Systems Science and Complexity, 2021, 34 : 2250 - 2266
  • [10] Research of Medical High-dimensional Imbalanced Data Classification-Ensemble Feature Selection Algorithm with Random Forest
    Zhu, Min
    Su, Bo
    Ning, Gangmin
    2017 INTERNATIONAL CONFERENCE ON SMART GRID AND ELECTRICAL AUTOMATION (ICSGEA), 2017, : 273 - 277