ENSEMBLE CLASSIFIER WITH RANDOM FOREST ALGORITHM TO DEAL WITH IMBALANCED HEALTHCARE DATA

被引:0
|
作者
Anbarasi, M. S. [1 ]
Janani, V. [1 ]
机构
[1] Pondicherry Engn Coll, Dept Informat & Technol, Pondicherry 605014, India
关键词
Ensemble Classifier; Random Forest Algorithm; Data pre-processing; Anomaly Detection Technique; Clustering technique;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In day today life, data is generated in massive amount with rapid growth in health care environment. The medical industries have large amount of data sets, for diagnosis purpose and maintain patient's records. The medical researches come with new treatments and medicine every day. But availability of medical datasets is often not balanced in their class labels. The performance of some existing method is poor on imbalanced dataset. So the prediction of disease from imbalanced data becomes difficult to handle. In this proposal Classifier ensemble method (Random Forest algorithm) can be used to overcome existing classifier techniques. Multiple classifier system is more accurate and robust than an existing classifier technique. The ensemble method proves to be very efficient in classification of records from available imbalanced healthcare patient data, as it involves the process of considering opinion from multiple base classifiers, as opposed to the single classifier method. This method gives a very accurate and precise inference, as unrelated data's are removed because of multiple base classifiers. The problems of healthcare dataset especially with some uncertainty can be predicted.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] An Ensemble Random Forest Algorithm for Privacy Preserving Distributed Medical Data Mining
    Hassan, Musavir
    Butt, Muheet Ahmed
    Zaman, Majid
    INTERNATIONAL JOURNAL OF E-HEALTH AND MEDICAL COMMUNICATIONS, 2021, 12 (06)
  • [32] Preprocessed dynamic classifier ensemble selection for highly imbalanced drifted data streams
    Zyblewski, Pawel
    Sabourin, Robert
    Wozniak, Michal
    INFORMATION FUSION, 2021, 66 : 138 - 154
  • [33] Hybrid-ensemble-based interpretable TSK fuzzy classifier for imbalanced data
    Bian, Zekang
    Zhang, Jin
    Nojima, Yusuke
    Chung, Fu-lai
    Wang, Shitong
    INFORMATION FUSION, 2023, 98
  • [34] FORESTEXTER: An efficient random forest algorithm for imbalanced text categorization
    Wu, Qingyao
    Ye, Yunming
    Zhang, Haijun
    Ng, Michael K.
    Ho, Shen-Shyang
    KNOWLEDGE-BASED SYSTEMS, 2014, 67 : 105 - 116
  • [35] ENSEMBLE CLASSIFIER AND RESAMPLING FOR IMBALANCED MULTICLASS LEARNING
    Sainin, Mohd Shamrie
    Ahmad, Faudziah
    Alfred, Rayner
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON COMPUTING & INFORMATICS, 2015, : 751 - 756
  • [36] Ensemble MultiBoost Based on RIPPER Classifier for Prediction of Imbalanced Software Defect Data
    He, Haitao
    Zhang, Xu
    Wang, Qian
    Ren, Jiadong
    Liu, Jiaxin
    Zhao, Xiaolin
    Cheng, Yongqiang
    IEEE ACCESS, 2019, 7 : 110333 - 110343
  • [37] An Improved Random Forest Algorithm for classification in an imbalanced dataset.
    Jose, Christy
    Gopakumar, G.
    2019 URSI ASIA-PACIFIC RADIO SCIENCE CONFERENCE (AP-RASC), 2019,
  • [38] A new classifier for imbalanced data with iterative learning process and ensemble operating process
    Pan, Tingting
    Pedrycz, Witold
    Yang, Jie
    Wu, Wei
    Zhang, Yulin
    KNOWLEDGE-BASED SYSTEMS, 2022, 249
  • [39] A Direct Ensemble Classifier for Imbalanced Multiclass Learning
    Sainin, Mohd Shamrie
    Alfred, Rayner
    2012 4TH CONFERENCE ON DATA MINING AND OPTIMIZATION (DMO), 2012, : 59 - 66
  • [40] ECC plus plus : An algorithm family based on ensemble of classifier chains for classifying imbalanced multi-label data
    Duan, Jicong
    Gu, Yan
    Yu, Hualong
    Yang, Xibei
    Gao, Shang
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 236