ENSEMBLE CLASSIFIER WITH RANDOM FOREST ALGORITHM TO DEAL WITH IMBALANCED HEALTHCARE DATA

被引:0
|
作者
Anbarasi, M. S. [1 ]
Janani, V. [1 ]
机构
[1] Pondicherry Engn Coll, Dept Informat & Technol, Pondicherry 605014, India
关键词
Ensemble Classifier; Random Forest Algorithm; Data pre-processing; Anomaly Detection Technique; Clustering technique;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In day today life, data is generated in massive amount with rapid growth in health care environment. The medical industries have large amount of data sets, for diagnosis purpose and maintain patient's records. The medical researches come with new treatments and medicine every day. But availability of medical datasets is often not balanced in their class labels. The performance of some existing method is poor on imbalanced dataset. So the prediction of disease from imbalanced data becomes difficult to handle. In this proposal Classifier ensemble method (Random Forest algorithm) can be used to overcome existing classifier techniques. Multiple classifier system is more accurate and robust than an existing classifier technique. The ensemble method proves to be very efficient in classification of records from available imbalanced healthcare patient data, as it involves the process of considering opinion from multiple base classifiers, as opposed to the single classifier method. This method gives a very accurate and precise inference, as unrelated data's are removed because of multiple base classifiers. The problems of healthcare dataset especially with some uncertainty can be predicted.
引用
收藏
页数:6
相关论文
共 50 条
  • [11] An Ensemble Random Forest Algorithm for Insurance Big Data Analysis
    Lin, Weiwei
    Wu, Ziming
    Lin, Longxin
    Wen, Angzhan
    Li, Jin
    IEEE ACCESS, 2017, 5 : 16568 - 16575
  • [12] An Ensemble Random Forest Algorithm for Insurance Big Data Analysis
    Wu, Ziming
    Lin, Weiwei
    Zhang, Zilong
    Wen, Angzhan
    Lin, Longxin
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND ENGINEERING (CSE) AND IEEE/IFIP INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (EUC), VOL 1, 2017, : 531 - 536
  • [13] A random forests quantile classifier for class imbalanced data
    O'Brien, Robert
    Ishwaran, Hemant
    PATTERN RECOGNITION, 2019, 90 : 232 - 249
  • [14] GAAE: a novel genetic algorithm based on autoencoder with ensemble classifiers for imbalanced healthcare data
    Ram, Pintu Kumar
    Kuila, Pratyay
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (01): : 541 - 572
  • [15] Ensemble Classifier for Imbalanced Streaming Data Using Partial Labeling
    Arabmakki, Elaheh
    Kantardzic, Mehmed
    Sethi, Tegjyot Singh
    PROCEEDINGS OF 2016 IEEE 17TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IEEE IRI), 2016, : 257 - 260
  • [16] GAAE: a novel genetic algorithm based on autoencoder with ensemble classifiers for imbalanced healthcare data
    Pintu Kumar Ram
    Pratyay Kuila
    The Journal of Supercomputing, 2023, 79 : 541 - 572
  • [17] Evidential reasoning based ensemble classifier for uncertain imbalanced data
    Fu, Chao
    Zhan, Qianshan
    Liu, Weiyong
    INFORMATION SCIENCES, 2021, 578 : 378 - 399
  • [18] Classifier Ensemble Design for Imbalanced Data Classification: A Hybrid Approach
    Salunkhe, Uma R.
    Mali, Suresh N.
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL MODELLING AND SECURITY (CMS 2016), 2016, 85 : 725 - 732
  • [19] Class Weights Random Forest Algorithm for Processing Class Imbalanced Medical Data
    Zhu, Min
    Xia, Jing
    Jin, Xiaoqing
    Yan, Molei
    Cai, Guolong
    Yan, Jing
    Ning, Gangmin
    IEEE ACCESS, 2018, 6 : 4641 - 4652
  • [20] Balanced random forest for imbalanced data streams
    Yagci, A. Murat
    Aytekin, Tevfik
    Gurgen, Fikret S.
    2016 24TH SIGNAL PROCESSING AND COMMUNICATION APPLICATION CONFERENCE (SIU), 2016, : 1065 - 1068