ENSEMBLE CLASSIFIER WITH RANDOM FOREST ALGORITHM TO DEAL WITH IMBALANCED HEALTHCARE DATA

被引:0
|
作者
Anbarasi, M. S. [1 ]
Janani, V. [1 ]
机构
[1] Pondicherry Engn Coll, Dept Informat & Technol, Pondicherry 605014, India
关键词
Ensemble Classifier; Random Forest Algorithm; Data pre-processing; Anomaly Detection Technique; Clustering technique;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In day today life, data is generated in massive amount with rapid growth in health care environment. The medical industries have large amount of data sets, for diagnosis purpose and maintain patient's records. The medical researches come with new treatments and medicine every day. But availability of medical datasets is often not balanced in their class labels. The performance of some existing method is poor on imbalanced dataset. So the prediction of disease from imbalanced data becomes difficult to handle. In this proposal Classifier ensemble method (Random Forest algorithm) can be used to overcome existing classifier techniques. Multiple classifier system is more accurate and robust than an existing classifier technique. The ensemble method proves to be very efficient in classification of records from available imbalanced healthcare patient data, as it involves the process of considering opinion from multiple base classifiers, as opposed to the single classifier method. This method gives a very accurate and precise inference, as unrelated data's are removed because of multiple base classifiers. The problems of healthcare dataset especially with some uncertainty can be predicted.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] Random Forest with Data Ensemble for Saliency Detection
    Nah, Seungjun
    Lee, Kyoung Mu
    2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 604 - 607
  • [42] Imbalanced Ensemble Classifier for Learning from Imbalanced Business School Dataset
    Chakraborty, Tanujit
    INTERNATIONAL JOURNAL OF MATHEMATICAL ENGINEERING AND MANAGEMENT SCIENCES, 2019, 4 (04) : 861 - 869
  • [43] A selective evolutionary heterogeneous ensemble algorithm for classifying imbalanced data
    An, Xiaomeng
    Xu, Sen
    ELECTRONIC RESEARCH ARCHIVE, 2023, 31 (05): : 2733 - 2757
  • [44] Improved PSO_AdaBoost Ensemble Algorithm for Imbalanced Data
    Li, Kewen
    Zhou, Guangyue
    Zhai, Jiannan
    Li, Fulai
    Shao, Mingwen
    SENSORS, 2019, 19 (06):
  • [45] Selective ensemble algorithm for imbalanced underwater acoustic target data
    Cheng Y.
    Zhang Z.
    Li H.
    Liu Z.
    Zhang, Zongtang (qtxy_robin@126.com), 1600, Editorial Board of Journal of Harbin Engineering (41): : 1553 - 1558
  • [46] Ensemble classification algorithm based improved SMOTE for imbalanced data
    Ning, Liu, 1600, Natsional'nyi Hirnychyi Universytet
  • [47] On the use of MapReduce for imbalanced big data using Random Forest
    del Rio, Sara
    Lopez, Victoria
    Manuel Benitez, Jose
    Herrera, Francisco
    INFORMATION SCIENCES, 2014, 285 : 112 - 137
  • [48] A Density-Based Random Forest for Imbalanced Data Classification
    Dong, Jia
    Qian, Quan
    FUTURE INTERNET, 2022, 14 (03):
  • [49] Using ensemble methods to deal with imbalanced data in predicting protein-protein interactions
    Zhang, Yongqing
    Zhang, Danling
    Mi, Gang
    Ma, Daichuan
    Li, Gongbing
    Guo, Yanzhi
    Li, Menglong
    Zhu, Min
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2012, 36 : 36 - 41
  • [50] Comparison of Sampling Methods for Imbalanced Data Classification in Random Forest
    Paing, May Phu
    Pintavirooj, C.
    Tungjitkusolmun, Supan
    Choomchuay, Somsak
    Hamamoto, Kazuhiko
    2018 11TH BIOMEDICAL ENGINEERING INTERNATIONAL CONFERENCE (BMEICON 2018), 2018,