ENSEMBLE CLASSIFIER WITH RANDOM FOREST ALGORITHM TO DEAL WITH IMBALANCED HEALTHCARE DATA

被引:0
|
作者
Anbarasi, M. S. [1 ]
Janani, V. [1 ]
机构
[1] Pondicherry Engn Coll, Dept Informat & Technol, Pondicherry 605014, India
关键词
Ensemble Classifier; Random Forest Algorithm; Data pre-processing; Anomaly Detection Technique; Clustering technique;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In day today life, data is generated in massive amount with rapid growth in health care environment. The medical industries have large amount of data sets, for diagnosis purpose and maintain patient's records. The medical researches come with new treatments and medicine every day. But availability of medical datasets is often not balanced in their class labels. The performance of some existing method is poor on imbalanced dataset. So the prediction of disease from imbalanced data becomes difficult to handle. In this proposal Classifier ensemble method (Random Forest algorithm) can be used to overcome existing classifier techniques. Multiple classifier system is more accurate and robust than an existing classifier technique. The ensemble method proves to be very efficient in classification of records from available imbalanced healthcare patient data, as it involves the process of considering opinion from multiple base classifiers, as opposed to the single classifier method. This method gives a very accurate and precise inference, as unrelated data's are removed because of multiple base classifiers. The problems of healthcare dataset especially with some uncertainty can be predicted.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] Improving the Accuracy of Ensemble Classifier Prediction model based on FLAME Clustering with Random Forest Algorithm
    Augusty, Seena Mary
    Izudheen, Sminu
    2013 THIRD INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING AND COMMUNICATIONS (ICACC 2013), 2013, : 269 - 273
  • [22] A classifier ensemble algorithm based on local random subspace
    School of Computer Science and Technology, Nanjing Normal University, Nanjing 210046, China
    不详
    Moshi Shibie yu Rengong Zhineng, 2012, 4 (595-603):
  • [23] An Adaptive Sampling Ensemble Classifier for Learning from Imbalanced Data Sets
    Geiler, Ordonez Jon
    Hong, Li
    Yue-Jian, Guo
    INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS (IMECS 2010), VOLS I-III, 2010, : 513 - 517
  • [24] Classifier Selection for Highly Imbalanced Data Streams with Minority Driven Ensemble
    Zyblewski, Pawel
    Ksieniewicz, Pawel
    Wozniak, Michal
    ARTIFICIAL INTELLIGENCEAND SOFT COMPUTING, PT I, 2019, 11508 : 626 - 635
  • [25] An Ensemble classifier approach for Disease Diagnosis using Random Forest
    Pachange, Sarika
    Joglekar, Bela
    Kulkarni, Parag
    2015 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2015,
  • [26] Rotation forest and random oracles: Two classifier ensemble methods
    Rodriguez, Juan J.
    Twentieth IEEE International Symposium on Computer-Based Medical Systems, Proceedings, 2007, : 3 - 3
  • [27] Research on the Classification of High Dimensional Imbalanced Data based on the Optimization of Random Forest Algorithm
    Ma Xiaojuan
    PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON BIG DATA ENGINEERING AND TECHNOLOGY (BDET 2018), 2018, : 60 - 67
  • [28] Enhanced SMOTE Algorithm for Classification of Imbalanced Big-Data using Random Forest
    Bhagat, Reshma C.
    Patil, Sachin S.
    2015 IEEE INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC), 2015, : 403 - 408
  • [29] Research on the Classification of High Dimensional Imbalanced Data Based on the Optimizational Random Forest Algorithm
    Bo, Su
    PROCEEDINGS OF 2017 9TH INTERNATIONAL CONFERENCE ON MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION (ICMTMA), 2017, : 228 - 231
  • [30] Online ensemble learning algorithm for imbalanced data stream
    Hongle, Du
    Yan, Zhang
    Gang, Ke
    Lin, Zhang
    Chen, Yeh-Cheng
    APPLIED SOFT COMPUTING, 2021, 107