An efficient approach for imputation and classification of medical data values using class-based clustering of medical records

被引:33
|
作者
Yelipe, UshaRani [1 ]
Porika, Sammulal [3 ]
Golla, Madhu [2 ]
机构
[1] VNR Vignana Jyothi Inst Engn & Technol, Hyderabad, Andhra Prades, India
[2] VNR Vignana Jyothi Inst Engn & Technol, Dept Informat Technol, Hyderabad, Andhra Prades, India
[3] JNTUH Coll Engn, Karimnagar, India
关键词
Imputation; Medical record; Clustering; Classifiers; Missing values; Prediction; MISSING VALUE ESTIMATION;
D O I
10.1016/j.compeleceng.2017.11.030
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Medical data is usually not free from missing values and this is also true when data is collected and sampled through various clinical trials. Existing Imputation techniques do not address the problem of high dimensionality and apply distance functions that also have the curse of high dimensionality. There is a need to turn up with innovative approaches and methods for accurate and efficient analysis of medical records. This research proposes an improved imputation approach called IM-CBC (Imputation based on class-based clustering) and a classifier termed as the Class-Based-Clustering Classifier(CBCC-IM). Experiments are performed on nine benchmark datasets and the recorded results using IM-CBC imputation approach are compared to ten imputation approaches using classifiers KNN, SVM and C4.5 and to the CBCC classifier using Euclidean distance and fuzzy gaussian similarity functions. Results obtained prove that the performance of classifiers is improved or atleast nearer to the existing approaches. CBCC-IM classifier records highest accuracy when compared to all other classifiers on benchmark datasets such as Cleveland, Ecoli, Iris, Pima, Wine and Wisconsin. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:487 / 504
页数:18
相关论文
共 50 条
  • [32] A Multi-Class Support Vector Data Description Approach for Classification of Medical Image
    Xie, Guocheng
    Jiang, Yun
    Chen, Na
    2013 9TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS), 2013, : 115 - 119
  • [33] Robust Fuzzy based Clustering approach in data mining using on Call Data Records
    Kaur, Navneet
    Ojha, Nitish
    2017 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS), 2017, : 1111 - 1117
  • [34] Methods for Faster and Efficient Data Entry in Electronic Medical Records
    Gogia, Shashi B.
    Malaviya, Anand N.
    MEDINFO 2017: PRECISION HEALTHCARE THROUGH INFORMATICS, 2017, 245 : 1264 - 1264
  • [35] Graded Medical Data Publishing Based on Clustering
    Yi, Tong
    Shi, Minyong
    Shang, Wenqian
    Cao, Jianxiang
    2015 12TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2015, : 1647 - 1652
  • [36] Threshold Based Similarity Clustering of Medical Data
    Morajkar, Sweta C.
    Laxminarayani, J. A.
    2014 INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION CONTROL AND COMPUTING TECHNOLOGIES (ICACCCT), 2014, : 591 - 595
  • [37] Handling imbalanced medical image data: A deep-learning-based one-class classification approach
    Gao, Long
    Zhang, Lei
    Liu, Chang
    Wu, Shandong
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2020, 108
  • [38] Hybrid Multistage Fuzzy Clustering System for Medical Data Classification
    Abdullah, Maryam
    Al-Anzi, Fawaz S.
    Al-Sharhan, Salah
    PROCEEDINGS 2018 INTERNATIONAL CONFERENCE ON COMPUTING SCIENCES AND ENGINEERING (ICCSE), 2018,
  • [39] EXTRACTION OF MEDICAL DATA FROM ELECTRONIC MEDICAL RECORDS USING NLP ALGORITHMS
    Gusev, Aleksandr V.
    Novitskiy, Roman E.
    Ivshin, Aleksandr A.
    Boldina, Juliia S.
    Shtykov, Aleksey S.
    Vasilev, Aleksey S.
    AD ALTA-JOURNAL OF INTERDISCIPLINARY RESEARCH, 2022, 12 (02): : 314 - 319
  • [40] A values-based approach to medical leadership
    Moen, Charlotte
    Prescott, Patricia
    BRITISH JOURNAL OF HOSPITAL MEDICINE, 2016, 77 (11) : 624 - 629