An Effective Feature Generation and Selection Approach for Lymph Disease Recognition

被引:1
|
作者
Jha, Sunil Kr. [1 ]
Ahmad, Zulfiqar [2 ]
机构
[1] Nanjing Univ Informat Sci & Technol, Sch Comp & Software, Nanjing 210044, Peoples R China
[2] Chinese Acad Sci, Inst Hydrobiol, Wuhan 430072, Peoples R China
来源
关键词
Disease data mining; feature selection; classification; lymph; diagnosis; COMPUTER-AIDED DIAGNOSIS; CLASSIFICATION; CANCER;
D O I
10.32604/cmes.2021.016817
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Health care data mining is noteworthy in disease diagnosis and recognition procedures. There exist several potentials to further improve the performance of machine learning based-classification methods in healthcare data analysis. The selection of a substantial subset of features is one of the feasible approaches to achieve improved recognition results of classification methods in disease diagnosis prediction. In the present study, a novel combined approach of feature generation using latent semantic analysis (LSA) and selection using ranker search (RAS) has been proposed to improve the performance of classification methods in lymph disease diagnosis prediction. The performance of the proposed combined approach (LSA-RAS) for feature generation and selection is validated using three function-based and two tree-based classification methods. The performance of the LSA-RAS selected features is compared with the original attributes and other subsets of attributes and features chosen by nine different attributes and features selection approaches in the analysis of a most widely used benchmark and open access lymph disease dataset. The LSA-RAS selected features improve the recognition accuracy of the classification methods significantly in the diagnosis prediction of the lymph disease. The tree-based classification methods have better recognition accuracy than the function-based classification methods. The best performance (recognition accuracy of 93.91%) is achieved for the logistic model tree (LMT) classification method using the feature subset generated by the proposed combined approach (LSA-RAS).
引用
收藏
页码:567 / 594
页数:28
相关论文
共 50 条
  • [31] Discriminative common vector approach based feature selection in face recognition
    Koc, Mehmet
    Barkana, Atalay
    COMPUTERS & ELECTRICAL ENGINEERING, 2014, 40 (08) : 37 - 50
  • [32] Study on an approach of feature selection for similar handwritten Chinese characters recognition
    Feng, J
    Piao, CH
    Wang, YF
    ICIA 2004: Proceedings of 2004 International Conference on Information Acquisition, 2004, : 380 - 383
  • [33] A New Approach to Feature Selection in Handwritten Farsi/Arabic Character Recognition
    Shayegan, Mohammad Amin
    Chan, Chee Seng
    2012 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE APPLICATIONS AND TECHNOLOGIES (ACSAT), 2012, : 506 - 511
  • [34] Feature selection with effective distance
    Liu, Mingxia
    Zhang, Daoqiang
    NEUROCOMPUTING, 2016, 215 : 100 - 109
  • [35] A GA based hierarchical feature selection approach for handwritten word recognition
    Samir Malakar
    Manosij Ghosh
    Showmik Bhowmik
    Ram Sarkar
    Mita Nasipuri
    Neural Computing and Applications, 2020, 32 : 2533 - 2552
  • [36] Feature selection and effective classifiers
    Deogun, JS
    Choubey, SK
    Raghavan, VV
    Sever, H
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1998, 49 (05): : 423 - 434
  • [37] A fuzzy rule based effective feature selection approach for augmented reality
    Thilahar, C. Rajendra
    Sivaramakrishnan, R.
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 38 (04) : 5045 - 5054
  • [38] A Simple and Effective Approach Based on a Multi-Level Feature Selection for Automated Parkinson's Disease Detection
    Demir, Fatih
    Siddique, Kamran
    Alswaitti, Mohammed
    Demir, Kursat
    Sengur, Abdulkadir
    JOURNAL OF PERSONALIZED MEDICINE, 2022, 12 (01):
  • [39] Extreme learning machines with feature selection using GA for effective prediction of fetal heart disease: A novel approach
    Panda D.
    Panda D.
    Dash S.R.
    Parida S.
    Informatica (Slovenia), 2021, 45 (03): : 381 - 392
  • [40] Extreme Learning Machines with Feature Selection Using GA for Effective Prediction of Fetal Heart Disease: A Novel Approach
    Panda, Debjani
    Panda, Divyajyoti
    Dash, Satya Ranjan
    Parida, Shantipriya
    INFORMATICA-AN INTERNATIONAL JOURNAL OF COMPUTING AND INFORMATICS, 2021, 45 (03): : 381 - 392