Feature Extraction and Analysis for Lung Nodule Classification using Random Forest

被引:7
|
作者
El-Askary, Nada S. [1 ]
Salem, Mohammed A-M [1 ,2 ]
Roushdy, Mohamed, I [1 ]
机构
[1] Ain Shams Univ, Fac Comp & Informat Sci, Cairo, Egypt
[2] German Univ Cairo, Fac Media Engn & Technol, Cairo, Egypt
关键词
Random Forest; Classification; Computed tomography; Machine Learning; Feature Extraction; Lung Nodule; Medical Images; Wavelet; IMAGE DATABASE CONSORTIUM; PULMONARY NODULES; SEGMENTATION;
D O I
10.1145/3328833.3328872
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Early detection of lung nodule decreases the risk of advanced stages in lung cancer disease. Random forest (RF), a machine learning classifier, is used to detect the lung nodules and classify soft-tissues into nodules and non-nodules. A lung nodule classification approach is proposed to improve early detection for nodules. A five stages model has been built and tested using 165 cases from the LIDC database. Stage 1 is image acquisition and preprocessing. Stage 2 is extracting 119 features from the CT image. Stage 3 is refining feature vectors by removing all duplicate instances and undersampling the non-nodule class. Stage 4 is tuning the RF parameters. Stage 5 is examining different collections from the extracted feature sets to select those scores best for classification. The accuracy achieved by RF is the highest compared to other machine learning classifiers such as KNN, SVM, and DT. The proposed method aimed to analyze and select features that maximize classification results. Pixel based feature set and wavelet-based set scored best for higher accuracy. RF was tuned with 170 trees and 0.007 for in-bag fraction. Best results were achieved by the proposed model are 90.67%, 90.8% and 90.73% for sensitivity, specificity, and accuracy respectively.
引用
收藏
页码:248 / 252
页数:5
相关论文
共 50 条
  • [41] Evolving Deep Forest with Automatic Feature Extraction for Image Classification Using Genetic Programming
    Bi, Ying
    Xue, Bing
    Zhang, Mengjie
    PARALLEL PROBLEM SOLVING FROM NATURE - PPSN XVI, PT I, 2020, 12269 : 3 - 18
  • [42] Random Subset Feature Selection and Classification of Lung Sound
    Don, S.
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND DATA SCIENCE, 2020, 167 : 313 - 322
  • [43] A Benchmarking: Feature Extraction and Classification of Agricultural Textures Using LBP, GLCM, RBO, Neural Networks, k-NN, and Random Forest
    Aygun, Sercan
    Gunes, Ece Olcay
    2017 6TH INTERNATIONAL CONFERENCE ON AGRO-GEOINFORMATICS, 2017, : 11 - 14
  • [44] Stellar spectral classification and feature evaluation based on a random forest
    Xiang-Ru Li
    Yang-Tao Lin
    Kai-Bin Qiu
    ResearchinAstronomyandAstrophysics, 2019, 19 (08) : 56 - 62
  • [45] Microgrid fault classification based on random forest feature selection
    Wang, Changhong
    Gao, Yanjie
    Tang, Min
    REVIEWS OF ADHESION AND ADHESIVES, 2023, 11 (02): : 220 - 237
  • [46] Stellar spectral classification and feature evaluation based on a random forest
    Li, Xiang-Ru
    Lin, Yang-Tao
    Qiu, Kai-Bin
    RESEARCH IN ASTRONOMY AND ASTROPHYSICS, 2019, 19 (08)
  • [47] Computerized Lung Nodule Detection Using 3D Feature Extraction and Learning Based Algorithms
    Ozekes, Serhat
    Osman, Onur
    JOURNAL OF MEDICAL SYSTEMS, 2010, 34 (02) : 185 - 194
  • [48] Computerized Lung Nodule Detection Using 3D Feature Extraction and Learning Based Algorithms
    Serhat Ozekes
    Onur Osman
    Journal of Medical Systems, 2010, 34 : 185 - 194
  • [49] Identifying feature relevance using a random forest
    Rogers, Jeremy
    Gunn, Steve
    SUBSPACE, LATENT STRUCTURE AND FEATURE SELECTION, 2006, 3940 : 173 - 184
  • [50] Feature Extraction and Classification of Respiratory Sound and Lung Diseases
    Latifi, Seyed Amir
    Ghassemian, Hassan
    Imani, Maryam
    2023 6TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION AND IMAGE ANALYSIS, IPRIA, 2023,