Feature Extraction and Analysis for Lung Nodule Classification using Random Forest

被引:7
|
作者
El-Askary, Nada S. [1 ]
Salem, Mohammed A-M [1 ,2 ]
Roushdy, Mohamed, I [1 ]
机构
[1] Ain Shams Univ, Fac Comp & Informat Sci, Cairo, Egypt
[2] German Univ Cairo, Fac Media Engn & Technol, Cairo, Egypt
关键词
Random Forest; Classification; Computed tomography; Machine Learning; Feature Extraction; Lung Nodule; Medical Images; Wavelet; IMAGE DATABASE CONSORTIUM; PULMONARY NODULES; SEGMENTATION;
D O I
10.1145/3328833.3328872
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Early detection of lung nodule decreases the risk of advanced stages in lung cancer disease. Random forest (RF), a machine learning classifier, is used to detect the lung nodules and classify soft-tissues into nodules and non-nodules. A lung nodule classification approach is proposed to improve early detection for nodules. A five stages model has been built and tested using 165 cases from the LIDC database. Stage 1 is image acquisition and preprocessing. Stage 2 is extracting 119 features from the CT image. Stage 3 is refining feature vectors by removing all duplicate instances and undersampling the non-nodule class. Stage 4 is tuning the RF parameters. Stage 5 is examining different collections from the extracted feature sets to select those scores best for classification. The accuracy achieved by RF is the highest compared to other machine learning classifiers such as KNN, SVM, and DT. The proposed method aimed to analyze and select features that maximize classification results. Pixel based feature set and wavelet-based set scored best for higher accuracy. RF was tuned with 170 trees and 0.007 for in-bag fraction. Best results were achieved by the proposed model are 90.67%, 90.8% and 90.73% for sensitivity, specificity, and accuracy respectively.
引用
收藏
页码:248 / 252
页数:5
相关论文
共 50 条
  • [1] CLASSIFICATION OF URBAN ENVIRONMENTS USING FEATURE EXTRACTION AND RANDOM FOREST
    dos Anjos, Camila Souza
    Lacerda, Marielcio Goncalves
    Andrade, Leidiane do Livramento
    Salles, Roberto Neves
    2017 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2017, : 1205 - 1208
  • [2] Random forest based lung nodule classification aided by clustering
    Lee, S. L. A.
    Kouzani, A. Z.
    Hub, E. J.
    COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2010, 34 (07) : 535 - 542
  • [3] Feature Extraction for Heroin-Use Classification Using Imbalanced Random Forest Methods
    Beattie, Matthew
    Nicholson, Charles
    SUBSTANCE USE & MISUSE, 2021, 56 (01) : 123 - 130
  • [4] A Random Forest for Lung Nodule Identification
    Lee, S. L. A.
    Kouzani, A. Z.
    Hu, E. J.
    2008 IEEE REGION 10 CONFERENCE: TENCON 2008, VOLS 1-4, 2008, : 1329 - 1333
  • [5] Feature fusion for lung nodule classification
    Farag, Amal A.
    Ali, Asem
    Elshazly, Salwa
    Farag, Aly A.
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2017, 12 (10) : 1809 - 1818
  • [6] Feature fusion for lung nodule classification
    Amal A. Farag
    Asem Ali
    Salwa Elshazly
    Aly A. Farag
    International Journal of Computer Assisted Radiology and Surgery, 2017, 12 : 1809 - 1818
  • [7] Feature selection and classification of leukocytes using random forest
    Saraswat, Mukesh
    Arya, K. V.
    MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING, 2014, 52 (12) : 1041 - 1052
  • [8] Feature selection and classification of leukocytes using random forest
    Mukesh Saraswat
    K. V. Arya
    Medical & Biological Engineering & Computing, 2014, 52 : 1041 - 1052
  • [9] Optimal Wavelet Based Feature Extraction and Classification of Power Quality Disturbances Using Random Forest
    Markovska, Marija
    Taskovski, Dimitar
    17TH IEEE INTERNATIONAL CONFERENCE ON SMART TECHNOLOGIES - IEEE EUROCON 2017 CONFERENCE PROCEEDINGS, 2017, : 855 - 859
  • [10] Feature-Based Lung Nodule Classification
    Farag, Amal
    Ali, Asem
    Graham, James
    Elhabian, Shireen
    Farag, Aly
    Falk, Robert
    ADVANCES IN VISUAL COMPUTING, PT III, 2010, 6455 : 79 - +