Features processing for random forest optimization in lung nodule localization

被引:11
|
作者
El-Askary, Nada S. [1 ]
Salem, Mohammed A. -M. [2 ,3 ]
Roushdy, Mohamed I. [4 ]
机构
[1] Ain Shams Univ, Comp Sci Dept, Fac Comp & Informat Sci, Cairo, Egypt
[2] Ain Shams Univ, Fac Comp & Informat Sci, Sci Comp Dept, Cairo, Egypt
[3] German Univ Cairo, Fac Media Engn & Technol, Cairo, Egypt
[4] Future Univ Egypt, Fac Comp & Informat Technol, New Cairo, Egypt
关键词
Lung nodule localization; Computed Tomography; Automatic detection; Random forest; Lung features; Feature processing; IMAGE DATABASE CONSORTIUM; PULMONARY NODULES; CLASSIFICATION;
D O I
10.1016/j.eswa.2021.116489
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Lung nodule can cause lung cancer and so researchers do their best to detect those nodules in their early stages. Machine learning algorithms are used to detect lung nodules in a short time with high accuracy. Random Forest (RF) is a remarkable ensemble machine learning algorithm can be used to classify medical images, recognize different pathologies and detect deficiencies based on selected input features. The paper proposes a model that enables early detection and localization for lung nodule from CT images and propose RF optimization and analysis the effect of the feature groups on the classification accuracy. Processing was applied on features extracted from CT images to optimize the RF output. In previous work, local features such as Haar features gave better results than region-based features. In the proposed model after applying a novel ANDing technique in preprocessing step these region-based features gave better results and the model accuracy enhanced. By combining global and local features the model classification results and accuracy are greatly improved. Experiments were made using 214 cases with total 2124 CT slices downloaded from the publicly available LIDC database. After applying preprocessing using novel technique, 119 features are calculated and extracted from each pixel in the CT image. Post-processing is made on the extracted features to refine the learner input data. Feature dimensionality reduction was applied by dividing features into 5 different feature sets and select best scored results. Finally, when comparing with previous work, RF is optimized, true positive rate is increased by 8.66% and false positive rate is decreased by 4.4% which led to better localization and accuracy increased by 5.47%. Best achieved results were 96.41%, 95.98% and 96.20% for sensitivity, specificity and accuracy respectively when tuning RF with 80 trees and 0.04 for in bag fraction. Results from RF were compared with other methodologies such as KNN, SVM, CNN and deep learning and RF proved to give best accuracy as mentioned in the discussion section.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] A Random Forest for Lung Nodule Identification
    Lee, S. L. A.
    Kouzani, A. Z.
    Hu, E. J.
    2008 IEEE REGION 10 CONFERENCE: TENCON 2008, VOLS 1-4, 2008, : 1329 - 1333
  • [2] Random forest based lung nodule classification aided by clustering
    Lee, S. L. A.
    Kouzani, A. Z.
    Hub, E. J.
    COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2010, 34 (07) : 535 - 542
  • [3] Feature Extraction and Analysis for Lung Nodule Classification using Random Forest
    El-Askary, Nada S.
    Salem, Mohammed A-M
    Roushdy, Mohamed, I
    PROCEEDINGS OF 2019 8TH INTERNATIONAL CONFERENCE ON SOFTWARE AND INFORMATION ENGINEERING (ICSIE 2019), 2019, : 248 - 252
  • [4] State of the Art in Lung Nodule Localization
    Alicuben, Evan T.
    Levesque, Renee L.
    Ashraf, Syed F.
    Christie, Neil A.
    Awais, Omar
    Sarkaria, Inderpal S.
    Dhupar, Rajeev
    JOURNAL OF CLINICAL MEDICINE, 2022, 11 (21)
  • [5] Transscapular Microcoil Lung Nodule Localization
    Sangha, Bippan S.
    Skarsgard, Erik D.
    Heran, Manraj K. S.
    JOURNAL OF VASCULAR AND INTERVENTIONAL RADIOLOGY, 2012, 23 (05) : 659 - 659
  • [6] Differential diagnosis of thyroid nodule capsules using random forest guided selection of image features
    Eftimie, Lucian G.
    Glogojeanu, Remus R.
    Tejaswee, A.
    Gheorghita, Pavel
    Stanciu, Stefan G.
    Chirila, Augustin
    Stanciu, George A.
    Paul, Angshuman
    Hristu, Radu
    SCIENTIFIC REPORTS, 2022, 12 (01):
  • [7] Differential diagnosis of thyroid nodule capsules using random forest guided selection of image features
    Lucian G. Eftimie
    Remus R. Glogojeanu
    A. Tejaswee
    Pavel Gheorghita
    Stefan G. Stanciu
    Augustin Chirila
    George A. Stanciu
    Angshuman Paul
    Radu Hristu
    Scientific Reports, 12 (1)
  • [8] Lung nodule radioguided localization results.
    Guiote Moreno, M.
    Lopez Cano, A.
    Castejon Echevarne, S.
    Tercero Garrido, D.
    Zurera Pareja, R.
    Vallejo Casas, J.
    EUROPEAN JOURNAL OF NUCLEAR MEDICINE AND MOLECULAR IMAGING, 2023, 50 (SUPPL 1) : S589 - S590
  • [9] Optimization of the Random Forest Algorithm
    Mohapatra, Niva
    Shreya, K.
    Chinmay, Ayes
    ADVANCES IN DATA SCIENCE AND MANAGEMENT, 2020, 37 : 201 - 208
  • [10] HIDDEN CONDITIONAL RANDOM FIELD FOR LUNG NODULE DETECTION
    Liu, Yang
    Wang, Zhongqiu
    Guo, Maozu
    Li, Ping
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 3518 - 3521