Detection of colon cancer based on microarray dataset using machine learning as a feature selection and classification techniques

被引:22
|
作者
Shafi, A. S. M. [1 ,2 ]
Molla, M. M. Imran [2 ]
Jui, Julakha Jahan [3 ]
Rahman, Mohammad Motiur [1 ]
机构
[1] Mawlana Bhashani Sci & Technol Univ, Dept Comp Sci & Engn, Tangail 1902, Bangladesh
[2] Khwaja Yunus Ali Univ, Fac Comp Sci & Engn, Sirajgonj 6751, Bangladesh
[3] Univ Malaysia Pahang, Fac Elect & Elect Engn, Pekan 26600, Pahang, Malaysia
来源
SN APPLIED SCIENCES | 2020年 / 2卷 / 07期
关键词
Colon cancer; Microarray data; Feature selection; Machine learning; Random forest; Cross validation; PARTICLE SWARM OPTIMIZATION; GENE; PREDICTION;
D O I
10.1007/s42452-020-3051-2
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Microarray data is an increasingly important tool for providing information on gene expression for analysis and interpretation. Researchers attempt to utilize the smallest possible set of relevant gene expression profiles in most gene expression studies to enhance tumor identification accuracy. This research aims to analyze and predicts colon cancer data employing a machine learning approach and feature selection technique based on a random forest classifier. More particularly, our proposed method can reduce the burden of high dimensional data and allow faster calculations by combining the "Mean Decrease Accuracy" and "Mean Decrease Gini" as feature selection methods into a renowned classifier namely Random Forest, with the aim of increasing the prediction model's accuracy level. In addition, we have also shown a comparative model analysis with selection of features and model without selection of features. The extensive experimental results have demonstrated that the proposed model with feature selection is favorable and effective which triumphs the best performance of accuracy.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Detection of colon cancer based on microarray dataset using machine learning as a feature selection and classification techniques
    A. S. M. Shafi
    M. M. Imran Molla
    Julakha Jahan Jui
    Mohammad Motiur Rahman
    SN Applied Sciences, 2020, 2
  • [2] Osteoporosis Detection Using Machine Learning Techniques and Feature Selection
    Iliou, Theodoros
    Anagnostopoulos, Christos-Nikolaos
    Anastassopoulos, George
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2014, 23 (05)
  • [3] REVIEW ON FEATURE SELECTION TECHNIQUES AND ITS IMPACT FOR EFFECTIVE DATA CLASSIFICATION USING UCI MACHINE LEARNING REPOSITORY DATASET
    Amarnath, B.
    Balamurugan, S. Appavu Alias
    JOURNAL OF ENGINEERING SCIENCE AND TECHNOLOGY, 2016, 11 (11) : 1639 - 1646
  • [4] Review on intrusion detection using feature selection with machine learning techniques
    Kalimuthan, C.
    Renjit, J. Arokia
    MATERIALS TODAY-PROCEEDINGS, 2020, 33 : 3794 - 3802
  • [5] Machine learning approaches for classification of colorectal cancer with and without feature selection method on microarray data
    Nazari, Elham
    Aghemiri, Mehran
    Avan, Amir
    Mehrabian, Amin
    Tabesh, Hamed
    GENE REPORTS, 2021, 25
  • [6] FEATURE SELECTION AND MACHINE LEARNING CLASSIFICATION FOR MALWARE DETECTION
    Khammas, Ban Mohammed
    Monemi, Alireza
    Bassi, Joseph Stephen
    Ismail, Ismahani
    Nor, Sulaiman Mohd
    Marsono, Muhammad Nadzir
    JURNAL TEKNOLOGI, 2015, 77 (01):
  • [7] Genetic algorithm-based feature selection with manifold learning for cancer classification using microarray data
    Wang, Zixuan
    Zhou, Yi
    Takagi, Tatsuya
    Song, Jiangning
    Tian, Yu-Shi
    Shibuya, Tetsuo
    BMC BIOINFORMATICS, 2023, 24 (01)
  • [8] Genetic algorithm-based feature selection with manifold learning for cancer classification using microarray data
    Zixuan Wang
    Yi Zhou
    Tatsuya Takagi
    Jiangning Song
    Yu-Shi Tian
    Tetsuo Shibuya
    BMC Bioinformatics, 24
  • [9] Microarray Lung Cancer Data Classification Using Similarity based Feature Selection
    Amrane, Meriem
    Oukid, Saliha
    Ensari, Tolga
    Benblidia, Nadjia
    Orman, Zeynep
    2019 SCIENTIFIC MEETING ON ELECTRICAL-ELECTRONICS & BIOMEDICAL ENGINEERING AND COMPUTER SCIENCE (EBBT), 2019,
  • [10] Bystander Detection: Automatic Labeling Techniques using Feature Selection and Machine Learning
    Gupta, Anamika
    Thakkar, Khushboo
    Bhasin, Veenu
    Tiwari, Aman
    Mathur, Vibhor
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (01) : 1135 - 1143