Breast Cancer Classification Based on DNA Microarray Analysis

被引:0
|
作者
El-Rahman, Sahar A. [1 ]
Alluhaidan, Ala Saleh D. [2 ]
Marzouk, Radwa [2 ]
机构
[1] Benha Univ, Fac Engn Shoubra, Elect Engn Dept, Cairo 13511, Egypt
[2] Princess Nourah Bint Abdulrahman Univ, Coll Comp & Informat Sci, Dept Informat Syst, Riyadh 11671, Saudi Arabia
来源
IEEE ACCESS | 2023年 / 11卷
关键词
Genetic sequences; big data analysis; machine learning algorithms breast cancer classification; breast cancer prediction; BIG DATA; ANALYTICS;
D O I
10.1109/ACCESS.2023.3334678
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective: Predicting the ability of a breast cancer patient to survive was a difficult research problem for many scholars. Since the early dates of the relevant research, significant progress has been recorded in many related areas. For example, with pioneering biomedical technologies, credits to low-cost computer hardware and software, high-quality data is gathered and stored automatically, and lastly, with better analytical methods, that massive data is processed efficiently and effectively. Therefore, the objective of this document is to submit a report on a research project in which we have benefited from the technological developments available to develop predictive models of breast cancer and whether it exists or not. Methods and materials: artificial neural network, support vector machine, decision trees, naive bayes, and random forest algorithms are used along with the most common statistical method (logistic regression) to build prediction models using a large data set. We also used the Holdout method. To avoid the unbalanced nature of the classes, the parameters of the performance evaluation are predefined. Results: The results show that the Decision Tree (DT) is the top predictor with 89.1% accuracy on the holdout sample, surpassing all prediction accuracy reported in the literature; Artificial Neural Networks (ANN) came out to be the second with 88.9% accuracy; Naive Bayes (NB) came out to be the third with 83.3% accuracy, Support Vector Machines (SVM) came out to be the fourth with 83.2% accuracy, and the Random Forest (RF) models came out to be the lowest of the five with 71.2% accuracy. Conclusion: A comparative study of multiple predictive models for breast cancer survival using a large set of data and 5-fold cross-validation gave us an insight into the relative ability to predict different data extraction methods. After analyzing the data, we have reached this conclusion: the model is able to help those who need it by predicting whether they have breast cancer or not. Furthermore, the proposed framework is valuable tool in cancer research and clinical practice.
引用
收藏
页码:138748 / 138758
页数:11
相关论文
共 50 条
  • [21] Breast Cancer Microarray and RNASeq Data Integration Applied to Classification
    Castillo, Daniel
    Manuel Galvez, Juan
    Javier Herrera, Luis
    Rojas, Ignacio
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2017, PT I, 2017, 10305 : 123 - 131
  • [22] Predicting breast cancer behavior by microarray analysis
    M van de Vijver
    Breast Cancer Research, 5 (Suppl 1)
  • [23] Cortactin in Breast Cancer: Analysis with Tissue Microarray
    Sheen-Chen, Shyr-Ming
    Huang, Chun-Ying
    Liu, Yu-Yin
    Huang, Chao-Cheng
    Tang, Rei-Ping
    ANTICANCER RESEARCH, 2011, 31 (01) : 293 - 297
  • [24] AN INTEGRATED BREAST CANCER MICROARRAY ANALYSIS APPROACH
    Lixandru-Petre, Irina-Oana
    Buiu, Catalin
    UPB Scientific Bulletin, Series C: Electrical Engineering and Computer Science, 2022, 84 (02): : 79 - 90
  • [25] AN INTEGRATED BREAST CANCER MICROARRAY ANALYSIS APPROACH
    Lixandru-Petre, Irina-Oana
    Buiu, Catalin
    UNIVERSITY POLITEHNICA OF BUCHAREST SCIENTIFIC BULLETIN SERIES C-ELECTRICAL ENGINEERING AND COMPUTER SCIENCE, 2022, 84 (02): : 79 - 90
  • [26] Breast Cancer Classification With Microarray Gene Expression Data Based on Improved Whale Optimization Algorithm
    Devi, S. Sathiya
    Prithiviraj, K.
    INTERNATIONAL JOURNAL OF SWARM INTELLIGENCE RESEARCH, 2023, 14 (01)
  • [27] Cancer Classification From DNA Microarray Using Genetic Algorithms and Case-Based Reasoning
    Machacha, Lilybert
    Bhattacharya, Prabir
    INTERNATIONAL JOURNAL OF SOFTWARE SCIENCE AND COMPUTATIONAL INTELLIGENCE-IJSSCI, 2021, 13 (01): : 17 - 37
  • [28] A Cancer Recognition Method Based on DNA Microarray
    Su Qian
    An Dong
    Zhai Yafeng
    Wang Ku
    Wang Shoujue
    CHINESE JOURNAL OF ELECTRONICS, 2009, 18 (03): : 491 - 493
  • [29] Cancer identification based on DNA microarray data
    Liu, Yihui
    EMERGING TECHNOLOGIES IN KNOWLEDGE DISCOVERY AND DATA MINING, 2007, 4819 : 153 - +
  • [30] Microarray-based analysis of microRNA expression in breast cancer stem cells
    Sun, Jian-guo
    Liao, Rong-xia
    Qiu, Jun
    Jin, Jun-yu
    Wang, Xin-xin
    Duan, Yu-zhong
    Chen, Fang-lin
    Hao, Ping
    Xie, Qi-chao
    Wang, Zhi-xin
    Li, De-zhi
    Chen, Zheng-tang
    Zhang, Shao-xiang
    JOURNAL OF EXPERIMENTAL & CLINICAL CANCER RESEARCH, 2010, 29