Gene Selection for Microarray Cancer Classification based on Manta Rays Foraging Optimization and Support Vector Machines

被引:0
|
作者
Essam H. Houssein
Hager N. Hassan
Mustafa M. Al-Sayed
Emad Nabil
机构
[1] Minia University,Faculty of Computers and Information
[2] Cairo University,Faculty of Computers and Artificial Intelligence
[3] Islamic University of Madinah,Faculty of Computer Science and Information Systems
关键词
Microarray; Gene expression; Gene selection; Cancer classification; Feature selection; Manta Ray Foraging Optimization algorithm; Support vector machines; Minimum Redundancy Maximum Relevance;
D O I
暂无
中图分类号
学科分类号
摘要
In DNA microarray applications, many techniques are proposed for cancer classification in order to detect normal and cancerous humans or classify different types of cancers. Gene selection is usually required as a preliminary step for a cancer classification problem. This step aims to select the most informative genes among a great number of genes, which represent an important issue. Although many studies have been proposed to address this issue, they lack getting the most informative and fewest number of genes with the highest accuracy and little effort from the high dimensionality of microarray datasets. Manta ray foraging optimization(MRFO) algorithm is a new meta-heuristic algorithm that mimics the nature of manta ray fishes in food foraging. MRFO has achieved promising results in other fields, such as solar generating units. Due to the high accuracy results of the support vector machines (SVM), it is the most commonly used classification algorithm in cancer studies, especially with microarray data. For exploiting the pros of both algorithms (i.e., MRFO and SVM), in this paper, a hybrid algorithm is proposed to select the most predictive and informative genes for cancer classification. A binary microarray dataset, which includes colon and leukemia1, and a multi-class microarray dataset that includes SRBCT, lymphoma, and leukemia2, are used to evaluate the accuracy of the proposed technique. Like other optimization techniques, MRFO suffers from some problems related to the high dimensionality and complexity of the microarray data. For solving such problems as well as improving the performance, the minimum redundancy maximum relevance (mRMR) method is used as a preprocessing stage. The proposed technique has been evaluated compared to the most common cancer classification algorithms. The experimental results show that our proposed technique achieves the highest accuracy with the fewest number of informative genes and little effort.
引用
收藏
页码:2555 / 2572
页数:17
相关论文
共 50 条
  • [1] Gene Selection for Microarray Cancer Classification based on Manta Rays Foraging Optimization and Support Vector Machines
    Houssein, Essam H.
    Hassan, Hager N.
    Al-Sayed, Mustafa M.
    Nabil, Emad
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2022, 47 (02) : 2555 - 2572
  • [2] Hybrid huberized support vector machines for microarray classification and gene selection
    Wang, Li
    Zhu, Ji
    Zou, Hui
    BIOINFORMATICS, 2008, 24 (03) : 412 - 419
  • [3] Gene selection for cancer classification using support vector machines
    Guyon, I
    Weston, J
    Barnhill, S
    Vapnik, V
    MACHINE LEARNING, 2002, 46 (1-3) : 389 - 422
  • [4] Gene Selection for Cancer Classification using Support Vector Machines
    Isabelle Guyon
    Jason Weston
    Stephen Barnhill
    Vladimir Vapnik
    Machine Learning, 2002, 46 : 389 - 422
  • [5] A Hybrid Barnacles Mating Optimizer Algorithm With Support Vector Machines for Gene Selection of Microarray Cancer Classification
    Houssein, Essam H.
    Abdelminaam, Diaa Salama
    Hassan, Hager N.
    Al-Sayed, Mustafa M.
    Nabil, Emad
    IEEE ACCESS, 2021, 9 : 64895 - 64905
  • [6] Manta Ray Foraging Optimization with Vector Quantization Based Microarray Image Compression Technique
    Alkhaldi, NoraA.
    Alsedais, Rawabi Abdulaziz Abdullah
    Halawani, Hanan T.
    Aboutaleb, Sayed M. Abdelkhalek M.
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [7] Applications of support vector machines to cancer classification with microarray data
    Chu, F
    Wang, LP
    INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2005, 15 (06) : 475 - 484
  • [8] Transductive Support Vector Machines for classification of microarray gene expression data
    Semolini, R
    Von Zuben, FJ
    PROCEEDINGS OF THE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS 2003, VOLS 1-4, 2003, : 2946 - 2951
  • [9] Feature selection and classification of breast cancer diagnosis based on support vector machines
    Purnami, Santi Wulan
    Rahayu, S. P.
    Embong, Abdullah
    INTERNATIONAL SYMPOSIUM OF INFORMATION TECHNOLOGY 2008, VOLS 1-4, PROCEEDINGS: COGNITIVE INFORMATICS: BRIDGING NATURAL AND ARTIFICIAL KNOWLEDGE, 2008, : 500 - 505
  • [10] Gene selection for cancer classification using bootstrapped genetic algorithms and support vector machines
    Chen, XW
    PROCEEDINGS OF THE 2003 IEEE BIOINFORMATICS CONFERENCE, 2003, : 504 - 505