A Hybrid Multiple Indefinite Kernel Learning Framework for Disease Classification from Gene Expression Data

被引:0
|
作者
Swetha, S. [1 ]
Srinivasan, G. N. [1 ]
Dayananda, P. [2 ]
机构
[1] RV Coll Engn, Dept Informat Sci & Engn, Bengaluru 560059, Karnataka, India
[2] Manipal Acad Higher Educ, Manipal Inst Technol Bengaluru, Dept Informat Technol, Manipal, India
关键词
Gene expression; optimized kernel principle component analysis; multiple indefinite kernel learning; flow direction algorithm based support vector machine; arithmetic optimization algorithm;
D O I
10.14569/IJACSA.2023.0140690
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In recent years, Machine Learning (ML) techniques have been used by several researchers to classify diseases using gene expression data. Disease categorization using heterogeneous gene expression data is often used for defining critical problems such as cancer analysis. A variety of evaluated factors known as genes are used to characterize the gene expression data gathered from DNA microarrays. Accurate classification of genetic data is essential to provide accurate treatments to sick people. A large number of genes can be viewed simultaneously from the collected data. However, processing this data has some limitations due to noises, redundant data, frequent errors, increased complexity, smaller samples with high dimensionality, difficult interpretation, etc. A model must be able to distinguish the features in such heterogeneous data with high accuracy to make accurate predictions. So this paper presents an innovative model to overcome these issues. The proposed model includes an effective multiple indefinite kernel learning based model for analyze the gene expression microarray data, then an optimized kernel principal component analysis (OKPCA) to select best features , hybrid flow-directed arithmetic support vector machine (SVM)-based multiple infinite kernel learning (FDASVM-MIKL) model for classification. Flow direction and arithmetic optimization algorithms are combined with SVM to increase classification accuracy. The proposed technique has an accuracy of 99.95%, 99.63%, 99.60%, 99.51% , 99.79% using the datasets including colon, Isolet, ALLAML, Lung_cancer, and Snp2 graph.
引用
收藏
页码:844 / 855
页数:12
相关论文
共 50 条
  • [41] Meta-learning approach to gene expression data classification
    de Souza, Bruno Feres
    Soares, Carlos
    de Carvalho, Andre C. P. L. F.
    INTERNATIONAL JOURNAL OF INTELLIGENT COMPUTING AND CYBERNETICS, 2009, 2 (02) : 285 - 303
  • [42] Learning misclassification costs for imbalanced classification on gene expression data
    Lu, Huijuan
    Xu, Yige
    Ye, Minchao
    Yan, Ke
    Gao, Zhigang
    Jin, Qun
    BMC BIOINFORMATICS, 2019, 20 (01)
  • [43] Comparative Study of Disease Classification Using Multiple Machine Learning Models Based on Landmark and Non-Landmark Gene Expression Data
    Huang, Xiaoqin
    Sun, Jian
    Srinivasan, Satish Mahadevan
    Sangwan, Raghvinder S.
    BIG DATA, IOT, AND AI FOR A SMARTER FUTURE, 2021, 185 : 264 - 273
  • [44] Optimized gene selection and classification of cancer from microarray gene expression data using deep learning
    Shah, Shamveel Hussain
    Iqbal, Muhammad Javed
    Ahmad, Iftikhar
    Khan, Suleman
    Rodrigues, Joel J. P. C.
    NEURAL COMPUTING & APPLICATIONS, 2020,
  • [45] Statistical characterization and classification of colon microarray gene expression data using multiple machine learning paradigms
    Maniruzzaman, Md
    Rahman, Md Jahanur
    Ahammed, Benojir
    Abedin, Md Menhazul
    Suri, Harman S.
    Biswas, Mainak
    El-Baz, Ayman
    Bangeas, Petros
    Tsoulfas, Georgios
    Suri, Jasjit S.
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2019, 176 : 173 - 193
  • [46] DISCRIMINATING MULTIPLE KERNEL LEARNING FOR JOINT CLASSIFICATION OF OPTICAL AND LIDAR DATA IN URBAN AREA
    Wang Qingwang
    Liu Huan
    Gu Yanfeng
    2015 7TH WORKSHOP ON HYPERSPECTRAL IMAGE AND SIGNAL PROCESSING: EVOLUTION IN REMOTE SENSING (WHISPERS), 2015,
  • [47] The hybrid of semisupervised manifold learning and spectrum kernel for classification
    Shen, Liang
    Xu, Qingsong
    Cao, Dongsheng
    Liang, Yizeng
    Dai, Hongshuai
    JOURNAL OF CHEMOMETRICS, 2018, 32 (02)
  • [48] Boosting Kernel Discriminant Analysis and Its Application to Tissue Classification of Gene Expression Data
    Dai, Guang
    Yeung, Dit-Yan
    20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 744 - 749
  • [49] A framework for density weighted kernel fuzzy c-Means on gene expression data
    Wang, Yu
    Angelova, Maia
    Zhang, Yang
    Advances in Intelligent Systems and Computing, 2013, 212 : 453 - 461
  • [50] Learning microarray gene expression data by hybrid discriminant analysis
    Lu, Yijuan
    Tian, Qi
    Sanchez, Maribel
    Neary, Jennifer
    Liu, Feng
    Wang, Yufeng
    IEEE MULTIMEDIA, 2007, 14 (04) : 22 - 31