A Hybrid Multiple Indefinite Kernel Learning Framework for Disease Classification from Gene Expression Data

被引:0
|
作者
Swetha, S. [1 ]
Srinivasan, G. N. [1 ]
Dayananda, P. [2 ]
机构
[1] RV Coll Engn, Dept Informat Sci & Engn, Bengaluru 560059, Karnataka, India
[2] Manipal Acad Higher Educ, Manipal Inst Technol Bengaluru, Dept Informat Technol, Manipal, India
关键词
Gene expression; optimized kernel principle component analysis; multiple indefinite kernel learning; flow direction algorithm based support vector machine; arithmetic optimization algorithm;
D O I
10.14569/IJACSA.2023.0140690
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In recent years, Machine Learning (ML) techniques have been used by several researchers to classify diseases using gene expression data. Disease categorization using heterogeneous gene expression data is often used for defining critical problems such as cancer analysis. A variety of evaluated factors known as genes are used to characterize the gene expression data gathered from DNA microarrays. Accurate classification of genetic data is essential to provide accurate treatments to sick people. A large number of genes can be viewed simultaneously from the collected data. However, processing this data has some limitations due to noises, redundant data, frequent errors, increased complexity, smaller samples with high dimensionality, difficult interpretation, etc. A model must be able to distinguish the features in such heterogeneous data with high accuracy to make accurate predictions. So this paper presents an innovative model to overcome these issues. The proposed model includes an effective multiple indefinite kernel learning based model for analyze the gene expression microarray data, then an optimized kernel principal component analysis (OKPCA) to select best features , hybrid flow-directed arithmetic support vector machine (SVM)-based multiple infinite kernel learning (FDASVM-MIKL) model for classification. Flow direction and arithmetic optimization algorithms are combined with SVM to increase classification accuracy. The proposed technique has an accuracy of 99.95%, 99.63%, 99.60%, 99.51% , 99.79% using the datasets including colon, Isolet, ALLAML, Lung_cancer, and Snp2 graph.
引用
收藏
页码:844 / 855
页数:12
相关论文
共 50 条
  • [31] A Hybrid Data Gravitation based Classification Algorithm Applied to Gene Expression Data
    Yeh, Wei-Chang
    Lee, Yen-Chin
    Lai, Chyh-Ming
    Shih, Yan-Chih
    Huang, Hsin-Ping
    Jiang, Yunzhi
    2017 13TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2017,
  • [32] A Hybrid Machine Learning Approach to Screen Optimal Predictors for the Classification of Primary Breast Tumors from Gene Expression Microarray Data
    Alromema, Nashwan
    Syed, Asif Hassan
    Khan, Tabrej
    DIAGNOSTICS, 2023, 13 (04)
  • [33] Joint learning of multiple gene networks from single-cell gene expression data
    Wu, Nuosi
    Yin, Fu
    Ou-Yang, Le
    Zhu, Zexuan
    Xie, Weixin
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2020, 18 : 2583 - 2595
  • [34] Kernel hierarchical gene clustering from microarray expression data
    Qin, J
    Lewis, DP
    Noble, WS
    BIOINFORMATICS, 2003, 19 (16) : 2097 - 2104
  • [35] A Hybrid Learning Framework for Imbalanced Classification
    Jiang, Eric P.
    INTERNATIONAL JOURNAL OF INTELLIGENT INFORMATION TECHNOLOGIES, 2022, 18 (01)
  • [36] Gene selection and classification from microarray data using kernel machine
    Cho, JH
    Lee, D
    Park, JH
    Lee, IB
    FEBS LETTERS, 2004, 571 (1-3) : 93 - 98
  • [37] Distribution Adaptation and Classification Framework Based on Multiple Kernel Learning for Motor Imagery BCI Illiteracy
    Tao, Lin
    Cao, Tianao
    Wang, Qisong
    Liu, Dan
    Sun, Jinwei
    SENSORS, 2022, 22 (17)
  • [38] A Kernel-Based Framework for Learning Graded Relations From Data
    Waegeman, Willem
    Pahikkala, Tapio
    Airola, Antti
    Salakoski, Tapio
    Stock, Michiel
    De Baets, Bernard
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2012, 20 (06) : 1090 - 1101
  • [39] An epicurean learning approach to gene-expression data classification
    Albrecht, A
    Vinterbo, SA
    Ohno-Machado, L
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2003, 28 (01) : 75 - 87
  • [40] Learning misclassification costs for imbalanced classification on gene expression data
    Huijuan Lu
    Yige Xu
    Minchao Ye
    Ke Yan
    Zhigang Gao
    Qun Jin
    BMC Bioinformatics, 20