Pan-cancer classification by regularized multi-task learning

被引:6
|
作者
Hossain, Sk Md Mosaddek [1 ]
Khatun, Lutfunnesa [2 ]
Ray, Sumanta [1 ]
Mukhopadhyay, Anirban [2 ]
机构
[1] Aliah Univ, Comp Sci & Engn, Kolkata 700160, India
[2] Univ Kalyani, Comp Sci & Engn, Kalyani 741235, W Bengal, India
关键词
INFORMATION; PROGNOSIS; MODEL;
D O I
10.1038/s41598-021-03554-8
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Classifying pan-cancer samples using gene expression patterns is a crucial challenge for the accurate diagnosis and treatment of cancer patients. Machine learning algorithms have been considered proven tools to perform downstream analysis and capture the deviations in gene expression patterns across diversified diseases. In our present work, we have developed PC-RMTL, a pan-cancer classification model using regularized multi-task learning (RMTL) for classifying 21 cancer types and adjacent normal samples using RNASeq data obtained from TCGA. PC-RMTL is observed to outperform when compared with five state-of-the-art classification algorithms, viz. SVM with the linear kernel (SVM-Lin), SVM with radial basis function kernel (SVM-RBF), random forest (RF), k-nearest neighbours (kNN), and decision trees (DT). The PC-RMTL achieves 96.07% accuracy and 95.80% MCC score for a completely unknown independent test set. The only method that appears as the real competitor is SVM-Lin, which nearly equalizes the accuracy in prediction of PC-RMTL but only when complete feature sets are provided for training; otherwise, PC-RMTL outperformed all other classification models. To the best of our knowledge, this is a significant improvement over all the existing works in pan-cancer classification as they have failed to classify many cancer types from one another reliably. We have also compared gene expression patterns of the top discriminating genes across the cancers and performed their functional enrichment analysis that uncovers several interesting facts in distinguishing pan-cancer samples.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] A New Multi-Task Learning Technique to Predict Classification of Leukemia and Prostate Cancer
    Chen, Austin H.
    Huang, Zone-Wei
    MEDICAL BIOMETRICS, PROCEEDINGS, 2010, 6165 : 11 - +
  • [22] A multi-task learning based approach for efficient breast cancer detection and classification
    Mishra, Arnab Kumar
    Roy, Pinki
    Bandyopadhyay, Sivaji
    Das, Sujit Kumar
    EXPERT SYSTEMS, 2022, 39 (09)
  • [23] Multi-task learning for object keypoints detection and classification
    Xu, Jie
    Zhao, Lin
    Zhang, Shanshan
    Gong, Chen
    Yang, Jian
    PATTERN RECOGNITION LETTERS, 2020, 130 : 182 - 188
  • [24] Curriculum Learning for Multi-Task Classification of Visual Attributes
    Sarafianos, Nikolaos
    Giannakopoulos, Theodore
    Nikou, Christophoros
    Kakadiaris, Ioannis A.
    2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 2608 - 2615
  • [25] Multi-task gradient descent for multi-task learning
    Lu Bai
    Yew-Soon Ong
    Tiantian He
    Abhishek Gupta
    Memetic Computing, 2020, 12 : 355 - 369
  • [26] Multi-task gradient descent for multi-task learning
    Bai, Lu
    Ong, Yew-Soon
    He, Tiantian
    Gupta, Abhishek
    MEMETIC COMPUTING, 2020, 12 (04) : 355 - 369
  • [27] Hierarchical Multi-Task Learning for Healthy Drink Classification
    Park, Homin
    Bharadhwaj, Homanga
    Lim, Brian Y.
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [28] Multi-Task Diffusion Learning for Time Series Classification
    Zheng, Shaoqiu
    Liu, Zhen
    Tian, Long
    Ye, Ling
    Zheng, Shixin
    Peng, Peng
    Chu, Wei
    ELECTRONICS, 2024, 13 (20)
  • [29] Deep multi-task learning for malware image classification
    Bensaoud, Ahmed
    Kalita, Jugal
    JOURNAL OF INFORMATION SECURITY AND APPLICATIONS, 2022, 64
  • [30] Multi-task learning for classification with Dirichlet process priors
    Department of Electrical and Computer Engineering, Duke University, Durham, NC 27708, United States
    不详
    J. Mach. Learn. Res., 2007, (35-63):