Cluster Tree based Multi-Label Classification for Protein Function Prediction

被引:0
|
作者
Wu, Qingyao [1 ,2 ]
Ye, Yunming [1 ,2 ]
Zhang, Xiaofeng [1 ,2 ]
Ho, Shen-Shyang [3 ]
机构
[1] Harbin Inst Technol, Shenzhen Grad Sch, Dept Comp Sci, Shenzhen, Peoples R China
[2] Shenzhen Key Lab Internet Informat Collaboration, Shenzhen, Peoples R China
[3] Nanyang Technol Univ, Sch Comp Engn, Singapore, Singapore
关键词
Data mining; Multi-label data; Multi-label classification; Protein function prediction;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Automatically assigning functions for unknown proteins is a key task in computational biology. Proteins in nature have multiple classes according to the functions they perform. Many efforts have been made to cast the protein function prediction into a multi-label learning problem. This paper proposes a novel Cluster Tree based Multi-label Learning algorithm (CTML) for protein function prediction. The main idea is to compute a set of predictive labels associated at each node for multi-label prediction by using the k-means clustering techniques and the predictive functions via the learning data at the nodes. With the propagation of the predictive labels from the root node to the leaf node, the correlations between labels can be preserved. Experimental results on benchmark data (genbase and yeast datasets) show that the proposed CTML algorithm is effective in predicting protein functions. Moreover, the classification performance of the CTML algorithm is competitive against the other baseline multi-label learning algorithms.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] Reduction strategies for hierarchical multi-label classification in protein function prediction
    Ricardo Cerri
    Rodrigo C. Barros
    André C. P. L. F. de Carvalho
    Yaochu Jin
    BMC Bioinformatics, 17
  • [2] Reduction strategies for hierarchical multi-label classification in protein function prediction
    Cerri, Ricardo
    Barros, Rodrigo C.
    de Carvalho, Andre C. P. L. F.
    Jin, Yaochu
    BMC BIOINFORMATICS, 2016, 17
  • [3] Exploiting PubMed for Protein Molecular Function Prediction via NMF based Multi-Label Classification
    Fodeh, Samah
    Tiwari, Aditya
    Yu, Hong
    2017 17TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2017), 2017, : 446 - 451
  • [4] A hierarchical multi-label classification ant colony algorithm for protein function prediction
    Otero F.E.B.
    Freitas A.A.
    Johnson C.G.
    Memetic Computing, 2010, 2 (3) : 165 - 181
  • [5] Multi-label Feature Selection Techniques for Hierarchical Multi-label Protein Function Prediction
    Cerri, Ricardo
    Mantovani, Rafael G.
    Basgalupp, Marcio P.
    de Carvalho, Andre C. P. L. F.
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [6] Multi-label Classification under Uncertainty: A Tree-based Conformal Prediction Approach
    Tyagi, Chhavi
    Guo, Wenge
    CONFORMAL AND PROBABILISTIC PREDICTION WITH APPLICATIONS, VOL 204, 2023, 204 : 488 - 512
  • [7] Multi-Label Hierarchical Classification using a Competitive Neural Network for Protein Function Prediction
    Borges, Helyane Bronoski
    Nievola, Julio Cesar
    2012 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2012,
  • [8] Hierarchical Multi-label Associative Classification for Protein Function Prediction Using Gene Ontology
    Sangsuriyun, Sawinee
    Rakthanmanon, Thanawin
    Waiyamai, Kitsana
    CHIANG MAI JOURNAL OF SCIENCE, 2019, 46 (01): : 165 - 179
  • [9] A Neural Network-Based Multi-Label Classifier for Protein Function Prediction
    Tahzeeb, Shahab
    Hasan, Shehzad
    ENGINEERING TECHNOLOGY & APPLIED SCIENCE RESEARCH, 2022, 12 (01) : 7974 - 7981
  • [10] ML-FOREST: A Multi-Label Tree Ensemble Method for Multi-Label Classification
    Wu, Qingyao
    Tan, Mingkui
    Song, Hengjie
    Chen, Jian
    Ng, Michael K.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (10) : 2665 - 2680