Cluster Tree based Multi-Label Classification for Protein Function Prediction

被引:0
|
作者
Wu, Qingyao [1 ,2 ]
Ye, Yunming [1 ,2 ]
Zhang, Xiaofeng [1 ,2 ]
Ho, Shen-Shyang [3 ]
机构
[1] Harbin Inst Technol, Shenzhen Grad Sch, Dept Comp Sci, Shenzhen, Peoples R China
[2] Shenzhen Key Lab Internet Informat Collaboration, Shenzhen, Peoples R China
[3] Nanyang Technol Univ, Sch Comp Engn, Singapore, Singapore
关键词
Data mining; Multi-label data; Multi-label classification; Protein function prediction;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Automatically assigning functions for unknown proteins is a key task in computational biology. Proteins in nature have multiple classes according to the functions they perform. Many efforts have been made to cast the protein function prediction into a multi-label learning problem. This paper proposes a novel Cluster Tree based Multi-label Learning algorithm (CTML) for protein function prediction. The main idea is to compute a set of predictive labels associated at each node for multi-label prediction by using the k-means clustering techniques and the predictive functions via the learning data at the nodes. With the propagation of the predictive labels from the root node to the leaf node, the correlations between labels can be preserved. Experimental results on benchmark data (genbase and yeast datasets) show that the proposed CTML algorithm is effective in predicting protein functions. Moreover, the classification performance of the CTML algorithm is competitive against the other baseline multi-label learning algorithms.
引用
收藏
页数:4
相关论文
共 50 条
  • [41] Multi-label Correlated Semi-supervised Learning for Protein Function Prediction
    Jiang, Jonathan Q.
    BIOINFORMATICS RESEARCH AND APPLICATIONS, 2011, 6674 : 368 - 379
  • [42] Multi-label Classification for Tree and Directed Acyclic Graphs Hierarchies
    Ramirez-Corona, Mallinali
    Enrique Sucar, L.
    Morales, Eduardo F.
    PROBABILISTIC GRAPHICAL MODELS, 2014, 8754 : 409 - 425
  • [43] Design and development of an efficient hierarchical approach for multi-label protein function prediction
    Prabha, Mohana G.
    Chitra, S.
    BIOMEDICAL RESEARCH-INDIA, 2017, 28
  • [44] Multi-label classification of gene function using MLPs
    Skabar, Andrew
    Wollersheim, Dennis
    Whitfort, Tim
    2006 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORK PROCEEDINGS, VOLS 1-10, 2006, : 2234 - +
  • [45] Boosting-based Multi-label Classification
    Kajdanowicz, Tomasz
    Kazienko, Przemyslaw
    JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2013, 19 (04) : 502 - 520
  • [46] Multi-label Anomaly Classification Based on Electrocardiogram
    Li, Chenyang
    Sun, Le
    HEALTH INFORMATION SCIENCE, HIS 2021, 2021, 13079 : 171 - 178
  • [47] Multi-label, Classification-based Prediction of Breast Cancer Metastasis Directions
    Wang, Tingting
    Fan, Qi
    Tan, Liang
    Zhang, Beier
    IAENG International Journal of Computer Science, 2025, 52 (01) : 1 - 10
  • [48] Biclustering-based multi-label classification
    Schmitke, Luiz Rafael
    Paraiso, Emerson Cabrera
    Nievola, Julio Cesar
    KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (08) : 4861 - 4898
  • [49] Topic Model Based Multi-Label Classification
    Padmanabhan, Divya
    Bhat, Satyanath
    Shevade, Shirish
    Narahari, Y.
    2016 IEEE 28TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2016), 2016, : 996 - 1003
  • [50] Multi-label classification with label clusters
    Gatto, Elaine Cecilia
    Ferrandin, Mauri
    Cerri, Ricardo
    KNOWLEDGE AND INFORMATION SYSTEMS, 2025, 67 (02) : 1741 - 1785