Cluster Tree based Multi-Label Classification for Protein Function Prediction

被引:0
|
作者
Wu, Qingyao [1 ,2 ]
Ye, Yunming [1 ,2 ]
Zhang, Xiaofeng [1 ,2 ]
Ho, Shen-Shyang [3 ]
机构
[1] Harbin Inst Technol, Shenzhen Grad Sch, Dept Comp Sci, Shenzhen, Peoples R China
[2] Shenzhen Key Lab Internet Informat Collaboration, Shenzhen, Peoples R China
[3] Nanyang Technol Univ, Sch Comp Engn, Singapore, Singapore
关键词
Data mining; Multi-label data; Multi-label classification; Protein function prediction;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Automatically assigning functions for unknown proteins is a key task in computational biology. Proteins in nature have multiple classes according to the functions they perform. Many efforts have been made to cast the protein function prediction into a multi-label learning problem. This paper proposes a novel Cluster Tree based Multi-label Learning algorithm (CTML) for protein function prediction. The main idea is to compute a set of predictive labels associated at each node for multi-label prediction by using the k-means clustering techniques and the predictive functions via the learning data at the nodes. With the propagation of the predictive labels from the root node to the leaf node, the correlations between labels can be preserved. Experimental results on benchmark data (genbase and yeast datasets) show that the proposed CTML algorithm is effective in predicting protein functions. Moreover, the classification performance of the CTML algorithm is competitive against the other baseline multi-label learning algorithms.
引用
收藏
页数:4
相关论文
共 50 条
  • [21] Cluster-Guided Label Generation in Extreme Multi-Label Classification
    Jung, Taehee
    Kim, Joo-Kyung
    Lee, Sungjin
    Kang, Dongyeop
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 1670 - 1685
  • [22] Hierarchical multi-label classification with SVMs: A case study in gene function prediction
    Vateekul, Peerapon
    Kubat, Miroslav
    Sarinnapakorn, Kanoksri
    INTELLIGENT DATA ANALYSIS, 2014, 18 (04) : 717 - 738
  • [23] Link Prediction-based Multi-label Classification on Networked Data
    Zhao, Yinfeng
    Li, Lei
    Wu, Xindong
    2016 IEEE FIRST INTERNATIONAL CONFERENCE ON DATA SCIENCE IN CYBERSPACE (DSC 2016), 2016, : 61 - 68
  • [24] Hierarchical multi-label prediction of gene function
    Barutcuoglu, Z
    Schapire, RE
    Troyanskaya, OG
    BIOINFORMATICS, 2006, 22 (07) : 830 - 836
  • [25] A Parallel Decision Tree Based Algorithm on MPI for Multi-label Classification Learning
    Zhou, Yihao
    Ji, Zhenzhou
    Wang, Kaiyu
    PROCEEDINGS OF THE 2017 2ND INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND ARTIFICIAL INTELLIGENCE (CAAI 2017), 2017, 134 : 366 - 369
  • [26] Hierarchical multi-label classification based on LSTM network and Bayesian decision theory for LncRNA function prediction
    Shou Feng
    Huiying Li
    Jiaqing Qiao
    Scientific Reports, 12
  • [27] Hierarchical multi-label classification based on LSTM network and Bayesian decision theory for LncRNA function prediction
    Feng, Shou
    Li, Huiying
    Qiao, Jiaqing
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [28] Multi-label classification with XGBoost for metabolic pathway prediction
    Hyunwhan Joe
    Hong-Gee Kim
    BMC Bioinformatics, 25
  • [29] Plant miRNA function prediction based on functional similarity network and transductive multi-label classification algorithm
    Meng, Jun
    Shi, Guan-Li
    Luan, Yu-Shi
    NEUROCOMPUTING, 2016, 179 : 283 - 289
  • [30] An empirical study of empty prediction of multi-label classification
    Liu, Shuhua
    Chen, Jiun-Hung
    EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (13) : 5567 - 5579