Multi-Instance Metric Transfer Learning for Genome-Wide Protein Function Prediction

被引:7
|
作者
Xu, Yonghui [1 ]
Min, Huaqing [2 ]
Wu, Qingyao [2 ,3 ]
Song, Hengjie [2 ]
Ye, Bicui [4 ]
机构
[1] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510006, Guangdong, Peoples R China
[2] South China Univ Technol, Sch Software Engn, Guangzhou 510006, Guangdong, Peoples R China
[3] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing, Jiangsu, Peoples R China
[4] Wuzhou Red Cross Hosp, Wuzhou 543002, Peoples R China
来源
SCIENTIFIC REPORTS | 2017年 / 7卷
关键词
DOMAIN; SYSTEM;
D O I
10.1038/srep41831
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Multi-Instance (MI) learning has been proven to be effective for the genome-wide protein function prediction problems where each training example is associated with multiple instances. Many studies in this literature attempted to find an appropriate Multi-Instance Learning (MIL) method for genome-wide protein function prediction under a usual assumption, the underlying distribution from testing data (target domain, i.e., TD) is the same as that from training data (source domain, i.e., SD). However, this assumption may be violated in real practice. To tackle this problem, in this paper, we propose a Multi-Instance Metric Transfer Learning (MIMTL) approach for genome-wide protein function prediction. In MIMTL, we first transfer the source domain distribution to the target domain distribution by utilizing the bag weights. Then, we construct a distance metric learning method with the reweighted bags. At last, we develop an alternative optimization scheme for MIMTL. Comprehensive experimental evidence on seven real-world organisms verifies the effectiveness and efficiency of the proposed MIMTL approach over several state-of-the-art methods.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Multi-Instance Metric Transfer Learning for Genome-Wide Protein Function Prediction
    Yonghui Xu
    Huaqing Min
    Qingyao Wu
    Hengjie Song
    Bicui Ye
    Scientific Reports, 7
  • [2] Multi-instance multi-label distance metric learning for genome-wide protein function prediction
    Xu, Yonghui
    Min, Huaqing
    Song, Hengjie
    Wu, Qingyao
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2016, 63 : 30 - 40
  • [3] Genome-Wide Protein Function Prediction through Multi-Instance Multi-Label Learning
    Wu, Jian-Sheng
    Huang, Sheng-Jun
    Zhou, Zhi-Hua
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2014, 11 (05) : 891 - 902
  • [4] Online Multi-Instance Multi-Label Learning for Protein Function Prediction
    Wu, Feng
    Liu, Qiong
    Hao, Tianyong
    Chen, Xiaojun
    Wu, Qingyao
    2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2016, : 780 - 785
  • [5] Multi-Instance Learning for Bankruptcy Prediction
    Kotsiantis, Sotiris
    Kanellopoulos, Dimitris
    THIRD 2008 INTERNATIONAL CONFERENCE ON CONVERGENCE AND HYBRID INFORMATION TECHNOLOGY, VOL 1, PROCEEDINGS, 2008, : 1007 - +
  • [6] Multi-instance clustering with applications to multi-instance prediction
    Min-Ling Zhang
    Zhi-Hua Zhou
    Applied Intelligence, 2009, 31 : 47 - 68
  • [7] Multi-instance clustering with applications to multi-instance prediction
    Zhang, Min-Ling
    Zhou, Zhi-Hua
    APPLIED INTELLIGENCE, 2009, 31 (01) : 47 - 68
  • [8] Metric learning for multi-instance classification with collapsed bags
    Li, Dewei
    Xu, Dongkuan
    Tang, Jingjing
    Tian, Yingjie
    2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2017, : 372 - 379
  • [9] Domain transfer multi-instance dictionary learning
    Wang, Ke
    Liu, Jiayong
    Gonzalez, Daniel
    NEURAL COMPUTING & APPLICATIONS, 2017, 28 : S983 - S992
  • [10] Domain transfer multi-instance dictionary learning
    Ke Wang
    Jiayong Liu
    Daniel González
    Neural Computing and Applications, 2017, 28 : 983 - 992