Predicting gene function using similarity learning

被引:7
|
作者
Tu Minh Phuong [1 ]
Ngo Phuong Nhung [2 ]
机构
[1] Posts & Telecommun Inst Technol, Dept Comp Sci, Hanoi, Vietnam
[2] Free Univ Bolzano, KRDB Res Ctr, Bolzano, Italy
来源
BMC GENOMICS | 2013年 / 14卷
关键词
NETWORK; ASSOCIATION; DISCOVERY; ALGORITHM;
D O I
10.1186/1471-2164-14-S4-S4
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Computational methods that make use of heterogeneous biological datasets to predict gene function provide a cost-effective and rapid way for annotating genomes. A common framework shared by many such methods is to construct a combined functional association network from multiple networks representing different sources of data, and use this combined network as input to network-based or kernel-based learning algorithms. In these methods, a key factor contributing to the prediction accuracy is the network quality, which is the ability of the network to reflect the functional relatedness of gene pairs. To improve the network quality, a large effort has been spent on developing methods for network integration. These methods, however, produce networks, which then remain unchanged, and nearly no effort has been made to optimize the networks after their construction. Results: Here, we propose an alternative method to improve the network quality. The proposed method takes as input a combined network produced by an existing network integration algorithm, and reconstructs this network to better represent the co-functionality relationships between gene pairs. At the core of the method is a learning algorithm that can learn a measure of functional similarity between genes, which we then use to reconstruct the input network. In experiments with yeast and human, the proposed method produced improved networks and achieved more accurate results than two other leading gene function prediction approaches. Conclusions: The results show that it is possible to improve the accuracy of network-based gene function prediction methods by optimizing combined networks with appropriate similarity measures learned from data. The proposed learning procedure can handle noisy training data and scales well to large genomes.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] DISCRIMINATION-LEARNING AS A FUNCTION OF THE SIMILARITY OF THE STIMULUS NAMES
    GERJUOY, IR
    CHILD DEVELOPMENT, 1964, 35 (03) : 677 - 684
  • [42] GrAPFI: predicting enzymatic function of proteins from domain similarity graphs
    Bishnu Sarker
    David W. Ritchie
    Sabeur Aridhi
    BMC Bioinformatics, 21
  • [43] GrAPFI: predicting enzymatic function of proteins from domain similarity graphs
    Sarker, Bishnu
    Ritchie, David W.
    Aridhi, Sabeur
    BMC BIOINFORMATICS, 2020, 21 (01)
  • [44] Predicting activity approach based on new atoms similarity kernel function
    Abu El-Atta, Ahmed H.
    Moussa, M. I.
    Hassanien, Aboul Ella
    JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 2015, 60 : 55 - 62
  • [45] reg2gene: predicting enhancer-gene associations using ensemble learning approaches
    Patarcic, I.
    Franke, V.
    Akalin, A.
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2019, 27 : 1157 - 1158
  • [46] Integrating phenotype and gene expression data for predicting gene function
    Brandon M Malone
    Andy D Perkins
    Susan M Bridges
    BMC Bioinformatics, 10
  • [47] Integrating phenotype and gene expression data for predicting gene function
    Malone, Brandon M.
    Perkins, Andy D.
    Bridges, Susan M.
    BMC BIOINFORMATICS, 2009, 10 : S20
  • [48] Predicting Hydrologic Function With Aquatic Gene Fragments
    Good, S. P.
    URycki, D. R.
    Crump, B. C.
    WATER RESOURCES RESEARCH, 2018, 54 (03) : 2424 - 2435
  • [49] Predicting Gene Function with Positive and Unlabeled Examples
    Chen, Yiming
    Li, Zhoujun
    Hu, Xiaohua
    Diao, Hongxiang
    Liu, Junwan
    2009 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING ( GRC 2009), 2009, : 89 - +
  • [50] Predicting gene function from patterns of annotation
    King, OD
    Foulger, RE
    Dwight, SS
    White, JV
    Roth, FP
    GENOME RESEARCH, 2003, 13 (05) : 896 - 904