A Protein Interaction Information-based Generative Model for Enhancing Gene Clustering

被引:0
|
作者
Pratik Dutta
Sriparna Saha
Sanket Pai
Aviral Kumar
机构
[1] Indian Institute of Technology Patna,Department of Computer Science and Engineering
[2] Indian Institute of Technology Patna,Department of Chemical Science and Technology
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
In the field of computational bioinformatics, identifying a set of genes which are responsible for a particular cellular mechanism, is very much essential for tasks such as medical diagnosis or disease gene identification. Accurately grouping (clustering) the genes is one of the important tasks in understanding the functionalities of the disease genes. In this regard, ensemble clustering becomes a promising approach to combine different clustering solutions to generate almost accurate gene partitioning. Recently, researchers have used generative model as a smart ensemble method to produce the right consensus solution. In the current paper, we develop a protein-protein interaction-based generative model that can efficiently perform a gene clustering. Utilizing protein interaction information as the generative model’s latent variable enables enhance the generative model’s efficiency in inferring final probabilistic labels. The proposed generative model utilizes different weak supervision sources rather utilizing any ground truth information. For weak supervision sources, we use a multi-objective optimization based clustering technique together with the world’s largest gene ontology based knowledge-base named Gene Ontology Consortium(GOC). These weakly supervised labels are supplied to a generative model that eventually assigns all genes to probabilistic labels. The comparative study with respect to silhouette score, Biological Homogeneity Index (BHI) and Biological Stability Index (BSI) proves that the proposed generative model outperforms than other state-of-the-art techniques.
引用
收藏
相关论文
共 50 条
  • [1] A Protein Interaction Information-based Generative Model for Enhancing Gene Clustering
    Dutta, Pratik
    Saha, Sriparna
    Pai, Sanket
    Kumar, Aviral
    SCIENTIFIC REPORTS, 2020, 10 (01)
  • [2] Information-based clustering
    Slonim, N
    Atwal, GS
    Tkacik, G
    Bialek, W
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2005, 102 (51) : 18297 - 18302
  • [3] Information-Based Clustering and Filtering for Field Reconstruction
    Chen, Jia
    Malhotra, Akshay
    Schizas, Ioannis D.
    2015 49TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, 2015, : 576 - 580
  • [4] A statistical information-based clustering approach in distance space
    Yue Shi-hong
    Li Ping
    Guo Ji-dong
    Zhou Shui-geng
    Journal of Zhejiang University-SCIENCE A, 2005, 6 (1): : 71 - 78
  • [5] A statistical information-based clustering approach in distance space
    岳士弘
    李平
    郭继东
    周水庚
    Journal of Zhejiang University Science A(Science in Engineering), 2005, (01) : 72 - 79
  • [6] Distributed information-based clustering of heterogeneous sensor data
    Chen, Jia
    Schizas, Ioannis D.
    SIGNAL PROCESSING, 2016, 126 : 35 - 51
  • [7] An information-based clustering approach for fMRI activation detection
    Bai, Lijun
    Qin, Wei
    Liang, Jimin
    Tian, Jie
    2008 IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING: FROM NANO TO MACRO, VOLS 1-4, 2008, : 588 - +
  • [8] State information-based ant colony clustering algorithm
    Shen Jie
    He Kun
    Wei Liu-Hua
    Bi Lei
    Sun Rong-Shuang
    Xu Fa-Yan
    PROCEEDINGS OF 2008 IEEE INTERNATIONAL CONFERENCE ON NETWORKING, SENSING AND CONTROL, VOLS 1 AND 2, 2008, : 630 - +
  • [9] Local information-based fast approximate spectral clustering
    Cao, Jiangzhong
    Chen, Pei
    Dai, Qingyun
    Ling, Wing-Kuen
    PATTERN RECOGNITION LETTERS, 2014, 38 : 63 - 69
  • [10] An information-based network approach for protein classification
    Wan, Xiaogeng
    Zhao, Xin
    Yau, Stephen S. T.
    PLOS ONE, 2017, 12 (03):