Comparative Analysis of Unsupervised Protein Similarity Prediction Based on Graph Embedding

被引:1
|
作者
Zhang, Yuanyuan [1 ,2 ]
Wang, Ziqi [1 ]
Wang, Shudong [2 ]
Shang, Junliang [3 ]
机构
[1] Qingdao Univ Technol, Sch Informat & Control Engn, Qingdao, Peoples R China
[2] China Univ Petr East China, Coll Comp Sci & Technol, Qingdao, Peoples R China
[3] Qufu Normal Univ, Sch Informat Sci & Engn, Rizhao, Peoples R China
基金
中国国家自然科学基金;
关键词
protein similarity; graph embedding; gene ontology; link prediction; DTW algorithm; SEMANTIC SIMILARITY; GENE ONTOLOGY;
D O I
10.3389/fgene.2021.744334
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
The study of protein-protein interaction and the determination of protein functions are important parts of proteomics. Computational methods are used to study the similarity between proteins based on Gene Ontology (GO) to explore their functions and possible interactions. GO is a series of standardized terms that describe gene products from molecular functions, biological processes, and cell components. Previous studies on assessing the similarity of GO terms were primarily based on Information Content (IC) between GO terms to measure the similarity of proteins. However, these methods tend to ignore the structural information between GO terms. Therefore, considering the structural information of GO terms, we systematically analyze the performance of the GO graph and GO Annotation (GOA) graph in calculating the similarity of proteins using different graph embedding methods. When applied to the actual Human and Yeast datasets, the feature vectors of GO terms and proteins are learned based on different graph embedding methods. To measure the similarity of the proteins annotated by different GO numbers, we used Dynamic Time Warping (DTW) and cosine to calculate protein similarity in GO graph and GOA graph, respectively. Link prediction experiments were then performed to evaluate the reliability of protein similarity networks constructed by different methods. It is shown that graph embedding methods have obvious advantages over the traditional IC-based methods. We found that random walk graph embedding methods, in particular, showed excellent performance in calculating the similarity of proteins. By comparing link prediction experiment results from GO(DTW) and GOA(cosine) methods, it is shown that GO(DTW) features provide highly effective information for analyzing the similarity among proteins.</p>
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Knowledge Graph Embedding for Link Prediction: A Comparative Analysis
    Rossi, Andrea
    Barbosa, Denilson
    Firmani, Donatella
    Matinata, Antonio
    Merialdo, Paolo
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2021, 15 (02)
  • [2] Graph-based prediction of Protein-protein interactions with attributed signed graph embedding
    Fang Yang
    Kunjie Fan
    Dandan Song
    Huakang Lin
    BMC Bioinformatics, 21
  • [3] Graph-based prediction of Protein-protein interactions with attributed signed graph embedding
    Yang, Fang
    Fan, Kunjie
    Song, Dandan
    Lin, Huakang
    BMC BIOINFORMATICS, 2020, 21 (01)
  • [4] Graph Embedding based Familial Analysis of Android Malware using Unsupervised Learning
    Fan, Ming
    Luo, Xiapu
    Liu, Jun
    Wang, Meng
    Nong, Chunyin
    Zheng, Qinghua
    Liu, Ting
    2019 IEEE/ACM 41ST INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2019), 2019, : 771 - 782
  • [5] Unsupervised Large Graph Embedding
    Nie, Feiping
    Zhu, Wei
    Li, Xuelong
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2422 - 2428
  • [6] Similarity Analysis of Knowledge Graph-based Company Embedding for Stocks Portfolio
    Zhang, Boyao
    Li, Zhongrui
    Yang, Chao
    Wang, Zongguo
    Zhao, Yonghua
    Sun, Jingqi
    Wang, Lihua
    2021 IEEE 6TH INTERNATIONAL CONFERENCE ON SMART CLOUD (SMARTCLOUD 2021), 2021, : 84 - 89
  • [7] Exploring Similarity-Based Graph Compression for Efficient Network Analysis and Embedding
    Akin, Hamdi Selim
    Aktas, Mehmet Emin
    Islam, Muhammed Ifte
    Hossain, Tanvir
    Akbas, Esra
    2024 33RD INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATIONS AND NETWORKS, ICCCN 2024, 2024,
  • [8] Unsupervised graph anomaly detection with discriminative embedding similarity for viscoelastic sandwich cylindrical structures
    Hou, Rujie
    Zhang, Zhousuo
    Chen, Jinglong
    Yang, Wenzhan
    Liu, Feng
    ISA TRANSACTIONS, 2024, 147 : 36 - 54
  • [9] A parameterised model for link prediction using node centrality and similarity measure based on graph embedding
    Lu, Haohui
    Uddin, Shahadat
    NEUROCOMPUTING, 2024, 593
  • [10] DTiGEMS+: drug–target interaction prediction using graph embedding, graph mining, and similarity-based techniques
    Maha A. Thafar
    Rawan S. Olayan
    Haitham Ashoor
    Somayah Albaradei
    Vladimir B. Bajic
    Xin Gao
    Takashi Gojobori
    Magbubah Essack
    Journal of Cheminformatics, 12