A representation learning model based on variational inference and graph autoencoder for predicting lncRNA-disease associations

被引:54
|
作者
Shi, Zhuangwei [1 ]
Zhang, Han [1 ]
Jin, Chen [2 ]
Quan, Xiongwen [1 ]
Yin, Yanbin [3 ]
机构
[1] Nankai Univ, Coll Artificial Intelligence, Tongyan Rd, Tianjin 300350, Peoples R China
[2] Nankai Univ, Coll Comp Sci, Tongyan Rd, Tianjin 300350, Peoples R China
[3] Univ Nebraska, Nebraska Food Hlth Ctr, Dept Food Sci & Technol, 1400 R St, Lincoln, NE 68588 USA
基金
中国国家自然科学基金;
关键词
Variational inference; Graph autoencoder; lncRNA-disease association; Representation learning; PROMOTES CELL-PROLIFERATION; NONCODING RNA; BREAST-CANCER; NEURAL-NETWORKS; POOR-PROGNOSIS; EXPRESSION; SIMILARITY;
D O I
10.1186/s12859-021-04073-z
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Numerous studies have demonstrated that long non-coding RNAs are related to plenty of human diseases. Therefore, it is crucial to predict potential lncRNA-disease associations for disease prognosis, diagnosis and therapy. Dozens of machine learning and deep learning algorithms have been adopted to this problem, yet it is still challenging to learn efficient low-dimensional representations from high-dimensional features of lncRNAs and diseases to predict unknown lncRNA-disease associations accurately. Results: We proposed an end-to-end model, VGAELDA, which integrates variational inference and graph autoencoders for lncRNA-disease associations prediction. VGAELDA contains two kinds of graph autoencoders. Variational graph autoencoders (VGAE) infer representations from features of lncRNAs and diseases respectively, while graph autoencoders propagate labels via known lncRNA-disease associations. These two kinds of autoencoders are trained alternately by adopting variational expectation maximization algorithm. The integration of both the VGAE for graph representation learning, and the alternate training via variational inference, strengthens the capability of VGAELDA to capture efficient low-dimensional representations from high-dimensional features, and hence promotes the robustness and preciseness for predicting unknown lncRNA-disease associations. Further analysis illuminates that the designed co-training framework of lncRNA and disease for VGAELDA solves a geometric matrix completion problem for capturing efficient low-dimensional representations via a deep learning approach. Conclusion: Cross validations and numerical experiments illustrate that VGAELDA outperforms the current state-of-the-art methods in lncRNA-disease association prediction. Case studies indicate that VGAELDA is capable of detecting potential lncRNA-disease associations. The source code and data are available at https://github.com/zhanglabNKU/VGAELDA.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] Predicting lncRNA-disease associations and constructing lncRNA functional similarity network based on the information of miRNA
    Xing Chen
    Scientific Reports, 5
  • [32] GANLDA: Graph attention network for lncRNA-disease associations prediction
    Lan, Wei
    Wu, Ximin
    Chen, Qingfeng
    Peng, Wei
    Wang, Jianxin
    Chen, Yiping Phoebe
    NEUROCOMPUTING, 2022, 469 : 384 - 393
  • [33] Weighted matrix factorization based data fusion for predicting lncRNA-disease associations
    Yu, Guoxian
    Wang, Yuehui
    Wang, Jun
    Fu, Guangyuan
    Guo, Maozu
    Domeniconi, Carlotta
    PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 572 - 577
  • [34] Finding potential lncRNA-disease associations using a boosting-based ensemble learning model
    Zhou, Liqian
    Peng, Xinhuai
    Zeng, Lijun
    Peng, Lihong
    FRONTIERS IN GENETICS, 2024, 15
  • [35] A new method on lncRNA-disease-miRNA tripartite graph to predict lncRNA-disease associations
    Van Tinh Nguyen
    Thi Tu Kien Le
    Dang Hung Tran
    2020 12TH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SYSTEMS ENGINEERING (IEEE KSE 2020), 2020, : 287 - 293
  • [36] WGRCMF: A Weighted Graph Regularized Collaborative Matrix Factorization Method for Predicting Novel LncRNA-Disease Associations
    Liu, Jin-Xing
    Cui, Zhen
    Gao, Ying-Lian
    Kong, Xiang-Zhen
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2021, 25 (01) : 257 - 265
  • [37] MAGCNSE: predicting lncRNA-disease associations using multi-view attention graph convolutional network and stacking ensemble model
    Liang, Ying
    Zhang, Ze-Qun
    Liu, Nian-Nian
    Wu, Ya-Nan
    Gu, Chang-Long
    Wang, Ying-Long
    BMC BIOINFORMATICS, 2022, 23 (01)
  • [38] Prediction of lncRNA-disease Associations Based on Robust Multi-label Learning
    Zhang, Jiaxin
    Sun, Quanmeng
    Liang, Cheng
    CURRENT BIOINFORMATICS, 2021, 16 (09) : 1179 - 1189
  • [39] DHNLDA: A Novel Deep Hierarchical Network Based Method for Predicting lncRNA-Disease Associations
    Xie, Fansen
    Yang, Ziqi
    Song, Jinmiao
    Dai, Qiguo
    Duan, Xiaodong
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (06) : 3395 - 3403
  • [40] ALSBMF: Predicting lncRNA-Disease Associations by Alternating Least Squares Based on Matrix Factorization
    Zhu, Wen
    Huang, Kaimei
    Xiao, Xiaofang
    Liao, Bo
    Yao, Yuhua
    Wu, Fang-Xiang
    IEEE ACCESS, 2020, 8 (08): : 26190 - 26198