Multimodal fused deep learning for drug property prediction: Integrating chemical language and molecular graph

被引:5
|
作者
Lu, Xiaohua [1 ]
Xie, Liangxu [1 ]
Xu, Lei [1 ]
Mao, Rongzhi [1 ]
Xu, Xiaojun [1 ]
Chang, Shan [1 ]
机构
[1] Jiangsu Univ Technol, Inst Bioinformat & Med Engn, Changzhou 213001, Peoples R China
基金
中国国家自然科学基金;
关键词
Multimodal learning; Deep learning; Drug discovery; Transformer; Graph; NETWORKS; FUSION; MODELS;
D O I
10.1016/j.csbj.2024.04.030
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Accurately predicting molecular properties is a challenging but essential task in drug discovery. Recently, many mono-modal deep learning methods have been successfully applied to molecular property prediction. However, mono-modal learning is inherently limited as it relies solely on a single modality of molecular representation, which restricts a comprehensive understanding of drug molecules. To overcome the limitations, we propose a multimodal fused deep learning (MMFDL) model to leverage information from different molecular representations. Specifically, we construct a triple-modal learning model by employing Transformer-Encoder, Bidirectional Gated Recurrent Unit (BiGRU), and graph convolutional network (GCN) to process three modalities of information from chemical language and molecular graph: SMILES-encoded vectors, ECFP fingerprints, and molecular graphs, respectively. We evaluate the proposed triple-modal model using five fusion approaches on six molecule datasets, including Delaney, Llinas2020, Lipophilicity, SAMPL, BACE, and pKa from DataWarrior. The results show that the MMFDL model achieves the highest Pearson coefficients, and stable distribution of Pearson coefficients in the random splitting test, outperforming mono-modal models in accuracy and reliability. Furthermore, we validate the generalization ability of our model in the prediction of binding constants for protein-ligand complex molecules, and assess the resilience capability against noise. Through analysis of feature distributions in chemical space and the assigned contribution of each modal model, we demonstrate that the MMFDL model shows the ability to acquire complementary information by using proper models and suitable fusion approaches. By leveraging diverse sources of bioinformatics information, multimodal deep learning models hold the potential for successful drug discovery.
引用
收藏
页码:1666 / 1679
页数:14
相关论文
共 50 条
  • [1] MolPROP: Molecular Property prediction with multimodal language and graph fusion
    Rollins, Zachary A.
    Cheng, Alan C.
    Metwally, Essam
    JOURNAL OF CHEMINFORMATICS, 2024, 16 (01):
  • [2] GMPP-NN: a deep learning architecture for graph molecular property prediction
    Abbassi, Outhman
    Ziti, Soumia
    Belhiah, Meryam
    Lagmiri, Souad Najoua
    Seghroucheni, Yassine Zaoui
    DISCOVER APPLIED SCIENCES, 2024, 6 (07)
  • [3] Metapath-fused heterogeneous graph network for molecular property prediction
    Ji, Ying
    Wan, Guojia
    Zhan, Yibing
    Du, Bo
    INFORMATION SCIENCES, 2023, 629 : 155 - 168
  • [4] Molecular property prediction based on graph structure learning
    Zhao, Bangyi
    Xu, Weixia
    Guan, Jihong
    Zhou, Shuigeng
    BIOINFORMATICS, 2024, 40 (05)
  • [5] MDFCL: Multimodal data fusion-based graph contrastive learning framework for molecular property prediction
    Gong, Xu
    Liu, Maotao
    Liu, Qun
    Guo, Yike
    Wang, Guoyin
    PATTERN RECOGNITION, 2025, 163
  • [6] Integrating concept of pharmacophore with graph neural networks for chemical property prediction and interpretation
    Yue Kong
    Xiaoman Zhao
    Ruizi Liu
    Zhenwu Yang
    Hongyan Yin
    Bowen Zhao
    Jinling Wang
    Bingjie Qin
    Aixia Yan
    Journal of Cheminformatics, 14
  • [7] Integrating concept of pharmacophore with graph neural networks for chemical property prediction and interpretation
    Kong, Yue
    Zhao, Xiaoman
    Liu, Ruizi
    Yang, Zhenwu
    Yin, Hongyan
    Zhao, Bowen
    Wang, Jinling
    Qin, Bingjie
    Yan, Aixia
    JOURNAL OF CHEMINFORMATICS, 2022, 14 (01)
  • [8] A Novel Descriptor and Molecular Graph-Based Bimodal Contrastive Learning Framework for Drug Molecular Property Prediction
    He, Zhengda
    Chen, Linjie
    Lv, Hao
    Zhou, Rui-ning
    Xu, Jiaying
    Chen, Yadong
    Hu, Jianhua
    Gao, Yang
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT III, 2023, 14088 : 700 - 715
  • [9] MultiGML: Multimodal graph machine learning for prediction of adverse drug events
    Krix, Sophia
    Delong, Lauren Nicole
    Madan, Sumit
    Domingo-Fernandez, Daniel
    Ahmad, Ashar
    Gul, Sheraz
    Zaliani, Andrea
    Froehlich, Holger
    HELIYON, 2023, 9 (09)
  • [10] GeomGCL: Geometric Graph Contrastive Learning for Molecular Property Prediction
    Li, Shuangli
    Zhou, Jingbo
    Xu, Tong
    Dou, Dejing
    Xiong, Hui
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 4541 - 4549