Enhancing foundation models for scientific discovery via multimodal knowledge graph representations

被引:0
|
作者
Lopez, Vanessa [1 ]
Hoang, Lam [1 ]
Martinez-Galindo, Marcos [1 ]
Fernandez-Diaz, Raul [1 ]
Sbodio, Marco Luca [1 ]
Ordonez-Hurtado, Rodrigo [1 ]
Zayats, Mykhaylo [1 ]
Mulligan, Natasha [1 ]
Bettencourt-Silva, Joao [1 ]
机构
[1] IBM Res Europe, Dublin, Ireland
来源
JOURNAL OF WEB SEMANTICS | 2025年 / 84卷
关键词
Multimodal graph learning; Multimodal knowledge graphs; Knowledge-enhanced drug discovery;
D O I
10.1016/j.websem.2024.100845
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Foundation Models (FMs) hold transformative potential to accelerate scientific discovery, yet reaching their full capacity in complex, highly multimodal domains such as genomics, drug discovery, and materials science requires a deeper consideration of the contextual nature of the scientific knowledge. We revisit the synergy between FMs and Multimodal Knowledge Graph (MKG) representation and learning, exploring their potential to enhance predictive and generative tasks in biomedical contexts like drug discovery. We seek to exploit MKGs to improve generative AI models' ability to capture intricate domain-specific relations and facilitate multimodal fusion. This integration promises to accelerate discovery workflows by providing more meaningful multimodal knowledge-enhanced representations and contextual evidence. Despite this potential, challenges and opportunities remain, including fusing multiple sequential, structural and knowledge modalities and models leveraging the strengths of each; developing scalable architectures for multi-task multi-dataset learning; creating end-to-end workflows to enhance the trustworthiness of biomedical FMs using knowledge from heterogeneous datasets and scientific literature; the domain data bottleneck and the lack of a unified representation between natural language and chemical representations; and benchmarking, specifically the transfer learning to tasks with limited data (e.g., unseen molecules and proteins, rear diseases). Finally, fostering openness and collaboration is key to accelerate scientific breakthroughs.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] Knowledge Graph-Enhanced Large Language Models via Path Selection
    Liu, Haochen
    Wang, Song
    Zhu, Yaochen
    Dong, Yushun
    Li, Jundong
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 6311 - 6321
  • [32] Improving Commonsense in Vision-Language Models via Knowledge Graph Riddles
    Ye, Shuquan
    Xie, Yujia
    Chen, Dongdong
    Xu, Yichong
    Yuan, Lu
    Zhu, Chenguang
    Liao, Jing
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 2634 - 2645
  • [33] OpenPSG: Open-Set Panoptic Scene Graph Generation via Large Multimodal Models
    Zhou, Zijian
    Zhu, Zheng
    Caesar, Holger
    Shi, Miaojing
    COMPUTER VISION - ECCV 2024, PT X, 2025, 15068 : 199 - 215
  • [34] A Novel Approach to Analyzing Defects: Enhancing Knowledge Graph Embedding Models for Main Electrical Equipment
    Chen, Yanyu
    Huang, Jianye
    Qian, Jian
    Yi, Longqiang
    Li, Jinhu
    Huang, Jiangsheng
    Zhang, Zhihong
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT V, 2023, 14090 : 715 - 725
  • [35] Enhancing Spatiotemporal Disease Progression Models via Latent Diffusion and Prior Knowledge
    Puglisi, Lemuel
    Alexander, Daniel C.
    Ravi, Daniele
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2024, PT II, 2024, 15002 : 173 - 183
  • [36] VQA-GNN: Reasoning with Multimodal Knowledge via Graph Neural Networks for Visual Question Answering
    Wang, Yanan
    Yasunaga, Michihiro
    Ren, Hongyu
    Wada, Shinya
    Leskovec, Jure
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 21525 - 21535
  • [37] Enhancing immunotherapy response prediction via multimodal integration of radiology and pathology deep learning models
    Ligero, M.
    El Nahhas, O. S. M.
    Prior Palomares, O.
    Navarro, V.
    Sansano, I.
    Serna, G.
    Mauchanski, S.
    Toledo, R. D. A.
    Dienstmann, R.
    Ramon y Cajal, S.
    Garralda, E.
    Nuciforo, P. G.
    Kather, J. N.
    Perez Lopez, R.
    ANNALS OF ONCOLOGY, 2023, 34 : S251 - S252
  • [38] Enhancing Graph-Based Semisupervised Learning via Knowledge-Aware Data Embedding
    Ienco, Dino
    Pensa, Ruggero G.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (11) : 5014 - 5020
  • [39] Enhancing Transformer-based language models with commonsense representations for knowledge-driven machine comprehension
    Li, Ronghan
    Jiang, Zejun
    Wang, Lifang
    Lu, Xinyu
    Zhao, Meng
    Chen, Daqing
    KNOWLEDGE-BASED SYSTEMS, 2021, 220
  • [40] Underlying scientific evidence discovery for FDA orphan drug designations from the GARD integrative knowledge graph: Towards drug discovery for rare diseases
    Zhu, Qian
    Dac-Trung Nguyen
    Southall, Noel
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2019, 258