MERGE: A Modal Equilibrium Relational Graph Framework for Multi-Modal Knowledge Graph Completion

被引:0
|
作者
Shang, Yuying [1 ,2 ,3 ,4 ]
Fu, Kun [1 ,2 ,3 ]
Zhang, Zequn [1 ,2 ]
Jin, Li [1 ,2 ]
Liu, Zinan [1 ,3 ,4 ]
Wang, Shensi [1 ,2 ,3 ,4 ]
Li, Shuchao [1 ,2 ]
机构
[1] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100094, Peoples R China
[2] Chinese Acad Sci, Aerosp Informat Res Inst, Key Lab Network Informat Syst Technol NIST, Beijing 100190, Peoples R China
[3] Univ Chinese Acad Sci, Beijing 100190, Peoples R China
[4] Univ Chinese Acad Sci, Sch Elect Elect & Commun Engn, Beijing 100094, Peoples R China
基金
中国国家自然科学基金;
关键词
multi-modal knowledge graph; knowledge graph representation; graph attention network; information integration;
D O I
10.3390/s24237605
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
The multi-modal knowledge graph completion (MMKGC) task aims to automatically mine the missing factual knowledge from the existing multi-modal knowledge graphs (MMKGs), which is crucial in advancing cross-modal learning and reasoning. However, few methods consider the adverse effects caused by different missing modal information in the model learning process. To address the above challenges, we innovatively propose a Modal Equilibrium Relational Graph framEwork, called MERGE. By constructing three modal-specific directed relational graph attention networks, MERGE can implicitly represent missing modal information for entities by aggregating the modal embeddings from neighboring nodes. Subsequently, a fusion approach based on low-rank tensor decomposition is adopted to align multiple modal features in both the explicit structural level and the implicit semantic level, utilizing the structural information inherent in the original knowledge graphs, which enhances the interpretability of the fused features. Furthermore, we introduce a novel interpolation re-ranking strategy to adjust the importance of modalities during inference while preserving the semantic integrity of each modality. The proposed framework has been validated on four publicly available datasets, and the experimental results have demonstrated the effectiveness and robustness of our method in the MMKGC task.
引用
收藏
页数:30
相关论文
共 50 条
  • [41] Self-Supervised Multi-Modal Knowledge Graph Contrastive Hashing for Cross-Modal Search
    Liang, Meiyu
    Du, Junping
    Liang, Zhengyang
    Xing, Yongwang
    Huang, Wei
    Xue, Zhe
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 12, 2024, : 13744 - 13753
  • [42] Multi-relational graph attention networks for knowledge graph completion
    Li, Zhifei
    Zhao, Yue
    Zhang, Yan
    Zhang, Zhaoli
    KNOWLEDGE-BASED SYSTEMS, 2022, 251
  • [43] Triplet-aware graph neural networks for factorized multi-modal knowledge graph entity alignment
    Li, Qian
    Li, Jianxin
    Wu, Jia
    Peng, Xutan
    Ji, Cheng
    Peng, Hao
    Wang, Lihong
    Yu, Philip S.
    NEURAL NETWORKS, 2024, 179
  • [44] Multi-Source Knowledge Reasoning Graph Network for Multi-Modal Commonsense Inference
    Ma, Xuan
    Yang, Xiaoshan
    Xu, Changsheng
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (04)
  • [45] MKGCN: Multi-Modal Knowledge Graph Convolutional Network for Music Recommender Systems
    Cui, Xiaohui
    Qu, Xiaolong
    Li, Dongmei
    Yang, Yu
    Li, Yuxun
    Zhang, Xiaoping
    ELECTRONICS, 2023, 12 (12)
  • [46] An Enhanced Multi-Modal Recommendation Based on Alternate Training With Knowledge Graph Representation
    Wang, Yuequn
    Dong, Liyan
    Zhang, Hao
    Ma, Xintao
    Li, Yongli
    Sun, Minghui
    IEEE ACCESS, 2020, 8 : 213012 - 213026
  • [47] Modality-Aware Negative Sampling for Multi-modal Knowledge Graph Embedding
    Zhang, Yichi
    Chen, Mingyang
    Zhang, Wen
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [48] Boosting Entity-Aware Image Captioning With Multi-Modal Knowledge Graph
    Zhao, Wentian
    Wu, Xinxiao
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 2659 - 2670
  • [49] Multi-modal knowledge graph inference via media convergence and logic rule
    Lin, Feng
    Li, Dongmei
    Zhang, Wenbin
    Shi, Dongsheng
    Jiao, Yuanzhou
    Chen, Qianzhong
    Lin, Yiying
    Zhu, Wentao
    CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2024, 9 (01) : 211 - 221
  • [50] MMKRL: A robust embedding approach for multi-modal knowledge graph representation learning
    Lu, Xinyu
    Wang, Lifang
    Jiang, Zejun
    He, Shichang
    Liu, Shizhong
    APPLIED INTELLIGENCE, 2022, 52 (07) : 7480 - 7497