M2KGRL: A semantic-matching based framework for multimodal knowledge graph representation learning

被引:0
|
作者
Chen, Tao [1 ]
Wang, Tiexin [1 ]
Zhang, Huihui [2 ,3 ]
Xu, Jianqiu [1 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 211106, Peoples R China
[2] Qilu Univ Technol, Shandong Acad Sci, Jinan 250353, Peoples R China
[3] Weifang Univ, Weifang 261061, Peoples R China
基金
中国国家自然科学基金;
关键词
Multimodal knowledge graph; Representation learning; Semantic matching;
D O I
10.1016/j.eswa.2025.126388
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Effective representation learning models are critical for knowledge computation and the practical application of knowledge graphs. However, most existing knowledge graph representation learning models primarily focus on structured triple-based entities, neglecting or underutilizing additional multimodal information, such as entity types, images, and texts. To address this issue, we propose a novel framework, M ulti- M odal K nowledge G raph R epresentation L earning ( M2KGRL ), which integrates multimodal features derived from structured triples, images, and textual data to enhance knowledge graph representations. M2KGRL leverages three adapted technologies (i.e., VGG16, BERT, and SimplE) to extract diverse features from these modalities. Additionally, it employs a specially designed autoencoder for feature fusion and a similarity-based scoring function to guide the presentation learning process. The proposed framework is evaluated through extensive experiments on two widely used datasets (FB15K and WN18) against ten representative baseline methods (e.g., ComplEx, TransAE). Experimental results demonstrate that M2KGRL achieves superior performance inmost scenarios. For instance, M2KGRL outperforms TransAE with a 1.8% improvement in Hit@10), showcasing its ability to predict more accurate links by incorporating visual and textual information. These findings highlight the potential of M2KGRL in advancing multimodal knowledge graph representation learning.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Temporal knowledge graph representation learning based on relational aggregation
    Su F.-L.
    Jing N.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2023, 57 (02): : 235 - 242
  • [32] DSKRL: A dissimilarity-support-aware knowledge representation learning framework on noisy knowledge graph
    Shao, Tianyang
    Li, Xinyi
    Zhao, Xiang
    Xu, Hao
    Xiao, Weidong
    NEUROCOMPUTING, 2021, 461 : 608 - 617
  • [33] Research on the Intelligent Construction of UAV Knowledge Graph Based on Attentive Semantic Representation
    Fan, Yi
    Mi, Baigang
    Sun, Yu
    Yin, Li
    DRONES, 2023, 7 (06)
  • [34] Knowledge graph completion method based on hyperbolic representation learning and contrastive learning
    Zhang, Xiaodong
    Wang, Meng
    Zhong, Xiuwen
    An, Feixu
    EGYPTIAN INFORMATICS JOURNAL, 2023, 24 (04)
  • [35] Unsupervised Multimodal Change Detection Based on Structural Relationship Graph Representation Learning
    Chen, Hongruixuan
    Yokoya, Naoto
    Wu, Chen
    Du, Bo
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [36] Temporal knowledge graph reasoning based on discriminative neighboring semantic learning
    Zhang, Jinchuan
    Hui, Bei
    Zhu, Xunyang
    Tian, Ling
    Hua, Fujun
    PATTERN RECOGNITION, 2025, 162
  • [37] Knowledge Representation Learning Method Based on Semantic Enhancement of External Information
    Li, Song
    Yang, Yuxin
    Zhang, Liping
    Recent Advances in Computer Science and Communications, 2024, 17 (07) : 68 - 84
  • [38] ASKRL: An Aligned-Spatial Knowledge Representation Learning Framework for Open-World Knowledge Graph
    Shang, Ziyu
    Wang, Peng
    Liu, Yuzhang
    Liu, Jiajun
    Ke, Wenjun
    SEMANTIC WEB, ISWC 2023, PART I, 2023, 14265 : 101 - 120
  • [39] Graph-Based Relation-Aware Representation Learning for Clothing Matching
    Li, Yang
    Luo, Yadan
    Huang, Zi
    DATABASES THEORY AND APPLICATIONS, ADC 2020, 2020, 12008 : 189 - 197
  • [40] Textual Knowledge Representation through the Semantic-based Graph Structure in Clustering Applications
    Wu, Jiangning
    Dang, Yanzhong
    Pan, Donghua
    Xuan, Zhaoguo
    Liu, Qiaofeng
    43RD HAWAII INTERNATIONAL CONFERENCE ON SYSTEMS SCIENCES VOLS 1-5 (HICSS 2010), 2010, : 3398 - 3405