Representation and Fusion Based on Knowledge Graph in Multi-Modal Semantic Communication

被引:2
|
作者
Xing, Chenlin [1 ]
Lv, Jie [1 ]
Luo, Tao [1 ]
Zhang, Zhilong [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Informat & Commun Engn, Beijing 100876, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantics; Correlation; Feature extraction; Knowledge graphs; Cognition; Head; Data mining; Semantic communication; multi-modal fusion; knowledge graph;
D O I
10.1109/LWC.2024.3369864
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The existing research on multi-modal semantic communication ignores the exploration of reasoning correlation among multi-modal data. Motivated by this, a multi-modal semantic representation and fusion model based on knowledge graph (KG-MSF) is proposed in this letter. In KG-MSF, the direct and reasoning correlation semantic information is extracted and mapped into a two-layer semantic architecture to represent the semantics of each modal fully. After that, the knowledge graph with structural advantage is utilized to fuse multi-modal semantic information, which is transmitted under different channel conditions. To assess the efficacy of semantic representation and fusion of the proposed KG-MSF in the multi-modal semantic communication system, we conduct comprehensive experiments on the task of visual question answer (VQA) with a metric of answer accuracy. The results demonstrate the superiority compared with existing models for multi-modal semantic representation, fusion, transmission efficiency and channel robustness.
引用
收藏
页码:1344 / 1348
页数:5
相关论文
共 50 条
  • [41] Combining Knowledge and Multi-modal Fusion for Meme Classification
    Zhong, Qi
    Wang, Qian
    Liu, Ji
    MULTIMEDIA MODELING (MMM 2022), PT I, 2022, 13141 : 599 - 611
  • [42] Application of Multi-modal Fusion Attention Mechanism in Semantic Segmentation
    Liu, Yunlong
    Yoshie, Osamu
    Watanabe, Hiroshi
    COMPUTER VISION - ACCV 2022, PT VII, 2023, 13847 : 378 - 397
  • [43] Rice Fertilization Period Discrimination Method Based on Multi-modal Knowledge Graph
    Yuan, Licun
    Zhou, Jun
    Ge, Weixi
    Zheng, Pengyuan
    Nongye Jixie Xuebao/Transactions of the Chinese Society for Agricultural Machinery, 2024, 55 (09): : 163 - 173
  • [44] Multi-modal Fusion
    Liu, Huaping
    Hussain, Amir
    Wang, Shuliang
    INFORMATION SCIENCES, 2018, 432 : 462 - 462
  • [45] Oracle Bone Inscriptions information processing based on multi-modal knowledge graph
    Xiong, Jing
    Liu, Guoying
    Liu, Yongge
    Liu, Mengting
    COMPUTERS & ELECTRICAL ENGINEERING, 2021, 92
  • [46] A task-centric knowledge graph construction method based on multi-modal representation learning for industrial maintenance automation
    Liu, Zengkun
    Lu, Yuqian
    ENGINEERING REPORTS, 2024, 6 (12)
  • [47] AutoCite: Multi-Modal Representation Fusion for Contextual Citation Generation
    Wang, Qingqin
    Xiong, Yun
    Zhang, Yao
    Zhang, Jiawei
    Zhu, Yangyong
    WSDM '21: PROCEEDINGS OF THE 14TH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2021, : 788 - 796
  • [48] Cognitive knowledge graph generation for grid fault handling based on attention mechanism combined with multi-modal factor fusion
    Li, Zhenbin
    Huang, Zhigang
    Guo, Lingxu
    Shan, Lianfei
    Yu, Guangyao
    Chong, Zhiqiang
    Zhang, Yue
    COMPUTERS & ELECTRICAL ENGINEERING, 2023, 111
  • [49] Multi-modal Graph Learning over UMLS Knowledge Graphs
    Burger, Manuel
    Ratsch, Gunnar
    Kuznetsova, Rita
    MACHINE LEARNING FOR HEALTH, ML4H, VOL 225, 2023, 225 : 52 - 81
  • [50] Multi-modal fusion architecture search for camera-based semantic scene completion
    Wang, Xuzhi
    Feng, Wei
    Wan, Liang
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 243