G2SAM: Graph-Based Global Semantic Awareness Method for Multimodal Sarcasm Detection

被引:0
|
作者
Wei, Yiwei [1 ,5 ]
Yuan, Shaozu [2 ]
Zhou, Hengyang [5 ]
Wang, Longbiao [1 ,4 ]
Yan, Zhiling [2 ]
Yang, Ruosong [2 ]
Chen, Meng [3 ]
机构
[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin Key Lab Cognit Comp & Applicat, Tianjin, Peoples R China
[2] JD AI Res, Beijing, Peoples R China
[3] Yep AI, Melbourne, Vic, Australia
[4] Huiyan Technol Tianjin Co Ltd, Tianjin, Peoples R China
[5] China Univ Petr Beijing Karamay, Karamay, Peoples R China
基金
中国国家自然科学基金;
关键词
CLASSIFICATION; NETWORK;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multimodal sarcasm detection, aiming to detect the ironic sentiment within multimodal social data, has gained substantial popularity in both the natural language processing and computer vision communities. Recently, graph-based studies by drawing sentimental relations to detect multimodal sarcasm have made notable advancements. However, they have neglected exploiting graph-based global semantic congruity from existing instances to facilitate the prediction, which ultimately hinders the model's performance. In this paper, we introduce a new inference paradigm that leverages global graph-based semantic awareness to handle this task. Firstly, we construct fine-grained multimodal graphs for each instance and integrate them into semantic space to draw graph-based relations. During inference, we leverage global semantic congruity to retrieve k-nearest neighbor instances in semantic space as references for voting on the final prediction. To enhance the semantic correlation of representation in semantic space, we also introduce label-aware graph contrastive learning to further improve the performance. Experimental results demonstrate that our model achieves state-of-the-art (SOTA) performance in multimodal sarcasm detection. The code will be available at https://github.com/upccpu/G2SAM.
引用
收藏
页码:9151 / 9159
页数:9
相关论文
共 50 条
  • [1] Multi-source Semantic Graph-based Multimodal Sarcasm Explanation Generation
    Jing, Liqiang
    Song, Xuemeng
    Ouyang, Kun
    Jia, Mengzhao
    Nie, Liqiang
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 11349 - 11361
  • [2] HSVDetector: a heterogeneous semantic graph-based method for smart contract vulnerability detection
    Zhu, Heming
    Li, Hao
    Lu, Gehao
    JOURNAL OF SUPERCOMPUTING, 2025, 81 (04):
  • [3] A multimodal fusion method for sarcasm detection based on late fusion
    Ding, Ning
    Tian, Sheng-wei
    Yu, Long
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (06) : 8597 - 8616
  • [4] A multimodal fusion method for sarcasm detection based on late fusion
    Ning Ding
    Sheng-wei Tian
    Long Yu
    Multimedia Tools and Applications, 2022, 81 : 8597 - 8616
  • [5] Role of Semantic Links in Performance of Information Retrieval on Graph-based Multimodal Collections
    Sabetghadam, Serwah
    Lupu, Mihai
    Rauber, Andreas
    2017 25TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2017, : 1574 - 1579
  • [6] SGFNet: A semantic graph-based multimodal network for financial invoice information extraction
    Luo, Shun
    Yu, Juan
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 258
  • [7] A Semantic Graph-Based Approach for Radicalisation Detection on Social Media
    Saif, Hassan
    Dickinson, Thomas
    Kastler, Leon
    Fernandez, Miriam
    Alani, Harith
    SEMANTIC WEB ( ESWC 2017), PT I, 2017, 10249 : 571 - 587
  • [8] A graph-based pivotal semantic mining framework for rumor detection
    Yan, Yeqing
    Wang, Yongjun
    Zheng, Peng
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 118
  • [9] Variable weight semantic graph-based ontology mapping method
    Yang, Feng
    EXPERT SYSTEMS, 2019, 36 (05)
  • [10] Multimodal sarcasm detection based on sentiment-clue inconsistency global detection fusion network
    Xi, Zhonghao
    Yu, Bengong
    Wang, Haoyu
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 275