Modeling Dense Cross-Modal Interactions for Joint Entity-Relation Extraction

被引:0
|
作者
Zhao, Shan [1 ]
Hu, Minghao [2 ]
Cai, Zhiping [1 ]
Liu, Fang [3 ]
机构
[1] Natl Univ Def Technol, Coll Comp, Changsha, Peoples R China
[2] PLA Acad Mil Sci, Beijing, Peoples R China
[3] Hunan Univ, Sch Design, Changsha, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Joint extraction of entities and their relations benefits from the close interaction between named entities and their relation information. Therefore, how to effectively model such cross-modal interactions is critical for the final performance. Previous works have used simple methods such as label-feature concatenation to perform coarse-grained semantic fusion among cross-modal instances, but fail to capture fine-grained correlations over token and label spaces, resulting in insufficient interactions. In this paper, we propose a deep Cross-Modal Attention Network (CMAN) for joint entity and relation extraction. The network is carefully constructed by stacking multiple attention units in depth to fully model dense interactions over token-label spaces, in which two basic attention units are proposed to explicitly capture fine-grained correlations across different modalities (e.g., token-to-token and label-to-token). Experiment results on CoNLL04 dataset show that our model obtains state-of-the-art results by achieving 90.62% F1 on entity recognition and 72.97% F1 on relation classification. In ADE dataset, our model surpasses existing approaches by more than 1.9% F1 on relation classification. Extensive analyses further confirm the effectiveness of our approach.
引用
收藏
页码:4032 / 4038
页数:7
相关论文
共 50 条
  • [31] CAG: A Consistency-Adaptive Text-Image Alignment Generation for Joint Multimodal Entity-Relation Extraction
    Yang, Xinjie
    Gong, Xiaocheng
    Tang, Binghao
    Lei, Yang
    Deng, Yayue
    Ouyang, Huan
    Zhao, Gang
    Luo, Lei
    Feng, Yunling
    Duan, Bin
    Li, Si
    Xu, Yajing
    PROCEEDINGS OF THE 33RD ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2024, 2024, : 4183 - 4187
  • [32] Joint Multimodal Entity-Relation Extraction Based on Edge-Enhanced Graph Alignment Network andWord-Pair Relation Tagging
    Yuan, Li
    Cai, Yi
    Wang, Jin
    Li, Qing
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 9, 2023, : 11051 - 11059
  • [33] GraphRel: Modeling Text as Relational Graphs for Joint Entity and Relation Extraction
    Fu, Tsu-Jui
    Li, Peng-Hsuan
    Ma, Wei-Yun
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 1409 - 1418
  • [34] Joint entity-relation knowledge embedding via cost-sensitive learning
    Yu, Sheng-kang
    Zhao, Xue-yi
    Li, Xi
    Zhang, Zhong-fei
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2017, 18 (11) : 1867 - 1873
  • [35] Cross-Modal Joint Embedding with Diverse Semantics
    Xie, Zhongwei
    Liu, Ling
    Wu, Yanzhao
    Li, Lin
    Zhong, Luo
    2020 IEEE SECOND INTERNATIONAL CONFERENCE ON COGNITIVE MACHINE INTELLIGENCE (COGMI 2020), 2020, : 157 - 166
  • [36] Deep Relation Embedding for Cross-Modal Retrieval
    Zhang, Yifan
    Zhou, Wengang
    Wang, Min
    Tian, Qi
    Li, Houqiang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 617 - 627
  • [37] Towards Bridged Vision and Language: Learning Cross-Modal Knowledge Representation for Relation Extraction
    Feng, Junhao
    Wang, Guohua
    Zheng, Changmeng
    Cai, Yi
    Fu, Ze
    Wang, Yaowei
    Wei, Xiao-Yong
    Li, Qing
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (01) : 561 - 575
  • [38] Cross-modal interactions between olfaction and touch
    Demattè, ML
    Sanabria, D
    Sugarman, R
    Spence, C
    CHEMICAL SENSES, 2006, 31 (04) : 291 - 300
  • [39] Cross-modal interactions in auditory and visual discrimination
    Marks, LE
    Ben-Artzi, E
    Lakatos, S
    INTERNATIONAL JOURNAL OF PSYCHOPHYSIOLOGY, 2003, 50 (1-2) : 125 - 145
  • [40] Cross-modal event extraction via Visual Event Grounding and Semantic Relation Filling
    Liu, Maofu
    Zhou, Bingying
    Hu, Huijun
    Qiu, Chen
    Zhang, Xiaokang
    INFORMATION PROCESSING & MANAGEMENT, 2025, 62 (03)