Modeling Dense Cross-Modal Interactions for Joint Entity-Relation Extraction

被引：0

作者：

Zhao, Shan ^{[1
]}

Hu, Minghao ^{[2
]}

Cai, Zhiping ^{[1
]}

Liu, Fang ^{[3
]}

机构：

[1] Natl Univ Def Technol, Coll Comp, Changsha, Peoples R China

[2] PLA Acad Mil Sci, Beijing, Peoples R China

[3] Hunan Univ, Sch Design, Changsha, Peoples R China

来源：

PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE | 2020年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Joint extraction of entities and their relations benefits from the close interaction between named entities and their relation information. Therefore, how to effectively model such cross-modal interactions is critical for the final performance. Previous works have used simple methods such as label-feature concatenation to perform coarse-grained semantic fusion among cross-modal instances, but fail to capture fine-grained correlations over token and label spaces, resulting in insufficient interactions. In this paper, we propose a deep Cross-Modal Attention Network (CMAN) for joint entity and relation extraction. The network is carefully constructed by stacking multiple attention units in depth to fully model dense interactions over token-label spaces, in which two basic attention units are proposed to explicitly capture fine-grained correlations across different modalities (e.g., token-to-token and label-to-token). Experiment results on CoNLL04 dataset show that our model obtains state-of-the-art results by achieving 90.62% F1 on entity recognition and 72.97% F1 on relation classification. In ADE dataset, our model surpasses existing approaches by more than 1.9% F1 on relation classification. Extensive analyses further confirm the effectiveness of our approach.

引用

页码：4032 / 4038

页数：7

共 50 条

[31] CAG: A Consistency-Adaptive Text-Image Alignment Generation for Joint Multimodal Entity-Relation Extraction
Yang, Xinjie
Gong, Xiaocheng
Tang, Binghao
Lei, Yang
Deng, Yayue
Ouyang, Huan
Zhao, Gang
Luo, Lei
Feng, Yunling
Duan, Bin
Li, Si
Xu, Yajing
PROCEEDINGS OF THE 33RD ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2024, 2024, : 4183 - 4187
[32] Joint Multimodal Entity-Relation Extraction Based on Edge-Enhanced Graph Alignment Network andWord-Pair Relation Tagging
Yuan, Li
Cai, Yi
Wang, Jin
Li, Qing
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 9, 2023, : 11051 - 11059
[33] GraphRel: Modeling Text as Relational Graphs for Joint Entity and Relation Extraction
Fu, Tsu-Jui
Li, Peng-Hsuan
Ma, Wei-Yun
57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 1409 - 1418
[34] Joint entity-relation knowledge embedding via cost-sensitive learning
Yu, Sheng-kang
Zhao, Xue-yi
Li, Xi
Zhang, Zhong-fei
FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2017, 18 (11) : 1867 - 1873
[35] Cross-Modal Joint Embedding with Diverse Semantics
Xie, Zhongwei
Liu, Ling
Wu, Yanzhao
Li, Lin
Zhong, Luo
2020 IEEE SECOND INTERNATIONAL CONFERENCE ON COGNITIVE MACHINE INTELLIGENCE (COGMI 2020), 2020, : 157 - 166
[36] Deep Relation Embedding for Cross-Modal Retrieval
Zhang, Yifan
Zhou, Wengang
Wang, Min
Tian, Qi
Li, Houqiang
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 617 - 627
[37] Towards Bridged Vision and Language: Learning Cross-Modal Knowledge Representation for Relation Extraction
Feng, Junhao
Wang, Guohua
Zheng, Changmeng
Cai, Yi
Fu, Ze
Wang, Yaowei
Wei, Xiao-Yong
Li, Qing
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (01) : 561 - 575
[38] Cross-modal interactions between olfaction and touch
Demattè, ML
Sanabria, D
Sugarman, R
Spence, C
CHEMICAL SENSES, 2006, 31 (04) : 291 - 300
[39] Cross-modal interactions in auditory and visual discrimination
Marks, LE
Ben-Artzi, E
Lakatos, S
INTERNATIONAL JOURNAL OF PSYCHOPHYSIOLOGY, 2003, 50 (1-2) : 125 - 145
[40] Cross-modal event extraction via Visual Event Grounding and Semantic Relation Filling
Liu, Maofu
Zhou, Bingying
Hu, Huijun
Qiu, Chen
Zhang, Xiaokang
INFORMATION PROCESSING & MANAGEMENT, 2025, 62 (03)

← 1 2 3 4 5 →