Enhancing relation extraction using multi-task learning with SDP evidence

被引：0

作者：

Wang, Hailin ^{[1
,2
]}

Zhang, Dan ^{[1
,2
]}

Liu, Guisong ^{[1
,2
]}

Huang, Li ^{[1
,2
]}

Qin, Ke ^{[3
]}

机构：

[1] Southwestern Univ Finance & Econ, Sch Comp & Artificial Intelligence, Complex Lab New Finance & Econ, Chengdu 611130, Peoples R China

[2] Kash Inst Elect & Informat Ind, Kashgar, Xinjiang, Peoples R China

[3] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu, Sichuan, Peoples R China

来源：

INFORMATION SCIENCES | 2024年 / 670卷

基金：

中国国家自然科学基金;

关键词：

Relation extraction; Multi-task learning; Shortest dependency path; Evidence; ATTENTION; MODEL;

D O I：

10.1016/j.ins.2024.120610

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Relation extraction (RE) is a crucial subtask of information extraction, which involves recognizing the relation between entity pairs in a sentence. Previous studies have extensively employed syntactic information, notably the shortest dependency path (SDP), to collect word evidence, termed SDP evidence, which gives clues about the given entity pair, thus improving RE. Nevertheless, prevalent transformer -based techniques lack syntactic information and cannot effectively model essential syntactic clues to support relations. This study exerts multi -task learning to address these issues by imbibing an SDP token position prediction task into the RE task. To this end, we introduce SGA, an SDP evidence guiding approach that transfers the SDP evidence into two novel supervisory signal labels: SDP tokens label and SDP matrix label. The former guides the attention modules to assign high attention weights to SDP token positions, emphasizing relational clues. In the meantime, the latter supervises SGA to predict a parameterized asymmetric product matrix among the SDP tokens for RE. Experimental outcomes demonstrate the model's enhanced ability to leverage SDP information, thereby directing attention modules and predicted matrix labels to focus on SDP evidence. Consequently, our proposed approach surpasses existing publicly available optimal baselines across four RE datasets: SemEval2010-Task8, KBP37, NYT, and WebNLG. 1

引用

页数：15

共 50 条

[41] Boosted multi-task learning
Olivier Chapelle
Pannagadatta Shivaswamy
Srinivas Vadrevu
Kilian Weinberger
Ya Zhang
Belle Tseng
Machine Learning, 2011, 85 : 149 - 173
[42] An overview of multi-task learning
Zhang, Yu
Yang, Qiang
NATIONAL SCIENCE REVIEW, 2018, 5 (01) : 30 - 43
[43] On Partial Multi-Task Learning
He, Yi
Wu, Baijun
Wu, Di
Wu, Xindong
ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 : 1174 - 1181
[44] Pareto Multi-Task Learning
Lin, Xi
Zhen, Hui-Ling
Li, Zhenhua
Zhang, Qingfu
Kwong, Sam
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[45] Federated Multi-Task Learning
Smith, Virginia
Chiang, Chao-Kai
Sanjabi, Maziar
Talwalkar, Ameet
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
[46] Asynchronous Multi-Task Learning
Baytas, Inci M.
Yan, Ming
Jain, Anil K.
Zhou, Jiayu
2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2016, : 11 - 20
[47] Calibrated Multi-Task Learning
Nie, Feiping
Hu, Zhanxuan
Li, Xuelong
KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, : 2012 - 2021
[48] An overview of multi-task learning
Yu Zhang
Qiang Yang
NationalScienceReview, 2018, 5 (01) : 30 - 43
[49] Boosted multi-task learning
Chapelle, Olivier
Shivaswamy, Pannagadatta
Vadrevu, Srinivas
Weinberger, Kilian
Zhang, Ya
Tseng, Belle
MACHINE LEARNING, 2011, 85 (1-2) : 149 - 173
[50] Distributed Multi-Task Learning
Wang, Jialei
Kolar, Mladen
Srebro, Nathan
ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 51, 2016, 51 : 751 - 760

← 1 2 3 4 5 →