Learning from a Friend: Improving Event Extraction via Self-Training with Feedback from Abstract Meaning Representation

被引：0

作者：

Xu, Zhiyang ^{[1
]}

Lee, Jay-Yoon ^{[2
]}

Huang, Lifu ^{[1
]}

机构：

[1] Virginia Tech, Dept Comp Sci, Blacksburg, VA 24061 USA

[2] Seoul Natl Univ, Grad Sch Data Sci, Seoul, South Korea

来源：

FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023) | 2023年

关键词：

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Data scarcity has been the main factor that hinders the progress of event extraction. To overcome this issue, we propose a Self-Training with Feedback (STF) framework that leverages the large-scale unlabeled data and acquires feedback for each new event prediction from the unlabeled data by comparing it to the Abstract Meaning Representation (AMR) graph of the same sentence. Specifically, STF consists of (1) a base event extraction model trained on existing event annotations and then applied to large-scale unlabeled corpora to predict new event mentions as pseudo training samples, and (2) a novel scoring model that takes in each new predicted event trigger, an argument, its argument role, as well as their paths in the AMR graph to estimate a compatibility score indicating the correctness of the pseudo label. The compatibility scores further act as feedback to encourage or discourage the model learning on the pseudo labels during self-training. Experimental results on three benchmark datasets, including ACE05-E, ACE05-E+, and ERE, demonstrate the effectiveness of the STF framework on event extraction, especially event argument extraction, with significant performance gain over the base event extraction models and strong baselines. Our experimental analysis further shows that STF is a generic framework as it can be applied to improve most, if not all, event extraction models by leveraging largescale unlabeled data, even when high-quality AMR graph annotations are not available.

引用

页码：10421 / 10437

页数：17

共 20 条

[1] Learning from Mistakes: Combining Ontologies via Self-Training for Dialogue Generation
Reed, Lena
Harrison, Vrindavan
Oraby, Shereen
Hakkani-Tur, Dilek
Walker, Marilyn
SIGDIAL 2020: 21ST ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2020), 2020, : 21 - 34
[2] Distantly Supervised Biomedical Relation Extraction via Negative Learning and Noisy Student Self-Training
Dai, Yuanfei
Zhang, Bin
Wang, Shiping
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2024, 21 (06) : 1697 - 1708
[3] Learning from Future: A Novel Self-Training Framework for Semantic Segmentation
Du, Ye
Shen, Yujun
Wang, Haochen
Fei, Jingjing
Li, Wei
Wu, Liwei
Zhao, Rui
Fu, Zehua
Liu, Qingjie
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[4] Improving Event Causality Identification via Self-Supervised Representation Learning on External Causal Statement
Zuo, Xinyu
Cao, Pengfei
Chen, Yubo
Liu, Kang
Zhao, Jun
Peng, Weihua
Chen, Yuguang
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 2162 - 2172
[5] Triple-Based Data Augmentation for Event Temporal Extraction via Reinforcement Learning from Human Feedback
Zhang, Xiaobin
Zang, Liangjun
Liu, Qianwen
Wei, Shuchong
Hu, Songlin
PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 1627 - 1632
[6] Consistency Self-Training Semi-Supervised Method for Road Extraction from Remote Sensing Images
Gu, Xingjian
Yu, Supeng
Huang, Fen
Ren, Shougang
Fan, Chengcheng
REMOTE SENSING, 2024, 16 (21)
[7] From Keypoints to Object Landmarks via Self-Training Correspondence: A Novel Approach to Unsupervised Landmark Discovery
Mallis, Dimitrios
Sanchez, Enrique
Bell, Matt
Tzimiropoulos, Georgios
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (07) : 8390 - 8404
[8] Self-Training for Label-Efficient Information Extraction from Semi-Structured Web-Pages
Sarkhel, Ritesh
Huang, Binxuan
Lockard, Cohn
Shiralkar, Prashant
PROCEEDINGS OF THE VLDB ENDOWMENT, 2023, 16 (11): : 3098 - 3110
[9] Semi-FCMNet: Semi-Supervised Learning for Forest Cover Mapping from Satellite Imagery via Ensemble Self-Training and Perturbation
Chen, Beiqi
Wang, Liangjing
Fan, Xijian
Bo, Weihao
Yang, Xubing
Tjahjadi, Tardi
REMOTE SENSING, 2023, 15 (16)
[10] Learning from pseudo-labels: Self-training Electronic Components Detector for Waste Printed Circuit Boards
Junior, Agostinho A. F.
Silva, Leandro H. de S.
Fernandes, Bruno J. T.
Azevedo, George O. A.
Oliveira, Sergio C.
2022 35TH SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI 2022), 2022, : 252 - 257

← 1 2 →