Learning from a Friend: Improving Event Extraction via Self-Training with Feedback from Abstract Meaning Representation

被引:0
|
作者
Xu, Zhiyang [1 ]
Lee, Jay-Yoon [2 ]
Huang, Lifu [1 ]
机构
[1] Virginia Tech, Dept Comp Sci, Blacksburg, VA 24061 USA
[2] Seoul Natl Univ, Grad Sch Data Sci, Seoul, South Korea
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Data scarcity has been the main factor that hinders the progress of event extraction. To overcome this issue, we propose a Self-Training with Feedback (STF) framework that leverages the large-scale unlabeled data and acquires feedback for each new event prediction from the unlabeled data by comparing it to the Abstract Meaning Representation (AMR) graph of the same sentence. Specifically, STF consists of (1) a base event extraction model trained on existing event annotations and then applied to large-scale unlabeled corpora to predict new event mentions as pseudo training samples, and (2) a novel scoring model that takes in each new predicted event trigger, an argument, its argument role, as well as their paths in the AMR graph to estimate a compatibility score indicating the correctness of the pseudo label. The compatibility scores further act as feedback to encourage or discourage the model learning on the pseudo labels during self-training. Experimental results on three benchmark datasets, including ACE05-E, ACE05-E+, and ERE, demonstrate the effectiveness of the STF framework on event extraction, especially event argument extraction, with significant performance gain over the base event extraction models and strong baselines. Our experimental analysis further shows that STF is a generic framework as it can be applied to improve most, if not all, event extraction models by leveraging largescale unlabeled data, even when high-quality AMR graph annotations are not available.
引用
收藏
页码:10421 / 10437
页数:17
相关论文
共 20 条
  • [1] Learning from Mistakes: Combining Ontologies via Self-Training for Dialogue Generation
    Reed, Lena
    Harrison, Vrindavan
    Oraby, Shereen
    Hakkani-Tur, Dilek
    Walker, Marilyn
    SIGDIAL 2020: 21ST ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2020), 2020, : 21 - 34
  • [2] Distantly Supervised Biomedical Relation Extraction via Negative Learning and Noisy Student Self-Training
    Dai, Yuanfei
    Zhang, Bin
    Wang, Shiping
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2024, 21 (06) : 1697 - 1708
  • [3] Learning from Future: A Novel Self-Training Framework for Semantic Segmentation
    Du, Ye
    Shen, Yujun
    Wang, Haochen
    Fei, Jingjing
    Li, Wei
    Wu, Liwei
    Zhao, Rui
    Fu, Zehua
    Liu, Qingjie
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [4] Improving Event Causality Identification via Self-Supervised Representation Learning on External Causal Statement
    Zuo, Xinyu
    Cao, Pengfei
    Chen, Yubo
    Liu, Kang
    Zhao, Jun
    Peng, Weihua
    Chen, Yuguang
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 2162 - 2172
  • [5] Triple-Based Data Augmentation for Event Temporal Extraction via Reinforcement Learning from Human Feedback
    Zhang, Xiaobin
    Zang, Liangjun
    Liu, Qianwen
    Wei, Shuchong
    Hu, Songlin
    PROCEEDINGS OF THE 2024 27 TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, CSCWD 2024, 2024, : 1627 - 1632
  • [6] Consistency Self-Training Semi-Supervised Method for Road Extraction from Remote Sensing Images
    Gu, Xingjian
    Yu, Supeng
    Huang, Fen
    Ren, Shougang
    Fan, Chengcheng
    REMOTE SENSING, 2024, 16 (21)
  • [7] From Keypoints to Object Landmarks via Self-Training Correspondence: A Novel Approach to Unsupervised Landmark Discovery
    Mallis, Dimitrios
    Sanchez, Enrique
    Bell, Matt
    Tzimiropoulos, Georgios
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (07) : 8390 - 8404
  • [8] Self-Training for Label-Efficient Information Extraction from Semi-Structured Web-Pages
    Sarkhel, Ritesh
    Huang, Binxuan
    Lockard, Cohn
    Shiralkar, Prashant
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2023, 16 (11): : 3098 - 3110
  • [9] Semi-FCMNet: Semi-Supervised Learning for Forest Cover Mapping from Satellite Imagery via Ensemble Self-Training and Perturbation
    Chen, Beiqi
    Wang, Liangjing
    Fan, Xijian
    Bo, Weihao
    Yang, Xubing
    Tjahjadi, Tardi
    REMOTE SENSING, 2023, 15 (16)
  • [10] Learning from pseudo-labels: Self-training Electronic Components Detector for Waste Printed Circuit Boards
    Junior, Agostinho A. F.
    Silva, Leandro H. de S.
    Fernandes, Bruno J. T.
    Azevedo, George O. A.
    Oliveira, Sergio C.
    2022 35TH SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI 2022), 2022, : 252 - 257