Enhance prototypical networks with hybrid attention and confusing loss function for few-shot relation classification

被引:10
|
作者
Li, Yibing [1 ,2 ,3 ]
Ma, Zuchang [1 ]
Gao, Lisheng [1 ]
Wu, Yichen [1 ,2 ,4 ]
Xie, Fei [3 ]
Ren, Xiaoye [3 ]
机构
[1] Chinese Acad Sci, Hefei Inst Phys Sci, Inst Intelligent Machines, Anhui Prov Key Lab Med Phys & Technol, Hefei 230031, Peoples R China
[2] Univ Sci & Technol China, Sci Isl Branch Grad Sch, Hefei 230026, Peoples R China
[3] Hefei Normal Univ, Sch Comp Sci & Technol, Hefei 230601, Peoples R China
[4] Anhui Jianzhu Univ, Sch Elect & Informat Engn, Hefei 230601, Peoples R China
基金
中国国家自然科学基金;
关键词
Relation classification; Few-shot learning; Hybrid attention; Loss; BERT;
D O I
10.1016/j.neucom.2022.04.067
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Relation classification (RC) is a fundamental task to building knowledge graphs and describing semantic formalization. It aims to classify a relation between the head and the tail entities in a sentence. The existing RC method mainly adopts the distant supervision (DS) scheme. However, DS still has the problem of long-tail and suffers from data sparsity. Recently, few-shot learning (FSL) has attracted people's attention. It solves the long-tail problem by learning from few-shot samples. The prototypical networks have a better effect on FSL, which classifies a relation by distance. However, the prototypical networks and their related variants did not consider the critical role of entity words. In addition, not all sentences in support set equally contributed to classifying relations. Furthermore, an entity pair in a sentence may have true and confusing relations, which is difficult for the RC model to distinguish them. A new context encoder BERT_FE is proposed to address those problems, which uses the BERT model as pre-training and fuses the information of head and tail entities by entity word-level attention (WLA). At the same time, the sentence-level attention (SLA) is proposed to give more weight to sentences of the support set similar to the query instance and improve the classification accuracy. A confusing loss function (CLF) is designed to enhance the model's ability to distinguish between true and confusing relations. The experiment results demonstrate that our proposed model (HACLF) is better than several baseline models. (c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:362 / 372
页数:11
相关论文
共 50 条
  • [21] TRANSDUCTIVE PROTOTYPICAL NETWORK FOR FEW-SHOT CLASSIFICATION
    Liu, Xinyue
    Liu, Pengxin
    Zong, Linlin
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1671 - 1675
  • [22] Few-Shot Node Classification Method of Graph Adaptive Prototypical Networks
    Guo, Ruize
    Wei, Wei
    Cui, Junbiao
    Feng, Kai
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2022, 35 (08): : 743 - 753
  • [23] Multimodal Prototypical Networks for Few-shot Learning
    Pahde, Frederik
    Puscas, Mihai
    Klein, Tassilo
    Nabi, Moin
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 2643 - 2652
  • [24] Prototypical Siamese Networks for Few-shot Learning
    Wang, Junhua
    Zhai, Yongping
    PROCEEDINGS OF 2020 IEEE 10TH INTERNATIONAL CONFERENCE ON ELECTRONICS INFORMATION AND EMERGENCY COMMUNICATION (ICEIEC 2020), 2020, : 178 - 181
  • [25] On Episodes, Prototypical Networks, and Few-Shot Learning
    Laenen, Steinar
    Bertinetto, Luca
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [26] Few-Shot Keyword Spotting With Prototypical Networks
    Parnami, Archit
    Lee, Minwoo
    PROCEEDINGS OF 2022 7TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING TECHNOLOGIES, ICMLT 2022, 2022, : 277 - 283
  • [27] Modified Prototypical Networks for Few-Shot Text Classification Based on Class-Covariance Metric and Attention
    Yang, Jun
    Wang, Bin
    Huang, Ming
    Yuan, Xin
    Liu, Huaping
    2021 6TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2021), 2021, : 81 - 85
  • [28] Improved prototypical networks for few-Shot learninge
    Ji, Zhong
    Chai, Xingliang
    Yu, Yunlong
    Pang, Yanwei
    Zhang, Zhongfei
    PATTERN RECOGNITION LETTERS, 2020, 140 : 81 - 87
  • [29] Relational concept enhanced prototypical network for incremental few-shot relation classification
    Ma, Rong
    Ma, Bo
    Wang, Lei
    Zhou, Xi
    Wang, Zhen
    Yang, Yating
    KNOWLEDGE-BASED SYSTEMS, 2024, 284
  • [30] Few-Shot Relation Classification Research Based on Prototypical Network and Causal Intervention
    Li, Zhiming
    Ouyang, Feifan
    Zhou, Chunlong
    He, Yihao
    Shen, Limin
    IEEE ACCESS, 2022, 10 : 36995 - 37002