Few-shot relation classification based on the BERT model, hybrid attention and fusion networks

被引:2
|
作者
Li, Yibing [1 ,2 ,3 ]
Ding, Zenghui [1 ]
Ma, Zuchang [1 ]
Wu, Yichen [1 ,2 ]
Wang, Yu [1 ,2 ]
Zhang, Ruiqi [1 ,2 ]
Xie, Fei [3 ]
Ren, Xiaoye [3 ]
机构
[1] Chinese Acad Sci, Inst Intelligent Machines, Inst Phys Sci, Hefei 230031, Peoples R China
[2] Univ Sci & Technol China, Sci Isl Branch, Grad Sch, Hefei 230026, Peoples R China
[3] Hefei Normal Univ, Sch Comp Sci & Technol, Hefei 230601, Peoples R China
基金
中国国家自然科学基金;
关键词
Relation classification; Few-shot learning; BERT; Attention; Rapidity of convergence; SUPERVISION;
D O I
10.1007/s10489-023-04634-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Relation classification (RC) is an essential task in information extraction. The distance supervision (DS) method can use many unlabeled data and solve the lack of training data on the RC task. However, the DS method has the problems of long tails and noise. Intuitively, people can solve these problems using few-shot learning (FSL). Our work aims to improve the accuracy and rapidity of convergence on the few-shot RC task. We believe that entity pairs have an essential role in the few-shot RC task. We propose a new context encoder, which is improved based on the bidirectional encoder representations from transformers (BERT) model to fuse entity pairs and their dependence information in instances. At the same time, we design hybrid attention, which includes support instance-level and query instance-level attention. The support instance level dynamically assigns the weight of each instance in the support set. It makes up for the insufficiency of prototypical networks, which distribute weights to sentences equally. Query instance-level attention is dynamically assigned weights to query instances by similarity with the prototype. The ablation study shows the effectiveness of our proposed method. In addition, a fusion network is designed to replace the Euclidean distance method of previous works when class matching is performed, improving the convergence's rapidity. This makes our model more suitable for industrial applications. The experimental results show that the proposed model's accuracy is better than that of several other models.
引用
收藏
页码:21448 / 21464
页数:17
相关论文
共 50 条
  • [41] Induction Networks for Few-Shot Text Classification
    Geng, Ruiying
    Li, Binhua
    Li, Yongbin
    Zhu, Xiaodan
    Jian, Ping
    Sun, Jian
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 3904 - 3913
  • [42] Hybrid Pooling Networks for Few-shot Learning
    Tan, Shaoqing
    Yang, Ruoyu
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [43] Few-Shot Classification Study for Prototype Fusion and Completion
    Wang, Yuheng
    Sun, Yanguo
    Lan, Zhenping
    Wang, Nan
    Li, Jiansong
    Yang, Xincheng
    IEEE Access, 2024, 12 : 174133 - 174143
  • [44] Few-shot classification with multisemantic information fusion network
    Gao, Ruixuan
    Su, Han
    Prasad, Shitala
    Tang, Peisen
    IMAGE AND VISION COMPUTING, 2024, 141
  • [45] Incremental Few-Shot Learning with Attention Attractor Networks
    Ren, Mengye
    Liao, Renjie
    Fetaya, Ethan
    Zemel, Richard S.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [46] An angular shrinkage BERT model for few-shot relation extraction with none-of-the-above detection
    Wang, Junwen
    Gao, Yongbin
    Fang, Zhijun
    PATTERN RECOGNITION LETTERS, 2023, 166 : 151 - 158
  • [47] MASTAF: A Model-Agnostic Spatio-Temporal Attention Fusion Network for Few-shot Video Classification
    Liu, Xin
    Zhang, Huanle
    Pirsiavash, Hamed
    Liu, Xin
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 2507 - 2516
  • [48] Few-Shot Person Re-identification Based on Hybrid Pooling Fusion and Gaussian Relation Metric
    Chen, Guizhen
    Zou, Guofeng
    Li, Jinjie
    Zhang, Xiaofei
    BIOMETRIC RECOGNITION, CCBR 2023, 2023, 14463 : 249 - 258
  • [49] Hyperplane projection network for few-shot relation classification
    Wang, Wei
    Wei, Xueguang
    Wang, Bailing
    Li, Yan
    Xin, Guodong
    Wei, Yuliang
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238
  • [50] Modified Prototypical Networks for Few-Shot Text Classification Based on Class-Covariance Metric and Attention
    Yang, Jun
    Wang, Bin
    Huang, Ming
    Yuan, Xin
    Liu, Huaping
    2021 6TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2021), 2021, : 81 - 85