Few-shot relation classification based on the BERT model, hybrid attention and fusion networks

被引:2
|
作者
Li, Yibing [1 ,2 ,3 ]
Ding, Zenghui [1 ]
Ma, Zuchang [1 ]
Wu, Yichen [1 ,2 ]
Wang, Yu [1 ,2 ]
Zhang, Ruiqi [1 ,2 ]
Xie, Fei [3 ]
Ren, Xiaoye [3 ]
机构
[1] Chinese Acad Sci, Inst Intelligent Machines, Inst Phys Sci, Hefei 230031, Peoples R China
[2] Univ Sci & Technol China, Sci Isl Branch, Grad Sch, Hefei 230026, Peoples R China
[3] Hefei Normal Univ, Sch Comp Sci & Technol, Hefei 230601, Peoples R China
基金
中国国家自然科学基金;
关键词
Relation classification; Few-shot learning; BERT; Attention; Rapidity of convergence; SUPERVISION;
D O I
10.1007/s10489-023-04634-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Relation classification (RC) is an essential task in information extraction. The distance supervision (DS) method can use many unlabeled data and solve the lack of training data on the RC task. However, the DS method has the problems of long tails and noise. Intuitively, people can solve these problems using few-shot learning (FSL). Our work aims to improve the accuracy and rapidity of convergence on the few-shot RC task. We believe that entity pairs have an essential role in the few-shot RC task. We propose a new context encoder, which is improved based on the bidirectional encoder representations from transformers (BERT) model to fuse entity pairs and their dependence information in instances. At the same time, we design hybrid attention, which includes support instance-level and query instance-level attention. The support instance level dynamically assigns the weight of each instance in the support set. It makes up for the insufficiency of prototypical networks, which distribute weights to sentences equally. Query instance-level attention is dynamically assigned weights to query instances by similarity with the prototype. The ablation study shows the effectiveness of our proposed method. In addition, a fusion network is designed to replace the Euclidean distance method of previous works when class matching is performed, improving the convergence's rapidity. This makes our model more suitable for industrial applications. The experimental results show that the proposed model's accuracy is better than that of several other models.
引用
收藏
页码:21448 / 21464
页数:17
相关论文
共 50 条
  • [31] Dynamic Prototype Selection by Fusing Attention Mechanism for Few-Shot Relation Classification
    Wu, Linfang
    Zhang, Hua-Ping
    Yang, Yaofei
    Liu, Xin
    Gao, Kai
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS (ACIIDS 2020), PT I, 2020, 12033 : 431 - 441
  • [32] A lightweight dense relation network with attention for hyperspectral image few-shot classification
    Shi, Meilin
    Ren, Jiansi
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 126
  • [33] Towards Hard Few-Shot Relation Classification
    Han, Jiale
    Cheng, Bo
    Wan, Zhiguo
    Lu, Wei
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (09) : 9476 - 9489
  • [34] Multi-Level Semantic Fusion Optimization for Few-shot Relation Classification
    Li, Peihong
    Cai, Fei
    Liu, Dengfeng
    Wang, Siyuan
    Liu, Shixian
    2024 7TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND BIG DATA, ICAIBD 2024, 2024, : 206 - 212
  • [35] A fusion spatial attention approach for few-shot learning
    Song, Heda
    Deng, Bowen
    Pound, Michael
    Ozcan, Ender
    Triguero, Isaac
    INFORMATION FUSION, 2022, 81 : 187 - 202
  • [36] Hybrid Adjacency Representation for Few-Shot Classification
    Zhao, Zhihao
    Liang, Xiaoyu
    Wu, Lu
    Xu, Ning
    Guo, Han
    2024 13TH INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS, ICCCAS 2024, 2024, : 514 - 518
  • [37] Few-Shot Few-Shot Learning and the role of Spatial Attention
    Lifchitz, Yann
    Avrithis, Yannis
    Picard, Sylvaine
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 2693 - 2700
  • [38] Compositional Prototypical Networks for Few-Shot Classification
    Lyu, Qiang
    Wang, Weiqiang
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 7, 2023, : 9011 - 9019
  • [39] Focus Your Attention when Few-Shot Classification
    Wang, Haoqing
    Jie, Shibo
    Deng, Zhi-Hong
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [40] Few-shot learning based on hierarchical classification via multi-granularity relation networks
    Su, Yuling
    Zhao, Hong
    Lin, Yaojin
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2022, 142 : 417 - 429