Label Attention Network for Structured Prediction

被引:4
|
作者
Cui, Leyang [1 ,2 ]
Li, Yafu [1 ,2 ]
Zhang, Yue [2 ,3 ]
机构
[1] Zhejiang Univ, Hangzhou 310007, Peoples R China
[2] Westlake Univ, Sch Engn, Hangzhou 310024, Peoples R China
[3] Westlake Inst Adv Study, Inst Adv Technol, Hangzhou 310024, Peoples R China
基金
美国国家科学基金会;
关键词
Labeling; Task analysis; Tagging; Artificial neural networks; Machine translation; Natural language processing; Encoding; Label attention; label dependency; sequence labeling;
D O I
10.1109/TASLP.2022.3145311
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Sequence labeling assigns a label to each token in a sequence, which is a fundamental problem in natural language processing (NLP). Many NLP tasks, including part-of-speech tagging and named entity recognition, can be solved in a form of sequence labeling problem. Other tasks such as constituency parsing and non-autoregressive machine translation can also be transformed into sequence labeling tasks. Neural models have been shown powerful for sequence labeling by employing a multi-layer sequence encoding network. Conditional random field (CRF) is proposed to enrich information over label sequences, yet it suffers large computational complexity and over-reliance on Marko assumption. To this end, we propose label attention network (LAN) to hierarchically refine representation of marginal label distributions bottom-up, enabling higher layers to learn more informed label sequence distribution based on information from lower layers. We demonstrate the effectiveness of LAN through extensive experiments on various NLP tasks including POS tagging, NER, CCG supertagging, constituency parsing and non-autoregressive machine translation. Empirical results show that LAN not only improves the overall tagging accuracy with similar number of parameters, but also significantly speeds up the training and testing compared to CRF.
引用
收藏
页码:1235 / 1248
页数:14
相关论文
共 50 条
  • [41] Structured Two-Stream Attention Network for Video Question Answering
    Gao, Lianli
    Zeng, Pengpeng
    Song, Jingkuan
    Li, Yuan-Fang
    Liu, Wu
    Mei, Tao
    Shen, Heng Tao
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 6391 - 6398
  • [42] A Pseudo Label-Wise Attention Network for Automatic ICD Coding
    Wu, Yifan
    Zeng, Min
    Yu, Ying
    Li, Yaohang
    Li, Min
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2022, 26 (10) : 5201 - 5212
  • [43] Structured Fusion Attention Network for Image Super-Resolution Reconstruction
    Dai, Yaonan
    Yu, Jiuyang
    Hu, Tianhao
    Lu, Yang
    Zheng, Xiaotao
    IEEE ACCESS, 2022, 10 : 31896 - 31906
  • [44] Attention-based adaptive structured continuous sparse network pruning
    Liu, Jiaxin
    Liu, Wei
    Li, Yongming
    Hu, Jun
    Cheng, Shuai
    Yang, Wenxing
    NEUROCOMPUTING, 2024, 590
  • [45] Automatic identification of commodity label images using lightweight attention network
    Junde Chen
    Adnan Zeb
    Shuangyuan Yang
    Defu Zhang
    Y. A. Nanehkaran
    Neural Computing and Applications, 2021, 33 : 14413 - 14428
  • [46] Research on the prediction and relationship between academic attention and network attention in chemistry teaching
    Song, Rui
    Li, Mingjiang
    Zhao, Yulin
    Liu, Kai
    Li, Junke
    Zhou, Jincheng
    ENGINEERING REPORTS, 2023, 5 (07)
  • [47] Graph Attention Transformer Network for Multi-label Image Classification
    Yuan, Jin
    Chen, Shikai
    Zhang, Yao
    Shi, Zhongchao
    Geng, Xin
    Fan, Jianping
    Rui, Yong
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (04)
  • [48] Automatic identification of commodity label images using lightweight attention network
    Chen, Junde
    Zeb, Adnan
    Yang, Shuangyuan
    Zhang, Defu
    Nanehkaran, Y. A.
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (21): : 14413 - 14428
  • [49] Structured Fusion Attention Network for Image Super-Resolution Reconstruction
    Dai, Yaonan
    Yu, Jiuyang
    Hu, Tianhao
    Lu, Yang
    Zheng, Xiaotao
    IEEE Access, 2022, 10 : 31896 - 31906
  • [50] A Label-Specific Attention-Based Network with Regularized Loss for Multi-label Classification
    Luo, Xiangyang
    Ran, Xiangying
    Sun, Wei
    Xu, Yunlai
    Wang, Chongjun
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: DEEP LEARNING, PT II, 2019, 11728 : 731 - 742