Encoding Syntactic Knowledge in Transformer Encoder for Intent Detection and Slot Filling

被引:0
|
作者
Wang, Jixuan [1 ,2 ,3 ]
Wei, Kai [3 ]
Radfar, Martin [3 ]
Zhang, Weiwei [3 ]
Chung, Clement [3 ]
机构
[1] Univ Toronto, Toronto, ON, Canada
[2] Vector Inst, Toronto, ON, Canada
[3] Amazon Alexa, Pittsburgh, PA 15205 USA
关键词
NEURAL-NETWORKS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel Transformer encoder-based architecture with syntactical knowledge encoded for intent detection and slot filling. Specifically, we encode syntactic knowledge into the Transformer encoder by jointly training it to predict syntactic parse ancestors and part-of-speech of each token via multi-task learning. Our model is based on self-attention and feed-forward layers and does not require external syntactic information to be available at inference time. Experiments show that on two benchmark datasets, our models with only two Transformer encoder layers achieve state-of-the-art results. Compared to the previously best performed model without pre-training, our models achieve absolute F1 score and accuracy improvement of 1.59% and 0.85% for slot filling and intent detection on the SNIPS dataset, respectively. Our models also achieve absolute F1 score and accuracy improvement of 0.1% and 0.34% for slot filling and intent detection on the ATIS dataset, respectively, over the previously best performed model. Furthermore, the visualization of the self-attention weights illustrates the benefits of incorporating syntactic information during training.
引用
收藏
页码:13943 / 13951
页数:9
相关论文
共 50 条
  • [41] PAPER Special Technology Support Hyperconnectivity Conceptual Knowledge Enhanced Model for Multi-Intent Detection and Slot Filling
    He, Li
    Zhao, Jingxuan
    Duan, Jianyong
    Wang, Hao
    Li, Xin
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2024, E107D (04) : 468 - 476
  • [42] A Survey of Joint Intent Detection and Slot Filling Models in Natural Language Understanding
    Weld, Henry
    Huang, Xiaoqi
    Long, Siqu
    Poon, Josiah
    Han, Soyeon Caren
    ACM COMPUTING SURVEYS, 2023, 55 (08)
  • [43] Intent Classification and Slot Filling for Privacy Policies
    Ahmad, Wasi Uddin
    Chi, Jianfeng
    Le, Tu
    Norton, Thomas
    Tian, Yuan
    Chang, Kai-Wei
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 4402 - 4417
  • [44] A Two-Stage Selective Fusion Framework for Joint Intent Detection and Slot Filling
    Ma, Ziyu
    Sun, Bin
    Li, Shutao
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (03) : 3874 - 3885
  • [45] Joint intent detection and slot filling using weighted finite state transducer and BERT
    Abro, Waheed Ahmed
    Qi, Guilin
    Aamir, Muhammad
    Ali, Zafar
    APPLIED INTELLIGENCE, 2022, 52 (15) : 17356 - 17370
  • [46] SlotRefine: A Fast Non-Autoregressive Model for Joint Intent Detection and Slot Filling
    Wu, Di
    Ding, Liang
    Lu, Fan
    Xie, Jian
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 1932 - 1937
  • [47] Joint Training Model of Intent Detection and Slot Filling for Multi Granularity Implicit Guidance
    Li, Bin
    Wang, Weihua
    Bao, Feilong
    2022 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2022), 2022, : 271 - 274
  • [48] A Novel Bi-directional Interrelated Model for Joint Intent Detection and Slot Filling
    E, Haihong
    Niu, Peiqing
    Chen, Zhongfu
    Song, Meina
    57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 5467 - 5471
  • [49] CONVOLUTIONAL NEURAL NETWORK BASED TRIANGULAR CRF FOR JOINT INTENT DETECTION AND SLOT FILLING
    Xu, Puyang
    Sarikaya, Ruhi
    2013 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2013, : 78 - 83
  • [50] Joint intent detection and slot filling using weighted finite state transducer and BERT
    Waheed Ahmed Abro
    Guilin Qi
    Muhammad Aamir
    Zafar Ali
    Applied Intelligence, 2022, 52 : 17356 - 17370