Hierarchical and Bidirectional Joint Multi-Task Classifiers for Natural Language Understanding

被引:0
|
作者
Ji, Xiaoyu [1 ,2 ]
Hu, Wanyang [3 ]
Liang, Yanyan [1 ,4 ]
机构
[1] Macau Univ Sci & Technol, Fac Innovat Engn, Sch Comp Sci & Engn, Macau, Peoples R China
[2] Guangxi Key Lab Machine Vis & Intelligent Control, Wuzhou 543002, Peoples R China
[3] Univ Svizzera Italiana, Dept Informat, CH-6962 Lugano, Switzerland
[4] CEI High Tech Res Inst Co Ltd, Macau, Peoples R China
基金
中国国家自然科学基金;
关键词
multi-task classifier; hierarchical structure; bidirectional joint structure; MASSIVE dataset;
D O I
10.3390/math11244895
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
The MASSIVE dataset is a spoken-language comprehension resource package for slot filling, intent classification, and virtual assistant evaluation tasks. It contains multi-language utterances from human beings communicating with a virtual assistant. In this paper, we exploited the relationship between intent classification and slot filling to improve the exact match accuracy by proposing five models with hierarchical and bidirectional architectures. There are two variants for hierarchical architectures and three variants for bidirectional architectures. These are the hierarchical concatenation model, the hierarchical attention-based model, the bidirectional max-pooling model, the bidirectional LSTM model, and the bidirectional attention-based model. The results of our models showed a significant improvement in the averaged exact match accuracy. The hierarchical attention-based model improved the accuracy by 1.01 points for the full training dataset. As for the zero-shot setup, we observed that the exact match accuracy increased from 53.43 to 53.91. In this study, we observed that, for multi-task problems, utilizing the relevance between different tasks can help in improving the model's overall performance.
引用
收藏
页数:22
相关论文
共 50 条
  • [41] Joint multi-task cascade for instance segmentation
    Wen, Yaole
    Hu, Fuyuan
    Ren, Jinchang
    Shang, Xinru
    Li, Linyan
    Xi, Xuefeng
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2020, 17 (06) : 1983 - 1989
  • [42] Multi-task Joint Learning for Videos in the Wild
    Hong, Yong Won
    Kim, Hoseong
    Byun, Hyeran
    PROCEEDINGS OF THE 1ST WORKSHOP AND CHALLENGE ON COMPREHENSIVE VIDEO UNDERSTANDING IN THE WILD (COVIEW'18), 2018, : 27 - 30
  • [43] Multi-Task Model and Feature Joint Learning
    Li, Ya
    Tian, Xinmei
    Liu, Tongliang
    Tao, Dacheng
    PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), 2015, : 3643 - 3649
  • [44] MULTI-TASK RNN-T WITH SEMANTIC DECODER FOR STREAMABLE SPOKEN LANGUAGE UNDERSTANDING
    Fu, Xuandi
    Chang, Feng-Ju
    Radfar, Martin
    Wei, Kai
    Liu, Jing
    Strimel, Grant P.
    Sathyendra, Kanthashree Mysore
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7507 - 7511
  • [45] Multi-task Deep Learning for Image Understanding
    Yu, Bo
    Lane, Ian
    2014 6TH INTERNATIONAL CONFERENCE OF SOFT COMPUTING AND PATTERN RECOGNITION (SOCPAR), 2014, : 37 - 42
  • [46] Offensive language identification with multi-task learning
    Zampieri, Marcos
    Ranasinghe, Tharindu
    Sarkar, Diptanu
    Ororbia, Alex
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2023, 60 (03) : 613 - 630
  • [47] Multi-Task Learning for Multiple Language Translation
    Dong, Daxiang
    Wu, Hua
    He, Wei
    Yu, Dianhai
    Wang, Haifeng
    PROCEEDINGS OF THE 53RD ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 7TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1, 2015, : 1723 - 1732
  • [48] Offensive language identification with multi-task learning
    Marcos Zampieri
    Tharindu Ranasinghe
    Diptanu Sarkar
    Alex Ororbia
    Journal of Intelligent Information Systems, 2023, 60 : 613 - 630
  • [49] Hierarchical multi-task learning withself-supervised auxiliary task
    Lee, Seunghan
    Park, Taeyoung
    KOREAN JOURNAL OF APPLIED STATISTICS, 2024, 37 (05)
  • [50] Coded Distributed Computing for Hierarchical Multi-task Learning
    Hu, Haoyang
    Li, Songze
    Cheng, Minquan
    Wu, Youlong
    2023 IEEE INFORMATION THEORY WORKSHOP, ITW, 2023, : 480 - 485