End to End Spoken Language Understanding Using Partial Disentangled Slot Embedding

被引:0
|
作者
Liu, Tan [1 ]
Guo, Wu [1 ]
机构
[1] Univ Sci & Technol China, Natl Engn Lab Speech & Language Informat Proc, Hefei, Peoples R China
基金
中国国家自然科学基金;
关键词
end to end; spoken language understanding; disentangled embedding;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Spoken language understanding (SLU) has switched from pipeline approaches to end-to-end (E2E) ones recently. For most E2E approaches, neural networks are adopted to extract embeddings from the audio signals directly for final intents prediction. In this paper, we explore this method for intent classification on Fluent Speech Commands (FSC) dataset, where intents are formed as combinations of three slots (action, object, and location). The information of different slots will be entangled with each other in the extracted embeddings, which sometimes brings about errors in the prediction of the current slot. To address this problem, we propose partial disentangled slot embedding (PDSE) method through adversarial training. Results show that the proposed method can achieve an error rate of 0.53%, which outperforms the baseline with over 35.3% error rate reduction.
引用
收藏
页码:1062 / 1066
页数:5
相关论文
共 50 条
  • [11] End-to-End Neural Transformer Based Spoken Language Understanding
    Radfar, Martin
    Mouchtaris, Athanasios
    Kunzmann, Siegfried
    INTERSPEECH 2020, 2020, : 866 - 870
  • [12] IN PURSUIT OF BABEL - MULTILINGUAL END-TO-END SPOKEN LANGUAGE UNDERSTANDING
    Mueller, Markus
    Choudhary, Samridhi
    Chung, Clement
    Mouchtaris, Athanasios
    Kunzmann, Siegfried
    2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 1042 - 1049
  • [13] A DATA EFFICIENT END-TO-END SPOKEN LANGUAGE UNDERSTANDING ARCHITECTURE
    Dinarelli, Marco
    Kapoor, Nikita
    Jabaian, Bassam
    Besacier, Laurent
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 8519 - 8523
  • [14] Integrating Dialog History into End-to-End Spoken Language Understanding Systems
    Ganhotra, Jatin
    Thomas, Samuel
    Kuo, Hong-Kwang J.
    Joshi, Sachindra
    Saon, George
    Tuske, Zoltan
    Kingsbury, Brian
    INTERSPEECH 2021, 2021, : 1254 - 1258
  • [15] END-TO-END ARCHITECTURES FOR ASR-FREE SPOKEN LANGUAGE UNDERSTANDING
    Palogiannidi, Elisavet
    Gkinis, Ioannis
    Mastrapas, George
    Mizera, Petr
    Stafylakis, Themos
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7974 - 7978
  • [16] End-to-End Spoken Language Understanding: Bootstrapping in Low Resource Scenarios
    Bhosale, Swapnil
    Sheikh, Imran
    Dumpala, Sri Harsha
    Kopparapu, Sunil Kumar
    INTERSPEECH 2019, 2019, : 1188 - 1192
  • [17] IMPROVING END-TO-END MODELS FOR SET PREDICTION IN SPOKEN LANGUAGE UNDERSTANDING
    Kuo, Hong-Kwang J.
    Tuske, Zoltan
    Thomas, Samuel
    Kingsbury, Brian
    Saon, George
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7162 - 7166
  • [18] Toward Low-Cost End-to-End Spoken Language Understanding
    Dinarelli, Marco
    Naguib, Marco
    Portet, Francois
    INTERSPEECH 2022, 2022, : 2728 - 2732
  • [19] TOP-DOWN ATTENTION IN END-TO-END SPOKEN LANGUAGE UNDERSTANDING
    Chen, Yixin
    Lu, Weiyi
    Mottini, Alejandro
    Li, Li Erran
    Droppo, Jasha
    Du, Zheng
    Zeng, Belinda
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6199 - 6203
  • [20] Low resource end-to-end spoken language understanding with capsule networks
    Poncelet, Jakob
    Renkens, Vincent
    Van hamme, Hugo
    COMPUTER SPEECH AND LANGUAGE, 2021, 66