Style Attuned Pre-training and Parameter Efficient Fine-tuning for Spoken Language Understanding

被引：1

作者：

Cao, Jin ^{[1
]}

Wang, Jun ^{[1
]}

Hamza, Wael ^{[1
]}

Vanee, Kelly ^{[1
]}

Li, Shang-Wen ^{[1
]}

机构：

[1] Amazon AI, Beijing, Peoples R China

来源：

INTERSPEECH 2020 | 2020年

关键词：

spoken language understanding (SLU); intent classification; slot labeling; transfer learning; NETWORKS;

D O I：

10.21437/Interspeech.2020-2907

中图分类号：

R36 [病理学]; R76 [耳鼻咽喉科学];

学科分类号：

100104 ; 100213 ;

摘要：

Neural models have yielded state-of-the-art results in deciphering spoken language understanding (SLU) problems; however, these models require a significant amount of domain-specific labeled examples for training, which is prohibitively expensive. While pre-trained language models like BERT have been shown to capture a massive amount of knowledge by learning from unlabeled corpora and solve SLU using fewer labeled examples for adaption, the encoding of knowledge is implicit and agnostic to downstream tasks. Such encoding results in model inefficiencies in parameter usage: an entirely new model is required for every domain. To address these challenges, we introduce a novel SLU framework, comprising a conversational language modeling (CLM) pre-training task and a light encoder architecture. The CLM pre-training enables networks to capture the representation of the language in conversation style with the presence of ASR errors. The light encoder architecture separates the shared pre-trained networks from the mappings of generally encoded knowledge to specific domains of SLU, allowing for the domain adaptation to be performed solely at the light encoder and thus increasing efficiency. With the framework, we match the performance of state-of-the-art SLU results on Alexa internal datasets and on two public ones (ATIS, SNIPS), adding only 4.4% parameters per task.

引用

页码：1570 / 1574

页数：5

共 50 条

[31] Parameter-efficient fine-tuning of large-scale pre-trained language models
Ning Ding
Yujia Qin
Guang Yang
Fuchao Wei
Zonghan Yang
Yusheng Su
Shengding Hu
Yulin Chen
Chi-Min Chan
Weize Chen
Jing Yi
Weilin Zhao
Xiaozhi Wang
Zhiyuan Liu
Hai-Tao Zheng
Jianfei Chen
Yang Liu
Jie Tang
Juanzi Li
Maosong Sun
Nature Machine Intelligence, 2023, 5 : 220 - 235
[32] Parameter-efficient fine-tuning of large-scale pre-trained language models
Ding, Ning
Qin, Yujia
Yang, Guang
Wei, Fuchao
Yang, Zonghan
Su, Yusheng
Hu, Shengding
Chen, Yulin
Chan, Chi-Min
Chen, Weize
Yi, Jing
Zhao, Weilin
Wang, Xiaozhi
Liu, Zhiyuan
Zheng, Hai-Tao
Chen, Jianfei
Liu, Yang
Tang, Jie
Li, Juanzi
Sun, Maosong
NATURE MACHINE INTELLIGENCE, 2023, 5 (03) : 220 - +
[33] Investigation of improving the pre-training and fine-tuning of BERT model for biomedical relation extraction
Peng Su
K. Vijay-Shanker
BMC Bioinformatics, 23
[34] Investigation of improving the pre-training and fine-tuning of BERT model for biomedical relation extraction
Su, Peng
Vijay-Shanker, K.
BMC BIOINFORMATICS, 2022, 23 (01)
[35] Few-Shot Intent Detection via Contrastive Pre-Training and Fine-Tuning
Zhang, Jian-Guo
Bui, Trung
Yoon, Seunghyun
Chen, Xiang
Liu, Zhiwei
Xia, Congying
Tran, Quan Hung
Chang, Walter
Yu, Philip
2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 1906 - 1912
[36] A New Pre-training Method for Training Deep Learning Models with Application to Spoken Language Understanding
Celikyilmaz, Asli
Sarikaya, Ruhi
Hakkani-Tur, Dilek
Liu, Xiaohu
Ramesh, Nikhil
Tur, Gokhan
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3255 - 3259
[37] FOOD IMAGE RECOGNITION USING DEEP CONVOLUTIONAL NETWORK WITH PRE-TRAINING AND FINE-TUNING
Yanai, Keiji
Kawano, Yoshiyuki
2015 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2015,
[38] Robust Face Tracking Using Siamese-VGG with Pre-training and Fine-tuning
Yuan, Shuo
Yu, Xinguo
Majid, Abdul
2019 4TH INTERNATIONAL CONFERENCE ON CONTROL AND ROBOTICS ENGINEERING (ICCRE), 2019, : 170 - 174
[39] ERNIE-GEN: An Enhanced Multi-Flow Pre-training and Fine-tuning Framework for Natural Language Generation
Xiao, Dongling
Zhang, Han
Li, Yukun
Sun, Yu
Tian, Hao
Wu, Hua
Wang, Haifeng
PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 3997 - 4003
[40] On the Effectiveness of Parameter-Efficient Fine-Tuning
Fu, Zihao
Yang, Haoran
So, Anthony Man-Cho
Lam, Wai
Bing, Lidong
Collier, Nigel
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 11, 2023, : 12799 - 12807

← 1 2 3 4 5 →