DialogueBERT: A Self-Supervised Learning based Dialogue Pre-training Encoder

被引：10

作者：

Zhang, Zhenyu ^{[1
]}

Guo, Tao ^{[2
]}

Chen, Meng ^{[3
]}

机构：

[1] JD AI, Chengdu, Peoples R China

[2] Xiaoduo AI, Chengdu, Peoples R China

[3] JD AI, Beijing, Peoples R China

来源：

PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021 | 2021年

关键词：

Dialogue Pre-training Model; Dialogue Representation; Intent Recognition; Emotion Recognition; Named Entity Recognition;

D O I：

10.1145/3459637.3482085

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

With the rapid development of artificial intelligence, conversational bots have became prevalent in mainstream E-commerce platforms, which can provide convenient customer service timely. To satisfy the user, the conversational bots need to understand the user's intention, detect the user's emotion, and extract the key entities from the conversational utterances. However, understanding dialogues is regarded as a very challenging task. Different from common language understanding, utterances in dialogues appear alternately from different roles and are usually organized as hierarchical structures. To facilitate the understanding of dialogues, in this paper, we propose a novel contextual dialogue encoder (i.e. DialogueBERT) based on the popular pre-trained language model BERT. Five self-supervised learning pre-training tasks are devised for learning the particularity of dialouge utterances. Four different input embeddings are integrated to catch the relationship between utterances, including turn embedding, role embedding, token embedding and position embedding. DialogueBERT was pre-trained with 70 million dialogues in real scenario, and then fine-tuned in three different downstream dialogue understanding tasks. Experimental results show that DialogueBERT achieves exciting results with 88.63% accuracy for intent recognition, 94.25% accuracy for emotion recognition and 97.04% F1 score for named entity recognition, which outperforms several strong baselines by a large margin.

引用

页码：3647 / 3651

页数：5

共 50 条

[41] Masked self-supervised pre-training model for EEG-based emotion recognition
Hu, Xinrong
Chen, Yu
Yan, Jinlin
Wu, Yuan
Ding, Lei
Xu, Jin
Cheng, Jun
COMPUTATIONAL INTELLIGENCE, 2024, 40 (03)
[42] Self-Supervised pre-training model based on Multi-view for MOOC Recommendation
Tian, Runyu
Cai, Juanjuan
Li, Chuanzhen
Wang, Jingling
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 252
[43] Token Boosting for Robust Self-Supervised Visual Transformer Pre-training
Li, Tianjiao
Foo, Lin Geng
Hu, Ping
Shang, Xindi
Rahmani, Hossein
Yuan, Zehuan
Liu, Jun
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 24027 - 24038
[44] ENHANCING THE DOMAIN ROBUSTNESS OF SELF-SUPERVISED PRE-TRAINING WITH SYNTHETIC IMAGES
Hassan, Mohamad N. C.
Bhattacharya, Avigyan
da Costa, Victor G. Turrisi
Banerjee, Biplab
Ricci, Elisa
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 5470 - 5474
[45] Individualized Stress Mobile Sensing Using Self-Supervised Pre-Training
Islam, Tanvir
Washington, Peter
APPLIED SCIENCES-BASEL, 2023, 13 (21):
[46] Stabilizing Label Assignment for Speech Separation by Self-supervised Pre-training
Huang, Sung-Feng
Chuang, Shun-Po
Liu, Da-Rong
Chen, Yi-Chen
Yang, Gene-Ping
Lee, Hung-yi
INTERSPEECH 2021, 2021, : 3056 - 3060
[47] Self-Supervised Pre-training for Protein Embeddings Using Tertiary Structures
Guo, Yuzhi
Wu, Jiaxiang
Ma, Hehuan
Huang, Junzhou
THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 6801 - 6809
[48] SslTransT: Self-supervised pre-training visual object tracking with Transformers
Cai, Yannan
Tan, Ke
Wei, Zhenzhong
OPTICS COMMUNICATIONS, 2024, 557
[49] GUIDED CONTRASTIVE SELF-SUPERVISED PRE-TRAINING FOR AUTOMATIC SPEECH RECOGNITION
Khare, Aparna
Wu, Minhua
Bhati, Saurabhchand
Droppo, Jasha
Maas, Roland
2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 174 - 181
[50] Abdominal Organs and Pan-Cancer Segmentation Based on Self-supervised Pre-training and Self-training
Li, He
Han, Meng
Wang, Guotai
FAST, LOW-RESOURCE, AND ACCURATE ORGAN AND PAN-CANCER SEGMENTATION IN ABDOMEN CT, FLARE 2023, 2024, 14544 : 130 - 142

← 1 2 3 4 5 →