A semantic and emotion-based dual latent variable generation model for a dialogue system

被引:32
|
作者
Yan, Ming [1 ,2 ,3 ,6 ]
Lou, Xingrui [1 ,2 ]
Chan, Chien Aun [4 ]
Wang, Yan [5 ]
Jiang, Wei [3 ]
机构
[1] Commun Univ China, State Key Lab Media Convergence & Commun, Beijing, Peoples R China
[2] Commun Univ China, Sch Informat & Commun Engn, Beijing, Peoples R China
[3] Commun Univ China, Key Lab Acoust Visual Technol & Intelligent Contro, Beijing, Peoples R China
[4] Univ Melbourne, Dept Elect & Elect Engn, Melbourne, Vic, Australia
[5] Commun Univ China, Sch Data Sci & Intelligent Media, Beijing, Peoples R China
[6] Commun Univ China, State Key Lab Media Convergence & Commun, Beijing 100024, Peoples R China
基金
中国国家自然科学基金;
关键词
conditional variational autoencoder; dual latent space; emotional responses; latent variable generation;
D O I
10.1049/cit2.12153
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the development of intelligent agents pursuing humanisation, artificial intelligence must consider emotion, the most basic spiritual need in human interaction. Traditional emotional dialogue systems usually use an external emotional dictionary to select appropriate emotional words to add to the response or concatenate emotional tags and semantic features in the decoding step to generate appropriate responses. However, selecting emotional words from a fixed emotional dictionary may result in loss of the diversity and consistency of the response. We propose a semantic and emotion-based dual latent variable generation model (Dual-LVG) for dialogue systems, which is able to generate appropriate emotional responses without an emotional dictionary. Different from previous work, the conditional variational autoencoder (CVAE) adopts the standard transformer structure. Then, Dual-LVG regularises the CVAE latent space by introducing a dual latent space of semantics and emotion. The content diversity and emotional accuracy of the generated responses are improved by learning emotion and semantic features respectively. Moreover, the average attention mechanism is adopted to better extract semantic features at the sequence level, and the semi-supervised attention mechanism is used in the decoding step to strengthen the fusion of emotional features of the model. Experimental results show that Dual-LVG can successfully achieve the effect of generating different content by controlling emotional factors.
引用
收藏
页码:319 / 330
页数:12
相关论文
共 50 条
  • [1] DLVGen: A Dual Latent Variable Approach to Personalized Dialogue Generation
    Lee, Jing Yang
    Lee, Kong Aik
    Gan, Woon Seng
    ICAART: PROCEEDINGS OF THE 14TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 2, 2022, : 193 - 202
  • [2] Automatic Emotion-based Image Semantic Annotation
    Zhang, Jingjing
    Cao, Yan
    Mu, Xiangwei
    MECHATRONICS, ROBOTICS AND AUTOMATION, PTS 1-3, 2013, 373-375 : 624 - 628
  • [3] Emotion-based Method for Latent Followee Recommendation in Twitter
    Akiyama, Kazuhiro
    Kumamoto, Tadahiko
    Nadamoto, Akiyo
    19TH INTERNATIONAL CONFERENCE ON INFORMATION INTEGRATION AND WEB-BASED APPLICATIONS & SERVICES (IIWAS2017), 2017, : 121 - 125
  • [4] A Dual Latent Variable Personalized Dialogue Agent
    Lee J.Y.
    Lee K.A.
    Gan W.S.
    SN Computer Science, 4 (2)
  • [5] Emotion-based Media Recommendation System
    Aote, Shailendra S.
    Muley, Aayush
    Kotgirwar, Adesh
    Daware, Yash
    Shukla, Gaurav
    Kapse, Jayesh
    INTERNATIONAL JOURNAL OF NEXT-GENERATION COMPUTING, 2021, 12 (05): : 587 - 593
  • [6] Emotion-based smart recruitment system
    Khosla, R
    Lai, C
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 2, PROCEEDINGS, 2005, 3682 : 243 - 250
  • [7] CoLV: A Collaborative Latent Variable Model for Knowledge-Grounded Dialogue Generation
    Zhan, Haolan
    Shen, Lei
    Chen, Hongshen
    Zhang, Hainan
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 2250 - 2261
  • [8] PLATO: Pre-trained Dialogue Generation Model with Discrete Latent Variable
    Bao, Siqi
    He, Huang
    Wang, Fan
    Wu, Hua
    Wang, Haifeng
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 85 - 96
  • [9] Continuous Emotion-Based Image-to-Music Generation
    Wang, Yajie
    Chen, Mulin
    Li, Xuelong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 5670 - 5679
  • [10] Emotion-Based Painting Image Display System
    Lee, Taemin
    Kang, Dongwann
    Yoon, Kyunghyun
    Seo, Sanghyun
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2020, 26 (01): : 181 - 192