Optimizing Policy via Deep Reinforcement Learning for Dialogue Management

被引:1
|
作者
Xu, Guanghao [1 ]
Lee, Hyunjung [2 ]
Koo, Myoung-Wan [1 ]
Seo, Jungyun [1 ]
机构
[1] Sogang Univ, Dept Comp Sci & Engn, Seoul, South Korea
[2] Univ Leipzig, Inst Linguist, D-04107 Leipzig, Germany
关键词
Deep Reinforcement Learning; Dialogue Management; Dialogue Policy;
D O I
10.1109/BigComp.2018.00101
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, we propose a dialogue manager model based on Deep Reinforcement Learning, which automatically optimizes a dialogue policy. The policy is trained within deep Q-learning algorithm, which efficiently approximates value of actions given a large space of dialogue state. Evaluation processes are conducted by comparing the performance of the proposed model to a rule-based one on the dialogue corpora of DSTC2 and 3 under three different levels of error rate in Spoken Language Understanding. Experimental results prove that given certain level of SLU error, the dialogue manager with self-learned policy shows higher completion rate and the robustness to SLU error. Overcoming the drawbacks of rule-based approach such as limited flexibility and high maintenance cost, our model shows the strength of self-learning algorithm in optimizing policy of dialogue manager without any hand-crafted features.
引用
收藏
页码:582 / 589
页数:8
相关论文
共 50 条
  • [41] Deep Reinforcement Learning for On-line Dialogue State Tracking
    Chen, Zhi
    Chen, Lu
    Zhou, Xiang
    Yu, Kai
    MAN-MACHINE SPEECH COMMUNICATION, NCMMSC 2022, 2023, 1765 : 278 - 292
  • [42] Diversity Evolutionary Policy Deep Reinforcement Learning
    Liu, Jian
    Feng, Liming
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2021, 2021
  • [43] Deep reinforcement learning for portfolio management
    Yang, Shantian
    KNOWLEDGE-BASED SYSTEMS, 2023, 278
  • [44] Resource Management with Deep Reinforcement Learning
    Mao, Hongzi
    Alizadeh, Mohammad
    Menache, Ishai
    Kandula, Srikanth
    PROCEEDINGS OF THE 15TH ACM WORKSHOP ON HOT TOPICS IN NETWORKS (HOTNETS '16), 2016, : 50 - 56
  • [45] Optimizing Data Center Energy Efficiency via Event-Driven Deep Reinforcement Learning
    Ran, Yongyi
    Zhou, Xin
    Hu, Han
    Wen, Yonggang
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2023, 16 (02) : 1296 - 1309
  • [46] Optimizing Drone Deployment for Maximized User Connectivity in Areas of Interest Via Deep Reinforcement Learning
    Kolichala Rajashekar
    Ashutosh Garg
    Anand M. Baswade
    Subhajit Sidhanta
    Journal of Network and Systems Management, 2025, 33 (3)
  • [47] Efficient Deep Reinforcement Learning via Policy-Extended Successor Feature Approximator
    Li, Yining
    Yang, Tianpei
    Hao, Jianye
    Zheng, Yan
    Tang, Hongyao
    DISTRIBUTED ARTIFICIAL INTELLIGENCE, DAI 2022, 2023, 13824 : 29 - 44
  • [48] A Deep Reinforcement Learning Approach for Optimizing Inventory Management in the Agri-Food Supply Chain
    Murugeshwari, B.
    Mohanapriya, M. P.
    Merin, J. Brindha
    Akila, R.
    JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (04) : 2238 - 2247
  • [49] Optimizing Attention for Sequence Modeling via Reinforcement Learning
    Fei, Hao
    Zhang, Yue
    Ren, Yafeng
    Ji, Donghong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (08) : 3612 - 3621
  • [50] Pixel-to-Action Policy for Underwater Pipeline Following via Deep Reinforcement Learning
    Liu, Yanan
    Wang, Fang
    Lv, Zeyu
    Cao, Kaihui
    Lin, Yuanshan
    2018 IEEE INTERNATIONAL CONFERENCE OF INTELLIGENT ROBOTICS AND CONTROL ENGINEERING (IRCE), 2018, : 135 - 139