Optimizing Policy via Deep Reinforcement Learning for Dialogue Management

被引：1

作者：

Xu, Guanghao ^{[1
]}

Lee, Hyunjung ^{[2
]}

Koo, Myoung-Wan ^{[1
]}

Seo, Jungyun ^{[1
]}

机构：

[1] Sogang Univ, Dept Comp Sci & Engn, Seoul, South Korea

[2] Univ Leipzig, Inst Linguist, D-04107 Leipzig, Germany

来源：

2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP) | 2018年

关键词：

Deep Reinforcement Learning; Dialogue Management; Dialogue Policy;

D O I：

10.1109/BigComp.2018.00101

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

In this paper, we propose a dialogue manager model based on Deep Reinforcement Learning, which automatically optimizes a dialogue policy. The policy is trained within deep Q-learning algorithm, which efficiently approximates value of actions given a large space of dialogue state. Evaluation processes are conducted by comparing the performance of the proposed model to a rule-based one on the dialogue corpora of DSTC2 and 3 under three different levels of error rate in Spoken Language Understanding. Experimental results prove that given certain level of SLU error, the dialogue manager with self-learned policy shows higher completion rate and the robustness to SLU error. Overcoming the drawbacks of rule-based approach such as limited flexibility and high maintenance cost, our model shows the strength of self-learning algorithm in optimizing policy of dialogue manager without any hand-crafted features.

引用

页码：582 / 589

页数：8

共 50 条

[41] Deep Reinforcement Learning for On-line Dialogue State Tracking
Chen, Zhi
Chen, Lu
Zhou, Xiang
Yu, Kai
MAN-MACHINE SPEECH COMMUNICATION, NCMMSC 2022, 2023, 1765 : 278 - 292
[42] Diversity Evolutionary Policy Deep Reinforcement Learning
Liu, Jian
Feng, Liming
COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2021, 2021
[43] Deep reinforcement learning for portfolio management
Yang, Shantian
KNOWLEDGE-BASED SYSTEMS, 2023, 278
[44] Resource Management with Deep Reinforcement Learning
Mao, Hongzi
Alizadeh, Mohammad
Menache, Ishai
Kandula, Srikanth
PROCEEDINGS OF THE 15TH ACM WORKSHOP ON HOT TOPICS IN NETWORKS (HOTNETS '16), 2016, : 50 - 56
[45] Optimizing Data Center Energy Efficiency via Event-Driven Deep Reinforcement Learning
Ran, Yongyi
Zhou, Xin
Hu, Han
Wen, Yonggang
IEEE TRANSACTIONS ON SERVICES COMPUTING, 2023, 16 (02) : 1296 - 1309
[46] Optimizing Drone Deployment for Maximized User Connectivity in Areas of Interest Via Deep Reinforcement Learning
Kolichala Rajashekar
Ashutosh Garg
Anand M. Baswade
Subhajit Sidhanta
Journal of Network and Systems Management, 2025, 33 (3)
[47] Efficient Deep Reinforcement Learning via Policy-Extended Successor Feature Approximator
Li, Yining
Yang, Tianpei
Hao, Jianye
Zheng, Yan
Tang, Hongyao
DISTRIBUTED ARTIFICIAL INTELLIGENCE, DAI 2022, 2023, 13824 : 29 - 44
[48] A Deep Reinforcement Learning Approach for Optimizing Inventory Management in the Agri-Food Supply Chain
Murugeshwari, B.
Mohanapriya, M. P.
Merin, J. Brindha
Akila, R.
JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (04) : 2238 - 2247
[49] Optimizing Attention for Sequence Modeling via Reinforcement Learning
Fei, Hao
Zhang, Yue
Ren, Yafeng
Ji, Donghong
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (08) : 3612 - 3621
[50] Pixel-to-Action Policy for Underwater Pipeline Following via Deep Reinforcement Learning
Liu, Yanan
Wang, Fang
Lv, Zeyu
Cao, Kaihui
Lin, Yuanshan
2018 IEEE INTERNATIONAL CONFERENCE OF INTELLIGENT ROBOTICS AND CONTROL ENGINEERING (IRCE), 2018, : 135 - 139

← 1 2 3 4 5 →