A Collaborative Multi-agent Reinforcement Learning Framework for Dialog Action Decomposition

被引：0

作者：

Wang, Huimin ^{[1
]}

Wong, Kam-Fai ^{[1
]}

机构：

[1] Chinese Univ Hong Kong, Hong Kong, Peoples R China

来源：

2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021) | 2021年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Most reinforcement learning methods for dialog policy learning train a centralized agent that selects a predefined joint action concatenating domain name, intent type, and slot name. The centralized dialog agent suffers from a great many user-agent interaction requirements due to the large action space. Besides, designing the concatenated actions is laborious to engineers and maybe struggled with edge cases. To solve these problems, we model the dialog policy learning problem with a novel multi-agent framework, in which each part of the action is led by a different agent. The framework reduces labor costs for action templates and decreases the size of the action space for each agent. Furthermore, we relieve the non-stationary problem caused by the changing dynamics of the environment as evolving of agents' policies by introducing a joint optimization process that makes agents can exchange their policy information. Concurrently, an independent experience replay buffer mechanism is integrated to reduce the dependence between gradients of samples to improve training efficiency. The effectiveness of the proposed framework is demonstrated in a multi-domain environment with both user simulator evaluation and human evaluation.

引用

页码：7882 / 7889

页数：8

共 50 条

[41] Multi-Vehicle Collaborative Lane Changing Based on Multi-Agent Reinforcement Learning
Zhang, Xiang
Li, Shihao
Wang, Boyang
Xue, Mingxuan
Li, Zhiwei
Liu, Haiou
2024 35TH IEEE INTELLIGENT VEHICLES SYMPOSIUM, IEEE IV 2024, 2024, : 1214 - 1221
[42] A multi-agent collaborative learning scheme for young university teachers based on reinforcement learning
Jia, Fei
World Transactions on Engineering and Technology Education, 2013, 11 (04): : 495 - 499
[43] A MULTI-AGENT FRAMEWORK FOR A HYBRID DIALOG MANAGEMENT SYSTEM
Schwaerzler, Stefan
Schenk, Joachim
Ruske, Guenther
Wallhoff, Frank
ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 958 - 961
[44] Multi-agent reinforcement learning with bidding for automatic segmentation of action sequences
Sun, R
Sessions, C
FOURTH INTERNATIONAL CONFERENCE ON MULTIAGENT SYSTEMS, PROCEEDINGS, 2000, : 445 - 446
[45] Off-Policy Action Anticipation in Multi-Agent Reinforcement Learning
Bighashdel, Ariyan
de Geus, Daan
Jancura, Pavol
Dubbelman, Gijs
JOURNAL OF MACHINE LEARNING RESEARCH, 2024, 25
[46] Macro-Action-Based Deep Multi-Agent Reinforcement Learning
Xiao, Yuchen
Hoffman, Joshua
Amato, Christopher
CONFERENCE ON ROBOT LEARNING, VOL 100, 2019, 100
[47] Intrinsic Action Tendency Consistency for Cooperative Multi-Agent Reinforcement Learning
Zhang, Junkai
Zhang, Yifan
Zhang, Xi Sheryl
Zang, Yifan
Cheng, Jian
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 16, 2024, : 17600 - 17608
[48] Multi-Agent Cognition Difference Reinforcement Learning for Multi-Agent Cooperation
Wang, Huimu
Qiu, Tenghai
Liu, Zhen
Pu, Zhiqiang
Yi, Jianqiang
Yuan, Wanmai
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[49] A multi-agent deep reinforcement learning framework for automated driving on highways
Bakker, Louis
Grammatico, Sergio
2020 28TH MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION (MED), 2020, : 770 - 775
[50] Multi-Agent Reinforcement Learning With Distributed Targeted Multi-Agent Communication
Xu, Chi
Zhang, Hui
Zhang, Ya
2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 2915 - 2920

← 1 2 3 4 5 →