A Collaborative Multi-agent Reinforcement Learning Framework for Dialog Action Decomposition

被引:0
|
作者
Wang, Huimin [1 ]
Wong, Kam-Fai [1 ]
机构
[1] Chinese Univ Hong Kong, Hong Kong, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most reinforcement learning methods for dialog policy learning train a centralized agent that selects a predefined joint action concatenating domain name, intent type, and slot name. The centralized dialog agent suffers from a great many user-agent interaction requirements due to the large action space. Besides, designing the concatenated actions is laborious to engineers and maybe struggled with edge cases. To solve these problems, we model the dialog policy learning problem with a novel multi-agent framework, in which each part of the action is led by a different agent. The framework reduces labor costs for action templates and decreases the size of the action space for each agent. Furthermore, we relieve the non-stationary problem caused by the changing dynamics of the environment as evolving of agents' policies by introducing a joint optimization process that makes agents can exchange their policy information. Concurrently, an independent experience replay buffer mechanism is integrated to reduce the dependence between gradients of samples to improve training efficiency. The effectiveness of the proposed framework is demonstrated in a multi-domain environment with both user simulator evaluation and human evaluation.
引用
收藏
页码:7882 / 7889
页数:8
相关论文
共 50 条
  • [41] Multi-Vehicle Collaborative Lane Changing Based on Multi-Agent Reinforcement Learning
    Zhang, Xiang
    Li, Shihao
    Wang, Boyang
    Xue, Mingxuan
    Li, Zhiwei
    Liu, Haiou
    2024 35TH IEEE INTELLIGENT VEHICLES SYMPOSIUM, IEEE IV 2024, 2024, : 1214 - 1221
  • [42] A multi-agent collaborative learning scheme for young university teachers based on reinforcement learning
    Jia, Fei
    World Transactions on Engineering and Technology Education, 2013, 11 (04): : 495 - 499
  • [43] A MULTI-AGENT FRAMEWORK FOR A HYBRID DIALOG MANAGEMENT SYSTEM
    Schwaerzler, Stefan
    Schenk, Joachim
    Ruske, Guenther
    Wallhoff, Frank
    ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 958 - 961
  • [44] Multi-agent reinforcement learning with bidding for automatic segmentation of action sequences
    Sun, R
    Sessions, C
    FOURTH INTERNATIONAL CONFERENCE ON MULTIAGENT SYSTEMS, PROCEEDINGS, 2000, : 445 - 446
  • [45] Off-Policy Action Anticipation in Multi-Agent Reinforcement Learning
    Bighashdel, Ariyan
    de Geus, Daan
    Jancura, Pavol
    Dubbelman, Gijs
    JOURNAL OF MACHINE LEARNING RESEARCH, 2024, 25
  • [46] Macro-Action-Based Deep Multi-Agent Reinforcement Learning
    Xiao, Yuchen
    Hoffman, Joshua
    Amato, Christopher
    CONFERENCE ON ROBOT LEARNING, VOL 100, 2019, 100
  • [47] Intrinsic Action Tendency Consistency for Cooperative Multi-Agent Reinforcement Learning
    Zhang, Junkai
    Zhang, Yifan
    Zhang, Xi Sheryl
    Zang, Yifan
    Cheng, Jian
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 16, 2024, : 17600 - 17608
  • [48] Multi-Agent Cognition Difference Reinforcement Learning for Multi-Agent Cooperation
    Wang, Huimu
    Qiu, Tenghai
    Liu, Zhen
    Pu, Zhiqiang
    Yi, Jianqiang
    Yuan, Wanmai
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [49] A multi-agent deep reinforcement learning framework for automated driving on highways
    Bakker, Louis
    Grammatico, Sergio
    2020 28TH MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION (MED), 2020, : 770 - 775
  • [50] Multi-Agent Reinforcement Learning With Distributed Targeted Multi-Agent Communication
    Xu, Chi
    Zhang, Hui
    Zhang, Ya
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 2915 - 2920