Leveraging Statistical Multi-Agent Online Planning with Emergent Value Function Approximation

被引:0
|
作者
Phan, Thomy [1 ]
Belzner, Lenz [1 ]
Gabor, Thomas [1 ]
Schmid, Kyrill [1 ]
机构
[1] Ludwig Maximilians Univ Munchen, Inst Informat, Munich, Germany
关键词
multi-agent planning; online planning; value function approximation;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Making decisions is a great challenge in distributed autonomous environments due to enormous state spaces and uncertainty. Many online planning algorithms rely on statistical sampling to avoid searching the whole state space, while still being able to make acceptable decisions. However, planning often has to be performed under strict computational constraints making online planning in multi-agent systems highly limited, which could lead to poor system performance, especially in stochastic domains. In this paper, we propose Emergent Value function Approximation for Distributed Environments (EVADE), an approach to integrate global experience into multi-agent online planning in stochastic domains to consider global effects during local planning. For this purpose, a value function is approximated online based on the emergent system behaviour by using methods of reinforcement learning. We empirically evaluated EVADE with two statistical multi-agent online planning algorithms in a highly complex and stochastic smart factory environment, where multiple agents need to process various items at a shared set of machines. Our experiments show that EVADE can effectively improve the performance of multi-agent online planning while offering efficiency w.r.t. the breadth and depth of the planning process.
引用
收藏
页码:730 / 738
页数:9
相关论文
共 50 条
  • [1] Scalable Online Planning for Multi-Agent MDPs
    Choudhury S.
    Gupta J.K.
    Morales P.
    Kochenderfer M.J.
    Journal of Artificial Intelligence Research, 2022, 73 : 821 - 846
  • [2] Scalable Online Planning for Multi-Agent MDPs
    Choudhury, Shushman
    Gupta, Jayesh K.
    Morales, Peter
    Kochenderfer, Mykel J.
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2022, 73 : 821 - 846
  • [3] Online planning for multi-agent systems with bounded communication
    Wu, Feng
    Zilberstein, Shlomo
    Chen, Xiaoping
    ARTIFICIAL INTELLIGENCE, 2011, 175 (02) : 487 - 511
  • [4] Function approximation based multi-agent reinforcement learning
    Abul, O
    Polat, F
    Alhajj, R
    12TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2000, : 36 - 39
  • [5] Determining the value of information for collaborative multi-agent planning
    David Sarne
    Barbara J. Grosz
    Autonomous Agents and Multi-Agent Systems, 2013, 26 : 456 - 496
  • [6] Determining the value of information for collaborative multi-agent planning
    Sarne, David
    Grosz, Barbara J.
    AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2013, 26 (03) : 456 - 496
  • [7] Multi-Agent Planning under Uncertainty with Monte Carlo Q-Value Function
    Zhang, Jian
    Pan, Yaozong
    Wang, Ruili
    Fang, Yuqiang
    Yang, Haitao
    APPLIED SCIENCES-BASEL, 2019, 9 (07):
  • [8] Statistical Planning: Building Models of Entropy of Centralized Planning for Multi-Agent Systems
    Jacopin, Eric
    2018 56TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2018, : 100 - 109
  • [9] Adaptive Fuzzy Function Approximation for Multi-Agent Reinforcement Learning
    Wu, Cheng
    Meleis, Waleed
    2009 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 2, 2009, : 169 - 176
  • [10] Multi-Agent Q-Learning with Joint State Value Approximation
    Chen Gang
    Cao Weihua
    Chen Xin
    Wu Min
    2011 30TH CHINESE CONTROL CONFERENCE (CCC), 2011, : 4878 - 4882