Fast Teammate Adaptation in the Presence of Sudden Policy Change

被引:0
|
作者
Zhang, Ziqian [1 ]
Yuan, Lei [1 ,2 ]
Li, Lihe [1 ]
Xue, Ke [1 ]
Jia, Chengxing [1 ,2 ]
Guan, Cong [1 ]
Qian, Chao [1 ]
Yu, Yang [1 ,2 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing, Peoples R China
[2] Polixir Technol, Nanjing, Peoples R China
来源
UNCERTAINTY IN ARTIFICIAL INTELLIGENCE | 2023年 / 216卷
基金
美国国家科学基金会;
关键词
REINFORCEMENT;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cooperative multi-agent reinforcement learning (MARL), where agents coordinates with teammate(s) for a shared goal, may sustain non-stationary caused by the policy change of teammates. Prior works mainly concentrate on the policy change cross episodes, ignoring the fact that teammates may suffer from sudden policy change within an episode, which might lead to miscoordination and poor performance. We formulate the problem as an open Dec-POMDP, where we control some agents to coordinate with uncontrolled teammates, whose policies could be changed within one episode. Then we develop a new framework Fast teammates adaptation (Fastap) to address the problem. Concretely, we first train versatile teammates' policies and assign them to different clusters via the Chinese Restaurant Process (CRP). Then, we train the controlled agent(s) to coordinate with the sampled uncontrolled teammates by capturing their identifications as context for fast adaptation. Finally, each agent applies its local information to anticipate the teammates' context for decision-making accordingly. This process proceeds alternately, leading to a robust policy that can adapt to any teammates during the decentralized execution phase. We show in multiple multi-agent benchmarks that Fastap can achieve superior performance than multiple baselines in stationary and non-stationary scenarios.
引用
收藏
页码:2465 / 2476
页数:12
相关论文
共 50 条
  • [1] Polygenic adaptation after a sudden change in environment
    Hayward, Laura Katharine
    Sella, Guy
    ELIFE, 2022, 11
  • [2] The diffusion of climate change adaptation policy
    Schoenefeld, Jonas J.
    Schulze, Kai
    Bruch, Nils
    WILEY INTERDISCIPLINARY REVIEWS-CLIMATE CHANGE, 2022, 13 (03)
  • [3] Climate change adaptation policy options
    Smith, JB
    Lenhart, SS
    CLIMATE RESEARCH, 1996, 6 (02) : 193 - 201
  • [4] Policy integration and climate change adaptation
    Biesbroek, Robbert
    CURRENT OPINION IN ENVIRONMENTAL SUSTAINABILITY, 2021, 52 : 75 - 81
  • [5] Adaptation to climate change: A study on regional climate change adaptation policy and practice framework
    Biswas, Rahul Ray
    Rahman, Anisur
    JOURNAL OF ENVIRONMENTAL MANAGEMENT, 2023, 336
  • [6] Cultural Surprises as Sources of Sudden, Big Policy Change
    Swedlow, Brendon
    PS-POLITICAL SCIENCE & POLITICS, 2011, 44 (04) : 736 - 739
  • [7] Drafting Pandemic Policy: Writing and Sudden Institutional Change
    Workman, Erin
    Vandenberg, Peter
    Crozier, Madeline
    JOURNAL OF BUSINESS AND TECHNICAL COMMUNICATION, 2021, 35 (01) : 140 - 146
  • [8] CLIMATE CHANGE ADAPTATION POLICY: ISSUES IN LATVIA
    Melece, Ligita
    Shena, Ilze
    18TH INTERNATIONAL SCIENTIFIC CONFERENCE ENGINEERING FOR RURAL DEVELOPMENT, 2019, : 1605 - 1615
  • [9] A Policy Suggestion for the Adaptation of Climate Change in Korea
    Shin, Im Chul
    Kim, Yeongsin
    ATMOSPHERE-KOREA, 2009, 19 (01): : 53 - 66
  • [10] Linking adaptation and mitigation in climate change policy
    Kane, S
    Shogren, JF
    CLIMATIC CHANGE, 2000, 45 (01) : 75 - 102