Scalable Decision-Theoretic Planning in Open and Typed Multiagent Systems

被引:0
|
作者
Eck, Adam [1 ]
Shah, Maulik [2 ]
Doshi, Prashant [2 ]
Soh, Leen-Kiat [3 ]
机构
[1] Oberlin Coll, Oberlin, OH 44074 USA
[2] Univ Georgia, Athens, GA 30602 USA
[3] Univ Nebraska, Lincoln, NE 68583 USA
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In open agent systems, the set of agents that are cooperating or competing changes over time and in ways that are nontrivial to predict. For example, if collaborative robots were tasked with fighting wildfires, they may run out of suppressants and be temporarily unavailable to assist their peers. We consider the problem of planning in these contexts with the additional challenges that the agents are unable to communicate with each other and that there are many of them. Because an agent's optimal action depends on the actions of others, each agent must not only predict the actions of its peers. but, before that, reason whether they are even present to perform an action. Addressing openness thus requires agents to model each other's presence, which becomes computationally intractable with high numbers of agents. We present a novel, principled, and scalable method in this context that enables an agent to reason about others' presence in its shared environment and their actions. Our method extrapolates models of a few peers to the overall behavior of the many-agent system. and combines it with a generalization of Monte Carlo tree search to perform individual agent reasoning in manyagent open environments. Theoretical analyses establish the number of agents to model in order to achieve acceptable worst case bounds on extrapolation error, as well as regret bounds on the agent's utility from modeling only some neighbors. Simulations of multiagent wildfire suppression problems demonstrate our approach's efficacy compared with alternative baselines.
引用
收藏
页码:7127 / 7134
页数:8
相关论文
共 50 条
  • [41] Decision-Theoretic Planning with Person Trajectory Prediction for Social Navigation
    Perez-Hurtado, Ignacio
    Capitan, Jesus
    Caballero, Fernando
    Merino, Luis
    ROBOT 2015: SECOND IBERIAN ROBOTICS CONFERENCE: ADVANCES IN ROBOTICS, VOL 2, 2016, 418 : 247 - 258
  • [42] A Decision-Theoretic Planning Approach for Clinical Practice Guideline Modelling
    Acosta, Dionisio
    Garcia-Gomez, Juan M.
    2014 IEEE-EMBS INTERNATIONAL CONFERENCE ON BIOMEDICAL AND HEALTH INFORMATICS (BHI), 2014, : 197 - 200
  • [43] Decision-theoretic inspection planning using imperfect and incomplete data
    Di Francesco, Domenic
    Chryssanthopoulos, Marios
    Faber, Michael Havbro
    Bharadwaj, Ujjwal
    DATA-CENTRIC ENGINEERING, 2021, 2 (01):
  • [44] A decision-theoretic approach to the evaluation of information retrieval systems
    Wang, YD
    Forgionne, G
    INFORMATION PROCESSING & MANAGEMENT, 2006, 42 (04) : 863 - 874
  • [45] Decision-Theoretic Monitoring of Cyber-Physical Systems
    Yavolovsky, Andrey
    Zefran, Milos
    Sistla, A. Prasad
    RUNTIME VERIFICATION, (RV 2016), 2016, 10012 : 404 - 419
  • [46] Decision-theoretic Clustering of Strategies
    Bard, Nolan
    Nicholas, Deon
    Szepesvari, Csaba
    Bowling, Michael
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS (AAMAS'15), 2015, : 17 - 25
  • [47] On decision-theoretic foundations for defaults
    Brafman, RI
    Friedman, N
    ARTIFICIAL INTELLIGENCE, 2001, 133 (1-2) : 1 - 33
  • [48] DECISION-THEORETIC SPEAKER RECOGNIZER
    KEITHSMITH, JE
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1962, 34 (12): : 1988 - &
  • [49] A Decision-Theoretic Model of Assistance
    Fern, Alan
    Natarajan, Sriraam
    Judah, Kshitij
    Tadepalli, Prasad
    JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2014, 50 : 71 - 104
  • [50] Decision-theoretic image retrieval
    Vasconcelos, N
    INTERNET MULTIMEDIA MANAGEMENT SYSTEMS III, 2002, 4862 : 114 - 125