Multi-Agent Learning from Learners

被引：0

作者：

Caliskan, Mine Melodi ^{[1
]}

Chini, Francesco ^{[1
]}

Maghsudi, Setareh ^{[1
]}

机构：

[1] Univ Tubingen, Dept Comp Sci, Tubingen, Germany

来源：

INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202 | 2023年 / 202卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

A large body of the "Inverse Reinforcement Learning" (IRL) literature focuses on recovering the reward function from a set of demonstrations of an expert agent who acts optimally or noisily optimally. Nevertheless, some recent works move away from the optimality assumption to study the "Learning from a Learner (LfL)" problem, where the challenge is inferring the reward function of a learning agent from a sequence of demonstrations produced by progressively improving policies. In this work, we take one of the initial steps in addressing the multi-agent version of this problem and propose a new algorithm, MA-LfL (Multiagent Learning from a Learner). Unlike the state-of-the-art literature, which recovers the reward functions from trajectories produced by agents in some equilibrium, we study the problem of inferring the reward functions of interacting agents in a general sum stochastic game without assuming any equilibrium state. The MA-LfL algorithm is rigorously built on a theoretical result that ensures its validity in the case of agents learning according to a multi-agent soft policy iteration scheme. We empirically test MA-LfL and we observe high positive correlation between the recovered reward functions and the ground truth.

引用

页数：16

共 50 条

[41] Social learning in a multi-agent system
Noble, J
Franks, DW
COMPUTING AND INFORMATICS, 2003, 22 (06) : 561 - 574
[42] Intelligent Multi-agent Coordination and Learning
Chang, Yu-Cheng
Dostovalova, Anna
Lin, Chin-Teng
Kim, Jijoong
2019 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2019, : 1431 - 1436
[43] Language Learning in Multi-Agent Systems
Allen, Martin
Goldman, Claudia V.
Zilberstein, Shlomo
19TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-05), 2005, : 1649 - 1650
[44] The Dynamics of Multi-Agent Reinforcement Learning
Dickens, Luke
Broda, Krysia
Russo, Alessandra
ECAI 2010 - 19TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2010, 215 : 367 - 372
[45] Adaptive Learning for Multi-Agent Navigation
Godoy, Julio
Karamouzas, Ioannis
Guy, Stephen J.
Gini, Maria
PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS (AAMAS'15), 2015, : 1577 - 1585
[46] Learning Fairness in Multi-Agent Systems
Jiang, Jiechuan
Lu, Zongqing
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[47] Learning multi-agent search strategies
Strens, MJA
ADAPTIVE AGENTS AND MULTI-AGENT SYSTEMS II: ADAPTATION AND MULTI-AGENT LEARNING, 2005, 3394 : 245 - 259
[48] Multi-agent reinforcement learning: A survey
Busoniu, Lucian
Babuska, Robert
De Schutter, Bart
2006 9TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION, VOLS 1- 5, 2006, : 1133 - +
[49] Multi-Agent Learning with Policy Prediction
Zhang, Chongjie
Lesser, Victor
PROCEEDINGS OF THE TWENTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-10), 2010, : 927 - 934
[50] Multi-Agent Reinforcement Learning for Microgrids
Dimeas, A. L.
Hatziargyriou, N. D.
IEEE POWER AND ENERGY SOCIETY GENERAL MEETING 2010, 2010,

← 1 2 3 4 5 →