Multi-Agent Learning from Learners

被引:0
|
作者
Caliskan, Mine Melodi [1 ]
Chini, Francesco [1 ]
Maghsudi, Setareh [1 ]
机构
[1] Univ Tubingen, Dept Comp Sci, Tubingen, Germany
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A large body of the "Inverse Reinforcement Learning" (IRL) literature focuses on recovering the reward function from a set of demonstrations of an expert agent who acts optimally or noisily optimally. Nevertheless, some recent works move away from the optimality assumption to study the "Learning from a Learner (LfL)" problem, where the challenge is inferring the reward function of a learning agent from a sequence of demonstrations produced by progressively improving policies. In this work, we take one of the initial steps in addressing the multi-agent version of this problem and propose a new algorithm, MA-LfL (Multiagent Learning from a Learner). Unlike the state-of-the-art literature, which recovers the reward functions from trajectories produced by agents in some equilibrium, we study the problem of inferring the reward functions of interacting agents in a general sum stochastic game without assuming any equilibrium state. The MA-LfL algorithm is rigorously built on a theoretical result that ensures its validity in the case of agents learning according to a multi-agent soft policy iteration scheme. We empirically test MA-LfL and we observe high positive correlation between the recovered reward functions and the ground truth.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Personalised learning object based on multi-agent model and learners' learning styles
    Pukkhem, Noppamas
    Vatanawood, Wiwat
    MAEJO INTERNATIONAL JOURNAL OF SCIENCE AND TECHNOLOGY, 2011, 5 (03) : 292 - 311
  • [2] ADAPTATION TO LEARNERS' LEARNING STYLES IN A MULTI-AGENT E-LEARNING SYSTEM
    Pham Quang Dung
    Florea, Adina Magda
    LEVERAGING TECHNOLOGY FOR LEARNING, VOL II, 2012, : 259 - 266
  • [3] A Collaborative Learning System Based on Learners' Individuality Using Multi-agent
    Fan, Yin
    ICCSE 2008: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION: ADVANCED COMPUTER TECHNOLOGY, NEW EDUCATION, 2008, : 51 - 55
  • [4] Multi-agent learning
    Eduardo Alonso
    Autonomous Agents and Multi-Agent Systems, 2007, 15 : 3 - 4
  • [5] Multi-agent learning
    Alonso, Eduardo
    AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2007, 15 (01) : 3 - 4
  • [6] Multi-Agent Joint Learning from Argumentation
    Xu, Junyi
    Yao, Li
    Li, Le
    Li, Jinyang
    AGENTS AND DATA MINING INTERACTION (ADMI 2013), 2014, 8316 : 14 - 25
  • [7] Multi-Agent Cognition Difference Reinforcement Learning for Multi-Agent Cooperation
    Wang, Huimu
    Qiu, Tenghai
    Liu, Zhen
    Pu, Zhiqiang
    Yi, Jianqiang
    Yuan, Wanmai
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [8] Multi-Agent Uncertainty Sharing for Cooperative Multi-Agent Reinforcement Learning
    Chen, Hao
    Yang, Guangkai
    Zhang, Junge
    Yin, Qiyue
    Huang, Kaiqi
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [9] Multi-Agent Reinforcement Learning With Distributed Targeted Multi-Agent Communication
    Xu, Chi
    Zhang, Hui
    Zhang, Ya
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 2915 - 2920
  • [10] Profiling Learners Behavior: A Multi-Agent Approach to Support Diagnosis in Learning Management System
    Chiu, Hsiao-Ya
    Third 2008 International Conference on Convergence and Hybrid Information Technology, Vol 2, Proceedings, 2008, : 1177 - 1181