Imitation learning using graphical models

被引:0
|
作者
Verma, Deepak [1 ]
Rao, Rajesh P. N. [1 ]
机构
[1] Univ Washington, Dept Comp Sci & Engn, Seattle, WA 98105 USA
来源
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Imitation-based learning is a general mechanism for rapid acquisition of new behaviors in autonomous agents and robots. In this paper, we propose a new approach to learning by imitation based on parameter learning in probabilistic graphical models. Graphical models are used not only to model an agent's own dynamics but also the dynamics of an observed teacher. Parameter tying between the agent-teacher models ensures consistency and facilitates learning. Given only observations of the teacher's states, we use the expect at ion-maximization (EM) algorithm to learn both dynamics and policies within graphical models. We present results demonstrating that EM-based imitation learning outperforms pure exploration-based learning on a benchmark problem (the FlagWorld domain). We additionally show that the graphical model representation can be leveraged to incorporate domain knowledge (e.g., state space factoring) to achieve significant speed-up in learning.
引用
收藏
页码:757 / +
页数:2
相关论文
共 50 条
  • [21] Learning to Discover Sparse Graphical Models
    Belilovsky, Eugene
    Kastner, Kyle
    Varoquaux, Gad
    Blaschko, Matthew B.
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [22] Learning Explainable Templated Graphical Models
    Embar, Varun
    Srinivasan, Sriram
    Getoor, Lise
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, VOL 180, 2022, 180 : 621 - 630
  • [23] Learning Latent Tree Graphical Models
    Choi, Myung Jin
    Tan, Vincent Y. F.
    Anandkumar, Animashree
    Willsky, Alan S.
    JOURNAL OF MACHINE LEARNING RESEARCH, 2011, 12 : 1771 - 1812
  • [24] Bayesian structure learning in graphical models
    Banerjee, Sayantan
    Ghosal, Subhashis
    JOURNAL OF MULTIVARIATE ANALYSIS, 2015, 136 : 147 - 162
  • [25] Local structure learning in graphical models
    Borgelt, C
    Kruse, R
    PLANNING BASED ON DECISION THEORY, 2003, (472): : 99 - 118
  • [26] Efficient learning of discrete graphical models*
    Vuffray, Marc
    Misra, Sidhant
    Lokhov, Andrey Y.
    JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2021, 2021 (12):
  • [27] Unifying learning in games and graphical models
    Rezek, I
    Roberts, SJ
    Rogers, A
    Dash, RK
    Jennings, N
    2005 7TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), VOLS 1 AND 2, 2005, : 1193 - 1198
  • [28] Experiments with learning graphical models on text
    Capdevila J.
    Zhao H.
    Petitjean F.
    Buntine W.
    Behaviormetrika, 2018, 45 (2) : 363 - 387
  • [29] Efficient Imitation Learning with Conservative World Models
    Kolev, Victor
    Rafailov, Rafael
    Hatch, Kyle
    Wu, Jiajun
    Finn, Chelsea
    6TH ANNUAL LEARNING FOR DYNAMICS & CONTROL CONFERENCE, 2024, 242 : 1776 - 1789
  • [30] Asymptotic Bayesian structure learning using graph supports for Gaussian graphical models
    Marrelec, Guillaume
    Benali, Habib
    JOURNAL OF MULTIVARIATE ANALYSIS, 2006, 97 (06) : 1451 - 1466