Imitation learning using graphical models

被引:0
|
作者
Verma, Deepak [1 ]
Rao, Rajesh P. N. [1 ]
机构
[1] Univ Washington, Dept Comp Sci & Engn, Seattle, WA 98105 USA
来源
基金
美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Imitation-based learning is a general mechanism for rapid acquisition of new behaviors in autonomous agents and robots. In this paper, we propose a new approach to learning by imitation based on parameter learning in probabilistic graphical models. Graphical models are used not only to model an agent's own dynamics but also the dynamics of an observed teacher. Parameter tying between the agent-teacher models ensures consistency and facilitates learning. Given only observations of the teacher's states, we use the expect at ion-maximization (EM) algorithm to learn both dynamics and policies within graphical models. We present results demonstrating that EM-based imitation learning outperforms pure exploration-based learning on a benchmark problem (the FlagWorld domain). We additionally show that the graphical model representation can be leveraged to incorporate domain knowledge (e.g., state space factoring) to achieve significant speed-up in learning.
引用
收藏
页码:757 / +
页数:2
相关论文
共 50 条
  • [41] Learning of Discrete Graphical Models with Neural Networks
    Abhijith, J.
    Lokhov, Andrey Y.
    Misra, Sidhant
    Vuffray, Marc
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [42] Learning graphical models for stationary time series
    Bach, FR
    Jordan, MI
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2004, 52 (08) : 2189 - 2199
  • [43] Greedy Learning of Graphical Models with Small Girth
    Ray, Avik
    Sanghavi, Sujay
    Shakkottai, Sanjay
    2012 50TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2012, : 2024 - 2031
  • [44] Decomposable Graphical Models on Learning, Fusion and Revision
    Schmidt, Fabian
    Gebhardt, Joerg
    Kruse, Rudolf
    RECENT DEVELOPMENTS AND THE NEW DIRECTION IN SOFT-COMPUTING FOUNDATIONS AND APPLICATIONS, 2018, 361 : 439 - 452
  • [45] Unsupervised Learning with Truncated Gaussian Graphical Models
    Su, Qinliang
    Liao, Xuejun
    Li, Chunyuan
    Gan, Zhe
    Carin, Lawrence
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2583 - 2589
  • [46] Improved Greedy Algorithms for Learning Graphical Models
    Ray, Avik
    Sanghavi, Sujay
    Shakkottai, Sanjay
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2015, 61 (06) : 3457 - 3468
  • [47] Probabilistic Graphical Models: On Learning, Fusion, and Revision
    Kruse, Rudolf
    Bouguila, Nizar
    Gregoire, Amphitheatre A.
    2019 6TH INTERNATIONAL CONFERENCE ON CONTROL, DECISION AND INFORMATION TECHNOLOGIES (CODIT 2019), 2019,
  • [48] Learning Graphical Models From the Glauber Dynamics
    Bresler, Guy
    Gamarnik, David
    Shah, Devavrat
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2018, 64 (06) : 4072 - 4080
  • [49] On the Difficulty of Learning Power Law Graphical Models
    Tandon, Rashish
    Ravikumar, Pradeep
    2013 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY PROCEEDINGS (ISIT), 2013, : 2493 - 2497
  • [50] Empirical Bayesian learning in AR graphical models
    Zorzi, Mattia
    AUTOMATICA, 2019, 109