Imitation learning using graphical models

被引：0

作者：

Verma, Deepak ^{[1
]}

Rao, Rajesh P. N. ^{[1
]}

机构：

[1] Univ Washington, Dept Comp Sci & Engn, Seattle, WA 98105 USA

来源：

MACHINE LEARNING: ECML 2007, PROCEEDINGS | 2007年 / 4701卷

基金：

美国国家科学基金会;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Imitation-based learning is a general mechanism for rapid acquisition of new behaviors in autonomous agents and robots. In this paper, we propose a new approach to learning by imitation based on parameter learning in probabilistic graphical models. Graphical models are used not only to model an agent's own dynamics but also the dynamics of an observed teacher. Parameter tying between the agent-teacher models ensures consistency and facilitates learning. Given only observations of the teacher's states, we use the expect at ion-maximization (EM) algorithm to learn both dynamics and policies within graphical models. We present results demonstrating that EM-based imitation learning outperforms pure exploration-based learning on a benchmark problem (the FlagWorld domain). We additionally show that the graphical model representation can be leveraged to incorporate domain knowledge (e.g., state space factoring) to achieve significant speed-up in learning.

引用

页码：757 / +

页数：2

共 50 条

[41] Learning of Discrete Graphical Models with Neural Networks
Abhijith, J.
Lokhov, Andrey Y.
Misra, Sidhant
Vuffray, Marc
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[42] Learning graphical models for stationary time series
Bach, FR
Jordan, MI
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2004, 52 (08) : 2189 - 2199
[43] Greedy Learning of Graphical Models with Small Girth
Ray, Avik
Sanghavi, Sujay
Shakkottai, Sanjay
2012 50TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2012, : 2024 - 2031
[44] Decomposable Graphical Models on Learning, Fusion and Revision
Schmidt, Fabian
Gebhardt, Joerg
Kruse, Rudolf
RECENT DEVELOPMENTS AND THE NEW DIRECTION IN SOFT-COMPUTING FOUNDATIONS AND APPLICATIONS, 2018, 361 : 439 - 452
[45] Unsupervised Learning with Truncated Gaussian Graphical Models
Su, Qinliang
Liao, Xuejun
Li, Chunyuan
Gan, Zhe
Carin, Lawrence
THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2583 - 2589
[46] Improved Greedy Algorithms for Learning Graphical Models
Ray, Avik
Sanghavi, Sujay
Shakkottai, Sanjay
IEEE TRANSACTIONS ON INFORMATION THEORY, 2015, 61 (06) : 3457 - 3468
[47] Probabilistic Graphical Models: On Learning, Fusion, and Revision
Kruse, Rudolf
Bouguila, Nizar
Gregoire, Amphitheatre A.
2019 6TH INTERNATIONAL CONFERENCE ON CONTROL, DECISION AND INFORMATION TECHNOLOGIES (CODIT 2019), 2019,
[48] Learning Graphical Models From the Glauber Dynamics
Bresler, Guy
Gamarnik, David
Shah, Devavrat
IEEE TRANSACTIONS ON INFORMATION THEORY, 2018, 64 (06) : 4072 - 4080
[49] On the Difficulty of Learning Power Law Graphical Models
Tandon, Rashish
Ravikumar, Pradeep
2013 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY PROCEEDINGS (ISIT), 2013, : 2493 - 2497
[50] Empirical Bayesian learning in AR graphical models
Zorzi, Mattia
AUTOMATICA, 2019, 109

← 1 2 3 4 5 →