Contrastive Example-Based Control

被引:0
|
作者
Hatch, Kyle [1 ]
Eysenbach, Benjamin [2 ]
Rafailov, Rafael [1 ]
Yu, Tianhe [1 ]
Salakhutdinov, Ruslan [2 ]
Levine, Sergey [3 ]
Finn, Chelsea [1 ]
机构
[1] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
[2] Carnegie Mellon Univ, Machine Learning Dept, Pittsburgh, PA USA
[3] Univ Calif Berkeley, Dept Elect Engn & Comp Sci, Berkeley, CA USA
关键词
reinforcement learning; offline RL; robot learning; reward learning; contrastive learning; model-based reinforcement learning; example-based control; reward-free learning;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
While many real-world problems that might benefit from reinforcement learning, these problems rarely fit into the MDP mold: interacting with the environment is often expensive and specifying reward functions is challenging. Motivated by these challenges, prior work has developed data-driven approaches that learn entirely from samples from the transition dynamics and examples of high-return states. These methods typically learn a reward function from high-return states, use that reward function to label the transitions, and then apply an offline RL algorithm to these transitions. While these methods can achieve good results on many tasks, they can be complex, often requiring regularization and temporal difference updates. In this paper, we propose a method for offline, example-based control that learns an implicit model of multi-step transitions, rather than a reward function. We show that this implicit model can represent the Q-values for the example-based control problem. Across a range of state-based and image-based offline control tasks, our method outperforms baselines that use learned reward functions; additional experiments demonstrate improved robustness and scaling with dataset size.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Example-based caricature generation with exaggeration control
    Yang, Wei
    Toyoura, Masahiro
    Xu, Jiayi
    Ohnuma, Fumio
    Mao, Xiaoyang
    VISUAL COMPUTER, 2016, 32 (03): : 383 - 392
  • [2] Example-based caricature generation with exaggeration control
    Wei Yang
    Masahiro Toyoura
    Jiayi Xu
    Fumio Ohnuma
    Xiaoyang Mao
    The Visual Computer, 2016, 32 : 383 - 392
  • [3] Example-based Antialiasing
    Han, Jian-Wei
    Yang, Bai-Lin
    Jiang, Zhao-Yi
    Wang, Xun
    INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND COMMUNICATION ENGINEERING (CSCE 2015), 2015, : 1177 - 1182
  • [4] Example-based automatic portraiture
    Chen, Hong
    Zheng, Nan-Ning
    Liang, Lin
    Xu, Ying-Qing
    Shum, Heung-Yeung
    Jisuanji Xuebao/Chinese Journal of Computers, 2003, 26 (02): : 147 - 152
  • [5] Interactive example-based hatching
    Gerl, Moritz
    Isenberg, Tobias
    COMPUTERS & GRAPHICS-UK, 2013, 37 (1-2): : 65 - 80
  • [6] An Example-Based Face Relighting
    Shim, Hyunjung
    Chen, Tsuhan
    ENGINEERING REALITY OF VIRTUAL REALITY 2012, 2012, 8289
  • [7] Example-based cosmetic transfer
    Tong, Wai-Shun
    Tang, Chi-Keung
    Brown, Michael S.
    Xu, Ying-Qing
    PACIFIC GRAPHICS 2007: 15TH PACIFIC CONFERENCE ON COMPUTER GRAPHICS AND APPLICATIONS, 2007, : 211 - +
  • [8] Example-Based Program Transformation
    Robbes, Romain
    Lanza, Michele
    MODEL DRIVEN ENGINEERING LANGUAGES AND SYSTEMS, PROCEEDINGS, 2008, 5301 : 174 - 188
  • [9] Example-based style synthesis
    Drori, I
    Cohen-Or, D
    Yeshurun, H
    2003 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL II, PROCEEDINGS, 2003, : 143 - 150
  • [10] Example-based head tracking
    Niyogi, S
    Freeman, WT
    PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION, 1996, : 374 - 378