Mask-based Latent Reconstruction for Reinforcement Learning

被引:0
|
作者
Yu, Tao [1 ]
Zhang, Zhizheng [2 ]
Lan, Cuiling [2 ]
Lu, Yan [2 ]
Chen, Zhibo [1 ]
机构
[1] Univ Sci & Technol China, Hefei, Peoples R China
[2] Microsoft Res Asia, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For deep reinforcement learning (RL) from pixels, learning effective state representations is crucial for achieving high performance. However, in practice, limited experience and high-dimensional inputs prevent effective representation learning. To address this, motivated by the success of mask-based modeling in other research fields, we introduce mask-based reconstruction to promote state representation learning in RL. Specifically, we propose a simple yet effective self-supervised method, Mask-based Latent Reconstruction (MLR), to predict complete state representations in the latent space from the observations with spatially and temporally masked pixels. MLR enables better use of context information when learning state representations to make them more informative, which facilitates the training of RL agents. Extensive experiments show that our MLR significantly improves the sample efficiency in RL and outperforms the state-of-the-art sample-efficient RL methods on multiple continuous and discrete control benchmarks. Our code is available at https://github.com/microsoft/Mask-based-Latent-Reconstruction.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Image reconstruction with transformer for mask-based lensless imaging
    Pan, Xiuxi
    Chen, Xiao
    Takeyama, Saori
    Yamaguchi, Masahiro
    OPTICS LETTERS, 2022, 47 (07) : 1843 - 1846
  • [2] DeepLIR: Attention-based approach for Mask-Based Lensless Image Reconstruction
    Poudel, Arpan
    Nakarmi, Ukash
    2024 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS, WACVW 2024, 2024, : 431 - 439
  • [3] MASK-BASED MICROSPHERE PHOTOLITHOGRAPHY
    Qu, Chuang
    Kinzel, Edward C.
    PROCEEDINGS OF THE ASME 13TH INTERNATIONAL MANUFACTURING SCIENCE AND ENGINEERING CONFERENCE, 2018, VOL 4, 2018,
  • [4] Disentangled Image Attribute Editing in Latent Space via Mask-based Retention Loss
    Ohaga, Shunya
    Togo, Ren
    Ogawa, Takahiro
    Haseyama, Miki
    PROCEEDINGS OF THE 4TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA IN ASIA, MMASIA 2022, 2022,
  • [5] Dreaming: Model-based Reinforcement Learning by Latent Imagination without Reconstruction
    Okada, Masashi
    Taniguchi, Tadahiro
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 4209 - 4215
  • [6] Student-Teacher Learning for BLSTM Mask-based Speech Enhancement
    Subramanian, Aswin Shanmugam
    Chen, Szu-Jui
    Watanabe, Shinji
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 3249 - 3253
  • [7] Gait Recognition with Mask-based Regularization
    Shen, Chuanfu
    Lin, Beibei
    Zhang, Shunli
    Yu, Xin
    Huang, George Q.
    Yu, Shiqi
    2023 IEEE INTERNATIONAL JOINT CONFERENCE ON BIOMETRICS, IJCB, 2023,
  • [8] Analysis of mask-based nanowire decoders
    Rachlin, Eric
    Savage, John E.
    IEEE TRANSACTIONS ON COMPUTERS, 2008, 57 (02) : 175 - 187
  • [9] Analysis of a mask-based nanowire decoder
    Rachlin, E
    Savage, JE
    Gojman, B
    IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI, PROCEEDINGS: NEW FRONTIERS IN VLSI DESIGN, 2005, : 6 - 13
  • [10] A Mask-Based Adversarial Defense Scheme
    Xu, Weizhen
    Zhang, Chenyi
    Zhao, Fangzhen
    Fang, Liangda
    ALGORITHMS, 2022, 15 (12)