Mask-based Latent Reconstruction for Reinforcement Learning

被引:0
|
作者
Yu, Tao [1 ]
Zhang, Zhizheng [2 ]
Lan, Cuiling [2 ]
Lu, Yan [2 ]
Chen, Zhibo [1 ]
机构
[1] Univ Sci & Technol China, Hefei, Peoples R China
[2] Microsoft Res Asia, Beijing, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For deep reinforcement learning (RL) from pixels, learning effective state representations is crucial for achieving high performance. However, in practice, limited experience and high-dimensional inputs prevent effective representation learning. To address this, motivated by the success of mask-based modeling in other research fields, we introduce mask-based reconstruction to promote state representation learning in RL. Specifically, we propose a simple yet effective self-supervised method, Mask-based Latent Reconstruction (MLR), to predict complete state representations in the latent space from the observations with spatially and temporally masked pixels. MLR enables better use of context information when learning state representations to make them more informative, which facilitates the training of RL agents. Extensive experiments show that our MLR significantly improves the sample efficiency in RL and outperforms the state-of-the-art sample-efficient RL methods on multiple continuous and discrete control benchmarks. Our code is available at https://github.com/microsoft/Mask-based-Latent-Reconstruction.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] A Mask-based Model for Mandarin Chinese Polyphone Disambiguation
    Zhang, Haiteng
    Pan, Huashan
    Li, Xiulin
    INTERSPEECH 2020, 2020, : 1728 - 1732
  • [22] MASK-BASED ENHANCEMENT FOR VERY LOW QUALITY SPEECH
    Gonzalez, Sira
    Brookes, Mike
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [23] Mask-based generative adversarial networking for crowd counting
    Duan, Guoxiu
    Zhu, Aichun
    Zhao, Lu
    Zhu, Xiaomei
    Hu, Fangqiang
    Guan, Xinjie
    JOURNAL OF ELECTRONIC IMAGING, 2021, 30 (04)
  • [24] Mask-Based Panoptic LiDAR Segmentation for Autonomous Driving
    Marcuzzi, Rodrigo
    Nunes, Lucas
    Wiesmann, Louis
    Behley, Jens
    Stachniss, Cyrill
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (02) : 1141 - 1148
  • [25] Learned reconstructions for practical mask-based lensless imaging
    Monakhova, Kristina
    Yurtsever, Joshua
    Kuo, Grace
    Antipa, Nick
    Yanny, Kyrollos
    Waller, Laura
    OPTICS EXPRESS, 2019, 27 (20): : 28075 - 28090
  • [26] Fabrication of micro/nanotubes by mask-based diffraction lithography
    Tan, Xianhua
    Shi, Tielin
    Gao, Yang
    Sheng, Wenjun
    Sun, Bo
    Liao, Guanglan
    JOURNAL OF MICROMECHANICS AND MICROENGINEERING, 2014, 24 (05)
  • [27] Achieving mask-based imaging with optical maskless lithography
    Stone, Elizabeth M.
    Hintersteiner, Jason D.
    Cebuhar, Wenceslao A.
    Albright, Ronald
    Eib, Nicholas K.
    Latypovi, Azat
    Baba-Ali, Nabila
    Poultney, Sherman K.
    Croffie, Ebo H.
    EMERGING LITHOGRAPHIC TECHNOLOGIES X, PTS 1 AND 2, 2006, 6151
  • [28] Modeling effects of oxygen inhibition in mask-based stereolithography
    Jariwala, Amit S.
    Ding, Fei
    Boddapati, Aparna
    Breedveld, Victor
    Grover, Martha A.
    Henderson, Clifford L.
    Rosen, David W.
    RAPID PROTOTYPING JOURNAL, 2011, 17 (03) : 168 - 175
  • [29] Mask-based fingerprinting scheme for digital video broadcasting
    Emmanuel, Sabu
    Kankanhalli, Mohan S.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2006, 31 (02) : 145 - 170
  • [30] Mask-based fingerprinting scheme for digital video broadcasting
    Sabu Emmanuel
    Mohan S. Kankanhalli
    Multimedia Tools and Applications, 2006, 31 : 145 - 170