Efficient DDPG via the Self-Supervised Method

被引:0
|
作者
Zhang, Guanghao [1 ]
Chen, Hongliang [2 ]
Li, Jianxun [1 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai, Peoples R China
[2] AVIC, Inst Electroopt Equipment, Luoyang 4710009, Peoples R China
关键词
Efficient DDPG; Self-Supervised Method; Inverse and Forward Model; MODEL;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, embedded with self-supervised learning network, an efficient DDPG(Deep Deterministic Policy Gradient) RL algorithm is investigated. With more essential characteristics of observing data included, the inputs of actor network and critic network of DDPG are replaced by the high-dimensional outputs from feature extracting layers and forward network respectively. Additionally, the parameters of these auxiliary layers are optimized with a self-supervised method by minimizing predicting errors, and thus both optimizing progresses can run parallelly and simultaneously. Lastly, an antagonistic air-fight simulation with a novel customized training index is introduced to perform the effectiveness and rising efficiency of our self-supervised DDPG RL algorithm.
引用
收藏
页码:4636 / 4642
页数:7
相关论文
共 50 条
  • [21] Self-Supervised Learning via Conditional Motion Propagation
    Zhan, Xiaohang
    Pan, Xingang
    Liu, Ziwei
    Lin, Dahua
    Loy, Chen Change
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 1881 - 1889
  • [22] Self-supervised Label Augmentation via Input Transformations
    Lee, Hankook
    Hwang, Sung Ju
    Shin, Jinwoo
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [23] Self-supervised part segmentation via motion imitation
    Zhang, Yanping
    Liang, Qiaokang
    Zou, Kunlin
    Li, Zhengwei
    Sun, Wei
    Wang, Yaonan
    IMAGE AND VISION COMPUTING, 2022, 120
  • [24] Self-supervised graph representation learning via bootstrapping
    Che, Feihu
    Yang, Guohua
    Zhang, Dawei
    Tao, Jianhua
    Liu, Tong
    NEUROCOMPUTING, 2021, 456 (456) : 88 - 96
  • [25] SEMI: Self-supervised Exploration via Multisensory Incongruity
    Wang, Jianren
    Zhuang, Ziwen
    Zhao, Hang
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022, : 2663 - 2670
  • [26] Self-Supervised GANs via Auxiliary Rotation Loss
    Chen, Ting
    Zhai, Xiaohua
    Ritter, Marvin
    Lucic, Mario
    Houlsby, Neil
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 12146 - 12155
  • [27] Self-supervised Video Hashing via Bidirectional Transformers
    Li, Shuyan
    Li, Xiu
    Lu, Jiwen
    Zhou, Jie
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 13544 - 13553
  • [28] Self-Supervised Learning via Maximum Entropy Coding
    Liu, Xin
    Wang, Zhongdao
    Li, Yali
    Wang, Shengjin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [29] Efficient Communication via Self-Supervised Information Aggregation for Online and Offline Multiagent Reinforcement Learning
    Guan, Cong
    Chen, Feng
    Yuan, Lei
    Zhang, Zongzhang
    Yu, Yang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [30] Generalized Semi-Supervised Learning via Self-Supervised Feature Adaptation
    Liang, Jiachen
    Hou, Ruibing
    Chang, Hong
    Ma, Bingpeng
    Shan, Shiguang
    Chen, Xilin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,