Efficient DDPG via the Self-Supervised Method

被引:0
|
作者
Zhang, Guanghao [1 ]
Chen, Hongliang [2 ]
Li, Jianxun [1 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai, Peoples R China
[2] AVIC, Inst Electroopt Equipment, Luoyang 4710009, Peoples R China
关键词
Efficient DDPG; Self-Supervised Method; Inverse and Forward Model; MODEL;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, embedded with self-supervised learning network, an efficient DDPG(Deep Deterministic Policy Gradient) RL algorithm is investigated. With more essential characteristics of observing data included, the inputs of actor network and critic network of DDPG are replaced by the high-dimensional outputs from feature extracting layers and forward network respectively. Additionally, the parameters of these auxiliary layers are optimized with a self-supervised method by minimizing predicting errors, and thus both optimizing progresses can run parallelly and simultaneously. Lastly, an antagonistic air-fight simulation with a novel customized training index is introduced to perform the effectiveness and rising efficiency of our self-supervised DDPG RL algorithm.
引用
收藏
页码:4636 / 4642
页数:7
相关论文
共 50 条
  • [1] Accelerating Self-Supervised Learning via Efficient Training Strategies
    Kocyigit, Mustafa Taha
    Hospedales, Timothy M.
    Bilen, Hakan
    2023 IEEE/CVF WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2023, : 5643 - 5653
  • [2] Efficient Medical Image Assessment via Self-supervised Learning
    Huang, Chun-Yin
    Lei, Qi
    Li, Xiaoxiao
    DATA AUGMENTATION, LABELLING, AND IMPERFECTIONS (DALI 2022), 2022, 13567 : 102 - 111
  • [3] A New Self-supervised Method for Supervised Learning
    Yang, Yuhang
    Ding, Zilin
    Cheng, Xuan
    Wang, Xiaomin
    Liu, Ming
    INTERNATIONAL CONFERENCE ON COMPUTER VISION, APPLICATION, AND DESIGN (CVAD 2021), 2021, 12155
  • [4] Self-Supervised Exploration via Disagreement
    Pathak, Deepak
    Gandhi, Dhiraj
    Gupta, Abhinav
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [5] A SELF-SUPERVISED NETWORK EMBEDDING METHOD
    Wang, Dongqi
    Chen, Xin
    Chen, Dongming
    JOURNAL OF NONLINEAR AND CONVEX ANALYSIS, 2022, 23 (10) : 2319 - 2328
  • [6] Sample Efficient Detection and Classification of Adversarial Attacks via Self-Supervised Embeddings
    Moayeri, Mazda
    Feizi, Soheil
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 7657 - 7666
  • [7] Efficient Multi-agent Communication via Self-supervised Information Aggregation
    Guan, Cong
    Chen, Feng
    Yuan, Lei
    Wang, Chenghe
    Yin, Hao
    Zhang, Zongzhang
    Yu, Yang
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [8] Self-Supervised Pretraining Improves Self-Supervised Pretraining
    Reed, Colorado J.
    Yue, Xiangyu
    Nrusimha, Ani
    Ebrahimi, Sayna
    Vijaykumar, Vivek
    Mao, Richard
    Li, Bo
    Zhang, Shanghang
    Guillory, Devin
    Metzger, Sean
    Keutzer, Kurt
    Darrell, Trevor
    2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 1050 - 1060
  • [9] A self-supervised deep learning method for data-efficient training in genomics
    Hüseyin Anil Gündüz
    Martin Binder
    Xiao-Yin To
    René Mreches
    Bernd Bischl
    Alice C. McHardy
    Philipp C. Münch
    Mina Rezaei
    Communications Biology, 6
  • [10] A self-supervised deep learning method for data-efficient training in genomics
    Guenduez, Hueseyin Anil
    Binder, Martin
    To, Xiao-Yin
    Mreches, Rene
    Bischl, Bernd
    McHardy, Alice C.
    Muench, Philipp C.
    Rezaei, Mina
    COMMUNICATIONS BIOLOGY, 2023, 6 (01)