Efficient DDPG via the Self-Supervised Method

被引:0
|
作者
Zhang, Guanghao [1 ]
Chen, Hongliang [2 ]
Li, Jianxun [1 ]
机构
[1] Shanghai Jiao Tong Univ, Sch Elect Informat & Elect Engn, Shanghai, Peoples R China
[2] AVIC, Inst Electroopt Equipment, Luoyang 4710009, Peoples R China
关键词
Efficient DDPG; Self-Supervised Method; Inverse and Forward Model; MODEL;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, embedded with self-supervised learning network, an efficient DDPG(Deep Deterministic Policy Gradient) RL algorithm is investigated. With more essential characteristics of observing data included, the inputs of actor network and critic network of DDPG are replaced by the high-dimensional outputs from feature extracting layers and forward network respectively. Additionally, the parameters of these auxiliary layers are optimized with a self-supervised method by minimizing predicting errors, and thus both optimizing progresses can run parallelly and simultaneously. Lastly, an antagonistic air-fight simulation with a novel customized training index is introduced to perform the effectiveness and rising efficiency of our self-supervised DDPG RL algorithm.
引用
收藏
页码:4636 / 4642
页数:7
相关论文
共 50 条
  • [31] Monocular Depth Estimation via Self-Supervised Self-Distillation
    Hu, Haifeng
    Feng, Yuyang
    Li, Dapeng
    Zhang, Suofei
    Zhao, Haitao
    SENSORS, 2024, 24 (13)
  • [32] Visualization of Supervised and Self-Supervised Neural Networks via Attribution Guided Factorization
    Gur, Shir
    Ali, Ameen
    Wolf, Lior
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 11545 - 11554
  • [33] Self-supervised AutoFlow
    Huang, Hsin-Ping
    Herrmann, Charles
    Hur, Junhwa
    Lu, Erika
    Sargent, Kyle
    Stone, Austin
    Yang, Ming-Hsuan
    Sun, Deqing
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 11412 - 11421
  • [34] Self-supervised ARTMAP
    Amis, Gregory P.
    Carpenter, Gail A.
    NEURAL NETWORKS, 2010, 23 (02) : 265 - 282
  • [35] Self-Supervised Learning Method for SAR Multiinterference Suppression
    Cen, Xi
    Li, Yachao
    Han, Zhaoyun
    Gu, Tong
    Zhang, Peng
    Cai, Tianyi
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61 : 1 - 17
  • [36] A method to challenge symmetries in data with self-supervised learning
    Tombs, Rupert
    Lester, Christopher G.
    JOURNAL OF INSTRUMENTATION, 2022, 17 (08)
  • [37] A Masked Self-Supervised Pretraining Method for Face Parsing
    Li, Zhuang
    Cao, Leilei
    Wang, Hongbin
    Xu, Lihong
    MATHEMATICS, 2022, 10 (12)
  • [38] Trajectory Prediction Method Enhanced by Self-supervised Pretraining
    Li, Linhui
    Fu, Yifan
    Wang, Ting
    Wang, Xuecheng
    Lian, Jing
    Qiche Gongcheng/Automotive Engineering, 2024, 46 (07): : 1219 - 1227
  • [39] A SELF-SUPERVISED METHOD FOR INFRARED AND VISIBLE IMAGE FUSION
    Lin, Xiaopeng
    Zhou, Guanxing
    Zeng, Weihong
    Tu, Xiaotong
    Huang, Yue
    Ding, Xinghao
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 2376 - 2380
  • [40] Efficient Personalized Speech Enhancement Through Self-Supervised Learning
    Sivaraman, Aswin
    Kim, Minje
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2022, 16 (06) : 1342 - 1356