Vibration Control with Reinforcement Learning Based on Multi-Reward Lightweight Networks

被引:0
|
作者
Shu, Yucheng [1 ]
He, Chaogang [1 ]
Qiao, Lihong [1 ]
Xiao, Bin [1 ]
Li, Weisheng [1 ]
机构
[1] Chongqing Univ Posts & Telecommun, Chongqing Key Lab Image Cognit, Sch Comp Sci & Technol, Chongqing 400065, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 09期
基金
中国国家自然科学基金;
关键词
active vibration control; reinforcement learning; lightweight; neural network; prioritized experience replaying; reward function; ALGORITHM;
D O I
10.3390/app14093853
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
This paper proposes a reinforcement learning method using a deep residual shrinkage network based on multi-reward priority experience playback for high-frequency and high-dimensional continuous vibration control. Firstly, we keep the underlying equipment unchanged and construct a vibration system simulator using FIR filters to ensure the complete fidelity of the physical model. Then, by interacting with the simulator using our proposed algorithm, we identify the optimal control strategy, which is directly applied to real-world scenarios in the form of a neural network. A multi-reward mechanism is proposed to assist the lightweight network to find a near-optimal control strategy, and a priority experience playback mechanism is used to prioritize the data to accelerate the convergence speed of the neural network and improve the data utilization efficiency. At the same time, the deep residual shrinkage network is introduced to realize adaptive denoising and lightweightness of the neural network. The experimental results indicate that under narrowband white-noise excitation ranging from 0 to 100 Hz, the DDPG algorithm achieved a vibration reduction effect of 12.728 dB, while our algorithm achieved a vibration reduction effect of 20.240 dB. Meanwhile, the network parameters were reduced by more than 7.5 times.
引用
收藏
页数:28
相关论文
共 50 条
  • [41] Reinforcement Learning Based Interference Control Scheme in Heterogeneous Networks
    Lee, Yunseong
    Park, Laihyuk
    Noh, Wonjong
    Cho, Sungrae
    2020 34TH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN 2020), 2020, : 83 - 85
  • [42] Robustness Verification of Deep Reinforcement Learning Based Control Systems Using Reward Martingales
    Zhi, Dapeng
    Wang, Peixin
    Chen, Cheng
    Zhang, Min
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 18, 2024, : 19992 - 20000
  • [43] Distributed Control using Reinforcement Learning with Temporal-Logic-Based Reward Shaping
    Zhang, Ningyuan
    Liu, Wenliang
    Belta, Calin
    LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 168, 2022, 168
  • [44] Adaptive reward shaping based reinforcement learning for docking control of autonomous underwater vehicles
    Chu, Shuguang
    Lin, Mingwei
    Li, Dejun
    Lin, Ri
    Xiao, Sa
    OCEAN ENGINEERING, 2025, 318
  • [45] LIIR: Learning Individual Intrinsic Reward in Multi-Agent Reinforcement Learning
    Du, Yali
    Han, Lei
    Fang, Meng
    Dai, Tianhong
    Liu, Ji
    Tao, Dacheng
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [46] A Multi-Agent Reinforcement Learning Based Approach to Quality of Experience Control in Future Internet Networks
    Battilotti, Stefano
    Delli Priscoli, Francesco
    Gori Giorgi, Claudio
    Monaco, Salvatore
    Panfili, Martina
    Pietrabissa, Antonio
    Ricciardi Celsi, Lorenzo
    Suraci, Vincenzo
    2015 34TH CHINESE CONTROL CONFERENCE (CCC), 2015, : 6495 - 6500
  • [47] Congestion Control in SDN-Based Networks via Multi-Task Deep Reinforcement Learning
    Lei, Kai
    Liang, Yuzhi
    Li, Wei
    IEEE NETWORK, 2020, 34 (04): : 28 - 34
  • [48] Decentralized Trajectory and Power Control Based on Multi-Agent Deep Reinforcement Learning in UAV Networks
    Chen, Binqiang
    Liu, Dong
    Hanzo, Lajos
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022), 2022, : 3983 - 3988
  • [49] Multi-Agent Deep Reinforcement Learning based Power Control for Large Energy Harvesting Networks
    Sharma, Mohit K.
    Zappone, Alessio
    Debbah, Merouane
    Assaad, Mohamad
    17TH INTERNATIONAL SYMPOSIUM ON MODELING AND OPTIMIZATION IN MOBILE, AD HOC, AND WIRELESS NETWORKS (WIOPT 2019), 2019, : 163 - 169
  • [50] Generalizing multi-reward functions aimed at identifying the best locations to install flow control devices in sewer systems
    Munoz, David F.
    Simoes, Nuno E.
    de Sousa, Luis M.
    Maluf, Lucas
    Marques, Alfeu Sa
    Leitao, Joao P.
    URBAN WATER JOURNAL, 2019, 16 (08) : 564 - 574