Vibration Control with Reinforcement Learning Based on Multi-Reward Lightweight Networks

被引:0
|
作者
Shu, Yucheng [1 ]
He, Chaogang [1 ]
Qiao, Lihong [1 ]
Xiao, Bin [1 ]
Li, Weisheng [1 ]
机构
[1] Chongqing Univ Posts & Telecommun, Chongqing Key Lab Image Cognit, Sch Comp Sci & Technol, Chongqing 400065, Peoples R China
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 09期
基金
中国国家自然科学基金;
关键词
active vibration control; reinforcement learning; lightweight; neural network; prioritized experience replaying; reward function; ALGORITHM;
D O I
10.3390/app14093853
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
This paper proposes a reinforcement learning method using a deep residual shrinkage network based on multi-reward priority experience playback for high-frequency and high-dimensional continuous vibration control. Firstly, we keep the underlying equipment unchanged and construct a vibration system simulator using FIR filters to ensure the complete fidelity of the physical model. Then, by interacting with the simulator using our proposed algorithm, we identify the optimal control strategy, which is directly applied to real-world scenarios in the form of a neural network. A multi-reward mechanism is proposed to assist the lightweight network to find a near-optimal control strategy, and a priority experience playback mechanism is used to prioritize the data to accelerate the convergence speed of the neural network and improve the data utilization efficiency. At the same time, the deep residual shrinkage network is introduced to realize adaptive denoising and lightweightness of the neural network. The experimental results indicate that under narrowband white-noise excitation ranging from 0 to 100 Hz, the DDPG algorithm achieved a vibration reduction effect of 12.728 dB, while our algorithm achieved a vibration reduction effect of 20.240 dB. Meanwhile, the network parameters were reduced by more than 7.5 times.
引用
收藏
页数:28
相关论文
共 50 条
  • [31] Graph convolutional recurrent networks for reward shaping in reinforcement learning
    Sami, Hani
    Bentahar, Jamal
    Mourad, Azzam
    Otrok, Hadi
    Damiani, Ernesto
    INFORMATION SCIENCES, 2022, 608 : 63 - 80
  • [32] A Modified Average Reward Reinforcement Learning Based on Fuzzy Reward Function
    Zhai, Zhenkun
    Chen, Wei
    Li, Xiong
    Guo, Jing
    IMECS 2009: INTERNATIONAL MULTI-CONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2009, : 113 - 117
  • [33] Intelligent vibration control of tensile cable based on deep reinforcement learning
    Zhang, Enqi
    Xiang, Sheng
    Cheng, Bin
    Li, Derui
    ADVANCES IN STRUCTURAL ENGINEERING, 2025, 28 (04) : 736 - 752
  • [34] Reward shaping for knowledge-based multi-objective multi-agent reinforcement learning
    Mannion, Patrick
    Devlin, Sam
    Duggan, Jim
    Howley, Enda
    KNOWLEDGE ENGINEERING REVIEW, 2018, 33
  • [35] Lightweight IDS For UAV Networks: A Periodic Deep Reinforcement Learning-based Approach
    Bouhamed, Omar
    Bouachir, Ouns
    Aloqaily, Moayad
    Al Ridhawi, Ismaeel
    2021 IFIP/IEEE INTERNATIONAL SYMPOSIUM ON INTEGRATED NETWORK MANAGEMENT (IM 2021), 2021, : 1032 - 1037
  • [36] Rationality of reward sharing in multi-agent reinforcement learning
    Kazuteru Miyazaki
    Shigenobu Kobayashi
    New Generation Computing, 2001, 19 : 157 - 172
  • [37] Multi-Objectivization of Reinforcement Learning Problems by Reward Shaping
    Brys, Tim
    Harutyunyan, Anna
    Vrancx, Peter
    Taylor, Matthew E.
    Kudenko, Daniel
    Nowe, Ann
    PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 2315 - 2322
  • [38] Rationality of reward sharing in multi-agent reinforcement learning
    Miyazaki, K
    Kobayashi, S
    NEW GENERATION COMPUTING, 2001, 19 (02) : 157 - 172
  • [39] Individual Reward Assisted Multi-Agent Reinforcement Learning
    Wang, Li
    Zhang, Yupeng
    Hu, Yujing
    Wang, Weixun
    Zhang, Chongjie
    Gao, Yang
    Hao, Jianye
    Lv, Tangjie
    Fan, Changjie
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [40] Distributional Reinforcement Learning for Multi-Dimensional Reward Functions
    Zhang, Pushi
    Chen, Xiaoyu
    Zhao, Li
    Xiong, Wei
    Qin, Tao
    Liu, Tie-Yan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34