SOFT ACTOR-CRITIC ALGORITHM WITH ADAPTIVE NORMALIZATION

被引：0

作者：

Gao, Xiaonan ^{[1
]}

Wu, Ziyi ^{[1
]}

Zhu, Xianchao ^{[1
]}

Cai, Lei ^{[2
]}

机构：

[1] Henan Univ Technol, Sch Artificial Intelligence & Big Data, Zhengzhou 450001, Peoples R China

[2] Henan Inst Sci & Technol, Sch Artificial Intelligence, Xinxiang 453003, Peoples R China

来源：

JOURNAL OF NONLINEAR FUNCTIONAL ANALYSIS | 2025年 / 2025卷

基金：

中国国家自然科学基金;

关键词：

Adaptive normalization; Deep reinforcement learning; Reward mechanism; Soft actor-critic algorithm; GAME; GO;

D O I：

10.23952/jnfa.2025.6

中图分类号：

O29 [应用数学];

学科分类号：

070104 ;

摘要：

In recent years, breakthroughs were made in the field of deep reinforcement learning, but, their applications in the real world were seriously affected due to the instability of algorithms and the difficulty in ensuring convergence. As a typical algorithm in reinforcement learning, although the SAC algorithm enhances the robustness and agent's exploration ability by introducing the concept of maximum entropy, it still has the disadvantage of instability in the training process. In order to solve the problems, this paper proposes an Adaptive Normalization-based SAC (AN-SAC) algorithm. By introducing the adaptive normalized reward mechanism into the SAC algorithm, our method can dynamically adjust the normalized parameters of the reward during the training process so that the reward value has zero mean and unit variance. Thus it better adapts to the reward distribution and improves the performance and stability of the algorithm. Experimental results demonstrate that the performance and stability of the AN-SAC algorithm are significantly improved compared with the SAC algorithm.

引用

页数：10

共 50 条

[1] A Soft Actor-Critic Algorithm for Sequential Recommendation
Hong, Hyejin
Kimurn, Yusuke
Hatano, Kenji
DATABASE AND EXPERT SYSTEMS APPLICATIONS, PT I, DEXA 2024, 2024, 14910 : 258 - 266
[2] An Adaptive Threshold for the Canny Edge With Actor-Critic Algorithm
Choi, Keong-Hun
Ha, Jong-Eun
IEEE ACCESS, 2023, 11 : 67058 - 67069
[3] A Hessian Actor-Critic Algorithm
Wang, Jing
Paschalidis, Ioannis Ch
2014 IEEE 53RD ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2014, : 1131 - 1136
[4] An Actor-Critic Algorithm With Second-Order Actor and Critic
Wang, Jing
Paschalidis, Ioannis Ch.
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2017, 62 (06) : 2689 - 2703
[5] Generative Adversarial Soft Actor-Critic
Hwang, Hyo-Seok
Kim, Yoojoong
Seok, Junhee
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
[6] Soft Actor-Critic With Integer Actions
Fan, Ting-Han
Wang, Yubo
2022 AMERICAN CONTROL CONFERENCE, ACC, 2022, : 2611 - 2616
[7] ADAPTIVE ACTOR-CRITIC BILATERAL FILTER
Chen, Bo-Hao
Cheng, Hsiang-Yin
Yin, Jia-Li
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 1675 - 1679
[8] An Actor-Critic Algorithm for SVM Hyperparameters
Kim, Chayoung
Park, Jung-min
Kim, Hye-young
INFORMATION SCIENCE AND APPLICATIONS 2018, ICISA 2018, 2019, 514 : 653 - 661
[9] A soft actor-critic reinforcement learning algorithm for network intrusion detection
Li, Zhengfa
Huang, Chuanhe
Deng, Shuhua
Qiu, Wanyu
Gao, Xieping
COMPUTERS & SECURITY, 2023, 135
[10] Soft Actor-Critic for Navigation of Mobile Robots
de Jesus, Junior Costa
Kich, Victor Augusto
Kolling, Alisson Henrique
Grando, Ricardo Bedin
Cuadros, Marco Antonio de Souza Leite
Gamarra, Daniel Fernando Tello
Journal of Intelligent and Robotic Systems: Theory and Applications, 2021, 102 (02):

← 1 2 3 4 5 →