Actor-critic algorithm as multi-time-scale stochastic approximation

被引:0
|
作者
Indian Inst of Science, Bangalore, India [1 ]
机构
来源
Sadhana | / pt 4卷 / 525-543期
关键词
D O I
暂无
中图分类号
学科分类号
摘要
引用
收藏
相关论文
共 50 条
  • [21] A modified actor-critic reinforcement learning algorithm
    Mustapha, SM
    Lachiver, G
    2000 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, CONFERENCE PROCEEDINGS, VOLS 1 AND 2: NAVIGATING TO A NEW ERA, 2000, : 605 - 609
  • [22] SOFT ACTOR-CRITIC ALGORITHM WITH ADAPTIVE NORMALIZATION
    Gao, Xiaonan
    Wu, Ziyi
    Zhu, Xianchao
    Cai, Lei
    JOURNAL OF NONLINEAR FUNCTIONAL ANALYSIS, 2025, 2025
  • [23] Multi-actor mechanism for actor-critic reinforcement learning
    Li, Lin
    Li, Yuze
    Wei, Wei
    Zhang, Yujia
    Liang, Jiye
    INFORMATION SCIENCES, 2023, 647
  • [24] Actor-Critic With Synthesis Loss for Solving Approximation Biases
    Guo, Bo-Wen
    Chao, Fei
    Chang, Xiang
    Shang, Changjing
    Shen, Qiang
    IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (09) : 5323 - 5336
  • [25] Multi-agent actor-critic with time dynamical opponent model
    Tian, Yuan
    Kladny, Klaus -Rudolf
    Wang, Qin
    Huang, Zhiwu
    Fink, Olga
    NEUROCOMPUTING, 2023, 517 : 165 - 172
  • [27] Real-Time 'Actor-Critic' Tracking
    Chen, Boyu
    Wang, Dong
    Li, Peixia
    Wang, Shuang
    Lu, Huchuan
    COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 328 - 345
  • [28] Forward Actor-Critic for Nonlinear Function Approximation in Reinforcement Learning
    Veeriah, Vivek
    van Seijen, Harm
    Sutton, Richard S.
    AAMAS'17: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2017, : 556 - 564
  • [29] An actor-critic algorithm for constrained Markov decision processes
    Borkar, VS
    SYSTEMS & CONTROL LETTERS, 2005, 54 (03) : 207 - 213
  • [30] Pseudorehearsal in actor-critic agents with neural network function approximation
    Marochko, Vladimir
    Johard, Leonard
    Mazzara, Manuel
    Longo, Luca
    PROCEEDINGS 2018 IEEE 32ND INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS (AINA), 2018, : 644 - 650