Convergence of Momentum-based Distributed Stochastic Approximation with RL Applications

被引:0
|
作者
Naskar, Ankur [1 ]
Thoppe, Gugan [1 ]
机构
[1] Indian Inst Sci, Dept Comp Sci & Automat, Bengaluru 560012, India
关键词
D O I
10.1109/ICC61519.2023.10442992
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We develop a novel proof strategy for deriving almost sure convergence of momentum-based distributed stochastic approximation (DSA) schemes. Popular momentum-based schemes such as Polyak's heavy-ball and Nesterov's Accelerated SGD can be analyzed using our template. Our technique enables us to do away with three restrictive assumptions of existing approaches. One, we do not need the communication matrix to be doubly stochastic. Two, we do not need the noise to be uniformly bounded. Lastly, our approach can handle cases where there are multiple or non-point attractors. As an application, we use our technique to derive convergence for momentum-based extensions of the multi-agent TD(0) algorithm, where the above restrictive assumptions do not hold.
引用
收藏
页码:178 / 179
页数:2
相关论文
共 50 条
  • [21] Momentum-Based Policy Gradient Methods
    Huang, Feihu
    Gao, Shangqian
    Pei, Jian
    Huang, Heng
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [22] Adaptive Optics for Orbital Angular Momentum-Based Internet of Underwater Things Applications
    Zhu, Lei
    Yao, Haipeng
    Chang, Huan
    Tian, Qinghua
    Zhang, Qi
    Xin, Xiangjun
    Yu, F. Richard
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (23): : 24281 - 24299
  • [23] Momentum-based wavelet and double wavelet neural networks for power system applications
    Subramaniam Nachimuthu Deepa
    John Basha Rizwana
    Neural Computing and Applications, 2018, 29 : 495 - 511
  • [24] Momentum-based wavelet and double wavelet neural networks for power system applications
    Deepa, Subramaniam Nachimuthu
    Rizwana, John Basha
    NEURAL COMPUTING & APPLICATIONS, 2018, 29 (07): : 495 - 511
  • [25] Unbiased quasi-hyperbolic nesterov-gradient momentum-based optimizers for accelerating convergence
    Cheng, Weiwei
    Yang, Xiaochun
    Wang, Bin
    Wang, Wei
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2023, 26 (04): : 1323 - 1344
  • [26] Finite-time convergence rates of distributed local stochastic approximation
    Doan, Thinh T.
    AUTOMATICA, 2023, 158
  • [27] Convergence and applications of stochastic approximation with state-dependent noise
    Chen, HF
    PROCEEDINGS OF THE 2001 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2001, : 744 - 749
  • [28] Unbiased quasi-hyperbolic nesterov-gradient momentum-based optimizers for accelerating convergence
    Weiwei Cheng
    Xiaochun Yang
    Bin Wang
    Wei Wang
    World Wide Web, 2023, 26 : 1323 - 1344
  • [29] Noise amplifiation of momentum-based optimization algorithms
    Mohammadi, Hesameddin
    Razaviyayn, Meisam
    Jovanovic, Mihailo R.
    2023 AMERICAN CONTROL CONFERENCE, ACC, 2023, : 849 - 854
  • [30] A momentum-based deformation system for granular material
    Zeng, Ya-Lun
    Tan, Charlie Irawan
    Tai, Wen-Kai
    Yang, Mau-Tsuen
    Chiang, Cheng-Chin
    Chang, Chin-Chen
    COMPUTER ANIMATION AND VIRTUAL WORLDS, 2007, 18 (4-5) : 289 - 300