A stochastic gradient type algorithm for closed-loop problems

被引:0
|
作者
Kengy Barty
Jean-Sébastien Roy
Cyrille Strugarek
机构
[1] Recherche et Développement,Électricité de France
来源
Mathematical Programming | 2009年 / 119卷
关键词
Stochastic quasi-gradient; Perturbed gradient; Closed-loop problems; Primary: 62L20; Secondary: 93E20; 93E35;
D O I
暂无
中图分类号
学科分类号
摘要
We focus on the numerical solution of closed-loop stochastic problems, and propose a perturbed gradient algorithm to achieve this goal. The main hurdle in such problems is the fact that the control variables are infinite-dimensional, due to, e.g., the information constraints. Alternatively said, control variables are feedbacks, i.e., functions. Such controls have hence to be represented in a finite way in order to solve the problem numerically. In the same way, the gradient of the criterion is itself an infinite-dimensional object. Our algorithm replaces this exact (and unknown) gradient by a perturbed one, which consists of the product of the true gradient evaluated at a random point and a kernel function which extends this gradient to the neighbourhood of the random point. Proceeding this way, we explore the whole space iteration after iteration through random points. Since each kernel function is perfectly known by a small number of parameters, say N, the control at iteration k is perfectly known as an infinite-dimensional object by at most N × k parameters. The main strength of this method is that it avoids any discretization of the underlying space, provided that we can sample as many points as needed in this space. Moreover, our algorithm can take into account the possible measurability constraints of the problem in a new way. Finally, the randomized strategy implemented by the algorithm causes the most probable parts of the space to be the most explored ones, which is a priori an interesting feature. In this paper, we first prove two convergence results of this algorithm in the strongly convex and convex cases, and then give some numerical examples showing the interest of this method for practical stochastic optimization problems.
引用
收藏
页码:51 / 78
页数:27
相关论文
共 50 条
  • [31] A Carrier Synchronization Algorithm Combine Open-loop and Closed-loop
    Wang, Hong-zhuo
    Yan, Jun-ji
    Liu, Ce-lun
    INTERNATIONAL CONFERENCE ON ADVANCES IN MANAGEMENT SCIENCE AND ENGINEERING (AMSE 2015), 2015, : 164 - 168
  • [32] Development of closed-loop intraperitoneal insulin infusion algorithm
    Nishida, K
    Sakakida, M
    Shimoda, S
    Matuo, Y
    Araki, E
    DIABETOLOGIA, 2001, 44 : A45 - A45
  • [33] TEST OF CLOSED-LOOP DEGAUSSING ALGORITHM ON A MINESWEEPER ENGINE
    IZAT, PF
    WATTS, KT
    WINGO, RA
    HOLMES, JJ
    LACKEY, MH
    NAVAL ENGINEERS JOURNAL, 1992, 104 (04) : 116 - 117
  • [34] AN ALGORITHM FOR CONSTRAINT STABILIZATION OF PLANAR MULTIBODYS WITH CLOSED-LOOP
    Zhang, Lina
    Zhang, Jianshu
    Rui, Xiaoting
    Gu, Junjie
    Zheng, Huaqing
    Zhang, Xizhe
    IET Conference Proceedings, 2022, 2022 (13): : 507 - 513
  • [36] CLOSED-LOOP DETECTION ALGORITHM USING VISUAL WORDS
    Liang, Zhiwei
    Chen, Yanyan
    INTERNATIONAL JOURNAL OF ROBOTICS & AUTOMATION, 2014, 29 (02): : 155 - 161
  • [37] NEW ALGORITHM FOR CLOSED-LOOP MODEL-MATCHING
    AGUIRRE, LA
    ELECTRONICS LETTERS, 1991, 27 (24) : 2260 - 2262
  • [38] SIMPLE ALGORITHM FOR CLOSED-LOOP CONTROL OF STEPPING MOTORS
    GRIMBLEBY, JB
    IEE PROCEEDINGS-ELECTRIC POWER APPLICATIONS, 1995, 142 (01): : 5 - 13
  • [39] PERSONALIZED RULE-BASED CLOSED-LOOP CONTROL ALGORITHM FOR TYPE 1 DIABETES
    Rodriguez-Herrero, A.
    Garcia-Saez, G.
    Garcia-Garcia, F.
    Perez-Gandia, C.
    Rigla, M.
    Hernando, M. E.
    DIABETES TECHNOLOGY & THERAPEUTICS, 2014, 16 : A104 - A105
  • [40] PERSONALIZED RULE-BASED CLOSED-LOOP CONTROL ALGORITHM FOR TYPE 1 DIABETES
    Rodriguez-Herrero, A. R. H. A.
    Garcia-Saez, G. G. S. G.
    Garcia-Garcia, F. G. G. F.
    Perez-Gandia, C. P. G. C.
    Rigla, M. R. M.
    Hernando, M. E. H. M. E.
    DIABETES TECHNOLOGY & THERAPEUTICS, 2014, 16 : A101 - A102