Model-free Reinforcement Learning with a Non-linear Reconstructor for closed-loop Adaptive Optics control with a pyramid wavefront sensor

被引:7
|
作者
Pou, B. [1 ,2 ]
Smith, J. [3 ]
Quinones, E. [1 ]
Martin, M. [2 ]
Gratadour, D. [4 ]
机构
[1] Barcelona Supercomputing Ctr BSC, C Jordi Girona 29, Barcelona 08034, Spain
[2] Univ Politecn Catalunya UPC, Comp Sci Dept, C Jordi Girona 31, Barcelona 08034, Spain
[3] Australian Natl Univ, Sch Comp, Canberra, Australia
[4] Univ PSL, Sorbonne Univ, Univ Paris Diderot, CNRS,LESIA,Observ Paris, Sorbonne Paris Cite,5 Pl Jules Janssen, F-92195 Meudon, France
来源
ADAPTIVE OPTICS SYSTEMS VIII | 2022年 / 12185卷
关键词
Reinforcement Learning; AO Control; Machine Learning; Pyramid Wavefront Sensor; NEURAL-NETWORKS;
D O I
10.1117/12.2627849
中图分类号
P1 [天文学];
学科分类号
0704 ;
摘要
We present a model-free reinforcement learning (RL) predictive model with a supervised learning non-linear reconstructor for adaptive optics (AO) control with a pyramid wavefront sensor (P-WFS). First, we analyse the additional problems of training an RL control method with a P-WFS compared to the Shack-Hartmann WFS. From those observations, we propose our solution: a combination of model-free RL for prediction with a non-linear reconstructor based on neural networks with a U-net architecture. We test the proposed method in simulation of closed-loop AO for an 8m telescope equipped with a 32x32 P-WFS and observe that both the predictive and non-linear reconstruction add additional benefits over an optimised integrator.
引用
收藏
页数:14
相关论文
共 50 条
  • [31] Non-linear position control of a pneumatic actuator with closed-loop stiffness and damping tuning
    Abry, Frederic
    Brun, Xavier
    Sesmat, Sylvie
    Bideaux, Eric
    2013 EUROPEAN CONTROL CONFERENCE (ECC), 2013, : 1089 - 1094
  • [32] Event-based model-free adaptive control for discrete-time non-linear processes
    Liu, Dong
    Yang, Guang-Hong
    IET CONTROL THEORY AND APPLICATIONS, 2017, 11 (15): : 2531 - 2538
  • [33] Revisiting Model Reference Adaptive Control: Linear-Like Closed-Loop Behavior
    Shahab, Mohamad T.
    Miller, Daniel E.
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2025, 70 (03) : 1483 - 1498
  • [34] Effect of pinhole diameter on correction accuracy of closed-loop adaptive optics system using self-referencing interferometer wavefront sensor
    Bai Fu-Zhong
    Rao Chang-Hui
    ACTA PHYSICA SINICA, 2010, 59 (11) : 8280 - 8286
  • [35] A Comparison of Closed-Loop Performance of Multirotor Configurations Using Non-Linear Dynamic Inversion Control
    Ireland, Murray L.
    Vargas, Aldo
    Anderson, David
    AEROSPACE, 2015, 2 (02): : 325 - 352
  • [36] Iterative learning control of linear induction motor based on model-free adaptive control
    Lu, Zhiqiang
    Zhang, Qiang
    Cao, Gaofeng
    Liu, Zhaorui
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 2578 - 2583
  • [37] Study on Model-free Learning Adaptive Control in Permanent Magnet Linear Motor
    Cao, Rongmin
    Bai, Lianping
    Hou, Zhongsheng
    2008 CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1-11, 2008, : 2946 - +
  • [38] An Adaptive Model-Free Control Method for Metro Train Based on Deep Reinforcement Learning
    Lai, Wenzhu
    Chen, Dewang
    Huang, Yunhu
    Huang, Benzun
    ADVANCES IN NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, ICNC-FSKD 2022, 2023, 153 : 263 - 273
  • [39] Model-Based Reinforcement Learning for Closed-Loop Dynamic Control of Soft Robotic Manipulators
    Thuruthel, Thomas George
    Falotico, Egidio
    Renda, Federico
    Laschi, Cecilia
    IEEE TRANSACTIONS ON ROBOTICS, 2019, 35 (01) : 124 - 134
  • [40] Linearized Bregman iteration based model-free adaptive sliding mode control for a class of non-linear systems
    Gao, Shouli
    Zhao, Dongya
    Yan, Xinggang
    Spurgeon, Sarah K.
    IET CONTROL THEORY AND APPLICATIONS, 2021, 15 (02): : 281 - 296