Tunnel ventilation control via an actor-critic algorithm employing nonparametric policy gradients

被引:0
|
作者
Division of Mechanical Engineering, Korea University, Seoul, 136-701, Korea, Republic of [1 ]
不详 [2 ]
机构
来源
J. Mech. Sci. Technol. | 2009年 / 2卷 / 311-323期
关键词
Ventilation - Automobile drivers - Pollution - Learning algorithms;
D O I
暂无
中图分类号
学科分类号
摘要
The appropriate operation of a tunnel ventilation system provides drivers passing through the tunnel with comfortable and safe driving conditions. Tunnel ventilation involves maintaining CO pollutant concentration and VI (visibility index) under an adequate level with operating highly energy-consuming facilities such as jet-fans. Therefore, it is significant to have an efficient operating algorithm in aspects of a safe driving environment as well as saving energy. In this research, a reinforcement learning (RL) method based on the actor-critic architecture and nonparametric policy gradients is applied as the control algorithm. The two objectives listed above, maintaining an adequate level of pollutants and minimizing power consumption, are included into a reward formulation that is a performance index to be maximized in the RL methodology. In this paper, a nonparametric approach is adopted as a promising route to perform a rigorous gradient search in a function space of policies to improve the efficacy of the actor module. Extensive simulation studies performed with real data collected from an existing tunnel system confirm that with the suggested algorithm, the control purposes were well accomplished and improved when compared to a previously developed RL-based control algorithm. © KSME & Springer 2009.
引用
收藏
相关论文
共 50 条
  • [1] Tunnel ventilation control via an actor-critic algorithm employing nonparametric policy gradients
    Chu, Baeksuk
    Hong, Daehie
    Park, Jooyoung
    JOURNAL OF MECHANICAL SCIENCE AND TECHNOLOGY, 2009, 23 (02) : 311 - 323
  • [2] Tunnel ventilation control via an actor-critic algorithm employing nonparametric policy gradients
    Baeksuk Chu
    Daehie Hong
    Jooyoung Park
    Journal of Mechanical Science and Technology, 2009, 23 : 311 - 323
  • [3] Tunnel ventilation controller design using an RLS-based natural actor-critic algorithm
    Chu, Baeksuk
    Park, Jooyoung
    Hong, Daehie
    INTERNATIONAL JOURNAL OF PRECISION ENGINEERING AND MANUFACTURING, 2010, 11 (06) : 829 - 838
  • [4] Tunnel ventilation controller design using an RLS-based natural actor-critic algorithm
    Baeksuk Chu
    Jooyoung Park
    Daehie Hong
    International Journal of Precision Engineering and Manufacturing, 2010, 11 : 829 - 838
  • [5] A Survey of Actor-Critic Reinforcement Learning: Standard and Natural Policy Gradients
    Grondman, Ivo
    Busoniu, Lucian
    Lopes, Gabriel A. D.
    Babuska, Robert
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2012, 42 (06): : 1291 - 1307
  • [6] A Hessian Actor-Critic Algorithm
    Wang, Jing
    Paschalidis, Ioannis Ch
    2014 IEEE 53RD ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2014, : 1131 - 1136
  • [7] An Actor-Critic Algorithm With Second-Order Actor and Critic
    Wang, Jing
    Paschalidis, Ioannis Ch.
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2017, 62 (06) : 2689 - 2703
  • [8] Actor-critic algorithm with incremental dual natural policy gradient
    Zhang P.
    Liu Q.
    Zhong S.
    Zhai J.-W.
    Qian W.-S.
    2017, Editorial Board of Journal on Communications (38): : 166 - 177
  • [9] An Actor-Critic Algorithm for SVM Hyperparameters
    Kim, Chayoung
    Park, Jung-min
    Kim, Hye-young
    INFORMATION SCIENCE AND APPLICATIONS 2018, ICISA 2018, 2019, 514 : 653 - 661
  • [10] Noisy Importance Sampling Actor-Critic: An Off-Policy Actor-Critic With Experience Replay
    Tasfi, Norman
    Capretz, Miriam
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,