A deep reinforcement learning method for structural dominant failure modes searching based on self-play strategy

被引:6
|
作者
Guan, Xiaoshu [1 ,2 ,3 ]
Sun, Huabin [1 ,2 ,3 ]
Hou, Rongrong [1 ,2 ,3 ]
Xu, Yang [1 ,2 ,3 ]
Bao, Yuequan [1 ,2 ,3 ]
Li, Hui [1 ,2 ,3 ]
机构
[1] Harbin Inst Technol, Minist Educ, Key Lab Struct Dynam Behav & Control, Harbin 150090, Peoples R China
[2] Harbin Inst Technol, Minist Ind & Informat Technol, Key Lab Smart Prevent & Mitigat Civil Engn Disaste, Harbin 150090, Peoples R China
[3] Harbin Inst Technol, Sch Civil Engn, Harbin 150090, Peoples R China
基金
中国国家自然科学基金;
关键词
Structural reliability analysis; Dominant failure modes; Deep reinforcement learning; Self-play strategy; Monte Carlo tree search; RELIABILITY; GAME; GO;
D O I
10.1016/j.ress.2023.109093
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In the research area of structural reliability analysis (SRA), the dominant failure modes (DFMs) of a structural system make significant contributions to life-span failure prediction and safety assessment. However, the high computational cost caused by the combinatorial explosion is the main problem in DFMs searching that hinders its application and further development. Recently, many successful applications have proved that the self-play deep reinforcement learning (DRL) has a strong ability to obtain action policy in the face of combinatorial explosion problems. Inspired by this, a self-play strategy is designed to optimize the DRL-based DFMs searching process and reduce the computational effort. A scoring function is designed and used as the refereeing standard of the self-play games and helps improve the efficiency of Monte Carlo tree search (MCTS) in an asynchronous training process. In comparison with the beta-unzipping method and exploration-based DFMs searching method, the pro-posed method significantly improved training efficiency with an accuracy of over 95% and a lower requirement of the number of finite element analysis (FEA), both of which contribute to the policy learning of failure component selection. In summary, the method shows potential applications for actual structures and makes valuable contributions to the problem with high computing costs.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Structural dominant failure modes searching method based on deep reinforcement learning
    Guan, Xiaoshu
    Xiang, Zhengliang
    Bao, Yuequan
    Li, Hui
    Reliability Engineering and System Safety, 2022, 219
  • [2] Structural dominant failure modes searching method based on deep reinforcement learning
    Guan, Xiaoshu
    Xiang, Zhengliang
    Bao, Yuequan
    Li, Hui
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2022, 219
  • [3] Mastering construction heuristics with self-play deep reinforcement learning
    Wang, Qi
    He, Yuqing
    Tang, Chunlei
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (06): : 4723 - 4738
  • [4] Mastering construction heuristics with self-play deep reinforcement learning
    Qi Wang
    Yuqing He
    Chunlei Tang
    Neural Computing and Applications, 2023, 35 : 4723 - 4738
  • [5] DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning
    Zha, Daochen
    Xie, Jingru
    Ma, Wenye
    Zhang, Sheng
    Lian, Xiangru
    Hu, Xia
    Liu, Ji
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [6] Air combat intelligent decision-making method based on self-play and deep reinforcement learning
    Shan, Shengzhe
    Zhang, Weiwei
    Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2024, 45 (04):
  • [7] Mastering Fighting Game Using Deep Reinforcement Learning With Self-play
    Kim, Dae-Wook
    Park, Sungyun
    Yang, Seong-il
    2020 IEEE CONFERENCE ON GAMES (IEEE COG 2020), 2020, : 576 - 583
  • [8] Self-play Reinforcement Learning for Video Transmission
    Huang, Tianchi
    Zhang, Rui-Xiao
    Sun, Lifeng
    NOSSDAV '20: PROCEEDINGS OF THE 2020 WORKSHOP ON NETWORK AND OPERATING SYSTEM SUPPORT FOR DIGITAL AUDIO AND VIDEO, 2020, : 7 - 13
  • [9] A Proposal of Score Distribution Predictive Model in Self-Play Deep Reinforcement Learning
    Kagoshima, Kazuya
    Sakaji, Hiroki
    Noda, Itsuki
    Transactions of the Japanese Society for Artificial Intelligence, 2024, 39 (05)
  • [10] A Sharp Analysis of Model-based Reinforcement Learning with Self-Play
    Liu, Qinghua
    Yu, Tiancheng
    Bai, Yu
    Jin, Chi
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139