A deep reinforcement learning method for structural dominant failure modes searching based on self-play strategy

被引:6
|
作者
Guan, Xiaoshu [1 ,2 ,3 ]
Sun, Huabin [1 ,2 ,3 ]
Hou, Rongrong [1 ,2 ,3 ]
Xu, Yang [1 ,2 ,3 ]
Bao, Yuequan [1 ,2 ,3 ]
Li, Hui [1 ,2 ,3 ]
机构
[1] Harbin Inst Technol, Minist Educ, Key Lab Struct Dynam Behav & Control, Harbin 150090, Peoples R China
[2] Harbin Inst Technol, Minist Ind & Informat Technol, Key Lab Smart Prevent & Mitigat Civil Engn Disaste, Harbin 150090, Peoples R China
[3] Harbin Inst Technol, Sch Civil Engn, Harbin 150090, Peoples R China
基金
中国国家自然科学基金;
关键词
Structural reliability analysis; Dominant failure modes; Deep reinforcement learning; Self-play strategy; Monte Carlo tree search; RELIABILITY; GAME; GO;
D O I
10.1016/j.ress.2023.109093
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In the research area of structural reliability analysis (SRA), the dominant failure modes (DFMs) of a structural system make significant contributions to life-span failure prediction and safety assessment. However, the high computational cost caused by the combinatorial explosion is the main problem in DFMs searching that hinders its application and further development. Recently, many successful applications have proved that the self-play deep reinforcement learning (DRL) has a strong ability to obtain action policy in the face of combinatorial explosion problems. Inspired by this, a self-play strategy is designed to optimize the DRL-based DFMs searching process and reduce the computational effort. A scoring function is designed and used as the refereeing standard of the self-play games and helps improve the efficiency of Monte Carlo tree search (MCTS) in an asynchronous training process. In comparison with the beta-unzipping method and exploration-based DFMs searching method, the pro-posed method significantly improved training efficiency with an accuracy of over 95% and a lower requirement of the number of finite element analysis (FEA), both of which contribute to the policy learning of failure component selection. In summary, the method shows potential applications for actual structures and makes valuable contributions to the problem with high computing costs.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Do as you teach: a multi-teacher approach to self-play in deep reinforcement learning
    Chaitanya Kharyal
    Sai Krishna Gottipati
    Tanmay Kumar Sinha
    Fatemeh Abdollahi
    Srijita Das
    Matthew E. Taylor
    Neural Computing and Applications, 2025, 37 (8) : 5945 - 5956
  • [22] Maneuvering strategy generation algorithm for multi-UAV in close-range air combat based on deep reinforcement learning and self-play
    Kong W.-R.
    Zhou D.-Y.
    Zhao Y.-Y.
    Yang W.-S.
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2022, 39 (02): : 352 - 362
  • [23] Finding Effective Security Strategies through Reinforcement Learning and Self-Play
    Hammar, Kim
    Stadler, Rolf
    2020 16TH INTERNATIONAL CONFERENCE ON NETWORK AND SERVICE MANAGEMENT (CNSM), 2020,
  • [24] TIYUNTSONG: A SELF-PLAY REINFORCEMENT LEARNING APPROACH FOR ABR VIDEO STREAMING
    Huang, Tianchi
    Yao, Xin
    Wu, Chenglei
    Zhang, Rui-Xiao
    Pang, Zhengyuan
    Sun, Lifeng
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1678 - 1683
  • [25] Zwei: A Self-Play Reinforcement Learning Framework for Video Transmission Services
    Huang, Tianchi
    Zhang, Rui-Xiao
    Sun, Lifeng
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 1350 - 1365
  • [26] Research on the Difficulty of Mobile Node Deployment's Self-Play in Wireless Ad Hoc Networks Based on Deep Reinforcement Learning
    Wang, Huitao
    Yang, Ruopeng
    Yin, Changsheng
    Zou, Xiaofei
    Wang, Xuefeng
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 2021
  • [27] Learning a game strategy using pattern-weights and self-play
    Shapiro, A
    Fuchs, G
    Levinson, R
    COMPUTERS AND GAMES, 2003, 2883 : 42 - 60
  • [28] Reinforcement Learning in the Game of Othello: Learning Against a Fixed Opponent and Learning from Self-Play
    van der Ree, Michiel
    Wiering, Marco
    PROCEEDINGS OF THE 2013 IEEE SYMPOSIUM ON ADAPTIVE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING (ADPRL), 2013, : 108 - 115
  • [29] A deep reinforcement learning based searching method for source localization
    College of Systems Engineering, National University of Defense Technology, 109 Deya Road, Kaifu District, Changsha City
    Hunan Province, China
    不详
    不详
    Inf Sci, 2022, (67-81): : 67 - 81
  • [30] A deep reinforcement learning based searching method for source localization
    Zhao, Yong
    Chen, Bin
    Wang, XiangHan
    Zhu, Zhengqiu
    Wang, Yiduo
    Cheng, Guangquan
    Wang, Rui
    Wang, Rongxiao
    He, Ming
    Liu, Yu
    INFORMATION SCIENCES, 2022, 588 : 67 - 81