A deep reinforcement learning method for structural dominant failure modes searching based on self-play strategy

被引:6
|
作者
Guan, Xiaoshu [1 ,2 ,3 ]
Sun, Huabin [1 ,2 ,3 ]
Hou, Rongrong [1 ,2 ,3 ]
Xu, Yang [1 ,2 ,3 ]
Bao, Yuequan [1 ,2 ,3 ]
Li, Hui [1 ,2 ,3 ]
机构
[1] Harbin Inst Technol, Minist Educ, Key Lab Struct Dynam Behav & Control, Harbin 150090, Peoples R China
[2] Harbin Inst Technol, Minist Ind & Informat Technol, Key Lab Smart Prevent & Mitigat Civil Engn Disaste, Harbin 150090, Peoples R China
[3] Harbin Inst Technol, Sch Civil Engn, Harbin 150090, Peoples R China
基金
中国国家自然科学基金;
关键词
Structural reliability analysis; Dominant failure modes; Deep reinforcement learning; Self-play strategy; Monte Carlo tree search; RELIABILITY; GAME; GO;
D O I
10.1016/j.ress.2023.109093
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In the research area of structural reliability analysis (SRA), the dominant failure modes (DFMs) of a structural system make significant contributions to life-span failure prediction and safety assessment. However, the high computational cost caused by the combinatorial explosion is the main problem in DFMs searching that hinders its application and further development. Recently, many successful applications have proved that the self-play deep reinforcement learning (DRL) has a strong ability to obtain action policy in the face of combinatorial explosion problems. Inspired by this, a self-play strategy is designed to optimize the DRL-based DFMs searching process and reduce the computational effort. A scoring function is designed and used as the refereeing standard of the self-play games and helps improve the efficiency of Monte Carlo tree search (MCTS) in an asynchronous training process. In comparison with the beta-unzipping method and exploration-based DFMs searching method, the pro-posed method significantly improved training efficiency with an accuracy of over 95% and a lower requirement of the number of finite element analysis (FEA), both of which contribute to the policy learning of failure component selection. In summary, the method shows potential applications for actual structures and makes valuable contributions to the problem with high computing costs.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Autonomous air combat decision-making of UAV based on parallel self-play reinforcement learning
    Li, Bo
    Huang, Jingyi
    Bai, Shuangxia
    Gan, Zhigang
    Liang, Shiyang
    Evgeny, Neretin
    Yao, Shouwen
    CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2023, 8 (01) : 64 - 81
  • [32] Advancing Air Combat Tactics with Improved Neural Fictitious Self-play Reinforcement Learning
    He, Shaoqin
    Gao, Yang
    Zhang, Baofeng
    Chang, Hui
    Zhang, Xinchen
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT V, 2023, 14090 : 653 - 666
  • [33] Mastering table tennis with hierarchy: a reinforcement learning approach with progressive self-play training
    Ma, Hongxu
    Fan, Jianyin
    Xu, Haoran
    Wang, Qiang
    APPLIED INTELLIGENCE, 2025, 55 (06)
  • [34] A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play
    Silver, David
    Hubert, Thomas
    Schrittwieser, Julian
    Antonoglou, Ioannis
    Lai, Matthew
    Guez, Arthur
    Lanctot, Marc
    Sifre, Laurent
    Kumaran, Dharshan
    Graepel, Thore
    Lillicrap, Timothy
    Simonyan, Karen
    Hassabis, Demis
    SCIENCE, 2018, 362 (6419) : 1140 - +
  • [35] Cyber Attack-Defense Game Strategy Solving Based on Reinforcement Learning and Self-play Cyber Attack-Defense Game Solver
    Zhang, Jie
    Luo, Yunfeng
    PROCEEDINGS OF 2024 3RD INTERNATIONAL CONFERENCE ON CRYPTOGRAPHY, NETWORK SECURITY AND COMMUNICATION TECHNOLOGY, CNSCT 2024, 2024, : 135 - 141
  • [36] Learning Diverse Risk Preferences in Population-Based Self-Play
    Jiang, Yuhua
    Liu, Qihan
    Ma, Xiaoteng
    Li, Chenghao
    Yang, Yiqin
    Yang, Jun
    Liang, Bin
    Zhao, Qianchuan
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 11, 2024, : 12910 - 12918
  • [37] Multiagent Reinforcement Learning for Strategic Decision Making and Control in Robotic Soccer Through Self-Play
    Brandao, Bruno
    De Lima, Telma Woerle
    Soares, Anderson
    Melo, Luckeciano
    Maximo, Marcos R. O. A.
    IEEE ACCESS, 2022, 10 : 72628 - 72642
  • [38] Transforming Cybersecurity Dynamics: Enhanced Self-Play Reinforcement Learning in Intrusion Detection and Prevention System
    Jaber, Aws
    18TH ANNUAL IEEE INTERNATIONAL SYSTEMS CONFERENCE, SYSCON 2024, 2024,
  • [39] Dogfight Advantage Occupancy Method Based on Imperfect Information Self-play
    Wang, Dinghan
    Ji, Longmeng
    Wang, Jingbo
    Shi, Zhuoyong
    Zhang, Jiandong
    Yang, Qiming
    Shi, Guoqing
    Wu, Yong
    Zhu, Yan
    Hu, Jinwen
    2024 IEEE 18TH INTERNATIONAL CONFERENCE ON CONTROL & AUTOMATION, ICCA 2024, 2024, : 845 - 849
  • [40] Hierarchical reinforcement learning from competitive self-play for dual-aircraft formation air combat
    Kong, Wei-ren
    Zhou, De-yun
    Zhou, Ying
    Zhao, Yi-yang
    JOURNAL OF COMPUTATIONAL DESIGN AND ENGINEERING, 2023, 10 (02) : 830 - 859