Adaptive disassembly sequence planning for VR maintenance training via deep reinforcement learning

被引:21
|
作者
Mao, Haoyang [1 ]
Liu, Zhenyu [1 ]
Qiu, Chan [1 ]
机构
[1] Zhejiang Univ, State Key Lab CAD&CG, Hangzhou 310027, Peoples R China
基金
中国国家自然科学基金;
关键词
Disassembly sequence planning; Deep reinforcement learning; Genetic algorithm; VR maintenance training; VIRTUAL-REALITY; ALGORITHM; INTELLIGENCE; SYSTEM;
D O I
10.1007/s00170-021-08290-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
VR training equipped with meta-heuristic disassembly planning algorithms has been widely applied in pre-employment training in recent years. However, these algorithms are usually authored for specific sequences of a single product, and it remains a challenge to generalize them to maintenance training with unpredictable disassembly targets. As a promising method for settling dynamic and stochastic problems, deep reinforcement learning (DRL) provides a new insight to dynamically generate optimal sequences. This study introduces the deep Q-network (DQN), a successful DRL method, to fulfill adaptive disassembly sequence planning (DSP) for the VR maintenance training. Disassembly Petri net is established to describe the disassembly process, and then the DSP problem is defined as a Markov decision process that can be solved by DQN. Two neural networks are designed and updated asynchronously, and the training of DQN is further achieved by backpropagation of errors. Especially, we replace the long-term return in DQN with the fitness function of the genetic algorithm to avoid dependence on the immediate reward. Several experiments have been carried out to exhibit great potentials of our method in on-site maintenance where the fault is uncertain.
引用
收藏
页码:3039 / 3048
页数:10
相关论文
共 50 条
  • [31] Adaptive Selection of Informative Path Planning Strategies via Reinforcement Learning
    Choi, Taeyeong
    Cielniak, Grzegorz
    10TH EUROPEAN CONFERENCE ON MOBILE ROBOTS (ECMR 2021), 2021,
  • [32] Deep reinforcement learning applied to an assembly sequence planning problem with user preferences
    Neves, Miguel
    Neto, Pedro
    INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2022, 122 (11-12): : 4235 - 4245
  • [33] Project Disassembly Sequence Planning Based on Adaptive Genetic Algorithm
    Xu, Da
    Jiao, Qing Long
    Li, Chuang
    FRONTIERS OF MANUFACTURING SCIENCE AND MEASURING TECHNOLOGY V, 2015, : 372 - 375
  • [34] Deep reinforcement learning applied to an assembly sequence planning problem with user preferences
    Miguel Neves
    Pedro Neto
    The International Journal of Advanced Manufacturing Technology, 2022, 122 : 4235 - 4245
  • [35] Reinforcement and deep reinforcement learning-based solutions for machine maintenance planning, scheduling policies, and optimization
    Ogunfowora, Oluwaseyi
    Najjaran, Homayoun
    JOURNAL OF MANUFACTURING SYSTEMS, 2023, 70 : 244 - 263
  • [36] Disassembly and Reassembly Sequence Planning Tradeoffs Under Uncertainty for Product Maintenance
    Behdad, Sara
    Thurston, Deborah
    JOURNAL OF MECHANICAL DESIGN, 2012, 134 (04)
  • [37] Author Correction: Stable training via elastic adaptive deep reinforcement learning for autonomous navigation of intelligent vehicles
    Yujiao Zhao
    Yong Ma
    Guibing Zhu
    Songlin Hu
    Xinping Yan
    Communications Engineering, 3 (1):
  • [38] Adaptive Optimization of Traffic Signal Timing via Deep Reinforcement Learning
    Ma, Zibo
    Cui, Tongchao
    Deng, Wenxing
    Jiang, Fengyao
    Zhang, Liguo
    JOURNAL OF ADVANCED TRANSPORTATION, 2021, 2021
  • [39] Adaptive Droplet Routing for MEDA Biochips via Deep Reinforcement Learning
    Elfar, Mahmoud
    Liang, Tung-Che
    Chakrabarty, Krishnendu
    Pajic, Miroslav
    PROCEEDINGS OF THE 2022 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2022), 2022, : 640 - 645
  • [40] Deep reinforcement learning for adaptive path planning and control of an autonomous underwater vehicle
    Hadi, Behnaz
    Khosravi, Alireza
    Sarhadi, Pouria
    APPLIED OCEAN RESEARCH, 2022, 129