Adaptive disassembly sequence planning for VR maintenance training via deep reinforcement learning

被引:21
|
作者
Mao, Haoyang [1 ]
Liu, Zhenyu [1 ]
Qiu, Chan [1 ]
机构
[1] Zhejiang Univ, State Key Lab CAD&CG, Hangzhou 310027, Peoples R China
基金
中国国家自然科学基金;
关键词
Disassembly sequence planning; Deep reinforcement learning; Genetic algorithm; VR maintenance training; VIRTUAL-REALITY; ALGORITHM; INTELLIGENCE; SYSTEM;
D O I
10.1007/s00170-021-08290-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
VR training equipped with meta-heuristic disassembly planning algorithms has been widely applied in pre-employment training in recent years. However, these algorithms are usually authored for specific sequences of a single product, and it remains a challenge to generalize them to maintenance training with unpredictable disassembly targets. As a promising method for settling dynamic and stochastic problems, deep reinforcement learning (DRL) provides a new insight to dynamically generate optimal sequences. This study introduces the deep Q-network (DQN), a successful DRL method, to fulfill adaptive disassembly sequence planning (DSP) for the VR maintenance training. Disassembly Petri net is established to describe the disassembly process, and then the DSP problem is defined as a Markov decision process that can be solved by DQN. Two neural networks are designed and updated asynchronously, and the training of DQN is further achieved by backpropagation of errors. Especially, we replace the long-term return in DQN with the fitness function of the genetic algorithm to avoid dependence on the immediate reward. Several experiments have been carried out to exhibit great potentials of our method in on-site maintenance where the fault is uncertain.
引用
收藏
页码:3039 / 3048
页数:10
相关论文
共 50 条
  • [41] Multi-UAV Adaptive Path Planning Using Deep Reinforcement Learning
    Westheider, Jonas
    Rueckin, Julius
    Popovic, Marija
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 649 - 656
  • [42] Network Planning with Deep Reinforcement Learning
    Zhu, Hang
    Gupta, Varun
    Ahuja, Satyajeet Singh
    Tian, Yuandong
    Zhang, Ying
    Jin, Xin
    SIGCOMM '21: PROCEEDINGS OF THE 2021 ACM SIGCOMM 2021 CONFERENCE, 2021, : 258 - 271
  • [43] Deep Reinforcement Learning for Adaptive Learning Systems
    Li, Xiao
    Xu, Hanchen
    Zhang, Jinming
    Chang, Hua-hua
    JOURNAL OF EDUCATIONAL AND BEHAVIORAL STATISTICS, 2023, 48 (02) : 220 - 243
  • [44] Deep reinforcement learning driven inspection and maintenance planning under incomplete information and constraints
    Andriotis, C. P.
    Papakonstantinou, K. G.
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2021, 212
  • [45] Adaptive Cognitive Training with Reinforcement Learning
    Zini, Floriano
    Le Piane, Fabio
    Gaspari, Mauro
    ACM TRANSACTIONS ON INTERACTIVE INTELLIGENT SYSTEMS, 2022, 12 (01)
  • [46] Reinforcement Learning-Based Selective Disassembly Sequence Planning for the End-of-Life Products With Structure Uncertainty
    Zhao, Xikun
    Li, Congbo
    Tang, Ying
    Cui, Jiabin
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (04) : 7807 - 7814
  • [47] Guided Reinforcement Learning via Sequence Learning
    Ramamurthy, Rajkumar
    Sifa, Rafet
    Luebbering, Max
    Bauckhage, Christian
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT II, 2020, 12397 : 335 - 345
  • [48] Curiosity-driven recommendation strategy for adaptive learning via deep reinforcement learning
    Han, Ruijian
    Chen, Kani
    Tan, Chunxi
    BRITISH JOURNAL OF MATHEMATICAL & STATISTICAL PSYCHOLOGY, 2020, 73 (03): : 522 - 540
  • [49] Deep Reinforcement Learning for Sequence-to-Sequence Models
    Keneshloo, Yaser
    Shi, Tian
    Ramakrishnan, Naren
    Reddy, Chandan K.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (07) : 2469 - 2489
  • [50] Cutting path planning using reinforcement learning with adaptive sequence adjustment and attention mechanisms
    Wang, Kaiqi
    Zhang, Shijin
    Wu, Yuqiang
    Jiang, Fengyang
    INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2025, 136 (11-12): : 5599 - 5612