Adaptive disassembly sequence planning for VR maintenance training via deep reinforcement learning

被引：21

作者：

Mao, Haoyang ^{[1
]}

Liu, Zhenyu ^{[1
]}

Qiu, Chan ^{[1
]}

机构：

[1] Zhejiang Univ, State Key Lab CAD&CG, Hangzhou 310027, Peoples R China

来源：

INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY | 2023年 / 124卷 / 09期

基金：

中国国家自然科学基金;

关键词：

Disassembly sequence planning; Deep reinforcement learning; Genetic algorithm; VR maintenance training; VIRTUAL-REALITY; ALGORITHM; INTELLIGENCE; SYSTEM;

D O I：

10.1007/s00170-021-08290-x

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

VR training equipped with meta-heuristic disassembly planning algorithms has been widely applied in pre-employment training in recent years. However, these algorithms are usually authored for specific sequences of a single product, and it remains a challenge to generalize them to maintenance training with unpredictable disassembly targets. As a promising method for settling dynamic and stochastic problems, deep reinforcement learning (DRL) provides a new insight to dynamically generate optimal sequences. This study introduces the deep Q-network (DQN), a successful DRL method, to fulfill adaptive disassembly sequence planning (DSP) for the VR maintenance training. Disassembly Petri net is established to describe the disassembly process, and then the DSP problem is defined as a Markov decision process that can be solved by DQN. Two neural networks are designed and updated asynchronously, and the training of DQN is further achieved by backpropagation of errors. Especially, we replace the long-term return in DQN with the fitness function of the genetic algorithm to avoid dependence on the immediate reward. Several experiments have been carried out to exhibit great potentials of our method in on-site maintenance where the fault is uncertain.

引用

页码：3039 / 3048

页数：10

共 50 条

[31] Adaptive Selection of Informative Path Planning Strategies via Reinforcement Learning
Choi, Taeyeong
Cielniak, Grzegorz
10TH EUROPEAN CONFERENCE ON MOBILE ROBOTS (ECMR 2021), 2021,
[32] Deep reinforcement learning applied to an assembly sequence planning problem with user preferences
Neves, Miguel
Neto, Pedro
INTERNATIONAL JOURNAL OF ADVANCED MANUFACTURING TECHNOLOGY, 2022, 122 (11-12): : 4235 - 4245
[33] Project Disassembly Sequence Planning Based on Adaptive Genetic Algorithm
Xu, Da
Jiao, Qing Long
Li, Chuang
FRONTIERS OF MANUFACTURING SCIENCE AND MEASURING TECHNOLOGY V, 2015, : 372 - 375
[34] Deep reinforcement learning applied to an assembly sequence planning problem with user preferences
Miguel Neves
Pedro Neto
The International Journal of Advanced Manufacturing Technology, 2022, 122 : 4235 - 4245
[35] Reinforcement and deep reinforcement learning-based solutions for machine maintenance planning, scheduling policies, and optimization
Ogunfowora, Oluwaseyi
Najjaran, Homayoun
JOURNAL OF MANUFACTURING SYSTEMS, 2023, 70 : 244 - 263
[36] Disassembly and Reassembly Sequence Planning Tradeoffs Under Uncertainty for Product Maintenance
Behdad, Sara
Thurston, Deborah
JOURNAL OF MECHANICAL DESIGN, 2012, 134 (04)
[37] Author Correction: Stable training via elastic adaptive deep reinforcement learning for autonomous navigation of intelligent vehicles
Yujiao Zhao
Yong Ma
Guibing Zhu
Songlin Hu
Xinping Yan
Communications Engineering, 3 (1):
[38] Adaptive Optimization of Traffic Signal Timing via Deep Reinforcement Learning
Ma, Zibo
Cui, Tongchao
Deng, Wenxing
Jiang, Fengyao
Zhang, Liguo
JOURNAL OF ADVANCED TRANSPORTATION, 2021, 2021
[39] Adaptive Droplet Routing for MEDA Biochips via Deep Reinforcement Learning
Elfar, Mahmoud
Liang, Tung-Che
Chakrabarty, Krishnendu
Pajic, Miroslav
PROCEEDINGS OF THE 2022 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2022), 2022, : 640 - 645
[40] Deep reinforcement learning for adaptive path planning and control of an autonomous underwater vehicle
Hadi, Behnaz
Khosravi, Alireza
Sarhadi, Pouria
APPLIED OCEAN RESEARCH, 2022, 129

← 1 2 3 4 5 →