TRANSFER REINFORCEMENT LEARNING: FEATURE TRANSFERABILITY IN SHIP COLLISION AVOIDANCE

被引：0

作者：

Wang, Xinrui ^{[1
]}

Jin, Yan ^{[1
]}

机构：

[1] Univ Southern Calif, Dept Aerosp & Mech Engn, Los Angeles, CA 90007 USA

来源：

PROCEEDINGS OF ASME 2023 INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, IDETC-CIE2023, VOL 3B | 2023年

关键词：

Artificial intelligence; deep learning; transfer learning; reinforcement learning; collision avoidance; RISK;

D O I：

暂无

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

The integration of artificial intelligence into engineering work has become increasingly prevalent. Engineering work processes can be highly complex, and learning from scratch requires large computation resources. Transfer learning has emerged as a promising technique for improving learning efficiency by leveraging knowledge gained from related tasks to the target task. To achieve optimal performance, one of the key challenges is to figure out how transferrable the features are among different work processes and within training networks. Simulation-based ship collision avoidance is used for case studies due to its inherent complexity and diversity. Two transfer reinforcement learning methods, feature extraction, and finetuning, are implemented and evaluated against the baseline. Instead of introducing large-scaled pre-trained models as the backbone, a light CNN model pre-trained in a related base case has been proven to transfer essential features to target cases. Simplified ship dynamics is introduced into the training process to make it more realistic and applicable, and the delay caused by the large moment of inertia is addressed by modifying the model-environment interaction mechanism. Work process features for the ship collision avoidance process are concluded from crucial aspects. The effects on transferability are displayed by experimental results discussed from the feature category and similarity perspective.

引用

页数：14

共 50 条

[41] Deep reinforcement learning based collision avoidance system for autonomous ships
Wang, Yong
Xu, Haixiang
Feng, Hui
He, Jianhua
Yang, Haojie
Li, Fen
Yang, Zhen
OCEAN ENGINEERING, 2024, 292
[42] Collision avoidance for an unmanned surface vehicle using deep reinforcement learning
Woo, Joohyun
Kim, Nakwan
OCEAN ENGINEERING, 2020, 199
[43] Multigoal Visual Navigation With Collision Avoidance via Deep Reinforcement Learning
Xiao, Wendong
Yuan, Liang
He, Li
Ran, Teng
Zhang, Jianbo
Cui, Jianping
IEEE Transactions on Instrumentation and Measurement, 2022, 71
[44] Learning-Based Navigation and Collision Avoidance Through Reinforcement for UAVs
Azzam, Rana
Chehadeh, Mohamad
Hay, Oussama Abdul
Humais, Muhammad Ahmed
Boiko, Igor
Zweiri, Yahya
IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2024, 60 (03) : 2614 - 2628
[45] Collision Avoidance in Pedestrian-Rich Environments With Deep Reinforcement Learning
Everett, Michael
Chen, Yu Fan
How, Jonathan P.
IEEE ACCESS, 2021, 9 : 10357 - 10377
[46] Multigoal Visual Navigation With Collision Avoidance via Deep Reinforcement Learning
Xiao, Wendong
Yuan, Liang
He, Li
Ran, Teng
Zhang, Jianbo
Cui, Jianping
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
[47] Reinforcement Learning Training Environment for Fixed Wing UAV Collision Avoidance
D'Apolito, Francesco
IFAC PAPERSONLINE, 2022, 55 (39): : 281 - 285
[48] Safe Reinforcement Learning for Pedestrian Collision Avoidance in Connected and Autonomous Vehicles
He, Ying
Zou, Guangyuan
Zhou, Guang
Pan, Weike
Ming, Zhong
AD HOC & SENSOR WIRELESS NETWORKS, 2025, 60 (1-2) : 141 - 169
[49] Spacecraft Proximity Maneuvering and Rendezvous With Collision Avoidance Based on Reinforcement Learning
Qu, Qingyu
Liu, Kexin
Wang, Wei
Lu, Jinhu
IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2022, 58 (06) : 5823 - 5834
[50] SSRL: A Safe and Smooth Reinforcement Learning Approach for Collision Avoidance in Navigation
Zhang, Ruixian
Yang, Jianan
Liang, Ye
Lu, Shengao
Zhang, Lixian
2023 2ND CONFERENCE ON FULLY ACTUATED SYSTEM THEORY AND APPLICATIONS, CFASTA, 2023, : 681 - 686

← 1 2 3 4 5 →