TRANSFER REINFORCEMENT LEARNING: FEATURE TRANSFERABILITY IN SHIP COLLISION AVOIDANCE

被引:0
|
作者
Wang, Xinrui [1 ]
Jin, Yan [1 ]
机构
[1] Univ Southern Calif, Dept Aerosp & Mech Engn, Los Angeles, CA 90007 USA
关键词
Artificial intelligence; deep learning; transfer learning; reinforcement learning; collision avoidance; RISK;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The integration of artificial intelligence into engineering work has become increasingly prevalent. Engineering work processes can be highly complex, and learning from scratch requires large computation resources. Transfer learning has emerged as a promising technique for improving learning efficiency by leveraging knowledge gained from related tasks to the target task. To achieve optimal performance, one of the key challenges is to figure out how transferrable the features are among different work processes and within training networks. Simulation-based ship collision avoidance is used for case studies due to its inherent complexity and diversity. Two transfer reinforcement learning methods, feature extraction, and finetuning, are implemented and evaluated against the baseline. Instead of introducing large-scaled pre-trained models as the backbone, a light CNN model pre-trained in a related base case has been proven to transfer essential features to target cases. Simplified ship dynamics is introduced into the training process to make it more realistic and applicable, and the delay caused by the large moment of inertia is addressed by modifying the model-environment interaction mechanism. Work process features for the ship collision avoidance process are concluded from crucial aspects. The effects on transferability are displayed by experimental results discussed from the feature category and similarity perspective.
引用
收藏
页数:14
相关论文
共 50 条
  • [41] Deep reinforcement learning based collision avoidance system for autonomous ships
    Wang, Yong
    Xu, Haixiang
    Feng, Hui
    He, Jianhua
    Yang, Haojie
    Li, Fen
    Yang, Zhen
    OCEAN ENGINEERING, 2024, 292
  • [42] Collision avoidance for an unmanned surface vehicle using deep reinforcement learning
    Woo, Joohyun
    Kim, Nakwan
    OCEAN ENGINEERING, 2020, 199
  • [43] Multigoal Visual Navigation With Collision Avoidance via Deep Reinforcement Learning
    Xiao, Wendong
    Yuan, Liang
    He, Li
    Ran, Teng
    Zhang, Jianbo
    Cui, Jianping
    IEEE Transactions on Instrumentation and Measurement, 2022, 71
  • [44] Learning-Based Navigation and Collision Avoidance Through Reinforcement for UAVs
    Azzam, Rana
    Chehadeh, Mohamad
    Hay, Oussama Abdul
    Humais, Muhammad Ahmed
    Boiko, Igor
    Zweiri, Yahya
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2024, 60 (03) : 2614 - 2628
  • [45] Collision Avoidance in Pedestrian-Rich Environments With Deep Reinforcement Learning
    Everett, Michael
    Chen, Yu Fan
    How, Jonathan P.
    IEEE ACCESS, 2021, 9 : 10357 - 10377
  • [46] Multigoal Visual Navigation With Collision Avoidance via Deep Reinforcement Learning
    Xiao, Wendong
    Yuan, Liang
    He, Li
    Ran, Teng
    Zhang, Jianbo
    Cui, Jianping
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
  • [47] Reinforcement Learning Training Environment for Fixed Wing UAV Collision Avoidance
    D'Apolito, Francesco
    IFAC PAPERSONLINE, 2022, 55 (39): : 281 - 285
  • [48] Safe Reinforcement Learning for Pedestrian Collision Avoidance in Connected and Autonomous Vehicles
    He, Ying
    Zou, Guangyuan
    Zhou, Guang
    Pan, Weike
    Ming, Zhong
    AD HOC & SENSOR WIRELESS NETWORKS, 2025, 60 (1-2) : 141 - 169
  • [49] Spacecraft Proximity Maneuvering and Rendezvous With Collision Avoidance Based on Reinforcement Learning
    Qu, Qingyu
    Liu, Kexin
    Wang, Wei
    Lu, Jinhu
    IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS, 2022, 58 (06) : 5823 - 5834
  • [50] SSRL: A Safe and Smooth Reinforcement Learning Approach for Collision Avoidance in Navigation
    Zhang, Ruixian
    Yang, Jianan
    Liang, Ye
    Lu, Shengao
    Zhang, Lixian
    2023 2ND CONFERENCE ON FULLY ACTUATED SYSTEM THEORY AND APPLICATIONS, CFASTA, 2023, : 681 - 686