TRANSFER REINFORCEMENT LEARNING: FEATURE TRANSFERABILITY IN SHIP COLLISION AVOIDANCE

被引:0
|
作者
Wang, Xinrui [1 ]
Jin, Yan [1 ]
机构
[1] Univ Southern Calif, Dept Aerosp & Mech Engn, Los Angeles, CA 90007 USA
关键词
Artificial intelligence; deep learning; transfer learning; reinforcement learning; collision avoidance; RISK;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The integration of artificial intelligence into engineering work has become increasingly prevalent. Engineering work processes can be highly complex, and learning from scratch requires large computation resources. Transfer learning has emerged as a promising technique for improving learning efficiency by leveraging knowledge gained from related tasks to the target task. To achieve optimal performance, one of the key challenges is to figure out how transferrable the features are among different work processes and within training networks. Simulation-based ship collision avoidance is used for case studies due to its inherent complexity and diversity. Two transfer reinforcement learning methods, feature extraction, and finetuning, are implemented and evaluated against the baseline. Instead of introducing large-scaled pre-trained models as the backbone, a light CNN model pre-trained in a related base case has been proven to transfer essential features to target cases. Simplified ship dynamics is introduced into the training process to make it more realistic and applicable, and the delay caused by the large moment of inertia is addressed by modifying the model-environment interaction mechanism. Work process features for the ship collision avoidance process are concluded from crucial aspects. The effects on transferability are displayed by experimental results discussed from the feature category and similarity perspective.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] A Collision Avoidance Method Based on Deep Reinforcement Learning
    Feng, Shumin
    Sebastian, Bijo
    Ben-Tzvi, Pinhas
    ROBOTICS, 2021, 10 (02)
  • [22] Deep Reinforcement Learning for Collision Avoidance of Autonomous Vehicle
    Tseng, Hsiao-Ting
    Hsieh, Chen-Chiung
    Lin, Wei-Ting
    Lin, Jyun-Ting
    2020 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS - TAIWAN (ICCE-TAIWAN), 2020,
  • [23] Multi-ship collaborative collision avoidance strategy based on multi-agent deep reinforcement learning
    Huang R.
    Luo L.
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2024, 30 (06): : 1972 - 1988
  • [24] Collision avoidance for autonomous ship using deep reinforcement learning and prior-knowledge-based approximate representation
    Wang, Chengbo
    Zhang, Xinyu
    Yang, Zaili
    Bashir, Musa
    Lee, Kwangil
    FRONTIERS IN MARINE SCIENCE, 2023, 9
  • [25] A human-like collision avoidance method for autonomous ship with attention-based deep reinforcement learning
    Jiang, Lingling
    An, Lanxuan
    Zhang, Xinyu
    Wang, Chengbo
    Wang, Xinjian
    OCEAN ENGINEERING, 2022, 264
  • [26] Procedures for ship collision avoidance
    Churkin, VI
    Zhukov, YI
    OCEANS'98 - CONFERENCE PROCEEDINGS, VOLS 1-3, 1998, : 857 - 860
  • [27] A learning method for AUV collision avoidance through deep reinforcement learning
    Xu, Jian
    Huang, Fei
    Wu, Di
    Cui, Yunfei
    Yan, Zheping
    Du, Xue
    OCEAN ENGINEERING, 2022, 260
  • [28] Smooth Trajectory Collision Avoidance through Deep Reinforcement Learning
    Song, Sirui
    Saunders, Kirk
    Yue, Ye
    Liu, Jundong
    2022 21ST IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS, ICMLA, 2022, : 914 - 919
  • [29] Collision Avoidance in Crowded Zone Using Adversarial Reinforcement Learning
    Natesan, Balaji
    Liu, Chuan-Ming
    2021 INTERNATIONAL CONFERENCE ON SECURITY AND INFORMATION TECHNOLOGIES WITH AI, INTERNET COMPUTING AND BIG-DATA APPLICATIONS, 2023, 314 : 276 - 283
  • [30] An Aircraft Collision Avoidance Method Based on Deep Reinforcement Learning
    Liu, Zuocheng
    Neretin, Evgeny
    Gao, Xiaoguang
    Wan, Kaifang
    2024 9TH INTERNATIONAL CONFERENCE ON CONTROL AND ROBOTICS ENGINEERING, ICCRE 2024, 2024, : 241 - 246