Deep reinforcement learning based online lifting path planning for tower cranes in unknown dynamic environments

被引:0
|
作者
Wang, Kai [1 ]
Li, Jing [2 ]
Yin, Zhiyuan [1 ]
Zhang, Jiankang [2 ]
Ma, Xin [1 ]
机构
[1] Shandong Univ, Sch Control Sci & Engn, Jinan 250061, Shandong, Peoples R China
[2] Shandong Fenghui Equipment Technol Co Ltd, Zhangqiu, Shandong, Peoples R China
来源
关键词
Lifting path planning; TD3; HER; tower cranes; hybrid action space;
D O I
10.1177/17298806241283176
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Lifting path planning is critical for the safety and efficiency of tower cranes operating in dynamic construction environments. This paper proposes a lifting path planner to efficiently generate safe and smooth lifting paths for tower cranes in an unknown construction environment through a deep reinforcement learning (DRL) method. Based on the Twin-Delayed DDPG (TD3) framework, the planner effectively plans a lifting path within constraints of collision avoidance and operational limitations using the local environmental information measured by lidar. A Long Short-Term Memory network is applied in the planner to handle the dynamic characteristics of the obstacles in the construction sites to ensure that the lifting path is collision-free with dynamic obstacles. A discrete-continuous hybrid action space for tower cranes is proposed to optimize planned lifting paths more suitable for practical engineering operations. Moreover, a novel reward function is introduced to optimize the smoothness of the lifting path, which improves the success rate and optimizes the energy and time cost. A new Hindsight Experience Replay algorithm is proposed to address the reward sparsity problem in lifting path planning, which improves the training speed. Simulation results in Webots platform show the presented method effectively reduces the planning time and achieves better performance on online path planning compared with the existing DRL path planning methods.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Lifting path planning of mobile cranes based on an improved RRT algorithm
    Zhou, Ying
    Zhang, Endong
    Guo, Hongling
    Fang, Yihai
    Li, Heng
    ADVANCED ENGINEERING INFORMATICS, 2021, 50
  • [42] Bayesian reinforcement learning for navigation planning in unknown environments
    Alali, Mohammad
    Imani, Mahdi
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2024, 7
  • [43] OPTIMAL AND EFFICIENT PATH PLANNING FOR UNKNOWN AND DYNAMIC ENVIRONMENTS
    STENTZ, A
    INTERNATIONAL JOURNAL OF ROBOTICS & AUTOMATION, 1995, 10 (03): : 89 - 100
  • [44] Robot path planning in dynamic environment based on reinforcement learning
    Zhuang, Xiao-Dong
    Meng, Qing-Chun
    Wei, Tian-Bin
    Wang, Xu-Zhu
    Tan, Rui
    Li, Xiao-Jing
    Journal of Harbin Institute of Technology (New Series), 2001, 8 (03) : 253 - 255
  • [45] Deep Reinforcement Learning-Based Path Planning with Dynamic Collision Probability for Mobile Robots
    Tariq, Muhammad Taha
    Wang, Congqing
    Hussain, Yasir
    2024 WRC SYMPOSIUM ON ADVANCED ROBOTICS AND AUTOMATION, WRC SARA, 2024, : 9 - 14
  • [46] Research on Dynamic Path Planning of Wheeled Robot Based on Deep Reinforcement Learning on the Slope Ground
    Wang, Peng
    Li, Xiaoqiang
    Song, Chunxiao
    Zhai, Shipeng
    JOURNAL OF ROBOTICS, 2020, 2020
  • [47] A graph-based path planning algorithm for the control of tower cranes
    Burkhardt, Mark
    Sawodny, Oliver
    2021 AMERICAN CONTROL CONFERENCE (ACC), 2021, : 1736 - 1741
  • [48] Path Planning of Mobile Robot in Dynamic Obstacle Avoidance Environment Based on Deep Reinforcement Learning
    Zhang, Qingfeng
    Ma, Wenpeng
    Zheng, Qingchun
    Zhai, Xiaofan
    Zhang, Wenqian
    Zhang, Tianchang
    Wang, Shuo
    IEEE ACCESS, 2024, 12 : 189136 - 189152
  • [49] Lift path planning for tower cranes based on environmental point clouds
    Lin, Xiao
    Han, Yu
    Guo, Hongling
    Luo, Zhubang
    Guo, Ziyang
    AUTOMATION IN CONSTRUCTION, 2023, 155
  • [50] Online Path Planning for Autonomous Underwater Vehicles in Unknown Environments
    Hernandez, Juan David
    Vidal, Eduard
    Vallicrosa, Guillem
    Galceran, Enric
    Carreras, Marc
    2015 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2015, : 1152 - 1157