Optimistic Reinforcement Learning-Based Skill Insertions for Task and Motion Planning

被引:1
|
作者
Liu, Gaoyuan [1 ,2 ]
de Winter, Joris [1 ]
Durodie, Yuri [1 ,2 ]
Steckelmacher, Denis [3 ]
Nowe, Ann [3 ]
Vanderborght, Bram [1 ,2 ]
机构
[1] Vrije Univ Brussel, Brubot, B-1050 Brussels, Belgium
[2] IMEC, B-3001 Leuven, Belgium
[3] Vrije Univ Brussel, Artificial Intelligence AI Lab, B-1050 Brussels, Belgium
来源
关键词
Manipulation planning; reinforcement learning; task and motion planning; SAMPLING-BASED METHODS;
D O I
10.1109/LRA.2024.3398402
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Task and motion planning (TAMP) for robotics manipulation necessitates long-horizon reasoning involving versatile actions and skills. While deterministic actions can be crafted by sampling or optimizing with certain constraints, planning actions with uncertainty, i.e., probabilistic actions, remains a challenge for TAMP. On the contrary, Reinforcement Learning (RL) excels in acquiring versatile, yet short-horizon, manipulation skills that are robust with uncertainties. In this letter, we design a method that integrates RL skills into TAMP pipelines. Besides the policy, a RL skill is defined with data-driven logical components that enable the skill to be deployed by symbolic planning. A plan refinement sub-routine is designed to further tackle the inevitable effect uncertainties. In the experiments, we compare our method with baseline hierarchical planning from both TAMP and RL fields and illustrate the strength of the method. The results show that by embedding RL skills, we extend the capability of TAMP to domains with probabilistic skills, and improve the planning efficiency compared to the previous methods.
引用
收藏
页码:5974 / 5981
页数:8
相关论文
共 50 条
  • [1] Synergistic Task and Motion Planning With Reinforcement Learning-Based Non-Prehensile Actions
    Liu, Gaoyuan
    de Winter, Joris
    Steckelmacher, Denis
    Hota, Roshan Kumar
    Nowe, Ann
    Vanderborght, Bram
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (05) : 2764 - 2771
  • [2] Reinforcement Learning-Based Motion Planning for Automatic Parking System
    Zhang, Jiren
    Chen, Hui
    Song, Shaoyu
    Hu, Fengwei
    IEEE ACCESS, 2020, 8 : 154485 - 154501
  • [3] Reinforcement learning based motion planning of dynamic manipulation task for manipulator
    Eng. Training Center, Shanghai Jiaotong Univ., Shanghai 200240, China
    Xitong Fangzhen Xuebao, 2006, 9 (2537-2540):
  • [4] Hierarchical Task and Motion Planning through Deep Reinforcement Learning
    Newaz, Abdullah Al Redwan
    Alam, Tauhidul
    2021 FIFTH IEEE INTERNATIONAL CONFERENCE ON ROBOTIC COMPUTING (IRC 2021), 2021, : 100 - 105
  • [5] A review of mobile robot motion planning methods: from classical motion planning workflows to reinforcement learning-based architectures
    Dong, Lu
    He, Zichen
    Song, Chunwei
    Sun, Changyin
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2023, 34 (02) : 439 - 459
  • [6] A review of mobile robot motion planning methods:from classical motion planning workflows to reinforcement learning-based architectures
    DONG Lu
    HE Zichen
    SONG Chunwei
    SUN Changyin
    JournalofSystemsEngineeringandElectronics, 2023, 34 (02) : 439 - 459
  • [7] Reinforcement Learning-based Motion Planning of a Triangular Floating Platform under Environmental Disturbances
    Tziortziotis, Konstantinos
    Vlachos, Kostas
    Blekas, Konstandnos
    2016 24TH MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION (MED), 2016, : 1014 - 1019
  • [8] A Reinforcement Learning-Based Framework for Robot Manipulation Skill Acquisition
    Liu, Dong
    Wang, Zitu
    Lu, Binpeng
    Cong, Ming
    Yu, Honghua
    Zou, Qiang
    IEEE ACCESS, 2020, 8 : 108429 - 108437
  • [9] A survey of learning-based robot motion planning
    Wang, Jiankun
    Zhang, Tianyi
    Ma, Nachuan
    Li, Zhaoting
    Ma, Han
    Meng, Fei
    Meng, Max Q. -H.
    IET CYBER-SYSTEMS AND ROBOTICS, 2021, 3 (04) : 302 - 314
  • [10] A Reinforcement Learning-based Path Planning for Collaborative UAVs
    Rahim, Shahnila
    Razaq, Mian Muaz
    Chang, Shih Yu
    Peng, Limei
    37TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, 2022, : 1938 - 1943