Optimistic Reinforcement Learning-Based Skill Insertions for Task and Motion Planning

被引:1
|
作者
Liu, Gaoyuan [1 ,2 ]
de Winter, Joris [1 ]
Durodie, Yuri [1 ,2 ]
Steckelmacher, Denis [3 ]
Nowe, Ann [3 ]
Vanderborght, Bram [1 ,2 ]
机构
[1] Vrije Univ Brussel, Brubot, B-1050 Brussels, Belgium
[2] IMEC, B-3001 Leuven, Belgium
[3] Vrije Univ Brussel, Artificial Intelligence AI Lab, B-1050 Brussels, Belgium
来源
关键词
Manipulation planning; reinforcement learning; task and motion planning; SAMPLING-BASED METHODS;
D O I
10.1109/LRA.2024.3398402
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Task and motion planning (TAMP) for robotics manipulation necessitates long-horizon reasoning involving versatile actions and skills. While deterministic actions can be crafted by sampling or optimizing with certain constraints, planning actions with uncertainty, i.e., probabilistic actions, remains a challenge for TAMP. On the contrary, Reinforcement Learning (RL) excels in acquiring versatile, yet short-horizon, manipulation skills that are robust with uncertainties. In this letter, we design a method that integrates RL skills into TAMP pipelines. Besides the policy, a RL skill is defined with data-driven logical components that enable the skill to be deployed by symbolic planning. A plan refinement sub-routine is designed to further tackle the inevitable effect uncertainties. In the experiments, we compare our method with baseline hierarchical planning from both TAMP and RL fields and illustrate the strength of the method. The results show that by embedding RL skills, we extend the capability of TAMP to domains with probabilistic skills, and improve the planning efficiency compared to the previous methods.
引用
收藏
页码:5974 / 5981
页数:8
相关论文
共 50 条
  • [31] Reinforcement learning-based dynamic obstacle avoidance and integration of path planning
    Choi, Jaewan
    Lee, Geonhee
    Lee, Chibum
    INTELLIGENT SERVICE ROBOTICS, 2021, 14 (05) : 663 - 677
  • [32] Deep reinforcement learning-based reactive trajectory planning method for UAVs
    Cao, Lijia
    Wang, Lin
    Liu, Yang
    Xu, Weihong
    Geng, Chuang
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART G-JOURNAL OF AEROSPACE ENGINEERING, 2024, 238 (10) : 1018 - 1037
  • [33] ReLeaPS : Reinforcement Learning-based Illumination Planning for Generalized Photometric Stereo
    Chan, Jun Hoong
    Yu, Bohan
    Guo, Heng
    Ren, Jieji
    Lu, Zongqing
    Shi, Boxin
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 9133 - 9141
  • [34] Reinforcement learning-based dynamic obstacle avoidance and integration of path planning
    Jaewan Choi
    Geonhee Lee
    Chibum Lee
    Intelligent Service Robotics, 2021, 14 : 663 - 677
  • [35] Self-learning UAV Motion Planning Based on Meta Reinforcement Learning
    Wang, Minchun
    Jiang, Bo
    Xie, Jinhui
    2024 9TH INTERNATIONAL CONFERENCE ON ELECTRONIC TECHNOLOGY AND INFORMATION SCIENCE, ICETIS 2024, 2024, : 225 - 231
  • [36] A reinforcement learning-based hyper-heuristic for AGV task assignment and route planning in parts-to-picker warehouses
    Li, Kunpeng
    Liu, Tengbo
    Kumar, P. N. Ram
    Han, Xuefang
    TRANSPORTATION RESEARCH PART E-LOGISTICS AND TRANSPORTATION REVIEW, 2024, 185
  • [37] Bayesian Optimistic Optimization: Optimistic Exploration for Model-based Reinforcement Learning
    Wu, Chenyang
    Li, Tianci
    Zhang, Zongzhang
    Yu, Yang
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [38] Integration of Reinforcement Learning Based Behavior Planning With Sampling Based Motion Planning for Automated Driving
    Klimke, Marvin
    Voelz, Benjamin
    Buchholz, Michael
    2023 IEEE INTELLIGENT VEHICLES SYMPOSIUM, IV, 2023,
  • [39] Online Reinforcement Learning-Based Pedagogical Planning for Narrative-Centered Learning Environments
    Fahid, Fahmid Morshed
    Rowe, Jonathan
    Kim, Yeojin
    Srivastava, Shashank
    Lester, James
    THIRTY-EIGTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 21, 2024, : 23191 - 23199
  • [40] Reinforcement and deep reinforcement learning-based solutions for machine maintenance planning, scheduling policies, and optimization
    Ogunfowora, Oluwaseyi
    Najjaran, Homayoun
    JOURNAL OF MANUFACTURING SYSTEMS, 2023, 70 : 244 - 263