Task-Directed Exploration in Continuous POMDPs for Robotic Manipulation of Articulated Objects

被引:1
|
作者
Curtis, Aidan [1 ,2 ]
Kaelbling, Leslie [1 ]
Jain, Siddarth [3 ]
机构
[1] MIT, Comp Sci & Artificial Intelligence Lab, Cambridge, MA USA
[2] MERL, Cambridge, MA USA
[3] Mitsubishi Elect Res Labs MERL, Cambridge, MA 02139 USA
关键词
D O I
10.1109/ICRA48891.2023.10160306
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Representing and reasoning about uncertainty is crucial for autonomous agents acting in partially observable environments with noisy sensors. Partially observable Markov decision processes (POMDPs) serve as a general framework for representing problems in which uncertainty is an important factor. Online sample-based POMDP methods have emerged as efficient approaches to solving large POMDPs and have been shown to extend to continuous domains. However, these solutions struggle to find long-horizon plans in problems with significant uncertainty. Exploration heuristics can help guide planning, but many real-world settings contain significant task-irrelevant uncertainty that might distract from the task objective. In this paper, we propose STRUG, an online POMDP solver capable of handling domains that require long-horizon planning with significant task-relevant and task-irrelevant uncertainty. We demonstrate our solution on several temporally extended versions of toy POMDP problems as well as robotic manipulation of articulated objects using a neural perception frontend to construct a distribution of possible models. Our results show that STRUG outperforms the current sample-based online POMDP solvers on several tasks.
引用
收藏
页码:3721 / 3728
页数:8
相关论文
共 6 条
  • [1] Task-directed configuration of networked robotic agents
    McKee, GT
    Baker, DI
    Schenker, PS
    2002 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-3, PROCEEDINGS, 2002, : 2752 - 2757
  • [2] Interlinked Visual Tracking and Robotic Manipulation of Articulated Objects
    Paolillo, Antonio
    Chappellet, Kevin
    Bolotnikova, Anastasia
    Kheddar, Abderrahmane
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2018, 3 (04): : 2746 - 2753
  • [3] Collaborative Robotic Manipulation: A Use Case of Articulated Objects in Three-dimensions with Gravity
    Bertolucci, Riccardo
    Capitanelli, Alessio
    Maratea, Marco
    Mastrogiovanni, Fulvio
    Vallati, Mauro
    2020 IEEE 32ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2020, : 1167 - 1174
  • [4] Learning Robotic Manipulation Tasks via Task Progress Based Gaussian Reward and Loss Adjusted Exploration
    Kumra, Sulabh
    Joshi, Shirin
    Sahin, Ferat
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (01) : 534 - 541
  • [5] Quantification of task-dependent cortical activation evoked by robotic continuous wrist joint manipulation in chronic hemiparetic stroke
    Vlaar, Martijn P.
    Solis-Escalante, Teodoro
    Dewald, Julius P. A.
    van Wegen, Erwin E. H.
    Schouten, Alfred C.
    Kwakkel, Gert
    van der Helm, Frans C. T.
    JOURNAL OF NEUROENGINEERING AND REHABILITATION, 2017, 14
  • [6] Quantification of task-dependent cortical activation evoked by robotic continuous wrist joint manipulation in chronic hemiparetic stroke
    Martijn P. Vlaar
    Teodoro Solis-Escalante
    Julius P. A. Dewald
    Erwin E. H. van Wegen
    Alfred C. Schouten
    Gert Kwakkel
    Frans C. T. van der Helm
    Journal of NeuroEngineering and Rehabilitation, 14