Data-Efficient Policy Selection for Navigation in Partial Maps via Subgoal-Based Abstraction

被引:0
|
作者
Paudel, Abhishek [1 ]
Stein, Gregory J. [1 ]
机构
[1] George Mason Univ, Dept Comp Sci, Fairfax, VA 22030 USA
基金
美国国家科学基金会;
关键词
D O I
10.1109/IROS55552.2023.10342047
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a novel approach for fast and reliable policy selection for navigation in partial maps. Leveraging the recent learning-augmented model-based Learning over Subgoals Planning (LSP) abstraction to plan, our robot reuses data collected during navigation to evaluate how well other alternative policies could have performed via a procedure we call offline alt-policy replay. Costs from offline alt-policy replay constrain policy selection among the LSP-based policies during deployment, allowing for improvements in convergence speed, cumulative regret and average navigation cost. With only limited prior knowledge about the nature of unseen environments, we achieve at least 67% and as much as 96% improvements on cumulative regret over the baseline bandit approach in our experiments in simulated maze and office-like environments.
引用
收藏
页码:11281 / 11288
页数:8
相关论文
共 13 条
  • [1] Bayesian Optimization with Automatic Prior Selection for Data-Efficient Direct Policy Search
    Pautrat, Remi
    Chatzilygeroudis, Konstantinos
    Mouret, Jean-Baptiste
    2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2018, : 7571 - 7578
  • [2] Fast Model Identification via Physics Engines for Data-Efficient Policy Search
    Zhu, Shaojun
    Kimmel, Andrew
    Bekris, Kostas E.
    Boularias, Abdeslam
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 3249 - 3256
  • [3] Model-based contextual policy search for data-efficient generalization of robot skills
    Kupcsik, Andras
    Deisenroth, Marc Peter
    Peters, Jan
    Poh, Loh Ai
    Vadakkepat, Prahlad
    Neumann, Gerhard
    ARTIFICIAL INTELLIGENCE, 2017, 247 : 415 - 439
  • [4] Data-Efficient Task Generalization via Probabilistic Model-Based Meta Reinforcement Learning
    Bhardwaj, Arjun
    Rothfuss, Jonas
    Sukhija, Bhavya
    As, Yarden
    Hutter, Marco
    Coros, Stelian
    Krause, Andreas
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (04) : 3918 - 3925
  • [5] Sparse Gaussian Processes-based Black-Box Data-efficient Policy Search for Robotics
    Rong, Chunyan
    Huang, Jingyi
    Rosendo, Andre
    2021 20TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS (ICAR), 2021, : 468 - 473
  • [6] Optimizing Traffic Control with Model-Based Learning: A Pessimistic Approach to Data-Efficient Policy Inference
    Kunjir, Mayuresh
    Chawla, Sanjay
    Chandrasekar, Siddarth
    Jay, Devika
    Ravindran, Balaraman
    PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 1176 - 1187
  • [7] BSDR: A Data-Efficient Deep Learning-Based Hyperspectral Band Selection Algorithm Using Discrete Relaxation
    Rahman, Mohammad
    Teng, Shyh Wei
    Murshed, Manzur
    Paul, Manoranjan
    Brennan, David
    SENSORS, 2024, 24 (23)
  • [8] Towards Personalized Plasma Medicine via Data-Efficient Adaptation of Fast Deep Learning-based MPC Policies
    Chan, Kimberly J.
    Makrygiorgos, Georgios
    Mesbah, Ali
    2023 AMERICAN CONTROL CONFERENCE, ACC, 2023, : 2769 - 2775
  • [9] Data-Efficient Static Cost Optimization via Extremum-Seeking Control with Kernel-Based Function Approximation
    Weekers, Wouter
    Saccon, Alessandro
    van de Wouw, Nathan
    2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 6761 - 6767
  • [10] Active learning for efficient data selection in radio-signal-based positioning via deep learning
    Corlay, Vincent
    Courcoux-Caro, Milan
    ELECTRONICS LETTERS, 2024, 60 (20)