Data-Efficient Policy Selection for Navigation in Partial Maps via Subgoal-Based Abstraction

被引：0

作者：

Paudel, Abhishek ^{[1
]}

Stein, Gregory J. ^{[1
]}

机构：

[1] George Mason Univ, Dept Comp Sci, Fairfax, VA 22030 USA

来源：

2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2023年

基金：

美国国家科学基金会;

关键词：

D O I：

10.1109/IROS55552.2023.10342047

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present a novel approach for fast and reliable policy selection for navigation in partial maps. Leveraging the recent learning-augmented model-based Learning over Subgoals Planning (LSP) abstraction to plan, our robot reuses data collected during navigation to evaluate how well other alternative policies could have performed via a procedure we call offline alt-policy replay. Costs from offline alt-policy replay constrain policy selection among the LSP-based policies during deployment, allowing for improvements in convergence speed, cumulative regret and average navigation cost. With only limited prior knowledge about the nature of unseen environments, we achieve at least 67% and as much as 96% improvements on cumulative regret over the baseline bandit approach in our experiments in simulated maze and office-like environments.

引用

页码：11281 / 11288

页数：8

共 13 条

[1] Bayesian Optimization with Automatic Prior Selection for Data-Efficient Direct Policy Search
Pautrat, Remi
Chatzilygeroudis, Konstantinos
Mouret, Jean-Baptiste
2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2018, : 7571 - 7578
[2] Fast Model Identification via Physics Engines for Data-Efficient Policy Search
Zhu, Shaojun
Kimmel, Andrew
Bekris, Kostas E.
Boularias, Abdeslam
PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 3249 - 3256
[3] Model-based contextual policy search for data-efficient generalization of robot skills
Kupcsik, Andras
Deisenroth, Marc Peter
Peters, Jan
Poh, Loh Ai
Vadakkepat, Prahlad
Neumann, Gerhard
ARTIFICIAL INTELLIGENCE, 2017, 247 : 415 - 439
[4] Data-Efficient Task Generalization via Probabilistic Model-Based Meta Reinforcement Learning
Bhardwaj, Arjun
Rothfuss, Jonas
Sukhija, Bhavya
As, Yarden
Hutter, Marco
Coros, Stelian
Krause, Andreas
IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (04) : 3918 - 3925
[5] Sparse Gaussian Processes-based Black-Box Data-efficient Policy Search for Robotics
Rong, Chunyan
Huang, Jingyi
Rosendo, Andre
2021 20TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS (ICAR), 2021, : 468 - 473
[6] Optimizing Traffic Control with Model-Based Learning: A Pessimistic Approach to Data-Efficient Policy Inference
Kunjir, Mayuresh
Chawla, Sanjay
Chandrasekar, Siddarth
Jay, Devika
Ravindran, Balaraman
PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 1176 - 1187
[7] BSDR: A Data-Efficient Deep Learning-Based Hyperspectral Band Selection Algorithm Using Discrete Relaxation
Rahman, Mohammad
Teng, Shyh Wei
Murshed, Manzur
Paul, Manoranjan
Brennan, David
SENSORS, 2024, 24 (23)
[8] Towards Personalized Plasma Medicine via Data-Efficient Adaptation of Fast Deep Learning-based MPC Policies
Chan, Kimberly J.
Makrygiorgos, Georgios
Mesbah, Ali
2023 AMERICAN CONTROL CONFERENCE, ACC, 2023, : 2769 - 2775
[9] Data-Efficient Static Cost Optimization via Extremum-Seeking Control with Kernel-Based Function Approximation
Weekers, Wouter
Saccon, Alessandro
van de Wouw, Nathan
2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 6761 - 6767
[10] Active learning for efficient data selection in radio-signal-based positioning via deep learning
Corlay, Vincent
Courcoux-Caro, Milan
ELECTRONICS LETTERS, 2024, 60 (20)

← 1 2 →