A Bayesian exploration-exploitation approach for optimal online sensing and planning with a visually guided mobile robot

被引：142

作者：

Martinez-Cantin, Ruben ^{[1
]}

de Freitas, Nando ^{[2
]}

Brochu, Eric ^{[2
]}

Castellanos, Jose ^{[3
]}

Doucet, Arnaud ^{[2
]}

机构：

[1] Inst Super Tecn, Inst Syst & Robot, Lisbon, Portugal

[2] Univ British Columbia, Dept Comp Sci, Vancouver, BC V6T 1W5, Canada

[3] Univ Zaragoza, Dept Comp Sci & Syst Engn, Zaragoza, Spain

来源：

AUTONOMOUS ROBOTS | 2009年 / 27卷 / 02期

基金：

加拿大自然科学与工程研究理事会;

关键词：

Bayesian optimization; Online path planning; Sequential experimental design; Attention and gaze planning; Active vision; Dynamic sensor networks; Active learning; Policy search; Active SLAM; Model predictive control; Reinforcement learning; GLOBAL OPTIMIZATION; REINFORCEMENT; ALGORITHMS;

D O I：

10.1007/s10514-009-9130-2

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We address the problem of online path planning for optimal sensing with a mobile robot. The objective of the robot is to learn the most about its pose and the environment given time constraints. We use a POMDP with a utility function that depends on the belief state to model the finite horizon planning problem. We replan as the robot progresses throughout the environment. The POMDP is high-dimensional, continuous, non-differentiable, nonlinear, non-Gaussian and must be solved in real-time. Most existing techniques for stochastic planning and reinforcement learning are therefore inapplicable. To solve this extremely complex problem, we propose a Bayesian optimization method that dynamically trades off exploration (minimizing uncertainty in unknown parts of the policy space) and exploitation (capitalizing on the current best solution). We demonstrate our approach with a visually-guide mobile robot. The solution proposed here is also applicable to other closely-related domains, including active vision, sequential experimental design, dynamic sensing and calibration with mobile sensors.

引用

页码：93 / 103

页数：11

共 18 条

[1] A Bayesian exploration-exploitation approach for optimal online sensing and planning with a visually guided mobile robot
Ruben Martinez-Cantin
Nando de Freitas
Eric Brochu
José Castellanos
Arnaud Doucet
Autonomous Robots, 2009, 27 : 93 - 103
[2] Online learning robust MPC: an exploration-exploitation approach
Manzano, J. M.
Calliess, J.
de la Pena, D. Munoz
Limon, D.
IFAC PAPERSONLINE, 2020, 53 (02): : 5292 - 5297
[3] Finding the optimal exploration-exploitation trade-off online through Bayesian risk estimation and minimization
Jamieson, Stewart
How, Jonathan P.
Girdhar, Yogesh
ARTIFICIAL INTELLIGENCE, 2024, 330
[4] Selection Criterion Based on an Exploration-Exploitation Approach for Optimal Design of Experiments
Atamturktur, Sez
Hegenderfer, Joshua
Williams, Brian
Unal, Cetin
JOURNAL OF ENGINEERING MECHANICS, 2015, 141 (01)
[5] Anytime Planning of Optimal Schedules for a Mobile Sensing Robot
Yu, Jingjin
Aslam, Javed
Karaman, Sertac
Rus, Daniela
2015 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2015, : 5279 - 5286
[6] An online trajectory planning method for visually guided assisted reaching through a rehabilitation robot
Loconsole, C.
Bartalucci, R.
Frisoli, A.
Bergamasco, M.
2011 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2011, : 1445 - 1450
[7] Near-Optimal Trajectory Planning of a Spherical Mobile Robot for Environment Exploration
Zhan, Qiang
Cai, Yao
Liu, Zengbo
2008 IEEE CONFERENCE ON ROBOTICS, AUTOMATION, AND MECHATRONICS, VOLS 1 AND 2, 2008, : 314 - 319
[8] An online path planning approach of mobile robot based on particle filter
Gao, Yang
Sun, Shu-dong
Hu, Da-wei
Wang, Lai-jun
INDUSTRIAL ROBOT-THE INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH AND APPLICATION, 2013, 40 (04): : 305 - 319
[9] Efficient Online Planning and Robust Optimal Control for Nonholonomic Mobile Robot in Unstructured Environments
Hu, Yingbai
Zhou, Wei
Liu, Yueyue
Zeng, Minghao
Ding, Weiping
Li, Shu
Li, Guoxin
Li, Zheng
Knoll, Alois
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, : 3559 - 3575
[10] OGPR: An Obstacle-Guided Path Refinement Approach for Mobile Robot Path Planning
Atia, Mohamed G. B.
Salah, Omar
El-Hussieny, Haitham
2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2018, : 844 - 849

← 1 2 →