A Bayesian exploration-exploitation approach for optimal online sensing and planning with a visually guided mobile robot

被引:142
|
作者
Martinez-Cantin, Ruben [1 ]
de Freitas, Nando [2 ]
Brochu, Eric [2 ]
Castellanos, Jose [3 ]
Doucet, Arnaud [2 ]
机构
[1] Inst Super Tecn, Inst Syst & Robot, Lisbon, Portugal
[2] Univ British Columbia, Dept Comp Sci, Vancouver, BC V6T 1W5, Canada
[3] Univ Zaragoza, Dept Comp Sci & Syst Engn, Zaragoza, Spain
基金
加拿大自然科学与工程研究理事会;
关键词
Bayesian optimization; Online path planning; Sequential experimental design; Attention and gaze planning; Active vision; Dynamic sensor networks; Active learning; Policy search; Active SLAM; Model predictive control; Reinforcement learning; GLOBAL OPTIMIZATION; REINFORCEMENT; ALGORITHMS;
D O I
10.1007/s10514-009-9130-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We address the problem of online path planning for optimal sensing with a mobile robot. The objective of the robot is to learn the most about its pose and the environment given time constraints. We use a POMDP with a utility function that depends on the belief state to model the finite horizon planning problem. We replan as the robot progresses throughout the environment. The POMDP is high-dimensional, continuous, non-differentiable, nonlinear, non-Gaussian and must be solved in real-time. Most existing techniques for stochastic planning and reinforcement learning are therefore inapplicable. To solve this extremely complex problem, we propose a Bayesian optimization method that dynamically trades off exploration (minimizing uncertainty in unknown parts of the policy space) and exploitation (capitalizing on the current best solution). We demonstrate our approach with a visually-guide mobile robot. The solution proposed here is also applicable to other closely-related domains, including active vision, sequential experimental design, dynamic sensing and calibration with mobile sensors.
引用
收藏
页码:93 / 103
页数:11
相关论文
共 18 条
  • [1] A Bayesian exploration-exploitation approach for optimal online sensing and planning with a visually guided mobile robot
    Ruben Martinez-Cantin
    Nando de Freitas
    Eric Brochu
    José Castellanos
    Arnaud Doucet
    Autonomous Robots, 2009, 27 : 93 - 103
  • [2] Online learning robust MPC: an exploration-exploitation approach
    Manzano, J. M.
    Calliess, J.
    de la Pena, D. Munoz
    Limon, D.
    IFAC PAPERSONLINE, 2020, 53 (02): : 5292 - 5297
  • [3] Finding the optimal exploration-exploitation trade-off online through Bayesian risk estimation and minimization
    Jamieson, Stewart
    How, Jonathan P.
    Girdhar, Yogesh
    ARTIFICIAL INTELLIGENCE, 2024, 330
  • [4] Selection Criterion Based on an Exploration-Exploitation Approach for Optimal Design of Experiments
    Atamturktur, Sez
    Hegenderfer, Joshua
    Williams, Brian
    Unal, Cetin
    JOURNAL OF ENGINEERING MECHANICS, 2015, 141 (01)
  • [5] Anytime Planning of Optimal Schedules for a Mobile Sensing Robot
    Yu, Jingjin
    Aslam, Javed
    Karaman, Sertac
    Rus, Daniela
    2015 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2015, : 5279 - 5286
  • [6] An online trajectory planning method for visually guided assisted reaching through a rehabilitation robot
    Loconsole, C.
    Bartalucci, R.
    Frisoli, A.
    Bergamasco, M.
    2011 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2011, : 1445 - 1450
  • [7] Near-Optimal Trajectory Planning of a Spherical Mobile Robot for Environment Exploration
    Zhan, Qiang
    Cai, Yao
    Liu, Zengbo
    2008 IEEE CONFERENCE ON ROBOTICS, AUTOMATION, AND MECHATRONICS, VOLS 1 AND 2, 2008, : 314 - 319
  • [8] An online path planning approach of mobile robot based on particle filter
    Gao, Yang
    Sun, Shu-dong
    Hu, Da-wei
    Wang, Lai-jun
    INDUSTRIAL ROBOT-THE INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH AND APPLICATION, 2013, 40 (04): : 305 - 319
  • [9] Efficient Online Planning and Robust Optimal Control for Nonholonomic Mobile Robot in Unstructured Environments
    Hu, Yingbai
    Zhou, Wei
    Liu, Yueyue
    Zeng, Minghao
    Ding, Weiping
    Li, Shu
    Li, Guoxin
    Li, Zheng
    Knoll, Alois
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2024, : 3559 - 3575
  • [10] OGPR: An Obstacle-Guided Path Refinement Approach for Mobile Robot Path Planning
    Atia, Mohamed G. B.
    Salah, Omar
    El-Hussieny, Haitham
    2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2018, : 844 - 849