Exploration-Exploitation Tradeoff in the Adaptive Information Sampling of Unknown Spatial Fields with Mobile Robots

被引:2
|
作者
Munir, Aiman [1 ]
Parasuraman, Ramviyas [1 ]
机构
[1] Univ Georgia, Sch Comp, Athens, GA 30602 USA
关键词
mobile robots; exploration; informative path planning; adaptive sampling; mapping; GAUSSIAN PROCESS REGRESSION; OPTIMIZATION;
D O I
10.3390/s23239600
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Adaptive information-sampling approaches enable efficient selection of mobile robots' waypoints through which the accurate sensing and mapping of a physical process, such as the radiation or field intensity, can be obtained. A key parameter in the informative sampling objective function could be optimized balance the need to explore new information where the uncertainty is very high and to exploit the data sampled so far, with which a great deal of the underlying spatial fields can be obtained, such as the source locations or modalities of the physical process. However, works in the literature have either assumed the robot's energy is unconstrained or used a homogeneous availability of energy capacity among different robots. Therefore, this paper analyzes the impact of the adaptive information-sampling algorithm's information function used in exploration and exploitation to achieve a tradeoff between balancing the mapping, localization, and energy efficiency objectives. We use Gaussian process regression (GPR) to predict and estimate confidence bounds, thereby determining each point's informativeness. Through extensive experimental data, we provide a deeper and holistic perspective on the effect of information function parameters on the prediction map's accuracy (RMSE), confidence bound (variance), energy consumption (distance), and time spent (sample count) in both single- and multi-robot scenarios. The results provide meaningful insights into choosing the appropriate energy-aware information function parameters based on sensing objectives (e.g., source localization or mapping). Based on our analysis, we can conclude that it would be detrimental to give importance only to the uncertainty of the information function (which would explode the energy needs) or to the predictive mean of the information (which would jeopardize the mapping accuracy). By assigning more importance to the information uncertainly with some non-zero importance to the information value (e.g., 75:25 ratio), it is possible to achieve an optimal tradeoff between exploration and exploitation objectives while keeping the energy requirements manageable.
引用
收藏
页数:21
相关论文
共 50 条
  • [31] Exploration-Exploitation Duality with Both Tradeoff and Synergy: The Curvilinear Interaction Effects of Learning Modes on Innovation Types
    Li, Peter Ping
    Liu, Heng
    Li, Yuan
    Wang, Haifeng
    MANAGEMENT AND ORGANIZATION REVIEW, 2023, 19 (03) : 498 - 532
  • [32] Adaptive network approach to exploration-exploitation trade-off in reinforcement learning
    Moradi, Mohammadamin
    Zhai, Zheng-Meng
    Panahi, Shirin
    Lai, Ying-Cheng
    CHAOS, 2024, 34 (12)
  • [33] A Thompson Sampling Approach to Channel Exploration-Exploitation Problem in Multihop Cognitive Radio Networks
    Toldov, Viktor
    Clavier, Laurent
    Loscri, Valeria
    Mitton, Nathalie
    2016 IEEE 27TH ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR, AND MOBILE RADIO COMMUNICATIONS (PIMRC), 2016, : 1355 - 1360
  • [34] An intelligent metaphor-free spatial information sampling algorithm for balancing exploitation and exploration
    Yang, Haichuan
    Yu, Yang
    Cheng, Jiujun
    Lei, Zhenyu
    Cai, Zonghui
    Zhang, Zihang
    Gao, Shangce
    KNOWLEDGE-BASED SYSTEMS, 2022, 250
  • [35] Collaborative exploration of unknown environments with teams of mobile robots
    Burgard, W
    Moors, M
    Schneider, F
    ADVANCES IN PLAN-BASED CONTROL OF ROBOTIC AGENTS, 2002, 2466 : 52 - 70
  • [36] Autonomous Exploration of Unknown Terrain for Groups of Mobile Robots
    Lopes, Samuel
    Frisch, Brian
    Boeing, Adrian
    Vinsen, Kevin
    Braeunl, Thomas
    2011 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2011, : 157 - 162
  • [37] Daisee: Adaptive importance sampling by balancing exploration and exploitation
    Lu, Xiaoyu
    Rainforth, Tom
    Teh, Yee Whye
    SCANDINAVIAN JOURNAL OF STATISTICS, 2023, 50 (03) : 1298 - 1324
  • [38] Learning the value of information and reward over time when solving exploration-exploitation problems
    Irene Cogliati Dezza
    Angela J. Yu
    Axel Cleeremans
    William Alexander
    Scientific Reports, 7
  • [39] Optimal Stochastic Process Optimizer: A New Metaheuristic Algorithm With Adaptive Exploration-Exploitation Property
    Xu, Jiahong
    Xu, Lihong
    IEEE ACCESS, 2021, 9 : 108640 - 108664
  • [40] Improved exploration-exploitation trade-off through adaptive prioritized experience replay
    Hassani, Hossein
    Nikan, Soodeh
    Shami, Abdallah
    NEUROCOMPUTING, 2025, 614