Exploration-Exploitation Tradeoff in the Adaptive Information Sampling of Unknown Spatial Fields with Mobile Robots

被引：2

作者：

Munir, Aiman ^{[1
]}

Parasuraman, Ramviyas ^{[1
]}

机构：

[1] Univ Georgia, Sch Comp, Athens, GA 30602 USA

来源：

SENSORS | 2023年 / 23卷 / 23期

关键词：

mobile robots; exploration; informative path planning; adaptive sampling; mapping; GAUSSIAN PROCESS REGRESSION; OPTIMIZATION;

D O I：

10.3390/s23239600

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

Adaptive information-sampling approaches enable efficient selection of mobile robots' waypoints through which the accurate sensing and mapping of a physical process, such as the radiation or field intensity, can be obtained. A key parameter in the informative sampling objective function could be optimized balance the need to explore new information where the uncertainty is very high and to exploit the data sampled so far, with which a great deal of the underlying spatial fields can be obtained, such as the source locations or modalities of the physical process. However, works in the literature have either assumed the robot's energy is unconstrained or used a homogeneous availability of energy capacity among different robots. Therefore, this paper analyzes the impact of the adaptive information-sampling algorithm's information function used in exploration and exploitation to achieve a tradeoff between balancing the mapping, localization, and energy efficiency objectives. We use Gaussian process regression (GPR) to predict and estimate confidence bounds, thereby determining each point's informativeness. Through extensive experimental data, we provide a deeper and holistic perspective on the effect of information function parameters on the prediction map's accuracy (RMSE), confidence bound (variance), energy consumption (distance), and time spent (sample count) in both single- and multi-robot scenarios. The results provide meaningful insights into choosing the appropriate energy-aware information function parameters based on sensing objectives (e.g., source localization or mapping). Based on our analysis, we can conclude that it would be detrimental to give importance only to the uncertainty of the information function (which would explode the energy needs) or to the predictive mean of the information (which would jeopardize the mapping accuracy). By assigning more importance to the information uncertainly with some non-zero importance to the information value (e.g., 75:25 ratio), it is possible to achieve an optimal tradeoff between exploration and exploitation objectives while keeping the energy requirements manageable.

引用

页数：21

共 50 条

[31] Exploration-Exploitation Duality with Both Tradeoff and Synergy: The Curvilinear Interaction Effects of Learning Modes on Innovation Types
Li, Peter Ping
Liu, Heng
Li, Yuan
Wang, Haifeng
MANAGEMENT AND ORGANIZATION REVIEW, 2023, 19 (03) : 498 - 532
[32] Adaptive network approach to exploration-exploitation trade-off in reinforcement learning
Moradi, Mohammadamin
Zhai, Zheng-Meng
Panahi, Shirin
Lai, Ying-Cheng
CHAOS, 2024, 34 (12)
[33] A Thompson Sampling Approach to Channel Exploration-Exploitation Problem in Multihop Cognitive Radio Networks
Toldov, Viktor
Clavier, Laurent
Loscri, Valeria
Mitton, Nathalie
2016 IEEE 27TH ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR, AND MOBILE RADIO COMMUNICATIONS (PIMRC), 2016, : 1355 - 1360
[34] An intelligent metaphor-free spatial information sampling algorithm for balancing exploitation and exploration
Yang, Haichuan
Yu, Yang
Cheng, Jiujun
Lei, Zhenyu
Cai, Zonghui
Zhang, Zihang
Gao, Shangce
KNOWLEDGE-BASED SYSTEMS, 2022, 250
[35] Collaborative exploration of unknown environments with teams of mobile robots
Burgard, W
Moors, M
Schneider, F
ADVANCES IN PLAN-BASED CONTROL OF ROBOTIC AGENTS, 2002, 2466 : 52 - 70
[36] Autonomous Exploration of Unknown Terrain for Groups of Mobile Robots
Lopes, Samuel
Frisch, Brian
Boeing, Adrian
Vinsen, Kevin
Braeunl, Thomas
2011 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), 2011, : 157 - 162
[37] Daisee: Adaptive importance sampling by balancing exploration and exploitation
Lu, Xiaoyu
Rainforth, Tom
Teh, Yee Whye
SCANDINAVIAN JOURNAL OF STATISTICS, 2023, 50 (03) : 1298 - 1324
[38] Learning the value of information and reward over time when solving exploration-exploitation problems
Irene Cogliati Dezza
Angela J. Yu
Axel Cleeremans
William Alexander
Scientific Reports, 7
[39] Optimal Stochastic Process Optimizer: A New Metaheuristic Algorithm With Adaptive Exploration-Exploitation Property
Xu, Jiahong
Xu, Lihong
IEEE ACCESS, 2021, 9 : 108640 - 108664
[40] Improved exploration-exploitation trade-off through adaptive prioritized experience replay
Hassani, Hossein
Nikan, Soodeh
Shami, Abdallah
NEUROCOMPUTING, 2025, 614

← 1 2 3 4 5 →