Exploration-Exploitation Tradeoff in the Adaptive Information Sampling of Unknown Spatial Fields with Mobile Robots

被引：2

作者：

Munir, Aiman ^{[1
]}

Parasuraman, Ramviyas ^{[1
]}

机构：

[1] Univ Georgia, Sch Comp, Athens, GA 30602 USA

来源：

SENSORS | 2023年 / 23卷 / 23期

关键词：

mobile robots; exploration; informative path planning; adaptive sampling; mapping; GAUSSIAN PROCESS REGRESSION; OPTIMIZATION;

D O I：

10.3390/s23239600

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

Adaptive information-sampling approaches enable efficient selection of mobile robots' waypoints through which the accurate sensing and mapping of a physical process, such as the radiation or field intensity, can be obtained. A key parameter in the informative sampling objective function could be optimized balance the need to explore new information where the uncertainty is very high and to exploit the data sampled so far, with which a great deal of the underlying spatial fields can be obtained, such as the source locations or modalities of the physical process. However, works in the literature have either assumed the robot's energy is unconstrained or used a homogeneous availability of energy capacity among different robots. Therefore, this paper analyzes the impact of the adaptive information-sampling algorithm's information function used in exploration and exploitation to achieve a tradeoff between balancing the mapping, localization, and energy efficiency objectives. We use Gaussian process regression (GPR) to predict and estimate confidence bounds, thereby determining each point's informativeness. Through extensive experimental data, we provide a deeper and holistic perspective on the effect of information function parameters on the prediction map's accuracy (RMSE), confidence bound (variance), energy consumption (distance), and time spent (sample count) in both single- and multi-robot scenarios. The results provide meaningful insights into choosing the appropriate energy-aware information function parameters based on sensing objectives (e.g., source localization or mapping). Based on our analysis, we can conclude that it would be detrimental to give importance only to the uncertainty of the information function (which would explode the energy needs) or to the predictive mean of the information (which would jeopardize the mapping accuracy). By assigning more importance to the information uncertainly with some non-zero importance to the information value (e.g., 75:25 ratio), it is possible to achieve an optimal tradeoff between exploration and exploitation objectives while keeping the energy requirements manageable.

引用

页数：21

共 50 条

[21] From Arms to Trees: Opportunity Costs and Path Dependence and the Exploration-Exploitation Tradeoff
Levinthal, Daniel A.
STRATEGY SCIENCE, 2021, 6 (04) : 331 - 337
[22] A novel sequential exploration-exploitation sampling strategy for global metamodeling
Jiang, Ping
Shu, Leshi
Zhou, Qi
Zhou, Hui
Shao, Xinyu
Xu, Junnan
IFAC PAPERSONLINE, 2015, 48 (28): : 532 - 537
[23] Exploration-exploitation tradeoff using variance estimates in multi-armed bandits
Audibert, Jean-Yves
Munos, Remi
Szepesvari, Csaba
THEORETICAL COMPUTER SCIENCE, 2009, 410 (19) : 1876 - 1902
[24] An adaptive human learning optimization with enhanced exploration-exploitation balance
Du, Jiaojie
Wen, Yalan
Wang, Ling
Zhang, Pinggai
Fei, Minrui
Pardalos, Panos M.
ANNALS OF MATHEMATICS AND ARTIFICIAL INTELLIGENCE, 2023, 91 (2-3) : 177 - 216
[25] How Relative Performance Information Affects Exploration-Exploitation Decisions
Newman, Andrew H.
Stikeleather, Bryan R.
Waddoups, Nathan J.
JOURNAL OF MANAGEMENT ACCOUNTING RESEARCH, 2022, 34 (01) : 75 - 95
[26] Disentangling the roles of dopamine and noradrenaline in the exploration-exploitation tradeoff during human decision-making
Cremer, Anna
Kalbe, Felix
Mueller, Jana Christina
Wiedemann, Klaus
Schwabe, Lars
NEUROPSYCHOPHARMACOLOGY, 2023, 48 (07) : 1078 - 1086
[27] An adaptive approach for the exploration-exploitation dilemma and its application to economic systems
Rejeb, Lilia
Guessoum, Zahia
M'Hallah, Rym
LEARNING AND ADAPTION IN MULTI-AGENT SYSTEMS, 2006, 3898 : 165 - 176
[28] Disentangling the roles of dopamine and noradrenaline in the exploration-exploitation tradeoff during human decision-making
Anna Cremer
Felix Kalbe
Jana Christina Müller
Klaus Wiedemann
Lars Schwabe
Neuropsychopharmacology, 2023, 48 : 1078 - 1086
[29] Adaptive exploration policy for exploration–exploitation tradeoff in continuous action control optimization
Min Li
Tianyi Huang
William Zhu
International Journal of Machine Learning and Cybernetics, 2021, 12 : 3491 - 3501
[30] An Exploration-Exploitation Compromise-Based Adaptive Operator Selection for Local Search
Veerapen, Nadarajen
Maturana, Jorge
Saubion, Frederic
PROCEEDINGS OF THE FOURTEENTH INTERNATIONAL CONFERENCE ON GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2012, : 1277 - 1284

← 1 2 3 4 5 →