Exploration-Exploitation Tradeoff in the Adaptive Information Sampling of Unknown Spatial Fields with Mobile Robots

被引:2
|
作者
Munir, Aiman [1 ]
Parasuraman, Ramviyas [1 ]
机构
[1] Univ Georgia, Sch Comp, Athens, GA 30602 USA
关键词
mobile robots; exploration; informative path planning; adaptive sampling; mapping; GAUSSIAN PROCESS REGRESSION; OPTIMIZATION;
D O I
10.3390/s23239600
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Adaptive information-sampling approaches enable efficient selection of mobile robots' waypoints through which the accurate sensing and mapping of a physical process, such as the radiation or field intensity, can be obtained. A key parameter in the informative sampling objective function could be optimized balance the need to explore new information where the uncertainty is very high and to exploit the data sampled so far, with which a great deal of the underlying spatial fields can be obtained, such as the source locations or modalities of the physical process. However, works in the literature have either assumed the robot's energy is unconstrained or used a homogeneous availability of energy capacity among different robots. Therefore, this paper analyzes the impact of the adaptive information-sampling algorithm's information function used in exploration and exploitation to achieve a tradeoff between balancing the mapping, localization, and energy efficiency objectives. We use Gaussian process regression (GPR) to predict and estimate confidence bounds, thereby determining each point's informativeness. Through extensive experimental data, we provide a deeper and holistic perspective on the effect of information function parameters on the prediction map's accuracy (RMSE), confidence bound (variance), energy consumption (distance), and time spent (sample count) in both single- and multi-robot scenarios. The results provide meaningful insights into choosing the appropriate energy-aware information function parameters based on sensing objectives (e.g., source localization or mapping). Based on our analysis, we can conclude that it would be detrimental to give importance only to the uncertainty of the information function (which would explode the energy needs) or to the predictive mean of the information (which would jeopardize the mapping accuracy). By assigning more importance to the information uncertainly with some non-zero importance to the information value (e.g., 75:25 ratio), it is possible to achieve an optimal tradeoff between exploration and exploitation objectives while keeping the energy requirements manageable.
引用
收藏
页数:21
相关论文
共 50 条
  • [21] From Arms to Trees: Opportunity Costs and Path Dependence and the Exploration-Exploitation Tradeoff
    Levinthal, Daniel A.
    STRATEGY SCIENCE, 2021, 6 (04) : 331 - 337
  • [22] A novel sequential exploration-exploitation sampling strategy for global metamodeling
    Jiang, Ping
    Shu, Leshi
    Zhou, Qi
    Zhou, Hui
    Shao, Xinyu
    Xu, Junnan
    IFAC PAPERSONLINE, 2015, 48 (28): : 532 - 537
  • [23] Exploration-exploitation tradeoff using variance estimates in multi-armed bandits
    Audibert, Jean-Yves
    Munos, Remi
    Szepesvari, Csaba
    THEORETICAL COMPUTER SCIENCE, 2009, 410 (19) : 1876 - 1902
  • [24] An adaptive human learning optimization with enhanced exploration-exploitation balance
    Du, Jiaojie
    Wen, Yalan
    Wang, Ling
    Zhang, Pinggai
    Fei, Minrui
    Pardalos, Panos M.
    ANNALS OF MATHEMATICS AND ARTIFICIAL INTELLIGENCE, 2023, 91 (2-3) : 177 - 216
  • [25] How Relative Performance Information Affects Exploration-Exploitation Decisions
    Newman, Andrew H.
    Stikeleather, Bryan R.
    Waddoups, Nathan J.
    JOURNAL OF MANAGEMENT ACCOUNTING RESEARCH, 2022, 34 (01) : 75 - 95
  • [26] Disentangling the roles of dopamine and noradrenaline in the exploration-exploitation tradeoff during human decision-making
    Cremer, Anna
    Kalbe, Felix
    Mueller, Jana Christina
    Wiedemann, Klaus
    Schwabe, Lars
    NEUROPSYCHOPHARMACOLOGY, 2023, 48 (07) : 1078 - 1086
  • [27] An adaptive approach for the exploration-exploitation dilemma and its application to economic systems
    Rejeb, Lilia
    Guessoum, Zahia
    M'Hallah, Rym
    LEARNING AND ADAPTION IN MULTI-AGENT SYSTEMS, 2006, 3898 : 165 - 176
  • [28] Disentangling the roles of dopamine and noradrenaline in the exploration-exploitation tradeoff during human decision-making
    Anna Cremer
    Felix Kalbe
    Jana Christina Müller
    Klaus Wiedemann
    Lars Schwabe
    Neuropsychopharmacology, 2023, 48 : 1078 - 1086
  • [29] Adaptive exploration policy for exploration–exploitation tradeoff in continuous action control optimization
    Min Li
    Tianyi Huang
    William Zhu
    International Journal of Machine Learning and Cybernetics, 2021, 12 : 3491 - 3501
  • [30] An Exploration-Exploitation Compromise-Based Adaptive Operator Selection for Local Search
    Veerapen, Nadarajen
    Maturana, Jorge
    Saubion, Frederic
    PROCEEDINGS OF THE FOURTEENTH INTERNATIONAL CONFERENCE ON GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2012, : 1277 - 1284