Active inference tree search in large POMDPs

被引:0
|
作者
Maisto, Domenico [1 ]
Gregoretti, Francesco [2 ]
Friston, Karl J. [3 ,4 ]
Pezzulo, Giovanni [1 ]
机构
[1] CNR, Inst Cognit Sci & Technol, Via Gian Domen Romagnosi 18-A, I-00196 Rome, Italy
[2] CNR, Inst High Performance Comp & Networking, Via Pietro Castellino 111, I-80131 Naples, Italy
[3] UCL, Inst Neurol, Wellcome Ctr Human Neuroimaging, London WC1N 3AR, England
[4] VERSES Res Lab, Los Angeles, CA 90016 USA
基金
欧盟地平线“2020”; 欧洲研究理事会;
关键词
Active inference; Tree search; Model-based planning; POMDP; PLANNING-ALGORITHMS; PREFRONTAL CORTEX; DECISION-MAKING; SEQUENCES; MODELS; UNCERTAINTY; TIME;
D O I
10.1016/j.neucom.2024.129319
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The ability to plan ahead efficiently is key for both living organisms and artificial systems. Model-based planning and prospection are widely studied in cognitive neuroscience and artificial intelligence (AI), but from different perspectives-and with different desiderata in mind (biological realism versus scalability) that are difficult to reconcile. Here, we introduce a novel method to plan in POMDPs-Active Inference Tree Search (AcT)-that combines the normative character and biological realism of a leading planning theory in neuroscience (Active Inference) and the scalability of tree search methods in AI. This unification enhances both approaches. On the one hand, tree searches enable the biologically grounded, first principle method of active inference to be applied to large-scale problems. On the other hand, active inference provides a principled solution to the exploration-exploitation dilemma, which is often addressed heuristically in tree search methods. Our simulations show that AcT successfully navigates binary trees that are challenging for sampling-based methods, problems that require adaptive exploration, and the large POMDP problem ' RockSample' - in which AcT reproduces state-ofthe-art POMDP solutions. Furthermore, we illustrate how AcT can simulate neurophysiological responses (e.g., in the hippocampus and prefrontal cortex) of humans and other animals that solve large planning problems. These numerical analyses show that Active Tree Search is a principled realisation of neuroscientific and AI planning theories, offering biological realism and scalability.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] Simulated Annealing Monte Carlo Tree Search for large POMDPs
    Xiong, Kai
    Jiang, Hong
    2014 SIXTH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC), VOL 1, 2014, : 140 - 143
  • [2] Learning in POMDPs with Monte Carlo Tree Search
    Katt, Sammie
    Oliehoek, Frans A.
    Amato, Christopher
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [3] Information gathering in POMDPs using active inference
    Walraven, Erwin
    Sijs, Joris
    Burghouts, Gertjan J.
    AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2025, 39 (01)
  • [4] Preference learning for guiding the tree search in continuous POMDPs
    Ahn, Jiyong
    Son, Sanghyeon
    Lee, Dongryung
    Han, Jisu
    Son, Dongwon
    Kim, Beomjoon
    CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229
  • [5] Monte-Carlo Tree Search for Constrained POMDPs
    Lee, Jongmin
    Kim, Geon-Hyeong
    Poupart, Pascal
    Kim, Kee-Eung
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [6] Sparse Tree Search Optimality Guarantees in POMDPs with Continuous Observation Spaces
    Lim, Michael H.
    Tomlin, Claire J.
    Sunberg, Zachary N.
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 4135 - 4142
  • [7] PLEASE: Palm Leaf Search for POMDPs with Large Observation Spaces
    Zhang, Zongzhang
    Hsu, David
    Lee, Wee Sun
    Lim, Zhan Wei
    Bai, Aijun
    PROCEEDINGS OF THE TWENTY-FIFTH INTERNATIONAL CONFERENCE ON AUTOMATED PLANNING AND SCHEDULING, 2015, : 249 - 257
  • [8] Online Planning for Interactive-POMDPs using Nested Monte Carlo Tree Search
    Schwartz, Jonathon
    Zhou, Ruijia
    Kurniawati, Hanna
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 8770 - 8777
  • [9] AEMS: An Anytime Online Search Algorithm for Approximate Policy Refinement in Large POMDPs
    Ross, Steephane
    Chaib-draa, Brahim
    20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 2592 - 2598
  • [10] Emergent bluffing and inference with Monte Carlo Tree Search
    Cowling, Peter I.
    Whitehouse, Daniel
    Powley, Edward J.
    2015 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND GAMES (CIG), 2015, : 114 - 121