Active inference tree search in large POMDPs

被引：0

作者：

Maisto, Domenico ^{[1
]}

Gregoretti, Francesco ^{[2
]}

Friston, Karl J. ^{[3
,4
]}

Pezzulo, Giovanni ^{[1
]}

机构：

[1] CNR, Inst Cognit Sci & Technol, Via Gian Domen Romagnosi 18-A, I-00196 Rome, Italy

[2] CNR, Inst High Performance Comp & Networking, Via Pietro Castellino 111, I-80131 Naples, Italy

[3] UCL, Inst Neurol, Wellcome Ctr Human Neuroimaging, London WC1N 3AR, England

[4] VERSES Res Lab, Los Angeles, CA 90016 USA

来源：

NEUROCOMPUTING | 2025年 / 623卷

基金：

欧盟地平线“2020”; 欧洲研究理事会;

关键词：

Active inference; Tree search; Model-based planning; POMDP; PLANNING-ALGORITHMS; PREFRONTAL CORTEX; DECISION-MAKING; SEQUENCES; MODELS; UNCERTAINTY; TIME;

D O I：

10.1016/j.neucom.2024.129319

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The ability to plan ahead efficiently is key for both living organisms and artificial systems. Model-based planning and prospection are widely studied in cognitive neuroscience and artificial intelligence (AI), but from different perspectives-and with different desiderata in mind (biological realism versus scalability) that are difficult to reconcile. Here, we introduce a novel method to plan in POMDPs-Active Inference Tree Search (AcT)-that combines the normative character and biological realism of a leading planning theory in neuroscience (Active Inference) and the scalability of tree search methods in AI. This unification enhances both approaches. On the one hand, tree searches enable the biologically grounded, first principle method of active inference to be applied to large-scale problems. On the other hand, active inference provides a principled solution to the exploration-exploitation dilemma, which is often addressed heuristically in tree search methods. Our simulations show that AcT successfully navigates binary trees that are challenging for sampling-based methods, problems that require adaptive exploration, and the large POMDP problem ' RockSample' - in which AcT reproduces state-ofthe-art POMDP solutions. Furthermore, we illustrate how AcT can simulate neurophysiological responses (e.g., in the hippocampus and prefrontal cortex) of humans and other animals that solve large planning problems. These numerical analyses show that Active Tree Search is a principled realisation of neuroscientific and AI planning theories, offering biological realism and scalability.

引用

页数：21

共 50 条

[1] Simulated Annealing Monte Carlo Tree Search for large POMDPs
Xiong, Kai
Jiang, Hong
2014 SIXTH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC), VOL 1, 2014, : 140 - 143
[2] Learning in POMDPs with Monte Carlo Tree Search
Katt, Sammie
Oliehoek, Frans A.
Amato, Christopher
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
[3] Information gathering in POMDPs using active inference
Walraven, Erwin
Sijs, Joris
Burghouts, Gertjan J.
AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2025, 39 (01)
[4] Preference learning for guiding the tree search in continuous POMDPs
Ahn, Jiyong
Son, Sanghyeon
Lee, Dongryung
Han, Jisu
Son, Dongwon
Kim, Beomjoon
CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229
[5] Monte-Carlo Tree Search for Constrained POMDPs
Lee, Jongmin
Kim, Geon-Hyeong
Poupart, Pascal
Kim, Kee-Eung
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
[6] Sparse Tree Search Optimality Guarantees in POMDPs with Continuous Observation Spaces
Lim, Michael H.
Tomlin, Claire J.
Sunberg, Zachary N.
PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 4135 - 4142
[7] PLEASE: Palm Leaf Search for POMDPs with Large Observation Spaces
Zhang, Zongzhang
Hsu, David
Lee, Wee Sun
Lim, Zhan Wei
Bai, Aijun
PROCEEDINGS OF THE TWENTY-FIFTH INTERNATIONAL CONFERENCE ON AUTOMATED PLANNING AND SCHEDULING, 2015, : 249 - 257
[8] Online Planning for Interactive-POMDPs using Nested Monte Carlo Tree Search
Schwartz, Jonathon
Zhou, Ruijia
Kurniawati, Hanna
2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 8770 - 8777
[9] AEMS: An Anytime Online Search Algorithm for Approximate Policy Refinement in Large POMDPs
Ross, Steephane
Chaib-draa, Brahim
20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 2592 - 2598
[10] Emergent bluffing and inference with Monte Carlo Tree Search
Cowling, Peter I.
Whitehouse, Daniel
Powley, Edward J.
2015 IEEE CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND GAMES (CIG), 2015, : 114 - 121

← 1 2 3 4 5 →