Branching Time Active Inference: The theory and its generality

Champion, Théophile, Da Costa, Lancelot, Bowman, Howard, Grześ, Marek (2022) Branching Time Active Inference: The theory and its generality. Neural Networks, 151 . pp. 295-316. ISSN 0893-6080. (doi:10.1016/j.neunet.2022.03.036) (KAR id:93990)

PDF Publisher pdf Language: English This work is licensed under a Creative Commons Attribution 4.0 International License.
Download this file (PDF/908kB)	Preview
Request a format suitable for use with assistive technology e.g. a screenreader
PDF Author's Accepted Manuscript Language: English Restricted to Repository staff only This work is licensed under a Creative Commons Attribution 4.0 International License.
Contact us about this Publication
Official URL: https://doi.org/10.1016/j.neunet.2022.03.036

Abstract

Over the last 10 to 15 years, active inference has helped to explain various brain mechanisms from habit formation to dopaminergic discharge and even modelling curiosity. However, the current implementations suffer from an exponential (space and time) complexity class when computing the prior over all the possible policies up to the time-horizon. Fountas et al. (2020) used Monte Carlo tree search to address this problem, leading to impressive results in two different tasks. In this paper, we present an alternative framework that aims to unify tree search and active inference by casting planning as a structure learning problem. Two tree search algorithms are then presented. The first propagates the expected free energy forward in time (i.e., towards the leaves), while the second propagates it backward (i.e., towards the root). Then, we demonstrate that forward and backward propagations are related to active inference and sophisticated inference, respectively, thereby clarifying the differences between those two planning strategies.

Item Type:	Article
DOI/Identification number:	10.1016/j.neunet.2022.03.036
Uncontrolled keywords:	POMDP; Markov process; active inference; variational inference
Subjects:	Q Science > QA Mathematics (inc Computing science) > QA276 Mathematical statistics Q Science > QA Mathematics (inc Computing science) > QA 76 Software, computer programming, > QA76.87 Neural computers, neural networks
Divisions:	Divisions > Division of Computing, Engineering and Mathematical Sciences > School of Computing
Depositing User:	Marek Grzes
Date Deposited:	11 Apr 2022 21:01 UTC
Last Modified:	05 Nov 2024 12:59 UTC
Resource URI:	https://kar.kent.ac.uk/id/eprint/93990 (The current URI for this page, for reference purposes)

University of Kent Author Information

Champion, Théophile.

Creator's ORCID:
CReDIT Contributor Roles:

Bowman, Howard.

Creator's ORCID:	https://orcid.org/0000-0003-4736-1869
CReDIT Contributor Roles:

Grześ, Marek.

Creator's ORCID:	https://orcid.org/0000-0003-4901-1539
CReDIT Contributor Roles:

Depositors only (login required):

Altmetric

Total Views

Total unique views of this page since July 2020. For more details click on the image.