Prior preferences in active inference agents: soft, hard, and goal shaping

2512.03293v1 cs.AI, q-bio.NC 2025-12-05

Авторы:

Filippo Torresan, Ryota Kanai, Manuel Baltieri

Abstract

Active inference proposes expected free energy as an objective for planning and decision-making to adequately balance exploitative and explorative drives in learning agents. The exploitative drive, or what an agent wants to achieve, is formalised as the Kullback-Leibler divergence between a variational probability distribution, updated at each inference step, and a preference probability distribution that indicates what states or observations are more likely for the agent, hence determining the agent's goal in a certain environment. In the literature, the questions of how the preference distribution should be specified and of how a certain specification impacts inference and learning in an active inference agent have been given hardly any attention. In this work, we consider four possible ways of defining the preference distribution, either providing the agents with hard or soft goals and either involving or not goal shaping (i.e., intermediate goals). We compare the performances of four agents, each given one of the possible preference distributions, in a grid world navigation task. Our results show that goal shaping enables the best performance overall (i.e., it promotes exploitation) while sacrificing learning about the environment's transition dynamics (i.e., it hampers exploration).

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

Prior preferences in active inference agents: soft, hard, and goal shaping

Авторы:

Abstract

Ссылки и действия

Связанные статьи

Fast dynamical similarity analysis

From generative AI to the brain: five takeaways

Predictive Coding Enhances Meta-RL To Achieve Interpretable Bayes-Optimal Belief...

Meta-Learning Theory-Informed Inductive Biases using Deep Kernel Gaussian Proces...

The Principles of Human-like Conscious Machine

Навигация