Prior preferences in active inference agents: soft, hard, and goal shaping

2512.03293v1 cs.AI, q-bio.NC 2025-12-05
Авторы:

Filippo Torresan, Ryota Kanai, Manuel Baltieri

Abstract

Active inference proposes expected free energy as an objective for planning and decision-making to adequately balance exploitative and explorative drives in learning agents. The exploitative drive, or what an agent wants to achieve, is formalised as the Kullback-Leibler divergence between a variational probability distribution, updated at each inference step, and a preference probability distribution that indicates what states or observations are more likely for the agent, hence determining the agent's goal in a certain environment. In the literature, the questions of how the preference distribution should be specified and of how a certain specification impacts inference and learning in an active inference agent have been given hardly any attention. In this work, we consider four possible ways of defining the preference distribution, either providing the agents with hard or soft goals and either involving or not goal shaping (i.e., intermediate goals). We compare the performances of four agents, each given one of the possible preference distributions, in a grid world navigation task. Our results show that goal shaping enables the best performance overall (i.e., it promotes exploitation) while sacrificing learning about the environment's transition dynamics (i.e., it hampers exploration).

Ссылки и действия

Связанные статьи

Meta-Learning Theory-Informed Inductive Biases using Deep Kernel Gaussian Proces...

#### Контекст Нейробиология становится все более нуждающейся в сформулированных теориях, которые могут объяснить сложны...

2025-10-01

The Principles of Human-like Conscious Machine

## Контекст Область исследования по концепции сознательных машин затрагивает ключевые вопросы о сознании, антропоцентрич...

2025-09-24