Category learning in deep neural networks: Information content and geometry of internal representations
2510.19021v1
cs.LG, cs.IT, math.IT, q-bio.NC
2025-10-25
Авторы:
Laurent Bonnasse-Gahot, Jean-Pierre Nadal
Abstract
In animals, category learning enhances discrimination between stimuli close
to the category boundary. This phenomenon, called categorical perception, was
also empirically observed in artificial neural networks trained on
classification tasks. In previous modeling works based on neuroscience data, we
show that this expansion/compression is a necessary outcome of efficient
learning. Here we extend our theoretical framework to artificial networks. We
show that minimizing the Bayes cost (mean of the cross-entropy loss) implies
maximizing the mutual information between the set of categories and the neural
activities prior to the decision layer. Considering structured data with an
underlying feature space of small dimension, we show that maximizing the mutual
information implies (i) finding an appropriate projection space, and, (ii)
building a neural representation with the appropriate metric. The latter is
based on a Fisher information matrix measuring the sensitivity of the neural
activity to changes in the projection space. Optimal learning makes this neural
Fisher information follow a category-specific Fisher information, measuring the
sensitivity of the category membership. Category learning thus induces an
expansion of neural space near decision boundaries. We characterize the
properties of the categorical Fisher information, showing that its eigenvectors
give the most discriminant directions at each point of the projection space. We
find that, unexpectedly, its maxima are in general not exactly at, but near,
the class boundaries. Considering toy models and the MNIST dataset, we
numerically illustrate how after learning the two Fisher information matrices
match, and essentially align with the category boundaries. Finally, we relate
our approach to the Information Bottleneck one, and we exhibit a bias-variance
decomposition of the Bayes cost, of interest on its own.