Beyond Grid-Locked Voxels: Neural Response Functions for Continuous Brain Encoding
2510.07342v1
q-bio.NC, cs.LG, eess.IV
2025-10-12
Авторы:
Haomiao Chen, Keith W Jamison, Mert R. Sabuncu, Amy Kuceyeski
Abstract
Neural encoding models aim to predict fMRI-measured brain responses to
natural images. fMRI data is acquired as a 3D volume of voxels, where each
voxel has a defined spatial location in the brain. However, conventional
encoding models often flatten this volume into a 1D vector and treat voxel
responses as independent outputs. This removes spatial context, discards
anatomical information, and ties each model to a subject-specific voxel grid.
We introduce the Neural Response Function (NRF), a framework that models fMRI
activity as a continuous function over anatomical space rather than a flat
vector of voxels. NRF represents brain activity as a continuous implicit
function: given an image and a spatial coordinate (x, y, z) in standardized MNI
space, the model predicts the response at that location. This formulation
decouples predictions from the training grid, supports querying at arbitrary
spatial resolutions, and enables resolution-agnostic analyses. By grounding the
model in anatomical space, NRF exploits two key properties of brain responses:
(1) local smoothness -- neighboring voxels exhibit similar response patterns;
modeling responses continuously captures these correlations and improves data
efficiency, and (2) cross-subject alignment -- MNI coordinates unify data
across individuals, allowing a model pretrained on one subject to be fine-tuned
on new subjects. In experiments, NRF outperformed baseline models in both
intrasubject encoding and cross-subject adaptation, achieving high performance
while reducing the data size needed by orders of magnitude. To our knowledge,
NRF is the first anatomically aware encoding model to move beyond flattened
voxels, learning a continuous mapping from images to brain responses in 3D
space.