MAD: Manifold Attracted Diffusion
2509.24710v1
stat.ML, cs.LG, cs.NA, math.NA
2025-10-01
Авторы:
Dennis Elbrächter, Giovanni S. Alberti, Matteo Santacesaria
Abstract
Score-based diffusion models are a highly effective method for generating
samples from a distribution of images. We consider scenarios where the training
data comes from a noisy version of the target distribution, and present an
efficiently implementable modification of the inference procedure to generate
noiseless samples. Our approach is motivated by the manifold hypothesis,
according to which meaningful data is concentrated around some low-dimensional
manifold of a high-dimensional ambient space. The central idea is that noise
manifests as low magnitude variation in off-manifold directions in contrast to
the relevant variation of the desired distribution which is mostly confined to
on-manifold directions. We introduce the notion of an extended score and show
that, in a simplified setting, it can be used to reduce small variations to
zero, while leaving large variations mostly unchanged. We describe how its
approximation can be computed efficiently from an approximation to the standard
score and demonstrate its efficacy on toy problems, synthetic data, and real
data.