Test-Time Anchoring for Discrete Diffusion Posterior Sampling

2510.02291v1 cs.LG, cs.CV, stat.ML 2025-10-04

Авторы:

Litu Rout, Andreas Lugmayr, Yasamin Jafarian, Srivatsan Varadharajan, Constantine Caramanis, Sanjay Shakkottai, Ira Kemelmacher-Shlizerman

Abstract

We study the problem of posterior sampling using pretrained discrete diffusion foundation models, aiming to recover images from noisy measurements without retraining task-specific models. While diffusion models have achieved remarkable success in generative modeling, most advances rely on continuous Gaussian diffusion. In contrast, discrete diffusion offers a unified framework for jointly modeling categorical data such as text and images. Beyond unification, discrete diffusion provides faster inference, finer control, and principled training-free Bayesian inference, making it particularly well-suited for posterior sampling. However, existing approaches to discrete diffusion posterior sampling face severe challenges: derivative-free guidance yields sparse signals, continuous relaxations limit applicability, and split Gibbs samplers suffer from the curse of dimensionality. To overcome these limitations, we introduce Anchored Posterior Sampling (APS) for masked diffusion foundation models, built on two key innovations -- quantized expectation for gradient-like guidance in discrete embedding space, and anchored remasking for adaptive decoding. Our approach achieves state-of-the-art performance among discrete diffusion samplers across linear and nonlinear inverse problems on the standard benchmarks. We further demonstrate the benefits of our approach in training-free stylization and text-guided editing.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

Test-Time Anchoring for Discrete Diffusion Posterior Sampling

Авторы:

Abstract

Ссылки и действия

Связанные статьи

Sampling Control for Imbalanced Calibration in Semi-Supervised Learning

Shortcut Invariance: Targeted Jacobian Regularization in Disentangled Latent Spa...

Self-Supervised Learning by Curvature Alignment

Coordinate Descent for Network Linearization

Matricial Free Energy as a Gaussianizing Regularizer: Enhancing Autoencoders for...

Навигация