Authentic Discrete Diffusion Model
2510.01047v1
cs.CV, cs.AI, cs.CL, cs.LG
2025-10-04
Авторы:
Xiao Li, Jiaqi Zhang, Shuxiang Zhang, Tianshui Chen, Liang Lin, Guangrun Wang
Abstract
We propose an Authentic Discrete Diffusion (ADD) framework that fundamentally
redefines prior pseudo-discrete approaches by preserving core diffusion
characteristics directly in the one-hot space through a suite of coordinated
mechanisms. Unlike conventional "pseudo" discrete diffusion (PDD) methods, ADD
reformulates the diffusion input by directly using float-encoded one-hot class
data, without relying on diffusing in the continuous latent spaces or masking
policies. At its core, a timestep-conditioned cross-entropy loss is introduced
between the diffusion model's outputs and the original one-hot labels. This
synergistic design establishes a bridge between discriminative and generative
learning. Our experiments demonstrate that ADD not only achieves superior
performance on classification tasks compared to the baseline, but also exhibits
excellent text generation capabilities on Image captioning. Extensive ablations
validate the measurable gains of each component.