Learning Energy-based Variational Latent Prior for VAEs
2510.00260v1
cs.LG, cs.AI, cs.CV
2025-10-04
Авторы:
Debottam Dutta, Chaitanya Amballa, Zhongweiyang Xu, Yu-Lin Wei, Romit Roy Choudhury
Abstract
Variational Auto-Encoders (VAEs) are known to generate blurry and
inconsistent samples. One reason for this is the "prior hole" problem. A prior
hole refers to regions that have high probability under the VAE's prior but low
probability under the VAE's posterior. This means that during data generation,
high probability samples from the prior could have low probability under the
posterior, resulting in poor quality data. Ideally, a prior needs to be
flexible enough to match the posterior while retaining the ability to generate
samples fast. Generative models continue to address this tradeoff. This paper
proposes to model the prior as an energy-based model (EBM). While EBMs are
known to offer the flexibility to match posteriors (and also improving the
ELBO), they are traditionally slow in sample generation due to their dependency
on MCMC methods. Our key idea is to bring a variational approach to tackle the
normalization constant in EBMs, thus bypassing the expensive MCMC approaches.
The variational form can be approximated with a sampler network, and we show
that such an approach to training priors can be formulated as an alternating
optimization problem. Moreover, the same sampler reduces to an implicit
variational prior during generation, providing efficient and fast sampling. We
compare our Energy-based Variational Latent Prior (EVaLP) method to multiple
SOTA baselines and show improvements in image generation quality, reduced prior
holes, and better sampling efficiency.
Ссылки и действия
Дополнительные ресурсы: