Plug-and-Play Prompt Refinement via Latent Feedback for Diffusion Model Alignment
2510.00430v1
cs.LG, cs.AI, cs.CV
2025-10-04
Авторы:
Suhyeon Lee, Jong Chul Ye
Abstract
Despite the recent progress, reinforcement learning (RL)-based fine-tuning of
diffusion models often struggles with generalization, composability, and
robustness against reward hacking. Recent studies have explored prompt
refinement as a modular alternative, but most adopt a feed-forward approach
that applies a single refined prompt throughout the entire sampling trajectory,
thereby failing to fully leverage the sequential nature of reinforcement
learning. To address this, here we introduce PromptLoop, a plug-and-play RL
framework that incorporates latent feedback into step-wise prompt refinement.
Rather than modifying diffusion model weights, a multimodal large language
model (MLLM) is trained with RL to iteratively update prompts based on
intermediate latent states of diffusion models. This design achieves a
structural analogy to the Diffusion RL approach, while retaining the
flexibility and generality of prompt-based alignment. Extensive experiments
across diverse reward functions and diffusion backbones demonstrate that
PromptLoop (i) achieves effective reward optimization, (ii) generalizes
seamlessly to unseen models, (iii) composes orthogonally with existing
alignment methods, and (iv) mitigates over-optimization and reward hacking.
Ссылки и действия
Дополнительные ресурсы: