SONA: Learning Conditional, Unconditional, and Mismatching-Aware Discriminator

2510.04576v1 cs.LG, cs.AI, cs.CV, stat.ML 2025-10-08

Авторы:

Yuhta Takida, Satoshi Hayakawa, Takashi Shibuya, Masaaki Imaizumi, Naoki Murata, Bac Nguyen, Toshimitsu Uesaka, Chieh-Hsin Lai, Yuki Mitsufuji

Abstract

Deep generative models have made significant advances in generating complex content, yet conditional generation remains a fundamental challenge. Existing conditional generative adversarial networks often struggle to balance the dual objectives of assessing authenticity and conditional alignment of input samples within their conditional discriminators. To address this, we propose a novel discriminator design that integrates three key capabilities: unconditional discrimination, matching-aware supervision to enhance alignment sensitivity, and adaptive weighting to dynamically balance all objectives. Specifically, we introduce Sum of Naturalness and Alignment (SONA), which employs separate projections for naturalness (authenticity) and alignment in the final layer with an inductive bias, supported by dedicated objective functions and an adaptive weighting mechanism. Extensive experiments on class-conditional generation tasks show that \ours achieves superior sample quality and conditional alignment compared to state-of-the-art methods. Furthermore, we demonstrate its effectiveness in text-to-image generation, confirming the versatility and robustness of our approach.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

SONA: Learning Conditional, Unconditional, and Mismatching-Aware Discriminator

Авторы:

Abstract

Ссылки и действия

Связанные статьи

Terminal Velocity Matching

LeJEPA: Provable and Scalable Self-Supervised Learning Without the Heuristics

On Flow Matching KL Divergence

Soft Task-Aware Routing of Experts for Equivariant Representation Learning

Gaussian Embeddings: How JEPAs Secretly Learn Your Data Density

Навигация