FlowOpt: Fast Optimization Through Whole Flow Processes for Training-Free Editing

2510.22010v1 cs.CV, cs.LG, eess.IV 2025-10-29

Авторы:

Or Ronai, Vladimir Kulikov, Tomer Michaeli

Abstract

The remarkable success of diffusion and flow-matching models has ignited a surge of works on adapting them at test time for controlled generation tasks. Examples range from image editing to restoration, compression and personalization. However, due to the iterative nature of the sampling process in those models, it is computationally impractical to use gradient-based optimization to directly control the image generated at the end of the process. As a result, existing methods typically resort to manipulating each timestep separately. Here we introduce FlowOpt - a zero-order (gradient-free) optimization framework that treats the entire flow process as a black box, enabling optimization through the whole sampling path without backpropagation through the model. Our method is both highly efficient and allows users to monitor the intermediate optimization results and perform early stopping if desired. We prove a sufficient condition on FlowOpt's step-size, under which convergence to the global optimum is guaranteed. We further show how to empirically estimate this upper bound so as to choose an appropriate step-size. We demonstrate how FlowOpt can be used for image editing, showcasing two options: (i) inversion (determining the initial noise that generates a given image), and (ii) directly steering the edited image to be similar to the source image while conforming to a target text prompt. In both cases, FlowOpt achieves state-of-the-art results while using roughly the same number of neural function evaluations (NFEs) as existing methods. Code and examples are available on the project's webpage.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

FlowOpt: Fast Optimization Through Whole Flow Processes for Training-Free Editing

Авторы:

Abstract

Ссылки и действия

Связанные статьи

Blinking Beyond EAR: A Stable Eyelid Angle Metric for Driver Drowsiness Detectio...

Neighbor GRPO: Contrastive ODE Policy Optimization Aligns Flow Models

CYPRESS: Crop Yield Prediction via Regression on Prithvi's Encoder for Satellite...

AngularFuse: A Closer Look at Angle-based Perception for Spatial-Sensitive Multi...

YOLOv11-Litchi: Efficient Litchi Fruit Detection based on UAV-Captured Agricultu...

Навигация