Yesnt: Are Diffusion Relighting Models Ready for Capture Stage Compositing? A Hybrid Alternative to Bridge the Gap

2510.23494v1 cs.CV, cs.GR 2025-10-29

Авторы:

Elisabeth Jüttner, Leona Krath, Stefan Korfhage, Hannah Dröge, Matthias B. Hullin, Markus Plack

Abstract

Volumetric video relighting is essential for bringing captured performances into virtual worlds, but current approaches struggle to deliver temporally stable, production-ready results. Diffusion-based intrinsic decomposition methods show promise for single frames, yet suffer from stochastic noise and instability when extended to sequences, while video diffusion models remain constrained by memory and scale. We propose a hybrid relighting framework that combines diffusion-derived material priors with temporal regularization and physically motivated rendering. Our method aggregates multiple stochastic estimates of per-frame material properties into temporally consistent shading components, using optical-flow-guided regularization. For indirect effects such as shadows and reflections, we extract a mesh proxy from Gaussian Opacity Fields and render it within a standard graphics pipeline. Experiments on real and synthetic captures show that this hybrid strategy achieves substantially more stable relighting across sequences than diffusion-only baselines, while scaling beyond the clip lengths feasible for video diffusion. These results indicate that hybrid approaches, which balance learned priors with physically grounded constraints, are a practical step toward production-ready volumetric video relighting.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

Yesnt: Are Diffusion Relighting Models Ready for Capture Stage Compositing? A Hybrid Alternative to Bridge the Gap

Авторы:

Abstract

Ссылки и действия

Связанные статьи

UTrice: Unifying Primitives in Differentiable Ray Tracing and Rasterization via ...

Back to Basics: Motion Representation Matters for Human Motion Generation Using ...

SplatFont3D: Structure-Aware Text-to-3D Artistic Font Generation with Part-Level...

Gaussian Swaying: Surface-Based Framework for Aerodynamic Simulation with 3D Gau...

Attention-guided reference point shifting for Gaussian-mixture-based partial poi...

Навигация