Novel View Synthesis from A Few Glimpses via Test-Time Natural Video Completion

2511.17932v1 cs.CV, cs.GR 2025-11-25

Авторы:

Yan Xu, Yixing Wang, Stella X. Yu

Abstract

Given just a few glimpses of a scene, can you imagine the movie playing out as the camera glides through it? That's the lens we take on \emph{sparse-input novel view synthesis}, not only as filling spatial gaps between widely spaced views, but also as \emph{completing a natural video} unfolding through space. We recast the task as \emph{test-time natural video completion}, using powerful priors from \emph{pretrained video diffusion models} to hallucinate plausible in-between views. Our \emph{zero-shot, generation-guided} framework produces pseudo views at novel camera poses, modulated by an \emph{uncertainty-aware mechanism} for spatial coherence. These synthesized frames densify supervision for \emph{3D Gaussian Splatting} (3D-GS) for scene reconstruction, especially in under-observed regions. An iterative feedback loop lets 3D geometry and 2D view synthesis inform each other, improving both the scene reconstruction and the generated views. The result is coherent, high-fidelity renderings from sparse inputs \emph{without any scene-specific training or fine-tuning}. On LLFF, DTU, DL3DV, and MipNeRF-360, our method significantly outperforms strong 3D-GS baselines under extreme sparsity.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

Novel View Synthesis from A Few Glimpses via Test-Time Natural Video Completion

Авторы:

Abstract

Ссылки и действия

Связанные статьи

UTrice: Unifying Primitives in Differentiable Ray Tracing and Rasterization via ...

Back to Basics: Motion Representation Matters for Human Motion Generation Using ...

SplatFont3D: Structure-Aware Text-to-3D Artistic Font Generation with Part-Level...

Gaussian Swaying: Surface-Based Framework for Aerodynamic Simulation with 3D Gau...

Attention-guided reference point shifting for Gaussian-mixture-based partial poi...

Навигация