Towards Efficient 3D Gaussian Human Avatar Compression: A Prior-Guided Framework
2510.10492v1
eess.IV, cs.CV, cs.MM, I.4; I.5
2025-10-15
Авторы:
Shanzhi Yin, Bolin Chen, Xinju Wu, Ru-Ling Liao, Jie Chen, Shiqi Wang, Yan Ye
Abstract
This paper proposes an efficient 3D avatar coding framework that leverages
compact human priors and canonical-to-target transformation to enable
high-quality 3D human avatar video compression at ultra-low bit rates. The
framework begins by training a canonical Gaussian avatar using articulated
splatting in a network-free manner, which serves as the foundation for avatar
appearance modeling. Simultaneously, a human-prior template is employed to
capture temporal body movements through compact parametric representations.
This decomposition of appearance and temporal evolution minimizes redundancy,
enabling efficient compression: the canonical avatar is shared across the
sequence, requiring compression only once, while the temporal parameters,
consisting of just 94 parameters per frame, are transmitted with minimal
bit-rate. For each frame, the target human avatar is generated by deforming
canonical avatar via Linear Blend Skinning transformation, facilitating
temporal coherent video reconstruction and novel view synthesis. Experimental
results demonstrate that the proposed method significantly outperforms
conventional 2D/3D codecs and existing learnable dynamic 3D Gaussian splatting
compression method in terms of rate-distortion performance on mainstream
multi-view human video datasets, paving the way for seamless immersive
multimedia experiences in meta-verse applications.