Wireless Video Semantic Communication with Decoupled Diffusion Multi-frame Compensation

2511.02478v1 cs.MM, cs.AI 2025-11-06

Авторы:

Bingyan Xie, Yongpeng Wu, Yuxuan Shi, Biqian Feng, Wenjun Zhang, Jihong Park, Tony Quek

Abstract

Existing wireless video transmission schemes directly conduct video coding in pixel level, while neglecting the inner semantics contained in videos. In this paper, we propose a wireless video semantic communication framework with decoupled diffusion multi-frame compensation (DDMFC), abbreviated as WVSC-D, which integrates the idea of semantic communication into wireless video transmission scenarios. WVSC-D first encodes original video frames as semantic frames and then conducts video coding based on such compact representations, enabling the video coding in semantic level rather than pixel level. Moreover, to further reduce the communication overhead, a reference semantic frame is introduced to substitute motion vectors of each frame in common video coding methods. At the receiver, DDMFC is proposed to generate compensated current semantic frame by a two-stage conditional diffusion process. With both the reference frame transmission and DDMFC frame compensation, the bandwidth efficiency improves with satisfying video transmission performance. Experimental results verify the performance gain of WVSC-D over other DL-based methods e.g. DVSC about 1.8 dB in terms of PSNR.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

Wireless Video Semantic Communication with Decoupled Diffusion Multi-frame Compensation

Авторы:

Abstract

Ссылки и действия

Связанные статьи

PSA-MF: Personality-Sentiment Aligned Multi-Level Fusion for Multimodal Sentimen...

Real-Time Mobile Video Analytics for Pre-arrival Emergency Medical Services

EVER: Edge-Assisted Auto-Verification for Mobile MR-Aided Operation

CPCLDETECTOR: Knowledge Enhancement and Alignment Selection for Chinese Patroniz...

MM-HSD: Multi-Modal Hate Speech Detection in Videos

Навигация