Video Object Recognition in Mobile Edge Networks: Local Tracking or Edge Detection?

2511.20716v1 cs.CV, eess.IV 2025-11-27

Авторы:

Kun Guo, Yun Shen, Xijun Wang, Chaoqun You, Yun Rui, Tony Q. S. Quek

Abstract

Fast and accurate video object recognition, which relies on frame-by-frame video analytics, remains a challenge for resource-constrained devices such as traffic cameras. Recent advances in mobile edge computing have made it possible to offload computation-intensive object detection to edge servers equipped with high-accuracy neural networks, while lightweight and fast object tracking algorithms run locally on devices. This hybrid approach offers a promising solution but introduces a new challenge: deciding when to perform edge detection versus local tracking. To address this, we formulate two long-term optimization problems for both single-device and multi-device scenarios, taking into account the temporal correlation of consecutive frames and the dynamic conditions of mobile edge networks. Based on the formulation, we propose the LTED-Ada in single-device setting, a deep reinforcement learning-based algorithm that adaptively selects between local tracking and edge detection, according to the frame rate as well as recognition accuracy and delay requirement. In multi-device setting, we further enhance LTED-Ada using federated learning to enable collaborative policy training across devices, thereby improving its generalization to unseen frame rates and performance requirements. Finally, we conduct extensive hardware-in-the-loop experiments using multiple Raspberry Pi 4B devices and a personal computer as the edge server, demonstrating the superiority of LTED-Ada.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

Video Object Recognition in Mobile Edge Networks: Local Tracking or Edge Detection?

Авторы:

Abstract

Ссылки и действия

Связанные статьи

Ultra-lightweight Neural Video Representation Compression

TinyViT: Field Deployable Transformer Pipeline for Solar Panel Surface Fault and...

Data Augmentation Strategies for Robust Lane Marking Detection

The Determinant Ratio Matrix Approach to Solving 3D Matching and 2D Orthographic...

Motion Transfer-Enhanced StyleGAN for Generating Diverse Macaque Facial Expressi...

Навигация