Camera Movement Classification in Historical Footage: A Comparative Study of Deep Video Models
2510.14713v1
cs.CV, cs.AI, eess.IV
2025-10-18
Авторы:
Tingyu Lin, Armin Dadras, Florian Kleber, Robert Sablatnig
Abstract
Camera movement conveys spatial and narrative information essential for
understanding video content. While recent camera movement classification (CMC)
methods perform well on modern datasets, their generalization to historical
footage remains unexplored. This paper presents the first systematic evaluation
of deep video CMC models on archival film material. We summarize representative
methods and datasets, highlighting differences in model design and label
definitions. Five standard video classification models are assessed on the
HISTORIAN dataset, which includes expert-annotated World War II footage. The
best-performing model, Video Swin Transformer, achieves 80.25% accuracy,
showing strong convergence despite limited training data. Our findings
highlight the challenges and potential of adapting existing models to
low-quality video and motivate future work combining diverse input modalities
and temporal architectures.
Ссылки и действия
Дополнительные ресурсы: