Seeing My Future: Predicting Situated Interaction Behavior in Virtual Reality
2510.10742v1
cs.CV, cs.LG
2025-10-15
Авторы:
Yuan Xu, Zimu Zhang, Xiaoxuan Ma, Wentao Zhu, Yu Qiao, Yizhou Wang
Abstract
Virtual and augmented reality systems increasingly demand intelligent
adaptation to user behaviors for enhanced interaction experiences. Achieving
this requires accurately understanding human intentions and predicting future
situated behaviors - such as gaze direction and object interactions - which is
vital for creating responsive VR/AR environments and applications like
personalized assistants. However, accurate behavioral prediction demands
modeling the underlying cognitive processes that drive human-environment
interactions. In this work, we introduce a hierarchical, intention-aware
framework that models human intentions and predicts detailed situated behaviors
by leveraging cognitive mechanisms. Given historical human dynamics and the
observation of scene contexts, our framework first identifies potential
interaction targets and forecasts fine-grained future behaviors. We propose a
dynamic Graph Convolutional Network (GCN) to effectively capture
human-environment relationships. Extensive experiments on challenging
real-world benchmarks and live VR environment demonstrate the effectiveness of
our approach, achieving superior performance across all metrics and enabling
practical applications for proactive VR systems that anticipate user behaviors
and adapt virtual environments accordingly.
Ссылки и действия
Дополнительные ресурсы: