OneOcc: Semantic Occupancy Prediction for Legged Robots with a Single Panoramic Camera

2511.03571v1 cs.RO, cs.CV, eess.IV 2025-11-07

Авторы:

Hao Shi, Ze Wang, Shangwei Guo, Mengfei Duan, Song Wang, Teng Chen, Kailun Yang, Lin Wang, Kaiwei Wang

Abstract

Robust 3D semantic occupancy is crucial for legged/humanoid robots, yet most semantic scene completion (SSC) systems target wheeled platforms with forward-facing sensors. We present OneOcc, a vision-only panoramic SSC framework designed for gait-introduced body jitter and 360{\deg} continuity. OneOcc combines: (i) Dual-Projection fusion (DP-ER) to exploit the annular panorama and its equirectangular unfolding, preserving 360{\deg} continuity and grid alignment; (ii) Bi-Grid Voxelization (BGV) to reason in Cartesian and cylindrical-polar spaces, reducing discretization bias and sharpening free/occupied boundaries; (iii) a lightweight decoder with Hierarchical AMoE-3D for dynamic multi-scale fusion and better long-range/occlusion reasoning; and (iv) plug-and-play Gait Displacement Compensation (GDC) learning feature-level motion correction without extra sensors. We also release two panoramic occupancy benchmarks: QuadOcc (real quadruped, first-person 360{\deg}) and Human360Occ (H3O) (CARLA human-ego 360{\deg} with RGB, Depth, semantic occupancy; standardized within-/cross-city splits). OneOcc sets new state-of-the-art (SOTA): on QuadOcc it beats strong vision baselines and popular LiDAR ones; on H3O it gains +3.83 mIoU (within-city) and +8.08 (cross-city). Modules are lightweight, enabling deployable full-surround perception for legged/humanoid robots. Datasets and code will be publicly available at https://github.com/MasterHow/OneOcc.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

OneOcc: Semantic Occupancy Prediction for Legged Robots with a Single Panoramic Camera

Авторы:

Abstract

Ссылки и действия

Связанные статьи

DRCP: Diffusion on Reinforced Cooperative Perception for Perceiving Beyond Limit...

QuaDreamer: Controllable Panoramic Video Generation for Quadruped Robots

UniFucGrasp: Human-Hand-Inspired Unified Functional Grasp Annotation Strategy an...

Навигация