Dejavu: Post-Deployment Learning for Embodied Agents via Experience Feedback
2510.10181v1
cs.RO, cs.AI, cs.CV
2025-10-15
Авторы:
Shaokai Wu, Yanbiao Ji, Qiuchang Li, Zhiyi Zhang, Qichen He, Wenyuan Xie, Guodong Zhang, Bayram Bayramli, Yue Ding, Hongtao Lu
Abstract
Embodied agents face a fundamental limitation: once deployed in real-world
environments to perform specific tasks, they are unable to acquire new useful
knowledge to enhance task performance. In this paper, we propose a general
post-deployment learning framework called Dejavu, which employs an Experience
Feedback Network (EFN) and augments the frozen Vision-Language-Action (VLA)
policy with retrieved execution memories. EFN automatically identifies
contextually successful prior action experiences and conditions action
prediction on this retrieved guidance. We adopt reinforcement learning with
semantic similarity rewards on EFN to ensure that the predicted actions align
with past successful behaviors under current observations. During deployment,
EFN continually enriches its memory with new trajectories, enabling the agent
to exhibit "learning from experience" despite fixed weights. Experiments across
diverse embodied tasks show that EFN significantly improves adaptability,
robustness, and success rates over frozen baselines. These results highlight a
promising path toward embodied agents that continually refine their behavior
after deployment.
Ссылки и действия
Дополнительные ресурсы: