MorphoSim: An Interactive, Controllable, and Editable Language-guided 4D World Simulator
2510.04390v1
cs.CV, cs.AI, cs.CL
2025-10-08
Авторы:
Xuehai He, Shijie Zhou, Thivyanth Venkateswaran, Kaizhi Zheng, Ziyu Wan, Achuta Kadambi, Xin Eric Wang
Abstract
World models that support controllable
and editable spatiotemporal environments are valuable
for robotics, enabling scalable training data, repro ducible evaluation, and
flexible task design. While
recent text-to-video models generate realistic dynam ics, they are
constrained to 2D views and offer limited
interaction. We introduce MorphoSim, a language guided framework that
generates 4D scenes with
multi-view consistency and object-level controls. From
natural language instructions, MorphoSim produces
dynamic environments where objects can be directed,
recolored, or removed, and scenes can be observed
from arbitrary viewpoints. The framework integrates
trajectory-guided generation with feature field dis tillation, allowing edits
to be applied interactively
without full re-generation. Experiments show that Mor phoSim maintains high
scene fidelity while enabling
controllability and editability. The code is available
at https://github.com/eric-ai-lab/Morph4D.
Ссылки и действия
Дополнительные ресурсы: