GAOT: Generating Articulated Objects Through Text-Guided Diffusion Models

2512.03566v1 cs.CV, cs.MM 2025-12-04

Авторы:

Hao Sun, Lei Fan, Donglin Di, Shaohui Liu

Abstract

Articulated object generation has seen increasing advancements, yet existing models often lack the ability to be conditioned on text prompts. To address the significant gap between textual descriptions and 3D articulated object representations, we propose GAOT, a three-phase framework that generates articulated objects from text prompts, leveraging diffusion models and hypergraph learning in a three-step process. First, we fine-tune a point cloud generation model to produce a coarse representation of objects from text prompts. Given the inherent connection between articulated objects and graph structures, we design a hypergraph-based learning method to refine these coarse representations, representing object parts as graph vertices. Finally, leveraging a diffusion model, the joints of articulated objects-represented as graph edges-are generated based on the object parts. Extensive qualitative and quantitative experiments on the PartNet-Mobility dataset demonstrate the effectiveness of our approach, achieving superior performance over previous methods.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

GAOT: Generating Articulated Objects Through Text-Guided Diffusion Models

Авторы:

Abstract

Ссылки и действия

Связанные статьи

A Sleep Monitoring System Based on Audio, Video and Depth Information

HUD: Hierarchical Uncertainty-Aware Disambiguation Network for Composed Video Re...

OralGPT-Omni: A Versatile Dental Multimodal Large Language Model

Parallel Vision Token Scheduling for Fast and Accurate Multimodal LMMs Inference

Signal: Selective Interaction and Global-local Alignment for Multi-Modal Object ...

Навигация