Social World Model-Augmented Mechanism Design Policy Learning
2510.19270v1
cs.CY, cs.AI
2025-10-24
Авторы:
Xiaoyuan Zhang, Yizhe Huang, Chengdong Ma, Zhixun Chen, Long Ma, Yali Du, Song-Chun Zhu, Yaodong Yang, Xue Feng
Abstract
Designing adaptive mechanisms to align individual and collective interests
remains a central challenge in artificial social intelligence. Existing methods
often struggle with modeling heterogeneous agents possessing persistent latent
traits (e.g., skills, preferences) and dealing with complex multi-agent system
dynamics. These challenges are compounded by the critical need for high sample
efficiency due to costly real-world interactions. World Models, by learning to
predict environmental dynamics, offer a promising pathway to enhance mechanism
design in heterogeneous and complex systems. In this paper, we introduce a
novel method named SWM-AP (Social World Model-Augmented Mechanism Design Policy
Learning), which learns a social world model hierarchically modeling agents'
behavior to enhance mechanism design. Specifically, the social world model
infers agents' traits from their interaction trajectories and learns a
trait-based model to predict agents' responses to the deployed mechanisms. The
mechanism design policy collects extensive training trajectories by interacting
with the social world model, while concurrently inferring agents' traits online
during real-world interactions to further boost policy learning efficiency.
Experiments in diverse settings (tax policy design, team coordination, and
facility location) demonstrate that SWM-AP outperforms established model-based
and model-free RL baselines in cumulative rewards and sample efficiency.
Ссылки и действия
Дополнительные ресурсы: