Complex Instruction Following with Diverse Style Policies in Football Games

2511.19885v1 cs.MA, cs.LG 2025-11-27

Авторы:

Chenglu Sun, Shuo Shen, Haonan Hu, Wei Zhou, Chen Chen

Abstract

Despite advancements in language-controlled reinforcement learning (LC-RL) for basic domains and straightforward commands (e.g., object manipulation and navigation), effectively extending LC-RL to comprehend and execute high-level or abstract instructions in complex, multi-agent environments, such as football games, remains a significant challenge. To address this gap, we introduce Language-Controlled Diverse Style Policies (LCDSP), a novel LC-RL paradigm specifically designed for complex scenarios. LCDSP comprises two key components: a Diverse Style Training (DST) method and a Style Interpreter (SI). The DST method efficiently trains a single policy capable of exhibiting a wide range of diverse behaviors by modulating agent actions through style parameters (SP). The SI is designed to accurately and rapidly translate high-level language instructions into these corresponding SP. Through extensive experiments in a complex 5v5 football environment, we demonstrate that LCDSP effectively comprehends abstract tactical instructions and accurately executes the desired diverse behavioral styles, showcasing its potential for complex, real-world applications.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

Complex Instruction Following with Diverse Style Policies in Football Games

Авторы:

Abstract

Ссылки и действия

Связанные статьи

Complementary Characterization of Agent-Based Models via Computational Mechanics...

SocialDriveGen: Generating Diverse Traffic Scenarios with Controllable Social In...

Multi-agent In-context Coordination via Decentralized Memory Retrieval

Structuring Collective Action with LLM-Guided Evolution: From Ill-Structured Pro...

Bayesian Ego-graph inference for Networked Multi-Agent Reinforcement Learning

Навигация