Complex Instruction Following with Diverse Style Policies in Football Games

2511.19885v1 cs.MA, cs.LG 2025-11-27
Авторы:

Chenglu Sun, Shuo Shen, Haonan Hu, Wei Zhou, Chen Chen

Abstract

Despite advancements in language-controlled reinforcement learning (LC-RL) for basic domains and straightforward commands (e.g., object manipulation and navigation), effectively extending LC-RL to comprehend and execute high-level or abstract instructions in complex, multi-agent environments, such as football games, remains a significant challenge. To address this gap, we introduce Language-Controlled Diverse Style Policies (LCDSP), a novel LC-RL paradigm specifically designed for complex scenarios. LCDSP comprises two key components: a Diverse Style Training (DST) method and a Style Interpreter (SI). The DST method efficiently trains a single policy capable of exhibiting a wide range of diverse behaviors by modulating agent actions through style parameters (SP). The SI is designed to accurately and rapidly translate high-level language instructions into these corresponding SP. Through extensive experiments in a complex 5v5 football environment, we demonstrate that LCDSP effectively comprehends abstract tactical instructions and accurately executes the desired diverse behavioral styles, showcasing its potential for complex, real-world applications.

Ссылки и действия

Связанные статьи

Structuring Collective Action with LLM-Guided Evolution: From Ill-Structured Pro...

## Контекст Коллективные действия, требующие выравнивания личных интересов со стратегическими целями на уровне группы, я...

2025-09-27

Bayesian Ego-graph inference for Networked Multi-Agent Reinforcement Learning

#### Контекст Сетевая многоагентная reinforcement learning (Networked-MARL) — это область исследований, где децентрализо...

2025-09-24