Curriculum Imitation Learning of Distributed Multi-Robot Policies

2509.25097v2 cs.RO, cs.LG, cs.MA 2025-10-03

Авторы:

Jesús Roche, Eduardo Sebastián, Eduardo Montijano

Abstract

Learning control policies for multi-robot systems (MRS) remains a major challenge due to long-term coordination and the difficulty of obtaining realistic training data. In this work, we address both limitations within an imitation learning framework. First, we shift the typical role of Curriculum Learning in MRS, from scalability with the number of robots, to focus on improving long-term coordination. We propose a curriculum strategy that gradually increases the length of expert trajectories during training, stabilizing learning and enhancing the accuracy of long-term behaviors. Second, we introduce a method to approximate the egocentric perception of each robot using only third-person global state demonstrations. Our approach transforms idealized trajectories into locally available observations by filtering neighbors, converting reference frames, and simulating onboard sensor variability. Both contributions are integrated into a physics-informed technique to produce scalable, distributed policies from observations. We conduct experiments across two tasks with varying team sizes and noise levels. Results show that our curriculum improves long-term accuracy, while our perceptual estimation method yields policies that are robust to realistic uncertainty. Together, these strategies enable the learning of robust, distributed controllers from global demonstrations, even in the absence of expert actions or onboard measurements.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

Curriculum Imitation Learning of Distributed Multi-Robot Policies

Авторы:

Abstract

Ссылки и действия

Связанные статьи

GRAND: Guidance, Rebalancing, and Assignment for Networked Dispatch in Multi-Age...

LEARN: Learning End-to-End Aerial Resource-Constrained Multi-Robot Navigation

Debate2Create: Robot Co-design via Large Language Model Debates

Prompting Robot Teams with Natural Language

Curriculum Imitation Learning of Distributed Multi-Robot Policies

Навигация