R2BC: Multi-Agent Imitation Learning from Single-Agent Demonstrations

2510.18085v1 cs.RO, cs.AI, cs.MA 2025-10-23

Авторы:

Connor Mattson, Varun Raveendra, Ellen Novoseller, Nicholas Waytowich, Vernon J. Lawhern, Daniel S. Brown

Abstract

Imitation Learning (IL) is a natural way for humans to teach robots, particularly when high-quality demonstrations are easy to obtain. While IL has been widely applied to single-robot settings, relatively few studies have addressed the extension of these methods to multi-agent systems, especially in settings where a single human must provide demonstrations to a team of collaborating robots. In this paper, we introduce and study Round-Robin Behavior Cloning (R2BC), a method that enables a single human operator to effectively train multi-robot systems through sequential, single-agent demonstrations. Our approach allows the human to teleoperate one agent at a time and incrementally teach multi-agent behavior to the entire system, without requiring demonstrations in the joint multi-agent action space. We show that R2BC methods match, and in some cases surpass, the performance of an oracle behavior cloning approach trained on privileged synchronized demonstrations across four multi-agent simulated tasks. Finally, we deploy R2BC on two physical robot tasks trained using real human demonstrations.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

R2BC: Multi-Agent Imitation Learning from Single-Agent Demonstrations

Авторы:

Abstract

Ссылки и действия

Связанные статьи

An Analysis of Constraint-Based Multi-Agent Pathfinding Algorithms

AVOID-JACK: Avoidance of Jackknifing for Swarms of Long Heavy Articulated Vehicl...

ScheduleStream: Temporal Planning with Samplers for GPU-Accelerated Multi-Arm Ta...

Policies over Poses: Reinforcement Learning based Distributed Pose-Graph Optimiz...

Destination-to-Chutes Task Mapping Optimization for Multi-Robot Coordination in ...

Навигация