GRAND: Guidance, Rebalancing, and Assignment for Networked Dispatch in Multi-Agent Path Finding

2512.03194v1 cs.RO, cs.LG, cs.MA 2025-12-04

Авторы:

Johannes Gaber, Meshal Alharbi, Daniele Gammelli, Gioele Zardini

Abstract

Large robot fleets are now common in warehouses and other logistics settings, where small control gains translate into large operational impacts. In this article, we address task scheduling for lifelong Multi-Agent Pickup-and-Delivery (MAPD) and propose a hybrid method that couples learning-based global guidance with lightweight optimization. A graph neural network policy trained via reinforcement learning outputs a desired distribution of free agents over an aggregated warehouse graph. This signal is converted into region-to-region rebalancing through a minimum-cost flow, and finalized by small, local assignment problems, preserving accuracy while keeping per-step latency within a 1 s compute budget. On congested warehouse benchmarks from the League of Robot Runners (LRR) with up to 500 agents, our approach improves throughput by up to 10% over the 2024 winning scheduler while maintaining real-time execution. The results indicate that coupling graph-structured learned guidance with tractable solvers reduces congestion and yields a practical, scalable blueprint for high-throughput scheduling in large fleets.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

GRAND: Guidance, Rebalancing, and Assignment for Networked Dispatch in Multi-Agent Path Finding

Авторы:

Abstract

Ссылки и действия

Связанные статьи

LEARN: Learning End-to-End Aerial Resource-Constrained Multi-Robot Navigation

Debate2Create: Robot Co-design via Large Language Model Debates

Curriculum Imitation Learning of Distributed Multi-Robot Policies

Prompting Robot Teams with Natural Language

Curriculum Imitation Learning of Distributed Multi-Robot Policies

Навигация