Stop Wasting Your Tokens: Towards Efficient Runtime Multi-Agent Systems

2510.26585v1 cs.MA, cs.AI 2025-11-01

Авторы:

Fulin Lin, Shaowen Chen, Ruishan Fang, Hongwei Wang, Tao Lin

Abstract

While Multi-Agent Systems (MAS) excel at complex tasks, their growing autonomy with operational complexity often leads to critical inefficiencies, such as excessive token consumption and failures arising from misinformation. Existing methods primarily focus on post-hoc failure attribution, lacking proactive, real-time interventions to enhance robustness and efficiency. To this end, we introduce SupervisorAgent, a lightweight and modular framework for runtime, adaptive supervision that operates without altering the base agent's architecture. Triggered by an LLM-free adaptive filter, SupervisorAgent intervenes at critical junctures to proactively correct errors, guide inefficient behaviors, and purify observations. On the challenging GAIA benchmark, SupervisorAgent reduces the token consumption of the Smolagent framework by an average of 29.45% without compromising its success rate. Extensive experiments across five additional benchmarks (math reasoning, code generation, and question answering) and various SoTA foundation models validate the broad applicability and robustness of our approach. The code is available at https://github.com/LINs-lab/SupervisorAgent.

Ссылки и действия

Читать на arXiv Скачать PDF

Дополнительные ресурсы:

Stop Wasting Your Tokens: Towards Efficient Runtime Multi-Agent Systems

Авторы:

Abstract

Ссылки и действия

Связанные статьи

Strategic Self-Improvement for Competitive Agents in AI Labour Markets

AsymPuzl: An Asymmetric Puzzle for multi-agent cooperation

EZYer: A simulacrum of high school with generative agent

Beyond Single-Agent Safety: A Taxonomy of Risks in LLM-to-LLM Interactions

AgentODRL: A Large Language Model-based Multi-agent System for ODRL Generation

Навигация