Verification-Aware Planning for Multi-Agent Systems

2510.17109v1 cs.CL, cs.AI, cs.LG, cs.MA 2025-10-22
Авторы:

Tianyang Xu, Dan Zhang, Kushan Mitra, Estevam Hruschka

Abstract

Large language model (LLM) agents are increasingly deployed to tackle complex tasks, often necessitating collaboration among multiple specialized agents. However, multi-agent collaboration introduces new challenges in planning, coordination, and verification. Execution failures frequently arise not from flawed reasoning alone, but from subtle misalignments in task interpretation, output format, or inter-agent handoffs. To address these challenges, we present VeriMAP, a framework for multi-agent collaboration with verification-aware planning. The VeriMAP planner decomposes tasks, models subtask dependencies, and encodes planner-defined passing criteria as subtask verification functions (VFs) in Python and natural language. We evaluate VeriMAP on diverse datasets, demonstrating that it outperforms both single- and multi-agent baselines while enhancing system robustness and interpretability. Our analysis highlights how verification-aware planning enables reliable coordination and iterative refinement in multi-agent systems, without relying on external labels or annotations.

Ссылки и действия

Связанные статьи

Multi-Objective Reinforcement Learning for Large Language Model Optimization: Vi...

## Контекст Оптимизация больших языковых моделей (LLMs) представляет собой сложную задачу, включающую в себя несколько ц...

2025-09-30

Memp: Exploring Agent Procedural Memory

## Контекст Large Language Models (LLMs) становятся все более успешными в решении разнообразных задач, но их процедурна...

2025-08-12