Beyond Majority Voting: LLM Aggregation by Leveraging Higher-Order Information

2510.01499v1 cs.LG, cs.AI, cs.GT 2025-10-04
Авторы:

Rui Ai, Yuqi Pan, David Simchi-Levi, Milind Tambe, Haifeng Xu

Abstract

With the rapid progress of multi-agent large language model (LLM) reasoning, how to effectively aggregate answers from multiple LLMs has emerged as a fundamental challenge. Standard majority voting treats all answers equally, failing to consider latent heterogeneity and correlation across models. In this work, we design two new aggregation algorithms called Optimal Weight (OW) and Inverse Surprising Popularity (ISP), leveraging both first-order and second-order information. Our theoretical analysis shows these methods provably mitigate inherent limitations of majority voting under mild assumptions, leading to more reliable collective decisions. We empirically validate our algorithms on synthetic datasets, popular LLM fine-tuning benchmarks such as UltraFeedback and MMLU, and a real-world healthcare setting ARMMAN. Across all cases, our methods consistently outperform majority voting, offering both practical performance gains and conceptual insights for the design of robust multi-agent LLM pipelines.

Ссылки и действия

Связанные статьи

SpinGPT: A Large-Language-Model Approach to Playing Poker Correctly

## Контекст Область исследования — искусственный интеллект (ИИ) в играх, специально в покере. Игры, которым необходима с...

2025-09-30

From Leiden to Pleasure Island: The Constant Potts Model for Community Detection...

#### Контекст Community detection является одной из основных задач в области data science, состоящей в разбиении узлов г...

2025-09-06

Meta-Inverse Reinforcement Learning for Mean Field Games via Probabilistic Conte...

## Контекст Инверсное обучение наград (IRL) в играх с многими агентами (mean field games, MFGs) является важной задачей ...

2025-09-06