When Retrieval Succeeds and Fails: Rethinking Retrieval-Augmented Generation for LLMs

2510.09106v1 cs.CL, 68T50, I.2.7 2025-10-14
Авторы:

Yongjie Wang, Yue Yu, Kaisong Song, Jun Lin, Zhiqi Shen

Abstract

Large Language Models (LLMs) have enabled a wide range of applications through their powerful capabilities in language understanding and generation. However, as LLMs are trained on static corpora, they face difficulties in addressing rapidly evolving information or domain-specific queries. Retrieval-Augmented Generation (RAG) was developed to overcome this limitation by integrating LLMs with external retrieval mechanisms, allowing them to access up-to-date and contextually relevant knowledge. However, as LLMs themselves continue to advance in scale and capability, the relative advantages of traditional RAG frameworks have become less pronounced and necessary. Here, we present a comprehensive review of RAG, beginning with its overarching objectives and core components. We then analyze the key challenges within RAG, highlighting critical weakness that may limit its effectiveness. Finally, we showcase applications where LLMs alone perform inadequately, but where RAG, when combined with LLMs, can substantially enhance their effectiveness. We hope this work will encourage researchers to reconsider the role of RAG and inspire the development of next-generation RAG systems.

Ссылки и действия

Связанные статьи

Prior-based Noisy Text Data Filtering: Fast and Strong Alternative For Perplexit...

------------------------------------------------------ ## Контекст -----------------------------------------------------...

2025-09-25

Charting a Decade of Computational Linguistics in Italy: The CLiC-it Corpus

## Контекст Область исследования, известная как Computational Linguistics (CL) или языковой моделирование, занимается ра...

2025-09-25

Quantifying Self-Awareness of Knowledge in Large Language Models

## Контекст Современные большие языковые модели (LLMs) представляют собой мощные инструменты, способные выполнять широки...

2025-09-23

Trusted Uncertainty in Large Language Models: A Unified Framework for Confidence...

## Контекст В современном мире развитие интеллектуальных технологий приводит к появлению моделей языка, которые становя...

2025-09-05

Testing the assumptions about the geometry of sentence embedding spaces: the cos...

## Контекст Основной контекст данного исследования заключается в оценке предположений о геометрии пространств слов и пре...

2025-09-05