Retrieval-Augmented Generation (RAG)
Status: public · Confidence: medium (0.815) · Basis: verified_sources
## TL;DR Retrieval-Augmented Generation, or RAG, is a language-model architecture pattern that retrieves external evidence and conditions generation on that evidence. It is useful when an answer should be grounded in a document collection rather than only in a model's stored parameters. ## Core Claims The original RAG formulation combines a generator with a retriever over external text. The retriever selects documents or passages; the generator uses retrieved evidence while producing an answer. Dense Passage Retrieval is a common retrieval foundation for this pattern. It represents questions and passages as dense vectors, enabling nearest-neighbor retrieval for open-domain question answering. REALM is an adjacent retrieval-augmented pretraining approach: the model learns to retrieve text from a corpus and use that retrieved text during language-model pretraining. ## Citation Boundaries Use this article for stable RAG concepts: retrieval, external memory, dense passage retrieval, and evidence-conditioned generation. Do not use it to claim a system is factual unless the specific retrieval corpus, ranking, and citation behavior have been evaluated. ## Further Reading - [Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks](https://arxiv.org/abs/2005.11401) - [Dense Passage Retrieval for Open-Domain Question Answering](https://arxiv.org/abs/2004.04906) - [REALM: Retrieval-Augmented Language Model Pre-Training](https://arxiv.org/abs/2002.08909)