Retrieval-Augmented Generation (RAG) is a technique that enhances LLM responses by first retrieving relevant information from external sources, then including that information in the prompt for the model to use when generating its response.
RAG solves key LLM limitations: knowledge cutoff dates, inability to access private data, and hallucination of facts. It grounds responses in actual documents.