Retrieval-augmented generation

Retrieval augmented generation (RAG) is a type of generative artificial intelligence that has information retrieval capabilities. It modifies interactions with a large language model (LLM) so that the model responds to user queries with reference to a specified set of documents, using this information in preference to information drawn from its own vast, static training data. This allows LLMs to use domain-specific and/or updated information.^[1] Use cases include providing chatbot access to internal company data, or giving factual information only from an authoritative source.^[2]

^ Cite error: The named reference Survey was invoked but never defined (see the help page).
^ Cite error: The named reference AWS was invoked but never defined (see the help page).

[Survey-1] Cite error: The named reference Survey was invoked but never defined (see the help page).

[AWS-2] Cite error: The named reference AWS was invoked but never defined (see the help page).

[1]

[2]