Question 1

What is Retrieval-Augmented Generation (RAG)?

Accepted Answer

A framework that combines a retriever to fetch relevant documents from a knowledge base with a generator that uses those documents to produce grounded, factual answers.

Question 2

What are the main components of a RAG architecture?

Accepted Answer

Retriever (finds relevant passages), Generator/Reader (produces the answer using retrieved docs), and a knowledge source (document store). Optional: fusion or ranking modules to combine evidence.

Question 3

What are RAG-Token and RAG-Sequence variants?

Accepted Answer

RAG-Token uses retrieved documents to inform each generated token; RAG-Sequence concatenates the retrieved passages and generates the answer from them. Both ground generation in evidence but differ in how evidence is integrated.

Question 4

How does retrieval improve accuracy and reduce hallucinations?

Accepted Answer

By grounding answers in actual documents, it reduces unsupported claims. Effectiveness depends on the quality of the knowledge base and the retriever's accuracy.

Question 5

What retrieval methods are common in RAG systems?

Accepted Answer

Dense retrieval with learned embeddings (e.g., DPR) using vector indices (e.g., FAISS), sparse retrieval like BM25, or hybrids. Retrievers can be trained jointly with the generator.

RAG Fundamentals: Architecture & Core Concepts+50

RAG Fundamentals: Architecture & Core Concepts
+50

💡 Key Takeaways

❓ Frequently Asked Questions