Question 1

What is Retrieval-Augmented Generation (RAG)?

Accepted Answer

RAG is a framework that combines a retriever to fetch relevant documents with a generator to produce an answer. The retrieved passages ground the response, improving accuracy and reducing hallucinations.

Question 2

How does RAG incorporate retrieved context into generation?

Accepted Answer

The generator conditions on the retrieved passages—often by appending them to the prompt or using them in attention mechanisms—so the produced text cites and relies on real evidence.

Question 3

What are the main components of a RAG system?

Accepted Answer

A retriever (dense embeddings or sparse methods like BM25), a generator/reader to craft the answer from the retrieved text, and optional fusion or reranking steps to select the best evidence and present a coherent response.

Question 4

What are dense and sparse retrieval in RAG, and how do they differ?

Accepted Answer

Dense retrieval uses learned vector embeddings to find semantically similar passages, while sparse retrieval relies on keyword-based indexes (e.g., BM25). Dense captures meaning beyond exact terms; sparse emphasizes exact term matches.

Question 5

When is RAG particularly beneficial?

Accepted Answer

RAG shines when up-to-date or domain-specific knowledge is needed, or when factual grounding is important. It improves accuracy, supports citations, and handles questions that go beyond the model’s training data.

Context Retrieval Basics (RAG)

Context Retrieval Basics (RAG)

💡 Key Takeaways

❓ Frequently Asked Questions