Question 1

What is sparse retrieval?

Accepted Answer

Sparse retrieval uses lexical representations (e.g., BM25, TF-IDF) and inverted indexes to match query terms with document terms. It's fast, scalable, and interpretable, but may miss semantic similarities.

Question 2

What is dense retrieval?

Accepted Answer

Dense retrieval uses neural embeddings to represent queries and documents as dense vectors; similarity is measured with cosine or dot product. It captures semantics and synonyms but requires vector indexes and more compute.

Question 3

What is hybrid retrieval?

Accepted Answer

Hybrid retrieval combines sparse and dense signals to select and rank results. It leverages both keyword matching and semantic similarity to improve recall and precision.

Question 4

When should I use sparse, dense, or hybrid retrieval?

Accepted Answer

Use sparse when you need fast, keyword-driven results. Use dense for semantic matching and synonyms. Use hybrid when you want the benefits of both, understanding it adds complexity and resource needs.

Sparse vs Dense vs Hybrid Retrieval+50

Sparse vs Dense vs Hybrid Retrieval
+50

💡 Key Takeaways

❓ Frequently Asked Questions