Question 1

What is negative sampling for retriever training?

Accepted Answer

Negative sampling selects non-relevant documents to pair with a query during training so the model learns to distinguish relevant from non-relevant results, usually using contrastive or ranking losses.

Question 2

What are common negative sampling strategies in retriever training?

Accepted Answer

Strategies include in-batch negatives, random negatives, hard negatives (similar to the query but not relevant), and semi-hard negatives mined from weaker retrievers or BM25; many setups combine several sources.

Question 3

What is hard negative sampling and why is it useful?

Accepted Answer

Hard negatives are non-relevant docs that are highly similar to the query. They push the model to learn finer distinctions, improving ranking, but can cause training instability if not managed carefully.

Question 4

How does negative sampling relate to the loss function in retriever training?

Accepted Answer

Negatives are used with positives in contrastive or cross-entropy losses (e.g., InfoNCE, softmax). More informative negatives yield stronger gradients and better discrimination.

Question 5

What are common pitfalls with negative sampling and how can I mitigate them?

Accepted Answer

False negatives (relevant docs treated as negatives) and stale negatives can hurt performance. Use diverse sources, refresh negatives periodically, and balance hard negatives with easier ones.

Negative Sampling for Retriever Training

Negative Sampling for Retriever Training

💡 Key Takeaways

❓ Frequently Asked Questions