Question 1

What is active learning?

Accepted Answer

Active learning is a training approach where the model requests labels for the most informative unlabeled examples from an oracle (e.g., a human annotator). This helps achieve higher accuracy with fewer labeled instances.

Question 2

What is hard negative mining?

Accepted Answer

Hard negatives are negative examples that the model misclassifies or finds confusing (close to the decision boundary). Mining them focuses training on difficult cases to improve precision and reduce false positives.

Question 3

How do active learning loops work with hard negatives?

Accepted Answer

Initialize with a labeled set, train the model, select informative or hard negative samples from the unlabeled pool, label them, add to training data, retrain, and repeat until performance converges.

Question 4

What are common strategies to pick hard negatives?

Accepted Answer

Uncertainty sampling (highest entropy or smallest margin), margin-based selection, diversity-aware sampling, and methods that explicitly mine negatives near the boundary.

Question 5

What should you watch out for when using this approach?

Accepted Answer

Labeling cost and quality, class imbalance, potential overfitting to hard negatives, and computational overhead from repeated retraining.

Active Learning Loops for Hard Negative Mining

Active Learning Loops for Hard Negative Mining

💡 Key Takeaways

❓ Frequently Asked Questions