Question 1

What is label noise in machine learning data?

Accepted Answer

Label noise occurs when the target labels in the training data are incorrect or unreliable, often due to human error, ambiguity, or automatic labeling.

Question 2

What is rater bias in annotation?

Accepted Answer

Rater bias is a systematic tendency of annotators to favor certain labels or interpretations, leading to consistent mislabeling across samples.

Question 3

What does label noise modeling involve?

Accepted Answer

It models how true labels become observed labels, typically using a noise transition matrix and algorithms to estimate the likely true labels from noisy ones.

Question 4

How can we mitigate label noise and rater bias in practice?

Accepted Answer

Collect multiple annotations per item, use consensus or probabilistic models (e.g., Dawid-Skene or EM) to infer true labels, and train with noise-robust loss functions or perform data cleaning.

Label Noise Modeling and Rater Bias Correction

Label Noise Modeling and Rater Bias Correction

💡 Key Takeaways

❓ Frequently Asked Questions