Question 1

What is majority vote in crowdsourcing?

Accepted Answer

The simplest way to combine labels: for each item, assign the label that the most workers chose. It’s fast but ignores how reliable each worker is.

Question 2

What is the Dawid-Skene model?

Accepted Answer

A probabilistic model that treats the true label as latent and models each worker’s labeling behavior with a confusion matrix. It uses an EM algorithm to jointly estimate the true labels and workers’ accuracies, often improving accuracy over majority vote.

Question 3

What is MACE in aggregation methods?

Accepted Answer

A more advanced probabilistic approach that jointly reasons about annotator reliability and the evidence for each label, typically using an EM-like or Bayesian procedure to infer the most likely true labels and account for complex annotator behavior.

Question 4

When should I use each method?

Accepted Answer

Use majority vote for simplicity and speed when you have many labels per item and reasonably reliable workers. Use Dawid-Skene when worker quality varies and you want a principled estimate of true labels. Use MACE when you need stronger handling of complex annotator behavior and have enough data to fit a more expressive model.

Aggregation Methods: Majority Vote, Dawid-Skene, MACE

💡 Key Takeaways

❓ Frequently Asked Questions

You may also like

Toxicity and Harassment Measurement with Classifiers

Continual Benchmarking with Synthetic Data Refresh

Construct Validity and Reliability of Evaluation Metrics

You may also like

Toxicity and Harassment Measurement with Classifiers

Continual Benchmarking with Synthetic Data Refresh

Construct Validity and Reliability of Evaluation Metrics