Question 1

What is a random forest?

Accepted Answer

An ensemble model that builds many decision trees on bootstrap samples with random feature selection, and combines their outputs to improve accuracy.

Question 2

How are predictions made in a random forest?

Accepted Answer

For classification, the forest outputs the class by majority vote; for regression, it returns the average of all trees' predictions.

Question 3

What is bagging and feature randomness in random forests?

Accepted Answer

Bagging (bootstrap aggregating) trains each tree on a bootstrap sample of the data; at each split, a random subset of features is considered, increasing diversity and reducing overfitting.

Question 4

Which hyperparameters matter in a random forest and what do they do?

Accepted Answer

Key settings include n_estimators (number of trees), max_features (how many features to consider at splits), and max_depth or min_samples_leaf to control growth; you can also use out-of-bag error to estimate generalization without separate validation.

Random Forests

Random Forests

💡 Key Takeaways

❓ Frequently Asked Questions