Question 1

What is gradient descent?

Accepted Answer

An optimization algorithm that minimizes a function by iteratively moving in the opposite direction of the gradient, reducing the loss toward a minimum.

Question 2

How does gradient descent apply to neural networks?

Accepted Answer

In neural networks, gradient descent minimizes the loss over training data by computing gradients of the loss with respect to weights and biases using backpropagation, and updating parameters to reduce error.

Question 3

What is learning rate and why is it important?

Accepted Answer

The learning rate sets how large each update is. A too large rate can overshoot minima; a too small rate makes training slow or get stuck.

Question 4

What are batch, stochastic, and mini-batch gradient descent?

Accepted Answer

Batch gradient descent uses the full dataset for each update, stochastic uses a single example, and mini-batch uses a small subset. Mini-batch is common because it balances accuracy and speed.

Question 5

What is a gradient, and what does the negative gradient do?

Accepted Answer

The gradient is the vector of partial derivatives showing how the loss changes with each parameter. The negative gradient points toward lower loss, guiding parameter updates.

Gradient Descent Basics

💡 Key Takeaways

❓ Frequently Asked Questions

You may also like

Advanced Neural Network Scalability Techniques

Understanding Neural Network Causality

Introduction to Hyperparameters

You may also like

Advanced Neural Network Scalability Techniques

Understanding Neural Network Causality

Introduction to Hyperparameters