Steepest descent algorithm
Statistical Thinking Update your probability not change (Learning Rate)
gradient ascent maximize and descent minimize function
Iteratively subtract the derivative of Loss Function wrt weight from the weight with the learning rate and then Cost Function decreases
Repeat until the weight convergence (Local extremum point)
Gradient Descents
Gradient Descent Notion