minimize the cost explicitly without relying on an iterative algorithm
- take Derivation of Cost Function with respect to the model coefficient
- find Local extremum point
Normal Equation
number 1’s result is dimension if X is vector (d is number of coefficient count)
Fancy but Inverse matrix is very expensive