Given a 2-D function , the gradient descent method finds
so that
.
First consider 1D case. We see that is always in the opposite
direction of the increasing direction of
:
Then consider 2D case where the derivative used in 1D case becomes gradient
Obviously the gradient descent method can be generalized to minimize high
dimensional functions
.