Previously we approximated the state or action value
functions by a linear function
or
parameterized by the weight vector
based on
a set of features in
. However, these features
need to hand picked or designed based on the specific
problem to solve.
Network!