The main purpose of feature selection is to reduce the computational cost by using
only
features for recognition/classification purposes. These
features can be either directly chosen from the
original ones, or generated as
some linear combinations of the original ones. The features selected should keep as
much separability information as possible.
There are
If the features chosen optimally above do not produce satisfactory separability,
we can try to generate some
new features as the linear combinations of the
original ones by a linear transform:
After a linear transform
, the mean vectors and covariance
matrices of each class become