The KLT is the optimal orthogonal transform among all possible
orthogonal transforms
based on any
orthogonal matrix
satisfying
,
in the following sense:
- KLT completely decorrelates the signal.
- KLT maximally compacts the energy (information) contained in the signal.
KLT Completely Decorrelates the Signal
The mean vector and covariance matrix of the random vector
after the KLT transform can found as
Based on the fact that the covariance matrix
is a
diagnal matrix, we have the following two observations:
KLT Optimally Compacts Signal Energy
While the total energy contained in the signal is conserved by any
orthogonal transform, the distribution of this energy among the
compoents before and after the transform may be very different. We
can show that the KLT is optimal in the sense that it redistributes
the total energy in such a way that it is maximally compacted into a
subset of components of
.
Consider a generic orthorgonal transform
based on an arbitary orthogonal matrix
satisfying
:
![$\displaystyle {\bf y}=\left[ \begin{array}{c} y_1 \\ \vdots \\ y_d \end{array} ...
...{ccc} &{\bf a}_1^T & \\ &\vdots& \\
& {\bf a}_d^T & \end{array} \right]{\bf x}$](img243.svg) |
(63) |
We also have
 |
(64) |
We then consider the energy contained in
out of the
components of
expressed as a function of the transform matrix
:
We can find the optimal orthogonal transform matrix
that maximizes
this partial energy
by solving the following optimization problem
with the constraint that the column vectors of
are normalized with
:
 |
(66) |
The Lagrange function of this constrained optimization problem is
 |
(67) |
To find the optimal
, we set partial derivative with respect to
each
to zero:
The resulting equation
happens to be the eigenequation of the covariance matrix
,
i.e., the column vectors of the optimal transform matrix
must
be the eigenvectors of
satisfying:
i.e. |
(69) |
We therefore see that the optimal transform is indeed the KLT transform
![$\displaystyle {\bf A}=[{\bf a}_1, \cdots,{\bf a}_d]={\bf V}=[{\bf v}_1, \cdots, {\bf v}_d]$](img260.svg) |
(70) |
and
 |
(71) |
This partial energy
is maximized if we choose those
eigenvectors
corresponding to the
greatest eigenvalues
of
:
.