Next: KLT Optimally Compacts the Up: pca Previous: The Principal Component Transform

KLT Completely Decorrelates the Signal

KLT is the optimal orthogonal transform in the following sense:

KLT completely decorrelates the signal
KLT optimally compacts the energy (information) contained in the signal.

The first property is simply due to the definition of KLT, and the second property is due to the fact that KLT redistributes the energy among the components in such a way that most of the energy is contained in a small number of components of $Y=\Phi^T X$ .

To see the first property, consider the mean vector and covariance matrix $\Sigma_Y$ of :

$\begin{displaymath}M_Y = E(Y)=E(\Phi^T X)=\Phi^T E(X)=\Phi^T M_X \end{displaymath}$

$\displaystyle \Sigma_Y$	$\textstyle =$	$\displaystyle E(YY^{T})-M_Y M_Y^T=E[\Phi^{T}X(\Phi^{T}X)^{T}] -\Phi^T M_X (\Phi^T M_X)^T$
	$\textstyle =$	$\displaystyle E[\Phi^{T}(XX^{T})\Phi]-\Phi^T M_X M_X^T \Phi$
	$\textstyle =$	$\displaystyle \Phi^T [ E(XX^{T}) -M_X M_X^T ] \Phi$
	$\textstyle =$	$\displaystyle \Phi^{T}\Sigma_X\Phi=\Lambda=diag[\lambda_0, \lambda_1, \cdots, \lambda_{N-1}]$

Or in matrix form, we have

$\begin{displaymath} \Sigma_Y = \Phi^T \Sigma_X \Phi=\Lambda= \left[ \begin{arra... ...s \\ 0 & 0 & \cdots & \sigma^2_{y_{N-1}} \end{array} \right] \end{displaymath}$

We see that after KLT, the correlation matrix of the signal is diagonalized, i.e., the correlation $\sigma_{ij}=0$ between any two components

and

is always zero. In other words, the signal is completely decorrelated.

Also, we have

$\begin{displaymath}R_Y = \Sigma_Y+M_Y M_Y^T=\Lambda+M_Y M_Y^T \end{displaymath}$

Next: KLT Optimally Compacts the Up: pca Previous: The Principal Component Transform

Ruye Wang 2004-09-29