Predictive coding

Next: Transform Coding (lossy) and Up: Inter-pixel Redundancy and Compression Previous: Gray Level Image Compression

Predictive coding

As natural signals are highly correlated, the difference between neighboring samples is usually small. The value of a pixel x can be therefore predicted by its neighbors a, b, c, and d with a small error:

$\begin{displaymath}e_1=x-a,\;\;\;\;e_2=x-b,\;\;\;\;e_3=x-\frac{a+b+c+d}{4} \end{displaymath}$

In general, the predicted value $\hat{x}_0$ for a pixel

is a linear combination of all available neighbors $a_i,\;(i=1,\cdots,n)$ :

$\begin{displaymath}\hat{x}_0=\sum_{j=1}^n a_j x_j \end{displaymath}$

The entropy of the histogram of the error image hist(e) is much smaller than that of the histogram of the original image hist(x), therefore Huffman coding will be much more effective for the error image than the original one.

Optimal predictive coding

The mean square error of the predictive error is:

$\begin{displaymath}E(e^2)=E[ (x_0-\hat{x}_0)^2 ]=E(x_0^2)-2E(x_0 \hat{x}_0)+E(\hat{x}^2_0) \end{displaymath}$

where

$\begin{displaymath}\hat{x}_0=\sum_{j=1}^n a_j x_j \end{displaymath}$

To find the optimal coefficients

so that $E(e^2) \rightarrow min$ is minimized, we let

$\begin{displaymath}\frac{\partial}{\partial a_i} E(e^2) =\frac{\partial}{\parti... ...E(\hat{x}^2_0)-2\frac{\partial}{\partial a_i}E(x_0\hat{x}_0) =0\end{displaymath}$

but as

$\begin{displaymath}2\frac{\partial}{\partial a_i}E(x_0\hat{x}_0)= 2E(x_0\frac{\partial}{\partial a_i}\sum_{j=1}^n a_j x_j)=2E(x_0 \cdot x_i) \end{displaymath}$

and

$\begin{displaymath}\frac{\partial}{\partial a_i}E(\hat{x}^2_0) =2E(\hat{x}_0 \f... ...j=1}^n a_j x_j \cdot x_i) =2\sum_{j=1}^n a_j E(x_j \cdot x_i) \end{displaymath}$

we have

$\begin{displaymath}E(x_0 \cdot x_i)=\sum_{j=1}^n a_j E(x_j \cdot x_i) \end{displaymath}$

Here

$\begin{displaymath}E(x_i x_j)=r_{ij} \end{displaymath}$

is the correlation between

and

which can be estimated from data obtained from multiple trials:

$\begin{displaymath}\hat{r}_{ij}=\hat{E}(x_i x_j)=\frac{1}{K}\sum_{k=1}^K x_i^{(k)} x_j^{(k)} \end{displaymath}$

Now the optimal prediction above can be written as

$\begin{displaymath}r_{0i}=\sum_{j=1}^n a_j r_{ij},\;\;\;\;\;(j=1,\cdots,n) \end{displaymath}$

which can be expressed in vector form:

$\begin{displaymath}{\bf r}_0=R {\bf a} \end{displaymath}$

where

$\begin{displaymath}{\bf r}_0=\left[ \begin{array}{c}r_{01} \cdots r_{0n} \... ... ... & r_{ij} & ... \\ ... & ... & ... \end{array} \right] \end{displaymath}$

Then the coefficients can be found as

$\begin{displaymath}{\bf a} = R^{-1}{\bf r}_0 \end{displaymath}$

To prevent predictive error from being accumulated, we require

$\begin{displaymath}\sum_{j=1}^n a_j< 1 \end{displaymath}$

so that the errors will not propagate.

Examples

$\begin{displaymath}\hat{x}(i,j)=0.9\;x(i,j-1) \end{displaymath}$

$\begin{displaymath}\hat{x}(i,j)=0.9\;x(i-1,j) \end{displaymath}$

$\begin{displaymath}\hat{x}(i,j)=0.7\;x(i,j-1)+0.7\;x(i-1,j)-0.5\;x(i-1,j-1) \end{displaymath}$

$\begin{displaymath}\hat{x}(i,j)=\left\{ \begin{array}{ll} 0.9 \; x(i,j-1) & \m... ...angle v \\ 0.9 \; x(i-1,j) & \mbox{else} \end{array} \right. \end{displaymath}$

where

$\begin{displaymath}\triangle h=\vert x(i-1,j)-x(i-1,j-1)\vert,\;\;\;\;\triangle v=\vert x(i,j-1)-x(i-1,j-1)\vert \end{displaymath}$

Next: Transform Coding (lossy) and Up: Inter-pixel Redundancy and Compression Previous: Gray Level Image Compression

Ruye Wang 2021-03-28