convolution

Digital Convolution -- E186 Handout

The convolution of two continuous signals and is defined as

$\begin{displaymath}y(t)=h(t)*x(t) \stackrel{\triangle}{=} \int_{-\infty}^{\infty... ... = \int_{-\infty}^{\infty} h(\tau) x(t-\tau) d\tau = x(t)*h(t) \end{displaymath}$

i.e., convolution is commutative. Also convolution is associative:

$\begin{displaymath}h*(g*x)=(h*g)*x \end{displaymath}$

Typically,

is the output of a system characterized by its impulse response function

with input

Convolution in discrete form is

$\begin{displaymath}y(n)=\sum_{m=-\infty}^{\infty} x(n-m) \; h(m) =\sum_{m=-\infty}^{\infty} h(n-m) \; x(m) \end{displaymath}$

is finite, e.g.,

$\begin{displaymath}h(m) = \left\{ \begin{array}{ll} h(m) & \vert m\vert\le k 0 & \vert m\vert>k \end{array} \right. \end{displaymath}$

the convolution becomes

$\begin{displaymath}y(n)=\sum_{m=-k}^{k} x(n-m) \; h(m) \end{displaymath}$

If the system in question were a causal system in time domain, i.e.,

$\begin{displaymath}h(n)=0\;\;\;\;\;\;\mbox{ if $n<0$} \end{displaymath}$

the above would become

$\begin{displaymath}y(n)=\sum_{m=0}^{k} x(n-m) \; h(m) \end{displaymath}$

However, in image processing, we often consider convolution in spatial domain where causality does not apply.

If is symmetric (almost always true in image processing), i.e.,

$\begin{displaymath}h(-m)=h(m) \end{displaymath}$

the convolution becomes the same as the correlation of the two functions:

$\begin{displaymath}y(n)=\sum_{m=-k}^{k} x(n+m) \; h(m) \end{displaymath}$

The correlation can be considered as the computational model for the neural networks in the biological visual system.

If the input is finite (always true in reality), i.e.,

$\begin{displaymath}x(m) = \left\{ \begin{array}{ll} x(m) & 0 \le m <N 0 & otherwise \end{array} \right. \end{displaymath}$

its index

in the convolution has to satisfy the following for

to be in the valid non-zero range:

$\begin{displaymath}0 \le n+m \le N-1 \end{displaymath}$

or correspondingly, the index

of the output

has to satisfy:

$\begin{displaymath}-m \le n \le N-m-1 \end{displaymath}$

When the variable index

in the convolution is equal to

, the index of output

reaches its lower bound

; when

, the index of

reaches its upper bound

. In other words, there are

valid (non-zero) elements in the output:

$\begin{displaymath}y(n),\;\;\;\;\;\;(-k \le n \le N+k-1) \end{displaymath}$

Digital convolution can be best understood graphically (where the index of is rearranged).

Assume the dimensionality of the input signal is and that of the kernel is (usually an odd number), then the dimensionality of the resulting convolution is . However, as it is usually desirable for the output to have the same dimensionality as the input , components at each end of are dropped. A code segment for this 1D convolution is given below.

$\begin{displaymath} \par k=(M-1)/2; \par \;\;for (n=0;\; n<N; \; n++) \{ \par \;... ...par \;\;\;\;\;\;\;\;\;\;y[n]+=x[n+m]*h[m+k]; \par \;\; \} \par \end{displaymath}$

In particular, if the elements of the kernel are all the same (an average or low-pass filter), the we can speed up the convolution process while sliding the kernel over the input signal by taking care of only the two ends of the kernel.

In image processing, all of the discussions above for one-dimensional convolution are generalized into two dimensions, and is called a convolution kernel, or mask.

$\begin{displaymath}y(m.n)=\sum_{i=-k}^k \sum_{j=-k}^k x(m+i,n+j) h(i,j) \end{displaymath}$

About this document ...

Next: About this document ...

Ruye Wang 2009-09-03