next up previous
Next: FD used for shape Up: fd Previous: fd

Fourier Descriptor

Let $x[m]$ and $y[m]$ be the coordinates of the mth pixel on the boundary of a given 2D shape, a complex number can be formed as $z[m]=x[m]+jy[m]$, and the Fourier Descriptor (FD) of this shape is defined as:

\begin{displaymath}Z[k]=DFT[z[m]]=\frac{1}{N}\sum_{m=0}^{N-1}z[m]e^{-j2\pi mk/N}\;\;\;(k=0,\cdots,N-1) \end{displaymath}

where $N$ is the total number of pixels along the boundary.

FD can be used as a representation of 2D closed shapes independent of its location, scaling, rotation and starting point. For example, the first $L<N$ FD's $\{ Z[k],\;k=0,\cdots,L\}$ can be used to represent the 2D shape as they reconstruct approximately the shape without the details (corresponding to the high frequency components susceptible to noise). However, note that since the Fourier transform is a complex transform, the frequency spectrum has negative frequencies as well as positive frequencies, with the DC component in the middle. Therefore the inverse transform with $L<N$ components needs to contain both positive and negative terms:


\begin{displaymath}\hat{z}[m]=\sum_{k=-L/2}^{L/2} Z[k]e^{j2\pi mk/N}\;\;\;(m=0,\cdots,N-1) \end{displaymath}

The following figures show the reconstruction of a Gumby figure based on different percentages of frequency components (descriptors) after the Fourier transform of the pixels on the boundary of the figure in the image. The percentages used are, from left to right, $0.1\%$, $1\%$, $2\%$, $5\%$, and $100\%$. It can be seen that the reconstructed figures using $5\%$ of the frequency components are very similar to the actual figure (using one hundred percent of the frequency components on the right).

gumby.gif

fd.gif

Fourier discriptor has the following properties:


next up previous
Next: FD used for shape Up: fd Previous: fd
rwang 2007-12-05