All classification algorithms discussed above are supervised in nature
based on the assumped availability of some training data in which each
of the data patterns
is labeled by
, i.e., the class they
each belong is known. However, when such training set is not available,
there is still the need to explore some potential structure of the data
in the form of a set of unknown number
of clusters
,
each composed of a set of similar data points close to each other in the
feature space. This can be accomplished by the unsupervised method of
clustering analysis, based only on the given dataset
, without any additional prior
knowledge, such as the labeling of the data samples. We will now consider
a few algorithms for clustering analysis.