Self-Organizing Maps and Their Applications in Image Processing, Information Organization, and Retrieval - Nonlinear Signal and Image Processing: Theory, Methods and Applications

Digital Signal Processing Reference

In-Depth Information

11.7.2

Splitting Criteria

C ( k )

Let us denote by

the cluster that can be tentatively split into two

subclusters. We use two statistics from the field of cluster analysis 38 , 63

(

−

)

that

(

)

rely on the sum of squared errors J e

, g

1 , 2, to test the validity of the

following possibilities:

C ( k )

1. Cluster

(

−

)

is kept united

(

)

C ( k )

2. Cluster

(

−

)

is subdivided into two clusters

(

)

, say,

C ( k ζ (

C ( k η (

−

)

and

−

)

First let us define the sum of squared errors in cases (1) and (2) outlined above.

We have

x j

x j ∈C ( k )

m ( k )

−

(

−

)

for g

(

−

)

J e

(

) =

(11.98)

x j

γ ∈{ ζ , η }

x j ∈C ( k γ ( n − 1 )

m ( k γ (

−

)

for g

2 ,

where m ( k ζ (

and m ( k η (

denote the sample mean vectors of the

resulting subclusters. In the sequel, we describe how a tentative splitting is

performed.

We determine the direction in which cluster

−

)

−

)

C ( k )

variation is greatest.

This is equivalent to finding the first principal component of the sample dis-

persion matrix (i.e., the eigenvector that corresponds to the largest eigenvalue

of S ( k )

(

−

)

). Let us denote by e ( k )

(

−

)

(

−

)

the first normalized principal eigen-

vector of S ( k )

(

−

)

. Having determined e ( k )

(

−

)

,weexamine the splitting

C ( k )

(

−

)

of cluster

with a hyperplane that is perpendicular to the direction

of e ( i )

(

−

)

and passes through the sample mean m ( k )

(

−

)

. Therefore, all

C ( k ζ (

C ( k )

C ( k η (

patterns in

(

−

)

are sorted into sets

−

)

and

−

)

as follows:

C ( k ζ (

−

)

= x

)

∈ C ( k )

: e ( k )

T x

e ( k )

T m ( k )

(

−

)

(

−

)

≤

(

−

)

(

−

(11.99)

C ( k η (

−

)

= x

) .

∈ C ( k )

: e ( k )

T x

e ( k )

T m ( k )

(

−

)

(

−

)

(

−

)

(

−

As mentioned earlier, splitting any cluster into two subclusters will result in a

lower sum of squared errors, i.e., J e (

.Wedecide to consider as valid

any splitting that yields a statistically significant improvement (i.e., decrease)

in the sum of squared errors. To this end, a binary hypothesis-testing problem

is formulated as follows. 63

Under the null hypothesis we assume that there is exactly one cluster

present. Furthermore, it is assumed that all z ( k )

J e (

)

patterns come from

a multivariate distribution with mean µ and covariance matrix

(

−

)

2 I .Inother

Nonlinear Signal and Image Processing: Theory, Methods and Applications

Search WWH ::

Custom Search

Home