Information Technology Reference
In-Depth Information
ab
ij
*
where n is number of samples available in the cluster C i and
n
denotes the number
a
i
a
b
j
*
b
*
of shared samples between the clusters
C
∈
P
and
C
∈
P
. Also k b* is the number
of clusters in P b* .
b*
Fig. 6. Computing the entropy between the cluster C 1 from P a and P b* from P b .
Here, the Average APMM, AAPMM is proposed as a measure of stability of a
primary cluster C i with respect to the partitions available in the reference set as
equation 4:
1
M
∑
=
a
i
b
j
*
AAPMM
(
C
)
=
APMM
(
C
,
P
)
(4)
i
M
j
1
b
where
P
is from j -th partition of the reference set.
3.2 Extended EAC (EEAC)
In this step, the selected clusters have to construct the co-association matrix. In EAC
method the m primary results from resampled data are accumulated in an n×n co-
association matrix. Each entry in this matrix is computed from equation 5.
n
i
m
,
j
C
(
i
,
j
)
=
(5)
i
,
j
Where n ij counts the number of clusters shared by objects with indices i and j in the
partitions over the B clusterings. Also m ij is the number of partitions where this pair of
objects is simultaneously present. There are only a fraction of all primary clusters
available, after Thresholding. So, the common EAC method cannot truly recognize
Search WWH ::




Custom Search