Information Technology Reference
In-Depth Information
By proper transformation, the data matrix A is to be a joint probability distri-
bution matrix p
( S,F )
between two discrete random variables
S,F
.Let K be the
number of disjoint clusters of samples and K
the number of disjoint features. The
B =( S ,F )=( {S k ,
: k =
K } )
set of biclusters is
: k
=
1
, ··· ,
K
}, {F k
1
, ··· ,
.The
mappings of C S ,
C F are objectives to find in this biclustering algorithm such that
C S :
{
S 1 ,
S 2 , ··· ,
S n }→{S 1 , ··· ,S K },
C F :
{
F 1 ,
F 2 , ··· ,
F m }→{F 1 , ··· ,F K }.
is the amount of
information shared between these two variables and is defined as in information
theory
The mutual information I
( S,F )
of two random variables
S,F
n
i = 1
m
j = 1 p ( S i , F j ) log
p
(
S i ,
F j )
I
( S,F )=
F j ) =
D
(
p
(
S
,
F
) ||
p
(
S
)
p
(
F
)) ,
p
(
S i )
p
(
where p
(
S i ,
F j ) ,
p
(
S i ) ,
p
(
F j )
are probabilities from distribution matrix p
( S,F )
, and
log p 1 ( x )
p 2 ( x )
(
||
)= x p 1
(
)
D
p 1
p 2
x
is the relative entropy between two probability dis-
tributions p 1
.
The objective of this biclustering is to find optimal biclusters of A such that the
loss in mutual information is minimized, i.e.,
(
x
)
and p 2
(
x
)
( S ,F ) .
min I
( S,F )
I
x ,
y )
x ,
y )
x )
y )
In order to solve this objective function, q
(
x
,
y
)=
p
(
p
(
p
(
x
|
p
(
y
|
is defined so that the objective function can be written as
( S ,F )=
min I
( S,F )
I
D
(
p
( S,F ) ||
q
( S,F )) .
For proof of this result, we refer to [18]. Then an iterative way is used to solve
by transformed the objective function [18].
6.3.5 Based on Probability
The following two biclustering algorithms (named as BBC and cMonkey) use the
theory of probability.
BBC. Gu and Liu [26] proposed a Bayesian biclustering model (BBC) and im-
plemented a Gibbs sampling [34] procedure for its statistical inference. This model
can also consider an implementation of plain model [50] of biclustering.
Given data matrix A , assume the entry
k = 1 (( μ k + α ik + β jk + ε ijk ) δ ik κ jk )+ e ij 1
k = 1 δ ik κ jk
K
K
a ij =
,
 
Search WWH ::




Custom Search