Recent Advances of Data Biclustering with Application in Computational Neuroscience - Computational Neuroscience

Information Technology Reference

In-Depth Information

where

μ k is the main effect of bicluster k , and

α ik and

β jk are the effects of sample i

and feature j , respectively, in bicluster k ,

ε ijk is the noise term for bicluster k , and e ij

models the data points that do not belong to any bicluster. Here

δ ik , κ jk are binary

variables:

δ ik =

1 indicates that row i belongs to bicluster k , and

δ ik =

0 otherwise;

similarly,

0 otherwise.

In plain model [50], the entry a ij has similar assumption with less factors to be

considered.

In nonoverlapping feature biclustering,

κ jk =

1 indicates that column j is in cluster k , and

κ jk =

κ jk ≤

1, and in nonoverlapping sam-

∑

ple biclustering,

δ jk ≤

1. Here, nonoverlapping sample is discussed. The priors

∑

of the indicators

are set so that a feature can be in multiple biclusters while

sample is at more than one.

In this model, an observation a ij can belong to either one or none of the biclusters,

and the probability distribution of a ij conditional on the bicluster indicators can be

rewritten as

and

ε k

a ij | δ ik =

, κ jk =

∼

( μ k + α ik + β jk , σ

)

if a ij belongs to bicluster k ; otherwise,

a ij | δ ik κ jk =

0 for all k

∼

(

, σ

) .

With Gaussian zero-mean priors on the effect parameters, the marginal distribu-

tion of the a ij conditional on the indicators is

B| δ , κ ∼

(

, Σ ) ,

where

is the covariance of matrix of

and

B = {

B 0 ,

B 1 ,

B 2 , ··· ,

B K }

with B k =

{

1 and B 0 being the vector of data points belonging to no

bicluster. More specifically,

a ij :

δ ik κ jk =

≥

is a sparse matrix of the form

⎛

⎞

e I 0

···

⎝

⎠ ,

Σ 1 ···

Σ =

. . .

··· Σ K

where

Σ k =

Cov

(

B k ,

B k )

is the covariance matrix of all data points belonging to

cluster k .

To make inference form above BBC model, the implemented Gibbs sampling

method is used. Initializing from a set of randomly assigned values of

's and

's,

the column indicators

are sampled by calculating the log-probability ratio

(

V 2 | κ jk =

, σ

β k , σ

, σ

)

( κ jk =

)

log

) ,

(

V 2 | κ jk =

, σ

μ k , σ

α k , σ

k , σ

ε k , σ

)

( κ jk =

where V 1 = {

a il :

δ ik =

0or

κ lk =

}

, the set contains data points not in cluster

k , and V 2 = {

, the set contains data points

that are or can in bicluster k . This notation follows that in [26].

a il :

δ ik =

, κ lk =

}∪{

a ij :

δ ik =

}

Computational Neuroscience

Search WWH ::

Custom Search

Home