Recent Advances of Data Biclustering with Application in Computational Neuroscience - Computational Neuroscience

Information Technology Reference

In-Depth Information

|S k | ∑

μ ( c )

a ij ,

(6.8)

∈ S k

and the mean of all the entries in B k is

μ k = ∑

∈ F k a ij

|F k ||S k |

∑

∈ S k

(6.9)

The residue of the entry a ij in bicluster B k is

a ij − μ ( r )

ik − μ ( c )

r ij =

jk + μ k ,

(6.10)

the variance of bicluster B k is

B k )= ∑

i ∈S k ∑

Va r

(

j ∈F k (

a ij − μ k )

(6.11)

and mean squared residue of the bicluster B k is

H k = ∑ i ∈S k ∑ j ∈F k r ij

|F k ||S k |

(6.12)

The first approach of biclustering by Hartigan [28] is known as block clustering ,

with the objective function as

k = 1 Va r ( B k )=

k = 1 ∑

i ∈ S k ∑

min Var

( B )=

j ∈ F k (

a ij − μ k )

where the number of biclusters is a given number. For each bicluster, the variance

Va r

(

B k )

is 0 if it is constant.

CC. Cheng and Church's Algorithm (CC) [11] defines a bicluster to be a sub-

matrix for which the mean squared residue score is below a user-defined threshold

represents the minimum possible value. To find the largest

bicluster in A , they propose a two-phase strategy: removing rows and columns and

then adding the removed rows and columns with some rules. First, the row to be

removed is the one

, i.e., H k ≤ δ

, where

|F k | ∑

r ij ,

arg max

∈ F k

and column is

|S k | j ∈ S k r ij .

Repeating these removing steps until the bicluster with H k ≤ δ

arg max

obtained. Then some

previously removed rows and columns can be added without violating the require-

ment of H k ≤ δ

. Yang et al. [58, 59] proposed an improved version of this algorithm

which allows missing data entry of A with a heuristic flexible overlapped clustering

(FLOC) algorithm.

Computational Neuroscience

Search WWH ::

Custom Search

Home