Matrix Visualization - Data Visualization

Graphics Reference

In-Depth Information

( ) and Slagle et al. ( ) related the traveling salesman and shortest spanning

pathproblemstotheclusteringofdataarrays.hecolorhistogramofWegman( )

wasthefirstcolormatrixvisualization tobereportedinthestatistical literature.Min-

notte and West ( ) extended the idea of color histograms to a data image package

that was later used for outlier detection (Marchette and Solka, ).

Some matrix visualization techniques were developed to explore only proximity

matrices: Ling ( ) looked for factors of variables by examining relationships us-

ing a shaded correlation matrix; Murdoch and Chow ( ) used elliptical glyphs to

represent large correlation matrices; Friendly ( )proposed corrgrams (similar to

the reorderable matrix method) to analyze multivariate structure among the vari-

ables in correlation and covariance matrices. Chen ( , ,and ) integrated

the visualization of a raw data matrix with two proximity matrices (for variables and

samples) into theframeworkofgeneralized association plots(GAP).heCluster and

TreeView packagesofEisen etal.( )areprobably the mostpopularmatrix visual-

ization packages due to the proliferation of gene expression profiling for microarray

experiments.

he permutation (ordering) of the columns and rows of a data matrix, and prox-

imity matrices for variables and samples, is an essential step in matrix visualization.

Several recent statistical works have touched on the issue of reordering variables and

samples: Chen ( ) proposed the concept of the relativity of a statistical graph;

FriendlyandKwan ( )discussedthe ideaofeffect-ordering ofdata displays;Hur-

ley( )used scatterplot matrices and parallel coordinates plots as examples to ad-

dressthe issueof placing interesting displays inprominent positions. Different terms

(such as the reorderable matrix, the heatmap, the color histogram, the data image

and matrix visualization) have been used in the literature to describe these related

techniques. We use matrix visualization (MV) to refer to them all.

The Basic Principles

of Matrix Visualization

15.3

We use the GAP (Chen, ) approach to illustrate the basic principles of matrix

visualization for continuous data, using the genes and microarray exper-

iments collected in the published yeast expression database for visualization and

data mining (Marc et al., ), which is designated henceforth as Dataset . De-

tailed descriptions of data preprocessing are given for the yeast Microarray Global

Viewer(http://transcriptome.ens.fr/ymgv/). Forthepurposesofillustration, wehave

selected samples and genes across these samples (“Dataset ”), where rows cor-

respond to genes and columns to microarray experiments (arrays). In various gene

expression profile analyses, the roles played by rows and columns are oten inter-

changeable. his interchangeability is well suited to the GAP approach to matrix vi-

sualization, where samples and variables are treated symmetrically and can be inter-

changed directly.

Search WWH ::

Custom Search

Home