Information Technology Reference
In-Depth Information
Ta b l e 8 . 4
MCA equivalents for the CA notation used in Chapter 7.
CA
MCA
X : Two-way contingency table with p
rows and q columns representing two
categorical variables with p and q
categories, respectively.
G : Indicator matrix with n rows and L
columns.
G
, G p ]
G j : n × L j . The indicator matrix for the
j th ( j
=
[ G 1 , G 2 ,
...
, p ) categorical variable
having L j categories.
=
1,
...
n : Total number of samples, i.e. n = 1 X1
n : Total number of samples (number of
rows of G ), i.e. n = 1 G1 / p
R : p
×
p
=
diag
(
X1
)
diag
(
G1
) =
diag
(
p 1
) =
p I : n
×
n
1 X
1 G
1 L
C : q
×
q
=
diag
(
)
diag
(
) =
diag
(
) =
L : L
×
L
Also, L
=
diag
(
diag
(
L 1 )
,diag
(
L 2 )
,
...
,
diag
,where L k is a diagonal
matrix giving the frequencies
( l k 1 , l k 2 , ... , l kL k ) of the k th variable.
(
L p ))
R11 C
p 11 L
11 L
E : p
×
q
=
/
n
n
(A subtle point to note is that the sum of
all elements of the input matrix is
needed here, not the number of
samples.)
/
np
=
/
11 /
11 /
Column-centred X
= (
I
p
)
X
Column-centred G
= (
I
n
)
G
W 1
1
( X E ) C 1 / 2 W 1
2
W 1
1
( p 1 / 2 I )( G 11 L / n ) L 1 / 2 W 1
2
R 1 / 2
= W 1
( p 1 / 2 I )( G 11 G / n ) L 1 / 2 W 1
1
2
= W 1
1
) } W 1
2
where the expression within the braces
denotes the column-centred matrix
p 1 / 2 GL 1 / 2
{ ( I 11 / n )( p 1 / 2 GL 1 / 2
8.9.3 Function CATPCAbipl
CATPCAbipl is our main function for constructing the two-dimensional categorical PCA
biplot discussed in Section 8.8.
Usage
CATPCAbipl uses the same calling conventions as PCAbipl and shares the following
arguments with it:
alpha
line.width
ort.lty
ax
max.num
parplotmar
ax.name.size
offset
pos
c.hull.n
orthog.transx
predict.sample
exp.factor
orthog.transy
select.origin
Search WWH ::




Custom Search