Principal component analysis biplots - Understanding Biplots

Information Technology Reference

In-Depth Information

Ta b l e 3 . 1 0

Means of species O. bullata, O. porosa and O. kenyensis for each of six

variables.

VesD

VesL

FibL

RayH

RayW

NumVes

O.bul

412

1185

375

O.ken

137

401

1568

446

O.por

129

342

1051

398

Ta b l e 3 . 1 1

Overall quality (%) of the Figure 3.23 biplot in each of the six dimensions.

Dim 1

Dim 2

Dim 3

Dim 4

Dim 5

Dim 6

Quality

47.38

64.37

79.93

88.87

96.14

100.00

The call

PCA.predictivities(Ocotea.data[,3:8], scaled.mat = TRUE,

X.new.samples = new.samples)

returns the measures of fit given in Tables 3.11 - 3.14.

From Table 3.11 we see that the Figure 3.23 biplot has an overall quality of 64%.

Adding another dimension would increase the quality to 80% and constructing a three-

dimensional biplot can thus be considered. Inspecting the axis predictivities in Table 3.12

shows that Ve s L has a poor predictivity of 0.29, while FibL and RayW have predic-

tivities exceeding 0.79. Judging from the three-dimensional predictivities it seems that

all variables have satisfactory predictivities in three dimensions. Turning attention to the

individual samples, it is clear from Table 3.13 that some of the predictions for some

sample points could be made very accurately from the Figure 2.23 biplot (e.g. S1 , S2 ,

S3 , S35 ), while others have a very low sample predictivity (e.g. S37 ). It is clear from

Table 3.14 that interpolating the sample mean of O. kenyensis into the biplot results in

a point whose predictions for the six variables can be very accurately inferred from the

two-dimensional PCA biplot. For the two other interpolated new points adding a third

dimension is desirable. In the next chapter we will see how a canonical variate analysis

biplot allows us to represent the means for all three species exactly in two dimensions.

3.5 Adding new axes to a PCA biplot and defining

their predictivities

We saw in Section 2.7 how to use the regression method for obtaining coordinates for

constructing a biplot axis for a new variable. Let x ∗ : n

1 denote a centred (and scaled,

if required) vector of sample values for a new variable. Note that this new variable is

centred and scaled using its own values analogous to the centring and scaling of the

original matrix X . It follows from (2.20) that the coordinates for adding a new biplot

axis representing x ∗ are given by the first r nonzero elements (which we denote by b r )

of J

− 1 U x ∗ =

as defined in Section 3.2. This added variable does not

play any role in providing the scaffolding axes for constructing the PCA biplot but its axis

predictivity can be defined as follows. For predictivity, we have to predict the values

of x ∗ given the r -dimensional PCA coordinates XVJ . The usual orthogonal multiple

Jb with J and

Understanding Biplots

Search WWH ::

Custom Search

Home