Modeling Methodology: Dimension Reduction and Resampling Methods - Neural Networks: Methodology and Applications

Information Technology Reference

In-Depth Information

Fig. 3.2. Change of variables by PCA

Inertia on Centered and Reduced Data

For centered and reduced data, one has Tr X T X = n.

Consider a sub-space of dimension q<n , and denote by V n×q

the matrix

R q ; the scatter diagram projected on

R q is

associated with the projector on

represented by matrix XV , the inertia of which is

I q =Tr V T X T XV .

PCA defines the linear projection that maximizes I q , the value of the inertia

of the points computed in

R q . That problem is solved by finding the first axis

with respect to which the inertia is maximum, then a second axis, orthogonal

to the previous one, to carry on with the maximization of the inertia, and

so on up to the p th axis. The axes obtained are borne by the eigenvectors of

matrix X T X , ranked in order of decreasing eigenvalues λ i . The eigenvalues λ j ,

j =1 ,...,n are positive or zero, since matrix X T X is positive symmetrical.

The transformation to be performed on of the centered data to obtain the

main components is

∈ R n →

V T

n×q

∈ R q<n .

x

Search WWH ::

Custom Search

Home