Chemometric methods application in pharmaceutical products and processes analysis and control - Computer-Aided Applications in Pharmaceutical Technology

Information Technology Reference

In-Depth Information

analyzed data set and represents it in visually comprehensible lower-

dimensional space. Reduction of the original variables number results

from generation of their linear combinations - new latent variables (LV),

referred to as loadings.

Note that bold uppercase (e.g. X ) represents matrices, bold lowercase

(e.g. y ) represents vectors, and italic characters represent scalars. X T

indicates vector transpose, if:

[4.1]

[4.2]

The transpose of a column vector is a row vector and vice versa.

Inverse matrix of X is:

[4.3]

In the case of a data set of M samples and N variables (Figure 4.1, a

data set of 4 samples and 2 variables), each sample represents row vector

x i T . Each variable j is described by its values for all samples with column

vector x j . If sample vectors are plotted in variable space, then the number

of axes is equal to the number of variables N . This way, all the information

in X , regarding the relationships (similarities or differences) between

samples, can be displayed. The same stands for representation of variable

vectors in sample space. The number of axes is equal to the number of

samples and relationships (correlations or co-variances) between variables

(Rajalahti and Kvalheim, 2011). This common variation (i.e. correlation)

between variables is used to determine new LVs. An increase in the

number of variables studied complicates the determination of LVs, such

as dimensionality reduction. This task is usually achieved by projecting

onto LVs. One of the most common algorithms used for computation of

LVs is the non-linear iterative PALS (NIPALS) algorithm (Geladi and

Kowalski, 1986). The idea behind the NIPALS algorithm is successive

orthogonal projection of LVs. Prior to LV projection, weight vector w a is

defi ned, on a different basis for different multivariate methods. The

weight vector describes LVs in both variable and sample space. Therefore,

the score vector t a and loading vector p a are different presentations of the

same LV, carrying information about samples in variable space and

variables in sample space, respectively (Rajalahti and Kvalheim, 2011).

Search WWH ::

Custom Search

Home