Environmental Engineering Reference
In-Depth Information
Blatman (2009, Appendix D)) as follows. The predicted residual in Equation 6.38 even-
tually reads:
MM MM
()
x
()
i
PC
(
x
()
i
()
i
PC
\
i
( )
i
i
=
()
x
()
x
=
,
(6.42)
1
h
where h i is the i ith diagonal term of matrix a(a T a) −1 a T . The LOO error estimate eventually
reads
2
n
1
MM
()
x
()
i
PC
() ,
x
()
i
·
Err
=
(6.43)
LOO
n
1
h
i
i
=
1
where M PC has been built up from the full ED. As a conclusion, from a single resolution
of a least-square problem using the ED X, a fair error estimate of the mean-square error is
available a posteriori using Equation 6.43 . Note that in practice, a normalized version of
the LOO error is obtained by dividing Err LO · by the sample variance Var [ Y ]. A correction
factor that accounts for the limit size of the ED is also added (Chapelle et al., 2002) that
eventually leads to
2
n
1
+
tr C
Var
1
1
1
MM
()
x
()
i
PC
()
x
()
i
n
emp
ˆ
LOO
=
,
(6.44)
[]
nP
1
h
Y
i
i
=
1
T
where tr(⋅) is the trace, P = card A , and C
= ()
1 ΨΨ
/
n
.
emp
6.3.6 Curse of dimensionality
Common engineering problems are solved with computational models having typically
M = 10-50 input parameters. Even when using a low-order PC expansion, this leads to a
large number of unknown coefficients (size of the truncation set A M,p ), which is equal to, for
example, 286 − 23,426 terms when choosing p = 3. As explained already, the suitable size
of the ED shall be 2-3 times those numbers, which may reveal as unaffordable when the
computational model M is, for example, a finite-element model.
On the other hand, most of the terms in this large truncation set A M,p correspond to
polynomials representing interactions between input variables. Yet, it has been observed in
many practical applications that only the low-interaction terms have coefficients that are
significantly nonzero. Taking again the example { M = 50, p = 3}, the number of univari-
ate polynomials in the input variables (i.e., depending only on U 1 , on U 2 , etc.) is equal to
p M = 150, that is, <1% of the total number of terms in A 50,3 . As a conclusion, the com-
mon truncation scheme in Equation 6.18 leads to compute a large number of coefficients,
whereas most of them may reveal negligible once the computation has been carried out.
Owing to this sparsity-of-effect principle, Blatman (2009) has proposed to use truncation
schemes that favor the low-interaction (also called low-rank) polynomials. Let us define the
rank r of a multi-index α by the number of nonzero integers in α, that is,
M
def α
r
=
=
1 {
} .
(6.45)
α
>
0
0
i
i
=
1
 
 
Search WWH ::




Custom Search