Constructing multivariate distributions for soil parameters - Risk and Reliability in Geotechnical Engineering

Environmental Engineering Reference

In-Depth Information

It is essential to verify whether the problem of “insufficient coverage” exists for the boot-

strap confidence intervals of δ 12 . The coverage probability refers to the probability for the

bootstrap confidence interval to cover the actual value of δ 12 . To investigate the sample size

under which the problem of insufficient coverage will be minimal, the probability for the

95% bootstrap confidence interval of δ 12 to cover the actual value of δ 12 is simulated. This

is done by the following steps:

1. Simulate (X 1 , X 2 ) data of sample size = n from a bivariate standard normal distribution

with a chosen δ 12 .

2. Construct the 95% bootstrap confidence interval of δ 12 based on the simulated (X 1 ,

X 2 ) data.

3. See whether the 95% bootstrap confidence interval covers the actual of δ 12 .

Do this for 1000 independent realizations of (X 1 , X 2 ) data. The coverage probability is

simply the ratio of successful coverage among the 1000 realizations. The bootstrap confi-

dence interval works properly if the coverage probability is close to 95%.

Table 1.7 shows the coverage probabilities for the 95% bootstrap confidence intervals of

δ 12 for three chosen values of δ 12 (0, 0.5, and 0.9). The numbers in the parentheses are for the

BCa method. It is clear that the coverage probability for δ 12 is close to 0.95 when n ≥ 50. The

improvement brought about by the BCa method for the 95% bootstrap confidence intervals

of δ 12 is insignificant. It is therefore recommended that the percentile method is sufficient and

that the sample size n should be ≥50 for the bootstrap confidence intervals to work properly.

1.3.3.5 Goodness-of-fit test (the line test)

The normality of a column of data can be checked using the K-S test (see Equations 1.23

through 1.26 ). However, the bivariate normality of two columns of data requires a separate

check. The line test (Hald 1952; Kowalski 1970) is a reasonably simple test for bivariate

normality. The line test is based on the fact that if (X 1 , X 2 ) are bivariate standard normal,

the variable Q

−

Q =

(1.46)

−

1 2

is χ-square distribution with 2 DOF. In reality, the value of μ, σ, and δ is not available.

Only the sample versions are available. Hence, (μ 1 , μ 2 ) are replaced by ( m 1 , m 2 ), (σ 1 , σ 2 ) are

replaced by ( s 1 , s 2 ), and δ 12 is replaced by its sample estimate (say using Equation 1.40 ). The

CDF for the 2-DOF χ-square distribution is

Table 1.7 Coverage probabilities for the 95% bootstrap confidence intervals of δ 12

(Numbers in parentheses are for the BCa method)

δ 12 = 0

δ 12 = 0.5

δ 12 = 0.9

n = 10

0.91 (0.94)

0.92 (0.94)

0.90 (0.93)

n = 20

0.92 (0.93)

0.92 (0.92)

0.94 (0.95)

0.94 (0.94)

0.95 (0.95)

n = 50

0.95 (0.95)

0.94 (0.94)

0.95 (0.95)

n = 100

n = 1000

0.95 (0.94)

0.94 (0.94)

Risk and Reliability in Geotechnical Engineering

Search WWH ::

Custom Search

Home