Environmental Engineering Reference
In-Depth Information
1.3.3.4 Statistical uncertainties in the
δ
12
estimate
The bootstrapping can be used to assess the sampling distribution of δ
12
. The steps for boot-
strapping are as follows:
1. Resampling the sample index with replacement. For the data in
Equation 1.43
, there
are in total five data points (
n
= 5). As a result, the sample
index
includes 1, 2, 3,
4, 5. Resampling these indices with replacement can be done by implementing the
MATLAB function randsample(
n
,
n
, 1). Note that after the resampling, there may be
repetitive indices.
2. Suppose the resampled indices are (5, 3, 1, 1, 2). The resampled samples are therefore
′
=
()
5
()
3
()
1
()
1
()
2
XXXXX
XXXXX
1
1
1
1
1
X
T
()
5
()
3
()
1
()
1
()
2
2
2
2
2
2
(1.45)
168174
.
.
120120
.
.
067
.
=
−
051027
.
.
−
128128
.
−
.
−
094
.
3. Estimate the sample Pearson correlation between the resampled (X
1
, X
2
) using one of
the methods presented above. This is a resampled δ
12
value.
4. Repeat steps 1 and 2 to obtain B resampled δ
12
value. Note that B is not the same as
n
.
The B resampled δ
12
values can be viewed as approximate realizations of the sampling
distribution of δ
12
.
δ
12
is 0.912. However, these are point estimates of δ
12
, and it is not clear how large the
ues for
n
= 50 and
n
= 200 based on the bootstrapping procedure (bootstrap sample size
B = 10,000). The 95% bootstrap confidence interval of δ
12
can be estimated as the interval
bounded by the 0.025 and 0.975 sample percentiles of the resampled δ
12
values. This is
called the percentile method (Efron 1981). It is clear that the confidence interval is narrower
when the number of data points gets larger.
2000
2000
Point estimate
= 0.912
n
= 50
Point estimate
= 0.937
n
= 200
1500
1500
1000
1000
500
500
0
0
0.8
0.85 0.9
Bootstrap samples of δ
12
0.95
1
0.8
0.85 0.9
Bootstrap samples of δ
12
0.95
1
Figure 1.13
Bootstrap samples of
δ
12
. The arrows indicate the point estimates.
Search WWH ::
Custom Search