Agriculture Reference
In-Depth Information
1.3 The Predictive Approach: The Concept
of Superpopulations
The traditional approach to survey sampling has several limitations, which have
been discussed in the literature over the last 40 years. Among others, Godambe
(
1966
) analyzed the singular effect of likelihood considerations on this approach to
survey sampling.
Design-based inference, as already noted, only considers
s
as the stochastic
element, and treats
y
as a constant. An alternative strategy assumes that the actual
finite population
y
i
is a realization of a random vector. In particular y¼ (
y
1
y
2
...
y
N
)
t
represents a certain observation vector of an
N
-dimensional distribution that
defines a stochastic model
ʾ
. This is called the superpopulation model. For example,
we may be interested in estimating the population total,
t
¼
X
U
y
k
. Using the
superpopulation approach, the population total represents the sum of only one
realization. If other realizations were generated, they would have different values
for
t
¼
X
U
y
k
. Under this strategy, the sample values are also random variables.
The population total is a sum of random variables, so it is also a random variable. In
this case, the source of randomness in a sample is only derived from the stochastic
model,
.
The superpopulation approach can be interpreted in terms of a long series
of realizations of the random process, for a fixed sample
s
. In reality, there is only
one finite population, so for the sake of simplicity, we will use
y
k
to represent both
the random variable
Y
k
and the observed value.
Now, consider a generic drawn sample
s 2 S
, and its complement
s 2 U s
. The
sample value
y
k
is only observed for
k 2 s
. An estimator of
t
is defined as a function
of these observed values,
ʾ
t
¼
^
t
k
f ð Þ
. Estimating
t
is equivalent to
predicting the population total using the available data. So, the predictor
t
of
t
is
said to be model unbiased if, given
s
:
k 2 s
E
ʾ
t t
½
ð
Þ j
¼ 0
:
ð
1
:
31
Þ
In this case, the model mean square error of
t
is
h
i
2
j
MSE
ʾ
ðÞ
¼
E
ʾ
t t
ð
Þ
:
ð
1
:
32
Þ
The difference between the two definitions of
MSE
in Eqs. (
1.18
) and (
1.32
)isonly
due to the source of randomness. In the design-based approach, the uncertainty is
ensured by
p
(
s
), while in the superpopulation approach, the randomness is provided
from the model
ʾ
. For this reason, this approach is also called model-based survey
sampling.
Using the superpopulation approach, we estimate the population total as follows.
The population total,
t,
can be decomposed into
Search WWH ::
Custom Search