Spatial Survey Data Modeling - Sampling Spatial Units for Agricultural Surveys

Agriculture Reference

In-Depth Information

expenditure on capital equipment, the total salary costs, or the total production of a

field. Alternatively, practitioners may use the mean per unit as a descriptive

statistic, which is often a total divided by an estimate of the number of units that

contribute to the total.

A superpopulation model (see Sect. 1.3 ) can be used to formalize the relation-

ship between a target variable y and auxiliary data X. For example, in a survey of

farms, the yield of a crop in a particular period may be related to the geographical

coordinates of the field, to the elevation (obtained through a digital elevation

model), and to the terrain. The main assumption is that the quantities of interest

are modeled as realizations of random variables with a particular joint probability

distribution. For example, in this case, the model can be defined as

Y k ¼ ʲ 0 þ ʲ 1 x 1 k þ ʲ 2 x 2 k þ ʲ 3 x 3 k þ ʵ k

...

, N

;

where Y k 1 is the yield of a crop, x 1 k , x 2 k , x 3 k , are the covariates, and the ʵ k s are

uncorrelated random errors with mean 0 and variance

2 x k . This is a simple

specification, but more complicated models can be used. In fact, we can add

different or additional covariates to the model, or use a non-linear relationship

between the variables.

Now, consider the population vector y

t that is treated as the

y 1

y 2

...

y N

t , and the general linear

realization of a random vector Y

Y 1 Y 2

...

Y N

model ξ

E ξ Y

ðÞ¼

Var ξ Y

ðÞ¼

;

where X is an N

q matrix of covariates,

is a q

1 vector of unknown param-

eters, and V is a positive definite covariance matrix.

Under Model ( 12.2 ), we can define the population total estimate, and derive the

best linear unbiased predictor (BLUP) estimator (Valliant 2009 ).

Generally speaking, our objective is to estimate a linear combination of y,

namely

t is a vector of constants of size N .If

we want to estimate the population total, then

t y, where

ʳ¼ ʳ 1

ʳ 2

... ʳ N

ʳ k ¼

1. Conversely, if we want to

1/ N .

We select a sample s of size n from the population of N units , and observe the

y values of the sample units. The non-sample units are denoted as s . Without loss of

generality, for any sample s , we can arrange the population vector y so that the first

n units are in the sample, and the last N

ʳ k ¼

estimate the population mean, then

n are not in the sample. In this way, we

t , where y s is the vector of the observed values of

can redefine the vector y ¼ y s ; y s

the sampled n units, and y s

is the vector of the unobserved values of the

1 To avoid confusion, note that in this section the uppercase Y indicates a random vector, while the

lowercase y describes the realization of Y.

Sampling Spatial Units for Agricultural Surveys

Search WWH ::

Custom Search

Home