|
Although
much has been publish on the selection of grid size, cell
size selection is seldom based on the inherent spatial
variability of the data. In fact, in most GIS projects,
grid resolution
is selected without any scientific justification. In ArcGIS
package, for example, the default output cell size is suggested
by the system using some trivial rule: in the case the
point data is being interpolated in spatial analyst, the
system
will take the shortest side of the study area and divide
by 250 to estimate the cell size. Obviously, such pragmatic
rules do not have a sound scientific background in cartography
nor in geoinformation science. This motivated me to produce
methodological guides to select a suitable grid resolution
for output maps and based on the inherent properties of
the input data. I tried to relate the choice of grid resolution
to the concrete cartographic and statistical concepts,
namely scale, processing power, positional accuracy, inspection
density, spatial correlation and complexity of terrain.
Here you can obtain an R script with processing steps explained
in detail.
For
more details see:
Hengl T., 2006. Finding the right pixel size. Computers and Geosciences, 32(9): 1283-1298.
SEE ALSO:
- Atkinson, P.M. and Aplin, P., 2004. Spatial variation in land cover and choice of spatial resolution for remote sensing. Photogrammetric Engineering and Remote Sensing, 25(18): 3687-3702.
- Bishop, T.F.A., McBratney, A.B. and Whelan, B.M., 2001. Measuring the quality of digital soil maps using information criteria. Geoderma, 103(1-2): 95-111.
- Dungan, J.L. et al., 2002. A balanced view of scale in spatial statistical analysis. Ecography, 25: 626-640.
- Florinsky, I.V. and Kuryakova, G.A., 2000. Determination of grid size for digital terrain modelling in landscape investigations – exemplified by soil moisture distribution at a micro-scale. International Journal of Geographical Information Science, 14(8): 815–832.
- Heuvelink, G.B.M. and Pebesma, E.J., 1999. Spatial aggregation and soil process modelling. Geoderma, 89(1-2): 47-65.
- McBratney, A.B., 1998. Some considerations on methods for spatially aggregating and disaggregating soil information. Nutrient Cycling in Agroecosystems, 50: 51-62.
Table: Summary equations to select the grid resolution: SN is the scale factor, r_E is the positioning error, r_E is the average positioning error, a is the average size of delineations, a_MLD is the area of the minimum legible delineation, w_MLD is the width of the narrowest legible delineation, A is the surface of the study area, N is the number of sampled points in the study area, h_{ij} is the spacing between the closest point pairs, h_R is the range of spatial dependence, m is the number of point pairs within the range and l is the total length of countours.
Aspect |
Coarsest legible resolution |
Finest legible resolution |
Recommended compromise |
Working scale |
|
|
|
GPS positioning error |
|
|
|
Size of reference objects |
|
|
|
Inspection density |
|
|
|
Distance between the points |
|
|
|
Range of spatial dependence |
|
|
|
Complexity of terrain |
|
|
|

EXAMPLES:
Example
1: Selection of the cell size
based on the GPS positioning
accuracy -- GPS100.txt
In
this
example
100
positioning
fixes
were
recorded
using
the
single-fix
GPS
positioning
method
at
the
control
point
with
a
known
location
(Xt=6535950,
Yt=5066581.48).
The
fluctuation
of
the
GPS
readings
can
be
seen
in
figure.
The
errors
ranged
from
0.7
to
23.9
m,
average
error
was
8.5
m
with
a
standard
distribution
of
5.2
m.
The
error
vectors
seems
to
follow
the
log-normal
distribution.
The
theoretical
distribution
gave
the
95%
probability
radius
of
19.1
m,
while
the
experimental
distribution
shows
somewhat
higher
value
(20.4).
A
suitable
grid
resolution
for
this
case
is
34.4
m.
If
this
grid
resolution
is
selected,
most
(95%)
of
GPS
fixes
will
fall
within
the
right
pixels.
This
number
for
example
corresponds
to
the
resolution
of
the
Landsat
imagery.
More
accurate
positioning
method
would
be
needed
to
locate
points
within
the
finer
grid
resolutions.
If
we
would
like
to
use
a
GPS
positioning
with
grid
resolutions
of
about
15
m,
then
we
would
need
to
use
GPS
positioning
with
averaging
(5
minutes
per
point).
Higher
positional
accuracy
(5-20
times)
can
be
achieved
by
using
differential
correction
or
WAAS
(Wide
Area
Augmentation
System),
which
can
improve
accuracy
to
less
than
three
meters
on
average.
Such
accuracy
would
be
compatible
with
grid
resolutions
within
the
range
2-10
m
(SPOT
or
IKONOS
imagery).

Figure: Selecting the grid resolution based on the confidence radius of the positioning method: (a) 100 single-fix GPS measurements around the true location of the point; (b) histogram of the error vectors, average error vector and the 95% probability confidence radius.
Example
2: Selection of the cell size
based on the size of agricultural
plots -- plots.zip
In this example I used an existing polygon map of agricultural fields (figure). The map consists of 121 polygons in total. The smallest polygon is 0.005 ha, the biggest is 6.903 ha, average size of polygons is 0.824 ha with standard deviation of 1.005 ha. The polygons were then separated into two groups according to the shape complexity index. In this case only 6 polygons classified for narrow polygons. For each of these, an average width has been estimated by taking regular measurements (10 per polygon). I further on derived the 5% inverse cumulative distribution value assuming the log-normal distribution. I got 0.046 ha, which means that the pixel size should be about 20 m. The coarsest legible grid size for this data set (P=50%) would be 70 m (A=0.5 ha). If resolutions coarser than 70 m are used to monitor agriculture for this area, then in more than 50% of the areas there will be less than four pixels per polygon. Note that in this case we are not using the true smallest polygon size but somewhat bigger figure (0.046 ha) because the smallest value (0.005 ha) is not representative. The further inspection of the widths showed that the average width of the narrow polygons is about 16 m, which gives a somewhat more strict grid resolution of about 8 m. However, the narrow polygons occupy only 0.9% percents of the the total study area, so we do not have to be as strict. Finally, I would recommend that the satellite imagery in range from 10 to 70 m can be used to monitor agriculture for this study area.

Figure: Selection of the grid resolution based on the average size of the objects observed: (a) agricultural plots; (b) distribution of surfaces for compact plots and related grid resolution.
Example
3:
Selection
of
the
cell
size
for
interpolation
purposes -- wesepe_c.eas
In this example I will use the Wesepe point data previously used in numerous soil mapping applications (De Gruijter et al., 1997). The dataset consist of 552 profile observations where various soil variables have been described. The target variable is the membership value to enk earth soil type. The values range from 0 to 1, with an average of 0.232 and a standard deviation of 0.322. The total size of the area is 12.1 km^2, which gives a sampling density of about 45 observations per km^2, which corresponds to the scale of 1:25K, i.e. grid
resolution of 12 m. If we inspect the spreading of the points, we see that the average spacing between the closest point pairs is about 120 m, which is fairly close to the regular point sampling (for this data set --- 148 m). The cumulative distribution showed that 95% of points are at distances of 5 and more meters. This means that the legible grid resolutions are between 5 and 150 m. Automated fitting of the variogram using a global model further gave a nugget parameter (C0) of 0.042, a sill (C0 + C1) of 0.097 and a range parameter (R) of 175.2 m, which means that the variable is correlated up to the distance of about 525 m (h_R). There are 11807 point pairs within this range, which finally means that the optimal lag/grid size would be about 23 m. The pattern analysis of the point data set further shows that there is clear regularity in the point geometry: most of the distances are grouped at 180 m. The final interpolated map (punctual estimates) in resolution of 10 m can be seen in Figure below. 
Figure: Selection of grid resolution based on the point pattern analysis: (a) a set of 552 soil profile observations; (b) probability of finding one point in the neighborhood and graph of distances to the closest point; (c) variogram and parameters fitted automatically in gstat, (d) interpolated map using ordinary kriging at grid resolution of 10 m.
Example
4:
Selection
of
cell
size
for
geomorphometric
analysis -- |