Autocorrelation - Semivariance

Functionality

AutoCorrelation calculates autocorrelation and semi-variogram values for a raster map. The autocorrelation of a raster map is generated by calculating the correlation between pixel values of a raster map and pixel values of the same raster map for different shifts (lags) in horizontal or vertical directions. The semi-variogram values which are a measure for the spatial variability of a raster map are calculated for the same shifts. The operation is a tool to obtain information on horizontal and vertical patterns in a raster map.

Method:

The user has to specify a maximum pixel shift, i.e. the maximum distance for which values of pixels should be compared to each other.

  1. For each pixel, the value of that pixel (at position P) is compared with the value of the pixel one column to the right (position Q: row + 1) and one row below (position R: col + 1). Figure 1 below shows the collection of horizontal pairs of pixel values (PQ) for the first row of a map for shift 1;
  2. Subsequently, for each pixel, the value of that pixel (at position P) is compared with the values of the pixel two rows to the right (position Q: row + 2) and two columns below (position R: col + 2). Figure 2 below shows the collection of horizontal pairs of pixel values for the first row of the map for shift 2;
  3. This continues until the specified maximum pixel shift is reached.
  4.   

    Fig. 1: Collection of horizontal pairs of pixel values PQ for the first row in a map for shift 1.

    Fig 2.: Collection of horizontal pairs of pixel values PQ for the first row in the map for shift 2.

Autocorrelation and correlogram:

The method is based on the assumption that a regional variable becomes random at a great distance. The autocorrelation is a value between -1 and +1. A horizontal correlation value of 1 means a perfect correlation between pixels with a certain shift, i.e. the pixel values at P (column position P) and Q (column position P + shift) increase and decrease exactly proportionally. Variation is however rarely as regular as this. In most cases autocorrelation is large at short shifts, where the values of P and Q are similar. It decreases to 0 at larger shifts, where there is little relation. A plot of the autocorrelation against the shift is known as a correlogram.

Semi-variogram values and semi-variogram:

Semi-variogram values are a measure of variance. It is a straightforward way of measuring how a value changes between P and Q. In the calculation, it is assumed that the variance of differences only depends on the distance between P and Q. A plot of the calculated semi-variogram values against the shifts (lags or distance) is known as a semi-variogram.

The figure below shows a typical semi-variogram:

It can be seen from the figure that the curve rises from a small semi-variogram value to higher levels at larger shifts, where it levels off. This horizontal part is known as the sill. At these large values of the lag there is no spatial dependence between the data points. The estimates of variances of differences will be invariant with distance. The distance at which the semi-variogram values approach the sill is the range. The range is the important part of the variogram because it describes how inter-site differences are spatially dependent. A long range indicates high correlation, a small range indicates low correlation. Often the semi-variogram values do not approach zero at the origin; semi-variogram values may seem to intersect the positive y-axis. This can be seen as the residual, spatially uncorrelated noise, which is also knows as the nugget.

Example:

The program can be used to detect correlation between pixels at regular horizontal or vertical distance intervals. In this way repetitions, or repeated patterns in a map, can be detected. An autocorrelation graph with regular peaks at certain interval distances indicates a cyclicity in the map. For instance, a map depicting the wave height at sea in a certain area will show an increased autocorrelation at a distance equal to the wave length, when looking in a direction parallel to the propagation direction of the waves.

Input map requirements:

The input raster map needs to have the Image domain or a value domain.

Output table:

An output table with domain None is created. The table has the same number of records as the pixel shift that you specified. The record numbers in the table represent the pixel shifts. The first record in the table represents a pixel shift of 1, the second record represents a pixel shift of 2, etc., and the last record thus represents the maximum pixel shift as specified. For your convenience, a distance column has been added which shows the pixel shift values as distance values (distance = record number * pixel size).

The table contains 5 columns:

Tips:

  1. For a better understanding it is useful to visualize autocorrelation and semi-variogram values in a graph. To visualize for instance semi-variogram values:
  2. For more information on semi-variograms, see also Spatial correlation : functionality.

  3. To visualize and determine a possible anisotropy axis, you can perform the Variogram surface operation.
  4. When you have points, you can perform the Spatial correlation operation: you can for instance calculate semi-variogram values for all or certain directions. To create points from a raster map, you can use the Raster to Points operation. Although Spatial correlation has more possibilities for points, the algorithm may become slow when there are very many points; the Autocorrelation-Semivariance operation is rather fast for large (raster) maps.

See also: