Correlation matrix

Functionality

The operation calculates correlation coefficients of raster maps in a map list. Correlation coefficients characterize the distribution of pixel values in two raster maps. Furthermore, the mean and standard deviation of each individual raster map is calculated.

Analyzing satellite data the images often show a degree of correlation. This means that when spectral values in one band are high the values in another band are expected to be high as well. Plotting values from highly correlated bands in a feature space will result in an ellipsoid denoting that the two bands contain dependent information. From a set of highly correlated bands only one adds real value whilst the other ones may be derived or estimated. Calculating a correlation matrix helps to detect the redundancy and identifies possible reductions in the number of bands, e.g., to be used in a color composite.

Correlation coefficients are normalized covariance values. A correlation coefficient ranges from -1 to +1. Diagonal elements are always 1. A correlation close to +1 indicates a direct relationship between two bands. This suggests that if the reflectance (DN) of a pixel in one band is known, the reflectance of that pixel in the other band can be derived or estimated. A correlation close to -1 indicates an inverse relationship between the reflectance values of one band and the reflectance values in the other one.

Example:

The following example shows a correlation matrix for 4 input bands. Bands 1 and 2 are highly correlated with a correlation coefficient of 0.97. Bands 1 and 4 seem independent with a correlation coefficient of -0.10.

  

band 1

band 2

band 3

band 4

band 1

1.00

0.97

0.50

-0.10

band 2

0.97

1.00

0.43

-0.21

band 3

0.50

0.43

1.00

0.83

band 4

-0.10

-0.21

0.83

1.00

The high correlation between band 1 and band 2 suggests the removal of one of these bands; in a color composite use band 1 or band 2, and band 3 and band 4.

Input map requirements:

The operation requires a map list in which all raster maps use the Image domain or the same value domain and the same georeference. Pixels in the input maps that have the undefined value are ignored in the calculation.

Output matrix:

After performing the operation, the correlation matrix, and the mean and standard deviation of each individual raster map are shown on the screen. To show the matrix once more after closing this window, you have to perform the operation again. The values of the correlation matrix are stored by the object definition file of the map list (.MPL).

Furthermore, after calculating a correlation matrix or a covariance matrix, the Properties dialog box of the input map list shows you the Optimum Index Factors, i.e. the combinations of three input maps with largest sum of standard deviations and smallest correlation. This can give you an indication which raster maps to use to create a color composite. For more information, see Optimum Index Factor : functionality / algorithm.

Note:

As the output correlation matrix is not stored as a separate object, this operation cannot be executed from the command line.

See also: