Pattern analysis

Functionality

Data in a point map represent spatial occurrence of a particular phenomenon. To acquire knowledge about the occurrence of the phenomenon, the spatial distribution of the points in the map can be examined. Point pattern analysis is a technique that is used to obtain information about the arrangement of point data in space, to be able to make a statement about the occurrence of certain patterns.

Three types of patterns can be distinguished:

There are basically two different techniques; the location of points can be studied:

Measures of arrangement:

Measures of arrangement are techniques that examine characteristics of the locations of points relative to other points in the pattern. In this technique measured frequencies of occurrences of reflexive nearest neighbours (RNN) are compared with expected frequencies of occurrences in a situation of CSR. The CSR is simulated for the same area and the same number of points. Two points are considered first order RNN if they are each other's nearest neighbour. This definition can be extended to higher orders; second order RNNs are points that are each others second-nearest neighbours etc. The frequencies are calculated for RNNs of first to sixth order.

Boots & Getis (1988, p. 69) remark that 'most researchers suggest that more higher order values in excess of CSR expectations indicate measure of regularity in arrangement of points, whereas lower empirical values imply elements of grouping in the pattern'. If more (first order) RNNs occur than is expected in CSR, it can be concluded that isolated and relatively uniformly arranged couples exist.

Measures of dispersion:

Measures of dispersion are techniques that examine characteristics of the locations of points with respect to the area e.g. the pattern is analyzed by calculating distances between individual points, and comparing these with the distances that would be found in a CSR. In this technique the mean distance between RNNs are tested against the expected distances in CSR. If the individual points are closer than they would be for CSR, this indicates a clustered pattern. If, on the other hand, individual points are further apart than they would be in CSR, a more regular pattern is assumed.

Input map requirements:

An input point map is required with at least two points which do not have the undefined value.

Output table:

An output table with domain None is created.

The output table contains the following information:

Measures of dispersion

In the output table, in which the mean distance between RNNs are tested against the expected distances in a (simulated) CRS, eight columns are stored:

The probabilities listed in the output table are in fact the relative frequencies drawn from the input point map, and are thus empirical values. If for a given distance, say 3.6m, the entry in column Prob3Pnt is 0.43, it means that for 43% of all points in the input point map, 3 nearest neighbours can be found within a radius of 3.6m.

Tips:

Measures of arrangement

The Additional Info tab in the Properties sheet of the output table contains additional information displayed in two tables, both with columns Order, Observed value, and Assumed with CSR.

The first table lists the order of the reflexive nearest neighbours (nearest, second nearest, etc.), the observed number of times (frequency) that such a pair is found in the dataset, and the number of times this would happen in the situation of CSR (expected frequency).

The second table lists the mean distance to the reflexive nearest neighbour for every order, and the mean distance in a CSR.

Advantages/disadvantages of dispersion vs. arrangement:

The advantages of measurements of arrangement are:

  1. The technique is density free, i.e. no estimates have to be made, for example the expected number of points per quadrant;
  2. The technique is free of edge effects, because the arrangement of points with respect to each other is studied rather than with respect to the area.

The advantages of the dispersion technique are:

  1. It is more rigorous than the arrangement technique, because it is more sensitive to certain differences in some pattern characteristics. For arrangement, identical values may sometimes be expected for patterns that are different in some way;
  2. The statistical theory for dispersion is better developed, so that this method is less subjective.

Note that higher order RNN analysis (refined analysis) might give more information than first order analysis alone; take the example of couples on a dance floor; the distance to the first order RNN will be significantly less than in CSR, while higher order RNN distances will be higher.

Reference:

See also: