The Final Solution

This is the 8th and final post on on my paper “A Method for Determining and Improving the Horizontal Accuracy of Geospatial Features” Other posts on this topic:

7. Determining the Spatial Accuracy of Polygons Using Buffer Overlay

In the last post on the subject we described how Buffer Overlay can be used to determine the horizontal accuracy of polygon features. We ran Buffer Overlay on a single Township/Range where we had 259 permits to compare against parcels. This resulted in 259 cumulative probability (CP) curves one for each permit. In the sample below 21 curves are shown for each of these when the CP is greater than one the buffer distance is equal to the accuracy of the permit. If the curve does not cross the 1 CP then its accuracy could not be determined.

Here is our initial horizontal accuracy distribution for the 259 permits. In the graph below we have two peaks: < 2 feet where we have about 50 records and >= 60 where we have about 35 these are those feature for which accuracies could not be determined. The middle part of the curve shows a random distribution of accuracies.

Here is the cumulative curve. If the data was perfect all 259 permits would have a horizontal accuracy of .5 feet. In which case the cumulative curve (shown below) would be a straight line across at a cumulative count of 259.

This information is visualized below where you can see that there are many features at accuracy levels from  2 – 60 feet. This type of visualization is useful for manual correction of features but we were looking to automate the process.

In most cases the last buffer and clip operation will extract a line segment from control that forms a complete ring. This means that using standard topology tools the extracted control lines can be built into polygons and used to replace less accurate test (permit) features.

The solution that we implemented consisted of two processes:

  • If the clipped line work formed a complete ring the ring could be easily built into a polygon. This was the Phase 1 Correction.
  • If the clipped line work had gaps, we implemented a more complex algorithm that did a node-to-node comparison and then extended lines to form closed rings. In this case, it was possible to create multiple rings and once those rings were built into polygons their area was compared to the original permit (test feature) to assure that the new feature was not significantly larger or smaller than the original. This was the Phase 2 Correction.

In this video the first build is the trivial solution or Phase 1 Correction, the second build is the more complex Phase 2 Correction.

After the Phase 1 Correction we we generated statistics and found that we had corrected 104 permits or 40% of our 259 features to parcel. The horizontal accuracy distribution before and after Phase 1 correction is shown below.

In this close up we see that the Phase 1 Corrections curve spikes at 0.5 feet above the initial conditions and that the rest of the Phase 1 Correction curve is under the Initial curve as is expected.

The same improvements can be seen in the cumulative curve.

After the Phase 1 correction we also calculated the RMSE error and found a significant reduction in the horizontal error when compared to parcel (see below).

Lastly, a visual comparison was done showing the obvious improvements at the 0.5 feet horizontal accuracy (shown in yellow below).

The phase two correction made a 6% (16 record) improvement to our data. Here are some images of the features corrected. Red is the original permit the green area is the corrected feature following parcel.

The image below shows a common problem with our algorithm on the west side of the polygon where a corner has been clipped. However, in general the new feature is a better representation than the original in red.

The final horizontal accuracy measures after Phase 2 correction are provided below. Phase 2 is under Phase 1 is under Initial conditions as expected.

In conclusion, we have found that Buffer Overlay analysis is an excellent tool for determining and improving the horizontal accuracy of our permit features to parcels. The initial accuracy assessment is easy to perform and provides data that is easy to interpret. Visualizing the data by these generated accuracies provides an avenue for manual correction of the data. However, with a little more effort complete rings can be extracted from the last buffer and clip operation that can be built into more horizontally accurate polygons. This is the low hanging fruit but also provides the biggest bang for the buck.

Our additional efforts in Phase 2 where we connected nodes to close gaps and create rings only improved our data by 6%, however, we feel confident that we can tweak our algorithm to yielding at least a 10-15% gain in the future.


Determining the Spatial Accuracy of Polygons Using Buffer Overlay

This is the 7th post on on my paper “A Method for Determining and Improving the Horizontal Accuracy of Geospatial Features” Other posts on this topic:

6. Five Methods for Determining the Spatial Accuracy of Lines
8. The Final Solution

Buffer Overlay is a geospatial analysis technique that creates a buffer around a control line, the buffer is then used to clip a test line, the lengths of both lines are compared and if the test line is significantly shorter than the control a larger buffer is used. When test and control lengths are equal the test line can be said to be the size of the last buffer applied.

In our case the control is parcel line work and the test are permits. However, it is not practical to buffer the line work of an individual parcel and then clip the permits because not all parcels have permits. So we will have to reverse the procedure and buffer the test line and find the length of control within the buffer. A more complete description of the procedure is as follows:

  • Convert parcel polygons to parcel lines
  • For each permit
    • Buffer from .5 ft to 60 ft @ .5 ft intervals
      • Clip the parcel lines (control) using buffer
      • Drop dangling nodes (where length = buffer)
      • Calculate the Cumulative Probability (CP)
      • CP = (Length of Clipped Parcel Line/Perimeter of Permit)
      • If CP >= 1
        • Horizontal accuracy is +/- buffer distance
      • Else If CP <= 0.999 and buffer < 60
        • Next buffer

The results of this procedure is the creation of individual probability curves for each permit as shown in the image and video below.

Five Methods for Determining the Spatial Accuracy of Lines

This is the 6th post on on my paper “A Method for Determining and Improving the Horizontal Accuracy of Geospatial Features” Other posts on this topic:

7. Determining the Spatial Accuracy of Polygons Using Buffer Overlay
5. Spatial Accuracy Assessments Using an Excel Spreadsheet

Positional accuracy or spatial accuracy refers to the accuracy of a sample feature when compared to a control feature. The positional accuracy of a point feature is usually provided by the Euclidean distance between the sample point and a control point. The error can be reported as errors in x, errors in y, and errors in z. For lines and polygons determination of positional accuracy is more complex since these are composed of multiple sample points.

The following is a brief summary of the most common methods for accessing the spatial accuracy of lines taken from (Atkinson and Ariza, 2002).

  • Epsilon Bands (Skidmore and Turner, 1992). These methods depend on measuring the offset of the test from control along a cross-section perpendicular to the control. There are various ways of generating the control line.
  • Hausdorff Distance (Abbas, Grussenmeyer, and Hottier, 1995). This is simply the smallest and largest offset of test measured perpendicular to the control.
    Hausdorf Distance
  • Maximum Proportion Standard/Maximum Distortion Standard (Veregin, 2000). Examines the areas created by a test line repeatedly crossing a control. Then by creating different ratios of the areas on one or the other side of the control measures of proportion/distortion are derived.
  • Buffer Overlay Statistics (Tveite and Langaas, 1999). Overlays buffers created around both test and control and then examines the ratio of the different resulting areas. In our case, the most interesting of these is the ratio of the areas that intersect both control and test compared to the size of the control buffer. As this approaches one the control buffer more and more accurately measures the positional accuracy.
  • Buffer Overlay (Goodchild and Hunter, 1997). Creates a buffer around a control line and measures the amount of test within the control buffer. The buffer is increased until a buffer size is found that includes all of test. The size of the last buffer is said to be the accuracy of test.

Of all these methods Buffer Overlay is the most commonly applied since it can readily be calculated using off the shelf GIS functions, the results are simple to interpret, and it is statistically based. In my next post we will use buffer overlay to determine the spatial accuracy of polygons.