spatial analysis iiblogs.ubc.ca/advancedgis/files/2019/10/lecture08saii.pdfspatial analysis a method...

Spatial Analysis IISpatial data analysisSpatial analysis and inference

Roadmap Spatial Analysis IOutline:

What is spatial analysis?Spatial JoinsStep 1: Analysis of attributesStep 2: Preparing for analyses: working with distanceStep 3: Spatial patterns analysisStep 4: Kernel density analysisSummary

Roadmap Spatial Analysis IIOutline:

What is spatial analysis?TransformationsIntroduction to spatial interpolationClassification of spatial interpolation methodsInterpolation methodsAreal interpolationKrigingSummary

https://dhsprogram.com/pubs/pdf/SAR9/SAR9.pdf

https://dhsprogram.com/pubs/pdf/SAR9/SAR9.pdf

Spatial analysisA method of analysis is spatial if the results depend on the locations of the objects being analyzed

move the objects and the results changeresults are not invariant under relocation

Spatial analysis requires both the attributes and locations of objects

Spatial analysis is the crux of GIS Attribute linkages

Spatial data

P,L,A,Px

Attribute data

NOIR

Spatial analysisMuch of what we do in spatial analysis is about transformations—transforming the data from one form of presentation (e.g., points, lines, areas) into another form of presentation (e.g., density [such as a density of crimes per CT], surfaces [such as creating a TIN or a DEM from mass points], buffers, lines [such as creating contour lines from a points, a TIN or a DEM]). (Also, normalizing vars. in MCE.)

Transformations are used for simple cartographic purposes (e.g., overlays), to aid in inductive reasoning, and for more complex purposes (e.g., hot spot analyses), as part of deductive reasoning.

Transforming spatial data from one form (point, line or area) into another form (area, line or point) is a

common and important precursor to much of spatial analysis.

Why and how are some of the ways in which we transform spatial data?

TransformationsCreate a contour map

Filled contours Or a

3-D surface

Given a set of points

ArcMap’s Hot Spot Analysis

Also look at Spatial Analyst: Kernel or Point Density

http://desktop.arcgis.com/en/arcmap/latest/tools/spatial-statistics-toolbox/optimized-hot-spot-analysis.htm

http://desktop.arcgis.com/en/arcmap/latest/tools/spatial-analyst-toolbox/kernel-density.htm

http://desktop.arcgis.com/en/arcmap/latest/tools/spatial-analyst-toolbox/point-density.htm

TransformationsOne of the most important means of transforming data is through spatial interpolation.

Can you describe / define interpolation?

What does interpolation provide for us (as geospatial scientists)? That is, why do we interpolate (field) data?

Uses of spatial interpolationTo create isolines (or other graphics) for visualization

To calculate some property of the surface at a location ( ) where measurements weren’t taken

To change the unit of comparison when using different data structures (e.g., census tracts to planning districts)

Both physical and social applications

Field representationslattice random points regular grid

areas TIN (Voronoi) contours

What are the ways we can represent field data in a GIS?

Hexagons

https://citygeographics.org/luminocity/luminocity3d-spatial-analysis/

https://citygeographics.org/luminocity/luminocity3d-spatial-analysis/

Basic forms of interpolationPoint to points (e.g., random points to a regular lattice)

Points to lines (e.g., random points to contour lines)

Lines to points (e.g., digitized contours to a regular grid)

Areas to areas (e.g., census tracts to planning districts)

Points

LinesAreas

Transformations

Point to Continuous

Point to Continuous

Elevation

Rainfall

Point to Continuous

RoadmapOutline:


ConsiderationsGiven the wide variety of spatial interpolation methods, you need to consider:

Which method best fits the data you have available.Which method best fits the process associated with the data.Which method will produce the result you need.

Know your data

A classification of interpolation methods

Global versus Local

Exact versus Approximate

Stochastic versus Deterministic

Abrupt versus Smooth

Global LocalAspatial

Could you describe what each of these dichotomies might encompass?

Windows

Exact versus Approximate

Exact interpolators honour the input data points (which doesn't mean that the surface is an exact replicate of the ‘true’ surface from which the points were collected, just that the surface falls exactly on the points)

Approximate interpolators allow for uncertainty in the input data points, which allows for smoothing

Stochastic versus DeterministicStochastic methods incorporate the concept of randomness (similar to a linear regression model—a surface of ‘best fit’)

Deterministic methods do not use probability theory (they exclude randomness).

Abrupt versus Smooth

Abrupt interpolators allow for barriers (e.g., faults, fronts)

Smooth interpolators produce a smooth surface

Issues to consider

What happens to trends?

What about minima / maxima?

How does the data distribution affect the results?How are the parameters selected?

In previous lectures we’ve talked about a few of the problematic issues

that might arise in spatial analysis, although at the time spatial interpolation

wasn’t specifically mentioned.

Rule uncertainty

Roadmap

Outline:What is spatial analysis?TransformationsIntroduction to spatial interpolationClassification of spatial interpolation methodsInterpolation methodsAreal interpolationKrigingSummary

Know your data?

20 30

13

?

What is the expected value?

Process?

Interpolation methods

Exact methods of point-based interpolation

Proximal: local, exact, abrupt, deterministicbest for nominal data (aka Thiessen polygons)

Thiessen polygons are the ‘dual’ of a Delaunay triangulation

Note: The point patterns are identical.

http://desktop.arcgis.com/en/arcmap/latest/tools/analysis-toolbox/create-thiessen-polygons.htm

Splines

B-Splines: “french curves” piecewise polynomials, local, exact, can be smooth, not min/max bound

http://desktop.arcgis.com/en/arcmap/latest/tools/3d-analyst-toolbox/how-spline-works.htm

Manual methods

Manual interpolation: knowledge-based, local, abrupt, tend to be exact, subjective

Often associated with geological mapping

Interpolation methods

Approximate methods of point-based interpolation

Trend surface analysis: similar to regression, global, smooth, deterministic

The simplest surface:z = a + bx + cy

What is this called?

http://desktop.arcgis.com/en/arcmap/latest/tools/3d-analyst-toolbox/how-trend-works.htm

Trend surface analysis

The graph illustrates a quadratic or second-order surface: z = a + bx + cy + dx2 + exy + fy2

Fourier methods

Fourier Series: assumes that the surface can be approximated by overlaying a series of sine and cosine waves

global, smooth, deterministic

The first four partial sums of the Fourier series

for a square wave

Distance-weighted methods

Moving Average / Distance-Weighted Average:can be exact or, more typically, approximate, local to global, smooth or abrupt the most widely used spatial interpolation method in Geography

an almost unlimited number of modifications or variations are available, including:

variations on the distance function imposing constraints on the point selection process (e.g., by direction, limiting the number of points, limiting the distance).

http://desktop.arcgis.com/en/arcmap/latest/extensions/geostatistical-analyst/how-inverse-distance-weighted-interpolation-works.htm

Moving averages

Moving averages are widely used in time series analysesThe smoother curves (dark blue on left, red on right) represent the moving average of the original time series. Note that the smoothed curve can never be higher nor lower than any of the data points.

Moving averages cannot extendacross the entire range of the data.

Why?

point iknown value zilocation xiweight wi distance di

unknown value (z to be interpolated)location x

∑∑=i

ii

ii wzwz )(x

βii dw 1=

The estimate is a weighted average

Weights decline with distance,

β is usually given a value of 2, but it can be any value > 1

Distance-weighted formulaSpatial Moving Averages (SMA)

Effects of changing parametersIDW: Changing the exponent from 2 to 4

21 ii dw = 41 ii dw =

TINs

Triangulated Irregular Networks (TINs)Not really a form of interpolation, per se, although using TINs one can create contours or regions.

Exact, local, abrupt, deterministic.

http://desktop.arcgis.com/en/arcmap/latest/manage-data/tin/fundamentals-of-tin-surfaces.htm

Contour placement

Although the process of 'stringing' contours between the vertices of the TIN triangles (or between two grid points) may seem unproblematic, there are some interesting issues (issues which should be addressed in any contouring exercise, but typically are never explicitly addressed).

Basically: what assumption should be made with respect to how the surface should be modeled between the known points?

http://desktop.arcgis.com/en/arcmap/latest/tools/3d-analyst-toolbox/contour.htm

Contour placement

The default for most programs, although this is likely more realistic.

RoadmapOutline:


What of areal interpolation?If the areas can be represented by a single point, and the data can be considered to be a field, you can use a point-based interpolation method (e.g., pop density of CTs).

Areal interpolation is actually a complex process. Polygon overlays , and using the proportional areas as weights, is a typical

approach (but one that is not reversible nor volume preserving—total pop after ≠ total pop before).Pycnophylactic interpolation is a reversible, volume preserving method (see

lecture notes for a link to a video that describes this method in detail).

http://desktop.arcgis.com/en/arcmap/latest/extensions/geostatistical-analyst/what-is-areal-interpolation.htm

Polygon overlays

The problem with (nonreversible) polygon overlays:

The fundamental assumption is that the field has a homogeneous distribution throughout the zones.

Note that the totals assigned to the two polygons in the bottom row

do not equal the values within each polygon in the top row.

Each area is split into 2 equal parts,

assuming equal pop in each part.

Pycnophylactic interpolation

ArcGIS’s Areal Interpolation Method

Arealinterpolation



RoadmapOutline:


Geostatistical methods: KrigingKriging

stochastic, exact, smooth or abrupt, global or local

Natural data are difficult to model using smooth functions because naturally-occurring random fluctuations and measurement error combine to cause irregularities in sampled data values.

Kriging was developed to model those stochastic concepts.

It is based on the concept of a regionalized variable that has three components:

http://desktop.arcgis.com/en/arcmap/latest/extensions/geostatistical-analyst/kriging-in-geostatistical-analyst.htm

Components of a Regionalized Variable

data

STRUCTURAL – This may be represented by the

mean or a constant trend.

SPATIALLY CORRELATED – Data often exhibit positive spatial

correlations.

RANDOM NOISE – Measurement errors, other errors,

random fluctuations.

Topography is a reflection of many processesoperating at different scales; with Kriging we hope to develop models of some of those processes.

Components of a Regionalized Variable

The random noise component (non-fitted)

The spatially correlated component

The structural component (e.g., a linear trend)

This is what we areattempting to model.

Kriging

Kriging is implemented using a semi-variogram

There are many different varieties of kriging (e.g, ordinary, universal, simple, indicator), and selecting the appropriate one requires careful consideration of the data.

ArcGIS's help file--look up the term kriging—provides a lot of information on the various types of kriging (and co-kriging) that are commonly used in spatial analysis.

ArcGIS’s tutorial for the Geostatistical Analyst is also very informative (in particular consider the Geostatistical Wizard)

http://desktop.arcgis.com/en/arcmap/latest/extensions/geostatistical-analyst/introduction-to-the-arcgis-geostatistical-analyst-tutorial.htm

http://resources.arcgis.com/en/help/main/10.2/index.html#//003100000004000000

Kriging

The semivariogram is based on modeling the (squared) differences in the z-values as a function of the distances between the known points.

h

http://desktop.arcgis.com/en/arcmap/latest/extensions/geostatistical-analyst/empirical-semivariogram-and-covariance-functions.htm

KrigingThe first step in ordinary kriging is to construct a semivariogram from the points being interpolated.

A semivariogram consists of two parts: an experimental semivariogram (the data) a model semivariogram (the math).

The experimental semivariogram is found by calculating the variance (g) of each point in the set with respect to each of the other points and plotting the (semi)variances versus distance (h) between the points.

Semivariogram

Variance between points is represented over space. The above shows how variance between point values tends to increase as distance between points increases.

The values of the points could be rainfall, elevation, demographic continuous data, ozone levels, percentage of gold in a sample…

N= number of pairsf1 = Headf2 = Tail

h = distance (0 -> range)

SemivariogramThe semivariogram is used to explore a data set visually and to estimate at what distance the spatial autocorrelation becomes insignificant (i.e., the range).

SemivariogramNugget represents subgrid scale variation that cannot be estimated by reasons of the sampling grid spacing or measurement error.

Range is a measure of the spatial autocorrelation between data points, represented in terms of the lag distance. It represents the distance at which the spatial autocorrelation is at its global value.

SemivariogramSill is the value of the semi-variance as the lag tends towards infinity – in non-standardized data it is equal to the total variance of the dataset.

Lag refers to distances (e.g., 1250m) that typically have ‘bands’ associated with them (e.g., +/- 250m). Distance may be a direct measure or a transformation of distance (e.g., log(1000 m)).

N= number of pairsf1 = Headf2 = Tail

h = distance (0 -> range)

Kriging

This is an example of a semivariogram produced using ArcGIS's Geostatistical Analyst.

KrigingOne of the very useful outputs from a kriging

analysis is the uncertainty surface that can be generated--we can answer the question: "How good are the predictions?"

Using some of the data that we used in lab 2 (where you created a TIN), I created an ordinary kriged map and a map showing the standard error of the predictions (and a TIN for comparison).

Kriging Prediction standard error mapshowing data points

TINOrdinary krigedprediction map

IDW versus kriging

SummaryInterpolation is a very important process in GIS, and in particular in spatial analysis. ArcGIS's help files provide a lot of useful information.

It is important to know, first, about your data (how was it collected, the spatial distribution of the collection points, the process responsible for creating the 'field' you are mapping) and, second, about the spatial interpolation method (its assumptions, faults and fine points).

Know your data

spatial analysis iiblogs.ubc.ca/advancedgis/files/2019/10/lecture08saii.pdfspatial analysis a method...

Documents