exploratory spatial data analysis - techniques and examples

47
Exploratory Spatial Data Analysis - Exploratory Spatial Data Analysis - Techniques and Examples Techniques and Examples Jürgen Jürgen Symanzik*, Symanzik*, Utah State University, Logan, UT Utah State University, Logan, UT *e-mail: *e-mail: symanzik symanzik@sunfs sunfs .math. .math.usu usu. edu edu WWW: http://www.math. WWW: http://www.math. usu usu. edu edu/~ /~ symanzik symanzik with Louise Griffith, Rob with Louise Griffith, Rob Gillies Gillies, , Di Di Cook, Jim Cook, Jim Majure Majure , Nicholas , Nicholas Lewin Lewin-Koh Koh, Noel Noel Cressie Cressie, , Sigbert Klinke Sigbert Klinke, Ed , Ed Wegman Wegman, , Adi Wilhelm Adi Wilhelm

Upload: others

Post on 03-Feb-2022

4 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Exploratory Spatial Data Analysis - Techniques and Examples

Exploratory Spatial Data Analysis -Exploratory Spatial Data Analysis -Techniques and ExamplesTechniques and Examples

JürgenJürgen Symanzik*, Symanzik*,

Utah State University, Logan, UTUtah State University, Logan, UT

*e-mail:*e-mail: symanzik symanzik@@sunfssunfs.math..math.usuusu..eduedu

WWW: http://www.math.WWW: http://www.math.usuusu..eduedu/~/~symanziksymanzik

with Louise Griffith, Rob with Louise Griffith, Rob GilliesGillies, , Di Di Cook, Jim Cook, Jim MajureMajure, Nicholas , Nicholas LewinLewin--KohKoh,,

Noel Noel CressieCressie, , Sigbert KlinkeSigbert Klinke, Ed , Ed WegmanWegman, , Adi WilhelmAdi Wilhelm

Page 2: Exploratory Spatial Data Analysis - Techniques and Examples

ContentsContents

nn ESDAESDA–– DefinitionDefinition

–– SoftwareSoftware

–– TechniquesTechniques

nn ExamplesExamples

nn Live DemoLive Demo

Page 3: Exploratory Spatial Data Analysis - Techniques and Examples

ESDAESDA

nn Working Definition:Working Definition:–– Find structure (cluster, unusual observations) inFind structure (cluster, unusual observations) in

large and not necessarily homogeneouslarge and not necessarily homogeneous

spatial data sets spatial data sets based on human perceptionbased on human perceptionusing graphical methods and user interactionusing graphical methods and user interaction

–– Goal or expected outcome of explorationGoal or expected outcome of exploration

usually unknown in advanceusually unknown in advance

Page 4: Exploratory Spatial Data Analysis - Techniques and Examples

Software: Software: XGobiXGobi//GGobiGGobiSwayneSwayne, Cook and , Cook and Buja Buja

•• Interactive environment for exploring multivariate dataInteractive environment for exploring multivariate data**Linked views allow “Linked views allow “linked brushing”linked brushing”**UnivariateUnivariate, , Bivariate Bivariate and Multivariate views of the dataand Multivariate views of the data**Grand tourGrand tour**Wide variety of methodsWide variety of methods**Open sourceOpen source**FreeFree

•• CaveatsCaveats**XGobiXGobi only on UNIX and only on UNIX and Linux Linux platformsplatforms**GGobiGGobi also available for PCs but not yet fully developed also available for PCs but not yet fully developed

Page 5: Exploratory Spatial Data Analysis - Techniques and Examples

Software:Software: ArcView ArcViewESRIESRITMTM

•• Desktop Desktop GIS GIS with wide Range of Viewingwith wide Range of Viewingand Data Manipulation Functions and Data Manipulation Functions

–– Editing Features Editing Features–– Query Operations Query Operations–– Map Display Map Display–– Interactive Interface Interactive Interface–– High Level Internal Scripting Language High Level Internal Scripting Language

Page 6: Exploratory Spatial Data Analysis - Techniques and Examples

Features ofFeatures of ArcView ArcView//XGobiXGobi-Link-Link

nn Multivariate LinkMultivariate Link

nn CDF LinkCDF Link

nn VariogramVariogram-cloud Link-cloud Link

nn Spatially LaggedSpatially Lagged Scatterplot Scatterplot Link Link

nn MultivariateMultivariate Variogram Variogram-Cloud Link-Cloud Link

Page 7: Exploratory Spatial Data Analysis - Techniques and Examples

Multivariate LinkMultivariate Link

XGobi

ArcView

Page 8: Exploratory Spatial Data Analysis - Techniques and Examples

CDF LinkCDF Link

XGobi

ArcView

Page 9: Exploratory Spatial Data Analysis - Techniques and Examples

VariogramVariogram-Cloud Link-Cloud Link

XGobi

ArcView

Page 10: Exploratory Spatial Data Analysis - Techniques and Examples

Spatially LaggedSpatially Lagged Scatterplot Scatterplot Link Link

XGobi

Page 11: Exploratory Spatial Data Analysis - Techniques and Examples

MultivariateMultivariate Variogram Variogram-Cloud Link-Cloud Link

XGobi

Page 12: Exploratory Spatial Data Analysis - Techniques and Examples

Software: Software: ExplorNExplorN//CrystalVisionCrystalVisionWegmanWegman, , LuoLuo, Carr , Carr

•• Interactive environment for exploring Interactive environment for exploring multivariate data (similar to multivariate data (similar to XGobiXGobi//GGobiGGobi))

**Advanced Parallel Advanced Parallel CoordinatesCoordinates Displays Displays**3D Surfaces3D Surfaces**Stereoscopic DisplaysStereoscopic Displays

•• CaveatsCaveats**ExplorN ExplorN only on SGI platformsonly on SGI platforms**CrystalVision CrystalVision available for PCs but not yet fully developedavailable for PCs but not yet fully developed**No interface to No interface to GIS GIS softwaresoftware

Page 13: Exploratory Spatial Data Analysis - Techniques and Examples

Tools: Parallel Coordinate PlotsTools: Parallel Coordinate Plots

ExplorN

Page 14: Exploratory Spatial Data Analysis - Techniques and Examples

Tools: Tools: Scatterplot Scatterplot Matrix & Density PlotMatrix & Density Plot

ExplorN

Page 15: Exploratory Spatial Data Analysis - Techniques and Examples

Tools: Grand TourTools: Grand Tour

–– Continuous random sequence ofContinuous random sequence ofprojections from n dimensions into 2 (orprojections from n dimensions into 2 (ormore) dimensions.more) dimensions.

Page 16: Exploratory Spatial Data Analysis - Techniques and Examples

ExamplesExamples

–– Remote Sensing: Vermont/New HampshireRemote Sensing: Vermont/New Hampshire»» Quarry, Water, CloudsQuarry, Water, Clouds

–– Remote Sensing: AtlantaRemote Sensing: Atlanta»» City, ForestCity, Forest

–– Archaeological DataArchaeological Data»» Sand ParticlesSand Particles

Page 17: Exploratory Spatial Data Analysis - Techniques and Examples

The Vermont/New Hampshire DataThe Vermont/New Hampshire Data

nn Landsat Landsat Thematic Thematic Mapper Mapper (TM) data(TM) data

nn 6 Spectral Bands6 Spectral Bands

nn Outstanding Water Body is Connecticut RiverOutstanding Water Body is Connecticut River

Page 18: Exploratory Spatial Data Analysis - Techniques and Examples

VT/NH: Field/Quarry/WaterVT/NH: Field/Quarry/Water

Page 19: Exploratory Spatial Data Analysis - Techniques and Examples

Field/Quarry/Clouds ??Field/Quarry/Clouds ??

Page 20: Exploratory Spatial Data Analysis - Techniques and Examples

Water = Water ??Water = Water ??

Page 21: Exploratory Spatial Data Analysis - Techniques and Examples

The Atlanta DataThe Atlanta Data

nn NOAA-14 Satellite NOAA-14 Satellite (National Oceanic and Atmospheric Administration)(National Oceanic and Atmospheric Administration)

nn AVHRR Sensor AVHRR Sensor (Advanced Very High Resolution Radiometer)(Advanced Very High Resolution Radiometer)::–– Band 1: RedBand 1: Red

–– Band 2: Near InfraredBand 2: Near Infrared

–– Band 3: Mid InfraredBand 3: Mid Infrared

–– Band 4: Long InfraredBand 4: Long Infrared

–– Band 5: (Very) Long InfraredBand 5: (Very) Long Infrared

nn Data from “NASA’s Project Atlanta”Data from “NASA’s Project Atlanta”

nn 18 Days from Jan 1997 to Dec 199718 Days from Jan 1997 to Dec 1997

nn Resolution: 1 km x 1 km per PixelResolution: 1 km x 1 km per Pixel

nn Main Study Area: 70 km x 46 kmMain Study Area: 70 km x 46 km

Page 22: Exploratory Spatial Data Analysis - Techniques and Examples

Some DefinitionsSome Definitions

nn Normalized Difference Vegetation Index:Normalized Difference Vegetation Index:

n NDVI ~ 0.8 for Highly Vegetated Surfaces ~ 0.8 for Highly Vegetated Surfaces

n NDVI ~ 0.1 for Bare Soil ~ 0.1 for Bare Soil

nn Surface Radiant Temperature Surface Radiant Temperature T0: Band 4: Band 4

nn Surface Moisture Availability Surface Moisture Availability M0

1212

BandBandBandBand

NDVI+−

=

Page 23: Exploratory Spatial Data Analysis - Techniques and Examples

An ExampleAn Example

NS001-TMS derived To-NDVI scatterplot (gray spectral scaling) at a 5 meter spatial resolution for a7 x 3 km area of the Mahantango Watershed, Pennsylvania. 18 July 1990, 1145 LST. Isoplethsrepresenting moisture availability index, Mo are overlaid with the legend, ο = 0.0 (‘warm’ edge), ◊ =0.2, o = 0.4, ∆ = 0.6, ∇ = 0.8, and × = 1.0 (cold edge).

Page 24: Exploratory Spatial Data Analysis - Techniques and Examples

The Main Study AreaThe Main Study Area

Page 25: Exploratory Spatial Data Analysis - Techniques and Examples

NDVI NDVI vs vs Surface TemperatureSurface Temperature

Page 26: Exploratory Spatial Data Analysis - Techniques and Examples

Two MonthsTwo Months

December

August

Page 27: Exploratory Spatial Data Analysis - Techniques and Examples

The CityThe CityDecember

August

Page 28: Exploratory Spatial Data Analysis - Techniques and Examples

Clouds in AugustClouds in August

August

Page 29: Exploratory Spatial Data Analysis - Techniques and Examples

ReclassificationReclassification

December

Linked

August

December

Page 30: Exploratory Spatial Data Analysis - Techniques and Examples

2 Pixels of Interest2 Pixels of Interest

Linked

December

August

December

Page 31: Exploratory Spatial Data Analysis - Techniques and Examples

Example: Archaeological DataExample: Archaeological Data

Published as:Published as:

WilhelmWilhelm, A. F. X.,, A. F. X., Wegman Wegman, E. J., Symanzik, J. (1999):, E. J., Symanzik, J. (1999):Visual Clustering and Classification: The Visual Clustering and Classification: The OronsayOronsay Particle ParticleSize Data Set Revisited, Computational Statistics: SpecialSize Data Set Revisited, Computational Statistics: SpecialIssue on Interactive Graphical Data Analysis,Issue on Interactive Graphical Data Analysis, Vol Vol. 14, No.. 14, No.1, 109-146.1, 109-146.

Page 32: Exploratory Spatial Data Analysis - Techniques and Examples

Oronsay Oronsay Sand ParticlesSand Particles

“The mesolithic shell middens on the islandof Oronsay are one of the most importantarcheological sites in Britain. It is ofconsiderable interest to determine theirposition with respect to the mesolithiccoastline. If the sand below the midden werebeach sand and the sand from the upperlayers dune sand, this would indicate aseaward shift of the beach-dune interface.”

Flenley and Olbricht, 1993

Page 33: Exploratory Spatial Data Analysis - Techniques and Examples

Objective of StudyObjective of Study

nn Cluster samples of modern sand intoCluster samples of modern sand into“beach-like” or “dune-like” sand.“beach-like” or “dune-like” sand.

nn Classify archeological sand samples as toClassify archeological sand samples as towhether they are beach sand or dune sand.whether they are beach sand or dune sand.

Page 34: Exploratory Spatial Data Analysis - Techniques and Examples

Oronsay Oronsay - Geography- Geography

Page 35: Exploratory Spatial Data Analysis - Techniques and Examples

Oronsay Oronsay - Data Problems- Data Problems

Page 36: Exploratory Spatial Data Analysis - Techniques and Examples

OronsayOronsay - Parametric Analysis - Parametric Analysis

nn Historical strategy is to fit parametricHistorical strategy is to fit parametricdistributions and compare modern anddistributions and compare modern andarcheological sands based on parameters.archeological sands based on parameters.

nn WeibullWeibull, 1933; , 1933; lognormal lognormal (breakage(breakagemodels), log-hyperbolic, log-skew-models), log-hyperbolic, log-skew-LaplaceLaplace,,1937, 1937, BarndorffBarndorff-Nielsen, 1977.-Nielsen, 1977.

nn Models 2 to 4 parameters, theory developed,Models 2 to 4 parameters, theory developed,practice problematic.practice problematic.

Page 37: Exploratory Spatial Data Analysis - Techniques and Examples

Oronsay Oronsay - Visual Approach- Visual Approach

nn Multidimensional Parallel CoordinateMultidimensional Parallel CoordinateDisplay Combined with Grand Tour.Display Combined with Grand Tour.

nn BRUSH-TOUR strategyBRUSH-TOUR strategy–– Clusters recognized by gaps in any horizontal axis.Clusters recognized by gaps in any horizontal axis.

–– Brush existing clusters with colors.Brush existing clusters with colors.

–– Execute grand tour until new clusters appear, brushExecute grand tour until new clusters appear, brushagain.again.

–– Continue until clusters are exhaustedContinue until clusters are exhausted..

Page 38: Exploratory Spatial Data Analysis - Techniques and Examples

Beach & Dune SandBeach & Dune SandBeach & Dune Sand

Page 39: Exploratory Spatial Data Analysis - Techniques and Examples

Separation of ClustersSeparation of ClustersSeparation of Clusters

Page 40: Exploratory Spatial Data Analysis - Techniques and Examples

Final ClusteringFinal ClusteringFinal Clustering

Page 41: Exploratory Spatial Data Analysis - Techniques and Examples

OronsayOronsay - Conclusions (1) - Conclusions (1)

nn Sands from the CC site and the CNG siteSands from the CC site and the CNG sitehave considerably different particle sizehave considerably different particle sizedistributions and cannot be effectivelydistributions and cannot be effectivelyaggregated.aggregated.

nn Data at small and at large particleData at small and at large particledimensions is too dimensions is too quantized quantized to be usedto be usedeffectively.effectively.

nn The visual based BRUSH-TOUR strategy isThe visual based BRUSH-TOUR strategy isextremely effective at clustering.extremely effective at clustering.

Page 42: Exploratory Spatial Data Analysis - Techniques and Examples

OronsayOronsay Conclusions (2) Conclusions (2)

nn Midden Midden sands are neither modern beachsands are neither modern beachsands nor modern dune sands.sands nor modern dune sands.

nn Midden Midden sands are more similar to modernsands are more similar to moderndune sands.dune sands.

nn This result does not support the seaward-This result does not support the seaward-shift-of-the-beach-dune-interface hypothesis,shift-of-the-beach-dune-interface hypothesis,but suggests the but suggests the middens middens were always in thewere always in thedunesdunes

Page 43: Exploratory Spatial Data Analysis - Techniques and Examples

Live DemoLive Demo

GGobi

Page 44: Exploratory Spatial Data Analysis - Techniques and Examples

Overall ConclusionOverall Conclusion

nn Visual approach effective to see unexpectedVisual approach effective to see unexpectedstructure in data.structure in data.

nn Combination of different techniques mostCombination of different techniques mosteffective.effective.

nn Can be used for almost all types of spatial data.Can be used for almost all types of spatial data.

Page 45: Exploratory Spatial Data Analysis - Techniques and Examples

Future Work (1)Future Work (1)

nn Enhance new software (e.g., Enhance new software (e.g., GGobiGGobi) to) tooperate in a linked environment with operate in a linked environment with GISGISsoftware.software.

nn Allow access to data bases.Allow access to data bases.

Page 46: Exploratory Spatial Data Analysis - Techniques and Examples

Future Work (2)Future Work (2)

nn Use 3D environment (CAVE,Use 3D environment (CAVE, MiniCAVE MiniCAVE) for) forvisualization and ESDA.visualization and ESDA.

Page 47: Exploratory Spatial Data Analysis - Techniques and Examples

ContactContact

nn JürgenJürgen Symanzik Symanzik–– symanziksymanzik@@sunfssunfs.math..math.usuusu..eduedu

nn WebsiteWebsite–– www.math.www.math.usuusu..eduedu/~/~symanziksymanzik