1 data capture in production of geostatistics estp course on geographic information systems (gis):...

20
1 Data capture in production of geostatistics course on Geographic Information Systems (GIS): of GIS for making statistics in a production environment istics Norway, Oslo, 26th to 30th of March 2012 Attribution (by) Licensees may copy, distribute, display and perform the work and make derivative works based on it only if they give the author or licensor the credits in the manner specified by these. http://creativecommons.org/licenses/by/3.0/ Mrs Diana Makarenko-Piirsalu MSc in Landscape Ecology and Environmental Protection Geolytics OÜ Mere tee 15, Saviranna, Jõelähtme vald, Harjumaa, ESTONIA [email protected] Mob. +372 556 19 636

Upload: melvin-ramsey

Post on 13-Jan-2016

254 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: 1 Data capture in production of geostatistics ESTP course on Geographic Information Systems (GIS): Use of GIS for making statistics in a production environment

1

Data capture in production of geostatistics

ESTP course on Geographic Information Systems (GIS): Use of GIS for making statistics in a production environmentStatistics Norway, Oslo, 26th to 30th of March 2012

Attribution (by) Licensees may copy, distribute, display and perform the work and make derivative works based on it only if they give the author or licensor the credits in the manner specified by these. http://creativecommons.org/licenses/by/3.0/

Mrs Diana Makarenko-PiirsaluMSc in Landscape Ecology and Environmental ProtectionGeolytics OÜMere tee 15, Saviranna, Jõelähtme vald, Harjumaa, [email protected]. +372 556 19 636

Page 2: 1 Data capture in production of geostatistics ESTP course on Geographic Information Systems (GIS): Use of GIS for making statistics in a production environment

You will obtain…

• an overview of the basics of data sources for producing grid based statistics

• examples from register and combined method census countries

• an overview of sources of digital geodata for further analyses

2

Page 3: 1 Data capture in production of geostatistics ESTP course on Geographic Information Systems (GIS): Use of GIS for making statistics in a production environment

What is Geostatistics?

• A class of statistics used to analyze and predict the values associated with spatial or spatio-temporal phenomena. Geostatistics provides a means of exploring spatial data and generating continuous surfaces from selected sampled data points.

(Source: http://wiki.gis.com/wiki/index.php/GIS_Glossary/G )

• statistics liked with the location (x;y) on the Earth

• methods for production of georeferenced statistics

• spatial analyses of georeferenced statistics

• focus has been on gridded statistics

3Source: Öber,S , Springfeldt, P 1991

Page 4: 1 Data capture in production of geostatistics ESTP course on Geographic Information Systems (GIS): Use of GIS for making statistics in a production environment

What is Geostatistics? • Statistics can be geocoded at different scales and geometries:

• Most common and traditional way is to geocode statistics based on:

– administrative units (polygons) - NUTS, LAU, Enumeration areas, postal areas

– statistical units can be represented also by points (centroids of the Statistical

units) or lines (streets, rivers)

• the thematic delineation as localities, UA, UMZ…

• Since 1970 linking statistics with grids has been developing – pioneered by the Statistics of Finland

4

Page 5: 1 Data capture in production of geostatistics ESTP course on Geographic Information Systems (GIS): Use of GIS for making statistics in a production environment

Methods for producing gridded statistics

• Aggregation– Detailed scale point based data are aggregated into grid cell based on

the spatial join

– Most accurate results can be produced

• Disaggregation

– downscaling polygon based data (enumeration area, LAU, post code districs) into grids using Corine Land Cover and/or Soil Sealing data

– as any modelling it contains many assumptions, depends on the scale of data used, data resolution, algorithms

– Tend not to reflect the reality accurately

5

Page 6: 1 Data capture in production of geostatistics ESTP course on Geographic Information Systems (GIS): Use of GIS for making statistics in a production environment

The main decisions before the production of aggregated grid based data

• Strategic decisions with long – term influence should be made:

• What is the smallest geographical unit in production of grid based statistics?

• Coordinate reference system and scale:– Data interoperability issue :

Interoperability with national geodata - national reference system-

Pan – European interoperability – INSPIRE Directive ETRS89 – LAEA (Source:http://inspire.jrc.ec.europa.eu/documents/Data_Specifications/INSPIRE_DataSpecification_SU_v2.0.pdf )

• What datasets are needed ? What data exists on particular country?

6

Page 7: 1 Data capture in production of geostatistics ESTP course on Geographic Information Systems (GIS): Use of GIS for making statistics in a production environment

Producing aggregated grid statistics…

• … main task is to find the best available solution for geocoding statistics with the most detailed spatial feature available for the country

• Issues to be considered:– Coverage – Accuracy– Frequency of data updating

• It involves linking different datasets (attribute data; spatial data) often based on addresses or in some cases other ID may be in use (building _ID, personal_ID, cadastral number etc)

• Link (unique key ) between different data sets should be generated and kept

7

Page 8: 1 Data capture in production of geostatistics ESTP course on Geographic Information Systems (GIS): Use of GIS for making statistics in a production environment

Data sources for aggregated grid statistics

• Enumeration area centroid - What is the area of the biggest EA?

In cities the EA centroid might be adequate for producing 1 km grid. Rural areas EA tend to be bigger.

• Existing national registers (x;y) as primary data sources:

• National Mapping Agency

– Cadastral - parcel centroid

– Digital topographic database: Source for the building geometries

– Building centroid– Part of the building centroid

Source for street network - Geocoding based on street segments

– Building register (x;y):building centroid or part of the building centroid

– Address Register

8

Page 9: 1 Data capture in production of geostatistics ESTP course on Geographic Information Systems (GIS): Use of GIS for making statistics in a production environment

Address registers are primary data source

• Despite of the myriad of projects at European or Global level aimed to harmonize, standardise , enable data interoperability etc address registers differ in each country remarkably

• At the broad conceptual (from the georeference point of view) level the difference is:

Object related addresses – Each building, parcel, street, settlement has exact

location (x:y) Digital street network - interpolated address locations

based on the street segment Mixed approach

• All in all – statistics having addresses are possible to link with location in Europe – at different spatial accuracy

9

Page 10: 1 Data capture in production of geostatistics ESTP course on Geographic Information Systems (GIS): Use of GIS for making statistics in a production environment

Example of Data capture from Estonia

10

• Mixed census method – partly using registers, e-survey and traditional enumeration

• AKP – address updating project improved the address coverage but not at full coverage

• Unique ID linking data at building level was generated for data capture process by Stat EE

• E-survey : geocoded address data were used + e-survey mapping application was provided - respondents had the possibility to geocode their permanent place of living

• Field enumeration – enumerator task is to geocode the addresses : two opportunities: Using mapping application or GPS

Page 11: 1 Data capture in production of geostatistics ESTP course on Geographic Information Systems (GIS): Use of GIS for making statistics in a production environment

Example of Data capture from Finland

11

• Register based census• 1970 – decision was made to add coordinates for all building centroids• Buildng and dwelling register is georeferenced • Since 1995 – standardized grid based products have been disseminated

Source: Production and dissemination of grid data since 1970 Census in Finland, 2010 - http://www.efgs.info/presentations/Production%20and%20dissemination%20of%20grid%20data-Finland-e.pdf/view

Page 12: 1 Data capture in production of geostatistics ESTP course on Geographic Information Systems (GIS): Use of GIS for making statistics in a production environment

Different data sources can be used for urban or rural areas

12

• Urban areas :

– Building centroid

– Part of the building – entrance

– Street network

– EA centroid

• Rural areas :

– Parcel centroid

Page 13: 1 Data capture in production of geostatistics ESTP course on Geographic Information Systems (GIS): Use of GIS for making statistics in a production environment

Pan – European Grid

13

• Data can be downloaded from www.efgs.info

• http://www.efgs.info/data/eurogrid

• CRS ETRS 89 – LAEA

• Pan - European statistical grid is specified in INSPIRE D2.8.III.1 Data Specification on Statistical Units

• Source:http://inspire.jrc.ec.europa.eu/documents/Data_Specifications/INSPIRE_DataSpecification_SU_v2.0.pdf

Page 14: 1 Data capture in production of geostatistics ESTP course on Geographic Information Systems (GIS): Use of GIS for making statistics in a production environment

Data for Netherlands• http://www.efgs.info/data

14

Page 15: 1 Data capture in production of geostatistics ESTP course on Geographic Information Systems (GIS): Use of GIS for making statistics in a production environment

Data for Norway• http://www.efgs.info/data

15

Page 16: 1 Data capture in production of geostatistics ESTP course on Geographic Information Systems (GIS): Use of GIS for making statistics in a production environment

Data for Portugal and France• http://www.efgs.info/data

16

Page 17: 1 Data capture in production of geostatistics ESTP course on Geographic Information Systems (GIS): Use of GIS for making statistics in a production environment

ESA – open data • ESA - Observing the Earth - ESA opens Landsat

archives

• www.esa.int

17

Page 18: 1 Data capture in production of geostatistics ESTP course on Geographic Information Systems (GIS): Use of GIS for making statistics in a production environment

EEA – open data • Environmental data , Urban morpholical zones etc…

18

Page 19: 1 Data capture in production of geostatistics ESTP course on Geographic Information Systems (GIS): Use of GIS for making statistics in a production environment

Data capture• data capture

Any operation that converts GIS data into computer-readable form. Geographic data can be captured by being downloaded directly into a GIS from sources such as remote-sensing or GPS data, or it can be digitized, scanned, or keyed in manually from paper maps or photographs

Source: http://wiki.gis.com/wiki/index.php/GIS_Glossary/D

• Things to consider: Data should be FIT for purpose – Scale– CRS– Data models– Formats– Coverage and quality issues – Primary or secondary – Processing steps of the data

19

Page 20: 1 Data capture in production of geostatistics ESTP course on Geographic Information Systems (GIS): Use of GIS for making statistics in a production environment

THANK YOU!

20

ESTP course on Geographic Information Systems (GIS): Use of GIS for making statistics in a production environmentStatistics Norway, Oslo, 26th to 30th of March 2012

Mrs Diana Makarenko-PiirsaluMSc in Landscape Ecology and Environmental ProtectionGeolytics OÜMere tee 15, Saviranna, Jõelähtme vald, Harjumaa, [email protected]. +372 556 19 636