using free and open source gis to automatically create standards-based spatial metadata in academia

19
Using Free and Open Source GIS to Automatically Create Standards-Based Spatial Metadata in Academia Claire Ellul, Nart Tamash, Feng Xian, John Stuiver, Patrick Rickles

Upload: patrick-rickles

Post on 08-Jul-2015

187 views

Category:

Technology


2 download

DESCRIPTION

This presentation was given at the Free and Open Source for Geospatial (FOSS4G) 2013 conference in Nottingham, UK on work undertaken by Dr. Claire Ellul, Nart Tamash, Feng Xian, John Stuiver, and Patrick Rickles in the hopes of automating as many of the INSPIRE metadata as possible.

TRANSCRIPT

Page 1: Using Free and Open Source GIS to Automatically Create Standards-Based Spatial Metadata in Academia

Using Free and Open Source GIS to Automatically Create

Standards-Based Spatial Metadata in Academia

Claire Ellul, Nart Tamash, Feng Xian, John Stuiver, Patrick Rickles

Page 2: Using Free and Open Source GIS to Automatically Create Standards-Based Spatial Metadata in Academia

DISCLAIMER

Page 3: Using Free and Open Source GIS to Automatically Create Standards-Based Spatial Metadata in Academia

Metadata

Page 4: Using Free and Open Source GIS to Automatically Create Standards-Based Spatial Metadata in Academia

Data about the data

Importance

Page 5: Using Free and Open Source GIS to Automatically Create Standards-Based Spatial Metadata in Academia

Boring

Lots of Fields

Maintenance

Page 6: Using Free and Open Source GIS to Automatically Create Standards-Based Spatial Metadata in Academia

Automation

Page 7: Using Free and Open Source GIS to Automatically Create Standards-Based Spatial Metadata in Academia

Many Easily Done

Keywords and Language

Open Source Software

Page 8: Using Free and Open Source GIS to Automatically Create Standards-Based Spatial Metadata in Academia

Metadata Spatial

Datasets

Tight Coupling (Triggers)

PostGIS 2.0 Database

PostreSQL / PostGIS Database (Nart Tamash)

Page 9: Using Free and Open Source GIS to Automatically Create Standards-Based Spatial Metadata in Academia

QGIS Plug-in (Feng Xian)

Page 10: Using Free and Open Source GIS to Automatically Create Standards-Based Spatial Metadata in Academia

Identifying Language (QGIS python Plug-in using the ‘langid’ library)

For metadata: concatenate TITLE,

ABSTRACT, LINEAGE

Pass the resulting text into the ‘langid’ code

Store result in the metadata table in the

database

For dataset: take the first 10,000 characters of text

Page 11: Using Free and Open Source GIS to Automatically Create Standards-Based Spatial Metadata in Academia

Identifying Keywords (PL/pgSQL Trigger)

Identify Text Fields in the Data Set

Split any text into single words using the space character as delimiter

Create a single column list of all the words using

the SQL UNION query

Use an SQL GROUP BY query to identify the 10 most frequently used

words

Page 12: Using Free and Open Source GIS to Automatically Create Standards-Based Spatial Metadata in Academia

Various Datasets

Different Countries

Page 13: Using Free and Open Source GIS to Automatically Create Standards-Based Spatial Metadata in Academia

Results

Page 14: Using Free and Open Source GIS to Automatically Create Standards-Based Spatial Metadata in Academia

Produced Metadata Dataset

Page 15: Using Free and Open Source GIS to Automatically Create Standards-Based Spatial Metadata in Academia

Dataset Metadata Information

Page 16: Using Free and Open Source GIS to Automatically Create Standards-Based Spatial Metadata in Academia

Common Words

Character Set

Page 17: Using Free and Open Source GIS to Automatically Create Standards-Based Spatial Metadata in Academia

18 out of 20 Mandatory INSPIRE Fields

Metadata Language Successfully Detected

Mixed Results with Keywords and Dataset Language

Page 18: Using Free and Open Source GIS to Automatically Create Standards-Based Spatial Metadata in Academia

Further Work

Page 19: Using Free and Open Source GIS to Automatically Create Standards-Based Spatial Metadata in Academia

Multiplatform

Web Accessible

Understandable To All