philadelphia, may 2–4, 2005 effective use of spatial databases for enterprise data integration dr...

28
Philadelphia, May 2– 4, 2005 www.locationintellige Effective use of spatial databases for Enterprise data integration Dr Paul Watson Laser-Scan

Upload: adelia-dayna-strickland

Post on 18-Jan-2018

218 views

Category:

Documents


0 download

DESCRIPTION

Philadelphia, May 2–4, Drivers for Data Integration Agile Competitors and Virtual Organisations - Goldman, Nagel and Preiss

TRANSCRIPT

Page 1: Philadelphia, May 2–4, 2005  Effective use of spatial databases for Enterprise data integration Dr Paul Watson Laser-Scan

Philadelphia, May 2–4, 2005

www.locationintelligence.net

Effective use of spatial databases for

Enterprise data integrationDr Paul WatsonLaser-Scan

Page 2: Philadelphia, May 2–4, 2005  Effective use of spatial databases for Enterprise data integration Dr Paul Watson Laser-Scan

Philadelphia, May 2–4, 2005

www.locationintelligence.net

Overview• Motivation - review key drivers for Data Integration• Obstacles to Data Integration• Data Quality Management Principles

– Data Stewardship– Knowledge Management– Enterprise metadata– Rules based processing

• Methdology – Data Quality Improvement cycle• Spatial Business Rules• Spatial Knowledge Management – Radius Studio

– Rules browser/ rules repository– Conformance check/ report– Reconciliation– Certification

Page 3: Philadelphia, May 2–4, 2005  Effective use of spatial databases for Enterprise data integration Dr Paul Watson Laser-Scan

Philadelphia, May 2–4, 2005

www.locationintelligence.net

Agile Principle Business Needs Information NeedsEnrich theCustomer

Adapt products &Services

Integrate, extend

MasterChange

ReactForecast

TransformModel, test, extrapolate

MobiliseResources

Plan, EmpowerBuild Virtual teams

Analyse, Build knowledgeCollaboration platforms

Co-operate toCompete

Focus, partner Transform, publish, exchange

Drivers for Data Integration

Agile Competitors and Virtual Organisations - Goldman, Nagel and Preiss

Page 4: Philadelphia, May 2–4, 2005  Effective use of spatial databases for Enterprise data integration Dr Paul Watson Laser-Scan

Philadelphia, May 2–4, 2005

www.locationintelligence.net

Obstacles to Data Integration

• Proprietary Data Silos - data cul de sac

• Monolithic Information Systems – embedded logic

• Built-in, Private Data Models – structure/lifecycle

• Unknown/Unproven Data Quality – KR/KM

Page 5: Philadelphia, May 2–4, 2005  Effective use of spatial databases for Enterprise data integration Dr Paul Watson Laser-Scan

Philadelphia, May 2–4, 2005

www.locationintelligence.net

Enablers For Enterprise Information Architectures

• Electronic Data Interchange • Straight-through Processing• Service Oriented Architectures• Rules-based Processing• Knowledge Representation Standards• Data & service integration is the goal

Page 6: Philadelphia, May 2–4, 2005  Effective use of spatial databases for Enterprise data integration Dr Paul Watson Laser-Scan

Philadelphia, May 2–4, 2005

www.locationintelligence.net

Technological Building Blocks

• Open data access standards

• Extensible, interoperable platforms

• Metadata publication/retrieval

• Data stewardship

Page 7: Philadelphia, May 2–4, 2005  Effective use of spatial databases for Enterprise data integration Dr Paul Watson Laser-Scan

Philadelphia, May 2–4, 2005

www.locationintelligence.net

Data Quality ImpactOne third of companies have been forcedto delay or scrap new systems because of faulty data,and a full 75% have experienced significant problemsresulting from data quality issues - PwC

55-70% of CRM and 70% of Data Warehouseproject failures are due to data quality issues - Gartner, Meta

US business loses $600 billion each yeardue to data quality problems - Data Warehousing Institute

When is data “fit for purpose”?

Page 8: Philadelphia, May 2–4, 2005  Effective use of spatial databases for Enterprise data integration Dr Paul Watson Laser-Scan

Philadelphia, May 2–4, 2005

www.locationintelligence.net

Principles, Tools & Method for DQ Improvement• Knowledge Management

– Describe the domain/problem independently (from data and systems)

• Rules-based paradigm– Decouple the problem definition from problem

discovery & resolution (what to do about it)• Data Quality Improvement Cycle

– Employ a process of continuous monitoring & improvement - sustainable interoperability™

Page 9: Philadelphia, May 2–4, 2005  Effective use of spatial databases for Enterprise data integration Dr Paul Watson Laser-Scan

Philadelphia, May 2–4, 2005

www.locationintelligence.net

Knowledge Management

• Where is knowledge now?– Embedded in data– Hidden in point applications– Inside people’s heads

• Store the knowledge/expertise of the organisation where everyone can contribute to it and share it - as enterprise metadata in a database

Page 10: Philadelphia, May 2–4, 2005  Effective use of spatial databases for Enterprise data integration Dr Paul Watson Laser-Scan

Philadelphia, May 2–4, 2005

www.locationintelligence.net

Benefits of a knowledge management approach

• Explicit/unambiguous – not embedded• Inclusive - accommodates domain experts• Non-technical – not just for developers• Open, distributable – location/applications• Auditable – regulatory/governance issues• Evolutionary – incremental acquisition, knowledge

is refined/grows over time • Structured – machine-readable

Page 11: Philadelphia, May 2–4, 2005  Effective use of spatial databases for Enterprise data integration Dr Paul Watson Laser-Scan

Philadelphia, May 2–4, 2005

www.locationintelligence.net

Knowledge Representation

•RDF•OWL•SWRL

Page 12: Philadelphia, May 2–4, 2005  Effective use of spatial databases for Enterprise data integration Dr Paul Watson Laser-Scan

Philadelphia, May 2–4, 2005

www.locationintelligence.net

Data Quality as Knowledge• Express data requirements as rules (e.g. SWRL)• Data quality rules – enterprise metadata - DB• Rules metadata can be shared (interpreted and

enforced) by many different applications• Rules can be used to measure Data Quality - %

conforming data instances• Rules guide data reconciliation - prioritise• Rules can be used to measure quality improvement

reliably• “Fit for Purpose” = satisfies the DQ rules

Page 13: Philadelphia, May 2–4, 2005  Effective use of spatial databases for Enterprise data integration Dr Paul Watson Laser-Scan

Philadelphia, May 2–4, 2005

www.locationintelligence.net

Rules-based Processing Paradigm

• Fact – Pattern – Action• Given some facts, if they meet any of the patterns/rules,

perform the defined action– Declarative – rule separated from processing– Pluggable actions – reporting/ reconciliation

Data Rule A

Report Action A

Reconcile Action A

Page 14: Philadelphia, May 2–4, 2005  Effective use of spatial databases for Enterprise data integration Dr Paul Watson Laser-Scan

Philadelphia, May 2–4, 2005

www.locationintelligence.net

Philadelphia, May 2–4, 2005

www.locationintelligence.net

make baselineassessment

refine rulemetadata

check dataconformance

perform datareconciliation

datacertification

datapublication

define rulemetadata

define qualitymission

Data Quality Improvement Cycle

0

10

20

30

40

50

60

70

80

90

Rule A Rule B

SP

0

10

20

30

40

50

60

70

80

90

Rule A Rule B

01020304050

60708090

100

Rule A Rule B

e.g. address cleaning

Page 15: Philadelphia, May 2–4, 2005  Effective use of spatial databases for Enterprise data integration Dr Paul Watson Laser-Scan

Philadelphia, May 2–4, 2005

www.locationintelligence.net

Enterprise Integration Gapfor Location

• Spatial data - complex, brittle and surprising• Maintenance - manual, expensive, error-prone• Adherence to a spatial model is often business critical

(e.g. land & property management, utilities)• Tools - significant bespoke development, inflexible, built

by developers not domain experts• Location data - key for BI, data quality chasm, mining• Rudimentary IT standards for spatial data• Spatial semantics never explicit – KR/KM

Page 16: Philadelphia, May 2–4, 2005  Effective use of spatial databases for Enterprise data integration Dr Paul Watson Laser-Scan

Philadelphia, May 2–4, 2005

www.locationintelligence.net

Spatial Knowledge Management

• Same methodology– Spatial business rules as metadata– Conformance checking/reporting– Data Reconciliation – rules driven– Certification/publication– Only detailed data operations change

• Business Rule - Percentage: 0 ≤ x ≤ 100• Location Rule Spatial (a,b) : coveredBy, contains, withinDistance

etc.• Empower the domain expert – short “dev.” cycles, stay agile

Page 17: Philadelphia, May 2–4, 2005  Effective use of spatial databases for Enterprise data integration Dr Paul Watson Laser-Scan

Philadelphia, May 2–4, 2005

www.locationintelligence.net

SpatialData Quality

RulesIntra-featureConstraints

physicalgeometric

DataSpecifications

Ad hocRules

Inter-featureConstraints

proximity topologicaldirectional

Spatial Rules Discovery

Page 18: Philadelphia, May 2–4, 2005  Effective use of spatial databases for Enterprise data integration Dr Paul Watson Laser-Scan

Philadelphia, May 2–4, 2005

www.locationintelligence.net

Spatial Rule Types

Page 19: Philadelphia, May 2–4, 2005  Effective use of spatial databases for Enterprise data integration Dr Paul Watson Laser-Scan

Philadelphia, May 2–4, 2005

www.locationintelligence.net

SpatialRulesAuthoring

curbline

building

Page 20: Philadelphia, May 2–4, 2005  Effective use of spatial databases for Enterprise data integration Dr Paul Watson Laser-Scan

Philadelphia, May 2–4, 2005

www.locationintelligence.net

Spatial Rule Builder

Page 21: Philadelphia, May 2–4, 2005  Effective use of spatial databases for Enterprise data integration Dr Paul Watson Laser-Scan

Philadelphia, May 2–4, 2005

www.locationintelligence.net

Add Rule Clauses

building

zoning

Page 22: Philadelphia, May 2–4, 2005  Effective use of spatial databases for Enterprise data integration Dr Paul Watson Laser-Scan

Philadelphia, May 2–4, 2005

www.locationintelligence.net

Spatial & Non-spatial Conditions

Fire stationbuilding

Street centreline

“There’s no such thing as a spatial rule”

Page 23: Philadelphia, May 2–4, 2005  Effective use of spatial databases for Enterprise data integration Dr Paul Watson Laser-Scan

Philadelphia, May 2–4, 2005

www.locationintelligence.net

<ruleml:imp> <ruleml:_body> <swrlx:classAtom> <owlx:Class owlx:name="Road"/> <ruleml:var>r</ruleml:var> </swrlx:classAtom> <swrlx:individualPropertyAtom swrlx:property="hasGeom"> <ruleml:var>r</ruleml:var> <ruleml:var>g</ruleml:var> </swrlx:individualPropertyAtom> <swrlx:datavaluedPropertyAtom swrlx:property="hasName"> <ruleml:var>r</ruleml:var> <ruleml:var>n</ruleml:var> </swrlx:datavaluedPropertyAtom> <swrlx:classAtom> <owlx:IntersectionOf> <owlx:Class owlx:name="Geometry"/> <owlx:ObjectRestriction owlx:property="hasFeature"> <owlx:SomeValuesFrom> <owlx:IntersectionOf> <owlx:Class owlx:name="RoadSegment"/> <owlx:DataRestriction owlx:property="hasName"> <owlx:hasValue>n</owlx:hasValue> </owlx:DataRestriction> </owlx:IntersectionOf> </owlx:SomeValuesFrom> </owlx:ObjectRestriction> </owlx:IntersectionOf> <ruleml:var>j</ruleml:var> </swrlx:classAtom>

<swrlx:individualPropertyAtom swrlx:property="hasAggregate"> <ruleml:var>j</ruleml:var> <ruleml:var>a</ruleml:var> </swrlx:individualPropertyAtom> </ruleml:_body> <ruleml:_head> <swrlx:individualPropertyAtom swrlx:property="equals"> <ruleml:var>g</ruleml:var> <ruleml:var>a</ruleml:var> </swrlx:individualPropertyAtom> </ruleml:_head> </ruleml:imp>

High St.

High St.

Road

Road Segment

For each Road, the set of Road Segments having the same name as the Road must have the same aggregate geometry as the Road

Rul

es a

s K

now

ledg

e

Page 24: Philadelphia, May 2–4, 2005  Effective use of spatial databases for Enterprise data integration Dr Paul Watson Laser-Scan

Philadelphia, May 2–4, 2005

www.locationintelligence.net

Conformance Checking

SpatialRulesEngine

EnterpriseMessaging

EnterpriseSpatial DB

EnterpriseMetadata -Spatial Rules

Web ServicesClients

AsynchronousMessagingClient

BrowserClient

ReportingSolution

Page 25: Philadelphia, May 2–4, 2005  Effective use of spatial databases for Enterprise data integration Dr Paul Watson Laser-Scan

Philadelphia, May 2–4, 2005

www.locationintelligence.net

Data Reconciliation

SpatialRulesEngine

EnterpriseSpatial DB

EnterpriseMetadata -Spatial Rules

Web ServicesClients

UpdateA-B-A

CloneA-B-A’

Schema MapA-B-C

Page 26: Philadelphia, May 2–4, 2005  Effective use of spatial databases for Enterprise data integration Dr Paul Watson Laser-Scan

Philadelphia, May 2–4, 2005

www.locationintelligence.net

Warehouse Certification

SpatialRulesEngine

EnterpriseSpatial DB

EnterpriseMetadata -Spatial Rules

Web ServicesClients

0

10

20

30

40

50

60

70

80

90

Rule A Rule B

01020304050

60708090

100

Rule A Rule B

Page 27: Philadelphia, May 2–4, 2005  Effective use of spatial databases for Enterprise data integration Dr Paul Watson Laser-Scan

Philadelphia, May 2–4, 2005

www.locationintelligence.net

Summary

• Data quality issues in integration are best addressed using knowledge management/rules-based approaches

• Spatial data quality is no different to any other data quality – standard, interoperable rules are key

• Enterprise spatial rules engines form a secure base from which to develop open, distributable location enabled applications

Page 28: Philadelphia, May 2–4, 2005  Effective use of spatial databases for Enterprise data integration Dr Paul Watson Laser-Scan

Philadelphia, May 2–4, 2005

www.locationintelligence.net