pragmatic approaches to the semantic web

40
Michael K. Bergman Pragmatic Approaches to the Semantic Web or, Why Aren’t We in Hyperland Yet?

Upload: mike-bergman

Post on 03-Jul-2015

5.427 views

Category:

Technology


2 download

DESCRIPTION

Mike Bergman offers his take on what approaches to the semantic Web are working, what are not, and what all of this might say about the semantic Web moving forward. Informed by Structured Dynamics' open source frameworks and client experiences, the main thesis is that the pragmatic contribution of semantic technologies resides more in mindsets, information models and architectures than in 'linked data' as currently practiced.

TRANSCRIPT

Page 1: Pragmatic Approaches to the Semantic Web

Michael K. Bergman

Pragmatic Approaches to the Semantic Webor, Why Aren’t We in Hyperland Yet?

Page 2: Pragmatic Approaches to the Semantic Web

2

Outline

Intro to SD and Me

Summary of Main Thesis

A Wee Bit of History

What is Not Working?

Problems with Linked Data

What is Working?

Some Pragmatic Lessons

SD’s Pragmatic Approach

Conclusion and Q & A

Page 3: Pragmatic Approaches to the Semantic Web

3

Structured Dynamics

Founded 2008; predecessor Zitgist LLC; two principals

Privately held, revenue funded

Boutique semantic technology shop

Services and consulting: Semantic enterprise adoption Ontology development and mapping Tech transfer and training

Development and software: Open source OSF stack

Data conversion and migration

Client-specific development

Page 4: Pragmatic Approaches to the Semantic Web

4

Current Products and OSF Stack

the pivotal product; Web services middleware that provides distributed data access and federation

Drupal-based structured data linkage to structWSF

spreadsheet, JSON and XML authoring and conversion framework

reference set of linking subjects and basis for domain vocabularies

an ontology- and entity-driven information extraction and tagging system

Page 5: Pragmatic Approaches to the Semantic Web

5

SD Locations

Page 6: Pragmatic Approaches to the Semantic Web

6

Michael Bergman

Page 7: Pragmatic Approaches to the Semantic Web

Summary of Main Thesis

Page 8: Pragmatic Approaches to the Semantic Web

8

Main Arguments

Not against linked data Proponent and explicator since 2006

But, linked data burdensome, not pivotal to interoperability

Interoperability requires: Structured data (from any source) Canonical data model (RDF) (Relatively simple) ontologies for world views, schema Curation

Page 9: Pragmatic Approaches to the Semantic Web

A Wee Bit of History

Page 10: Pragmatic Approaches to the Semantic Web

10

Key Historical Milestones

1945: Memex

1963: Hypertext

1990: Hyperland

2001: Semantic Web Lack of uptake

2006: Linked Data

2010: Revisionist Linked Data

Page 11: Pragmatic Approaches to the Semantic Web

11

Hyperland

Page 12: Pragmatic Approaches to the Semantic Web

12

Linked Data

“Linked Data is a set of best practices for publishing

and deploying instance and class data using the RDF

data model, naming the data objects using uniform

resource identifiers (URIs), thereby exposing the data

for access via the HTTP protocol, while emphasizing

data interconnections, interrelationships and context

useful to both humans and machine agents.”

Page 13: Pragmatic Approaches to the Semantic Web

What is Not Working?

Page 14: Pragmatic Approaches to the Semantic Web

14

Some Disappointments to Date

Full semantic Web vision

Widescale adoption of the semantic Web, linked data

Lack of intelligent agents

Many aspects of the practice of linked data

Page 15: Pragmatic Approaches to the Semantic Web

Problems with Linked Data

Page 16: Pragmatic Approaches to the Semantic Web

16

Problems with Linked Data

Burdensome on publishers

Naïve linkages: Overuse of sameAs Lack of accurate alignments

(Often) poor data quality

Wrong focus

Page 17: Pragmatic Approaches to the Semantic Web

17

Some Conditions for Interoperability

<Interoperability> <needsMapping> <Predicates>

<Interoperability> <needsReference> <Nouns>

Page 18: Pragmatic Approaches to the Semantic Web

18

Many Mappings Should be Approximate skos:broadMatch skos:related ore:similarTo umbel:isAbout vmf:isInVocabulary skos:closeMatch lvont:nearlySameAs umbel:isLike umbel:hasCharacteristic lvont:somewhatSameAs rdfs:seeAlso ore:describes map:narrowerThan skos:narrower map:broaderThan skos:broader dc:subject link:uri foaf:isPrimaryTopicOf

Page 19: Pragmatic Approaches to the Semantic Web

What is Working?

Page 20: Pragmatic Approaches to the Semantic Web

20

Successes

Siri

Bing (Powerset)

Google + schema.org

(Some) linked data

Page 21: Pragmatic Approaches to the Semantic Web

21

Siri

Page 22: Pragmatic Approaches to the Semantic Web

22

Bing (Powerset)

Page 23: Pragmatic Approaches to the Semantic Web

23

Google

Statistical NLP

Structured results

Initial schema (Metaweb)

schema.org (with Yahoo, Bing and Yandex)

Page 24: Pragmatic Approaches to the Semantic Web

24

Some Linked Data

Some selected knowledge bases: DBpedia GeoNames Freebase (Google)

Biomedical community

LOD-LAM community

Page 25: Pragmatic Approaches to the Semantic Web

Some Pragmatic Lessons

Page 26: Pragmatic Approaches to the Semantic Web

26

Some Lessons Learned

Structure is good in any form

Keep semantic technology in the background

Open Web (FYN) likely to be disappointing

Ontologies essential for alignments

NLP an essential contributor to structure

Metadata an essential contributor to characterization, use

Linked data is a burden to publishers, places semantic emphasis on wrong part of chain

Page 27: Pragmatic Approaches to the Semantic Web

27

Seven Pillars

Page 28: Pragmatic Approaches to the Semantic Web

28

Preserving Existing Assets

Relational databases (RDBMs)

Distributed structured assets spreadsheets lightweight datastores

Web pages and Web sites

Existing documents and text

Web databases and APIs

Other databases (RDF, OO, etc.)

Page 29: Pragmatic Approaches to the Semantic Web

29

irON Dataset Exchange Framework

Simple authoring and dataset creation

irON includes an abstract notation and vocabulary for instance records

Notations for: Instance records

Schema

Datasets and metadata

Linkages to other schema

Serializations available for: XML (irXML)

JSON (irJSON)

CSV/spreadsheets (commON)

Page 30: Pragmatic Approaches to the Semantic Web

30

Three irON SerializationsirXML irJSON

commON

Page 31: Pragmatic Approaches to the Semantic Web

31

Spreadsheet Correspondence to Triples

Page 32: Pragmatic Approaches to the Semantic Web

32

More-or-less Interchangeable Formats

Page 33: Pragmatic Approaches to the Semantic Web

SD’s Pragmatic Approach

Page 34: Pragmatic Approaches to the Semantic Web

34

A Layered Approach

Page 35: Pragmatic Approaches to the Semantic Web

35

OSF Stack

Page 36: Pragmatic Approaches to the Semantic Web

Conclusion

Page 37: Pragmatic Approaches to the Semantic Web

37

Summary

If you can, do linked data; it is a GOOD THING

In any event, expose your data: Structured (use NLP for unstructured) Metadata Definitions Relations (simple) “Semsets” (synonyms, acronyms, spelling variants)

Build vocabulary and ontology consortia

Build trust and curation communities

Semantics essential at the interoperability level, not necessarily publication or data transfer

Page 38: Pragmatic Approaches to the Semantic Web

38

Take Aways

James Hendler:

“A little bit of semantics goes a long way”

Leverage linked data, but broaden focus

Consider adopting the semantic enterprise as the broader focus

Page 39: Pragmatic Approaches to the Semantic Web

Further Information

Page 40: Pragmatic Approaches to the Semantic Web

40

More Info and Links

Open Semantic Framework (OSF) stack: http://openstructs.org

TechWiki (400 detailed OSF how-to articles): http://techwiki.openstructs.org

Key ontologies: UMBEL: http://umbel.org

BIBO: http://bibliontology.org

Blogs: Mike Bergman: http://mkbergman.com

Fred Giasson: http://fgiasson.com/blog

Structured Dynamics: http://structureddynamics.com

http://citizen-dan.org (community indicator systems)