introduction tothe semantic web and linked data

148
Introduction to the Semantic Web and Linking Data Eric Axel Franzon Vice President Semantic Universe/ Wilshire Conferences

Upload: eric-franzon

Post on 01-Nov-2014

2.472 views

Category:

Education


1 download

DESCRIPTION

This was presented to the San Francisco chapter of DAMA International on June 9, 2010 at SAP in Palo Alto, California.

TRANSCRIPT

Page 1: Introduction tothe Semantic Web and Linked Data

Introduction to theSemantic Web and Linking Data

Eric Axel FranzonVice PresidentSemantic Universe/Wilshire Conferences

Page 2: Introduction tothe Semantic Web and Linked Data

About Me• Professional

• Wilshire Conferences• Semantic Universe• W3C• Guidewire Group

• Coach / Consultant / Trainer• Geek

Page 3: Introduction tothe Semantic Web and Linked Data

Today we will talk about:

• Semantic Technologies

• Semantic Web & Web 3.0

• Linked Data– Linked Open Data

– Linked Enterprise Data

• Use cases

• That harmonica on the first slide

Page 4: Introduction tothe Semantic Web and Linked Data

SemanticTechnologies

SemanticWeb

Page 5: Introduction tothe Semantic Web and Linked Data

WebTechnologies

WorldWideWeb

Page 6: Introduction tothe Semantic Web and Linked Data

Semantic Web = Web 3.0Semantic Web

= Web of Data

Page 7: Introduction tothe Semantic Web and Linked Data

ww

w.g

eeka

ndpo

ke.c

om

Page 8: Introduction tothe Semantic Web and Linked Data

What is the Web of Data Not?

• A software package• Something that will ever

“be complete”• A replacement for the

current Web• A pipe dream• A silver bullet

Page 9: Introduction tothe Semantic Web and Linked Data

It’s also not…

• HAL 9000

Page 10: Introduction tothe Semantic Web and Linked Data

• Skynet

It’s also not…

Page 11: Introduction tothe Semantic Web and Linked Data

What is the Web of Data?

• A Web-scale architecture• A metadata technology• A layer of meaning on the

existing Web• In use TODAY!

Page 12: Introduction tothe Semantic Web and Linked Data

Web of Data

Page 13: Introduction tothe Semantic Web and Linked Data
Page 14: Introduction tothe Semantic Web and Linked Data

Q: What does Linked Data have to do with the Semantic Web?

Page 15: Introduction tothe Semantic Web and Linked Data

Web 1.0 – Linking Documents

Page 16: Introduction tothe Semantic Web and Linked Data

Web 1.0

Page 17: Introduction tothe Semantic Web and Linked Data

Web 1.0

“I see: characters + formatting + images”--my Computer

Page 18: Introduction tothe Semantic Web and Linked Data

Web 1.0 – Linking DocumentsWeb 2.0 – Linking People

Page 19: Introduction tothe Semantic Web and Linked Data

Web 2.0

Page 20: Introduction tothe Semantic Web and Linked Data

Web 2.0

“I see: characters + formatting + images”--my Computer

Page 21: Introduction tothe Semantic Web and Linked Data

Web 1.0 – Linking DocumentsWeb 2.0 – Linking PeopleWeb 3.0 – Linking Data

Page 22: Introduction tothe Semantic Web and Linked Data

Web 3.0 – Linking Data

Title Publisher

Price

Format

Cover

Author

Page 23: Introduction tothe Semantic Web and Linked Data

Web 3.0 – Linking Data

Title Publisher

Price

Format

Cover

Author

“I see: things + relationships. This informationis about a book.”

Page 24: Introduction tothe Semantic Web and Linked Data
Page 25: Introduction tothe Semantic Web and Linked Data

SemanticTechnologies

SemanticWeb

LinkedOpenData

Page 26: Introduction tothe Semantic Web and Linked Data

Linking Open Data ProjectMay, 2007

Page 27: Introduction tothe Semantic Web and Linked Data

March 2009

Page 28: Introduction tothe Semantic Web and Linked Data
Page 29: Introduction tothe Semantic Web and Linked Data
Page 30: Introduction tothe Semantic Web and Linked Data

Data from these trusted sources is available for you

to use in your applications TODAY.

Data you can LINK to.

And not just data…

Page 31: Introduction tothe Semantic Web and Linked Data

Semantic Data that is not onlymachine READABLE.

It is machine UNDERSTANDABLE!

Page 32: Introduction tothe Semantic Web and Linked Data

Disambiguation

Page 33: Introduction tothe Semantic Web and Linked Data

Disambiguation

mole, n.

Page 34: Introduction tothe Semantic Web and Linked Data
Page 35: Introduction tothe Semantic Web and Linked Data

But…

Page 36: Introduction tothe Semantic Web and Linked Data

MetadataDoctorow’s Criticisms LOD/LED Response

“People lie” Allow users to choose a social trust model

“People are lazy”Automate where possible and encourage

authoring where needed

“People are stupid”Automate where possible, check where

possible

“Mission Impossible: know thyself” Allow multiple sources of metadata

“Schemas aren’t neutral” Allow multiple schemas

“Metrics influence results” Allow multiple metrics

“There’s more than one way to describe something”

Allow multiple descriptions

Page 37: Introduction tothe Semantic Web and Linked Data

LOD/LED is flexible

Page 38: Introduction tothe Semantic Web and Linked Data

1. By uniquely identifying THINGS2. By uniquely identifying RELATIONSHIPS3. By using TRIPLES

How does LOD/LED work?

Page 39: Introduction tothe Semantic Web and Linked Data

So, what’s a THING?

1. By uniquely identifying THINGS

How does LOD/LED work?

Page 40: Introduction tothe Semantic Web and Linked Data

A THING is anything that can be uniquely identified by a URI or a literal (string)

Me

My postal code

The White House

L.A. County’s sales tax rate

http://twitter.com/ericaxel

http://www.city-data.com/zips/90043.html

Lat: 38.89859 Long: -77.035971

9.750 %

http://ericfranzon.com/operator.jpg

Page 41: Introduction tothe Semantic Web and Linked Data

This is a collection of THINGS:

t_peopleName City State Post codeDavid Fredericksburg VA 22408Eric Culver City CA 90230

Page 42: Introduction tothe Semantic Web and Linked Data

Trees and Tables

t_people

Name City State Post code

David Fredericksburg VA 22408

Eric Culver City CA 90230

people

EricDavid

Fredericksburg VA 22408

City

State Postcode

Culver City CA 90230

City

State Postcode

Page 43: Introduction tothe Semantic Web and Linked Data

Trees and Tables – Problem 1

t_people

Name City State Post code flag

David Fredericksburg VA 22408 1

Eric Culver City CA 90230

people

EricDavid

Fredericksburg VA 22408

City

State Postcode

Culver City CA 90230

City

State Postcode

flag1

Adding partial data totables leads to sparseness

Page 44: Introduction tothe Semantic Web and Linked Data

Trees and Tables – Problem 2

t_people

Name City State Post code

David Culver City CA 90230

Eric Culver City CA 90230

people

EricDavid

Culver City CA 90230

City

State Postcode

Culver City CA 90230

City

State Postcode

Common data leads to (lots!) of duplication

Page 45: Introduction tothe Semantic Web and Linked Data

Graphs

people

EricDavidCity

State

Postcode

Culver City

CA

90230

City

State

Postcode

flag1

Page 46: Introduction tothe Semantic Web and Linked Data

Who’s your daddy?

1. By uniquely identifying THINGS2. By uniquely identifying RELATIONSHIPS

How does LOD/LED work?

Page 47: Introduction tothe Semantic Web and Linked Data

Is Father of

<owl:ObjectProperty rdf:ID="isFather"><rdfs:domain rdf:resource="#Person"/><rdfs:range rdf:resource="#Person"/>

</owl:ObjectProperty>

mailto:[email protected]

Page 48: Introduction tothe Semantic Web and Linked Data

1. By uniquely identifying THINGS2. By uniquely identifying RELATIONSHIPS3. By using TRIPLES

What’s a triple?

Page 49: Introduction tothe Semantic Web and Linked Data

Predicate

Triples? It’s Elementary! (School)

book has title.

RelationshipThat is a Triple!

Page 50: Introduction tothe Semantic Web and Linked Data

“This book has a title.”

“Eric wrote this Web page.”

“This article is about moles.”

“I like blues.”

“I like B.L.U.E.S.”

“This image can be used non-commercially.”

“My email address is [email protected].”

Triples? It’s Elementary!

Page 51: Introduction tothe Semantic Web and Linked Data

Book Has Title “Title”

Eric Created Webpage

Image Has License CC Non-Commercial

TriplesSu

bjec

ts

Obj

ects

Predicates

Page 52: Introduction tothe Semantic Web and Linked Data

Book

Author Title

PublisherISBN

Page 53: Introduction tothe Semantic Web and Linked Data

The Trouble with Triples

Page 54: Introduction tothe Semantic Web and Linked Data

Cytoscape.org

Page 55: Introduction tothe Semantic Web and Linked Data

Review of the Review

Our Data are Multiplying.

Page 56: Introduction tothe Semantic Web and Linked Data

Trends in data growth

• Vast amounts of digital data being produced daily.

–Wal-Mart produces 1 million transactions every hour. DBs estimated at > 2.5 petabytes

• US National Archives creating > 10 million digital assets annually

Page 57: Introduction tothe Semantic Web and Linked Data

Data Inflation

• Megabyte (MB) = 220

• Gigabyte (GB) = 230

• Terabyte (TB) = 240

• Petabyte (PB) = 250 or 1000TB

• Exabyte (EB) = 260 or 1,000PB

• Zettabyte (ZB) = 270 or 1,000EB

• Yottabyte (YB) = 280 or 1,000ZB

Page 58: Introduction tothe Semantic Web and Linked Data

Acceleration

–Decoding human genome involves analyzing 3 billion base pairs

• what took 10 years to process in 2003, takes a week today

Page 59: Introduction tothe Semantic Web and Linked Data

A brand new professional has emerged ....

The data scientist, who combines the skills of

software programmer, statistician and storyteller/artist to extract the

nuggets of gold hidden under mountains of data.

- The Economist, “Data, data everywhere”, Feb 27th 2010

Page 60: Introduction tothe Semantic Web and Linked Data

When we come back…

Page 61: Introduction tothe Semantic Web and Linked Data

S – T – R – E – T – C - HBreak!

Page 62: Introduction tothe Semantic Web and Linked Data

Linked Data is like a harmonica

• It’s easy to play

Page 63: Introduction tothe Semantic Web and Linked Data

Facebook• Unique Visitors*: 540,000,000• Page Views: 570,000,000,000

* Per month

Source: Google - The 1000 most-visited sites on the web

Page 64: Introduction tothe Semantic Web and Linked Data

Facebook

Page 65: Introduction tothe Semantic Web and Linked Data

Facebook

Page 66: Introduction tothe Semantic Web and Linked Data

FOAF: Friend-Of-A-Friend

http://www.foaf-project.org/

Page 67: Introduction tothe Semantic Web and Linked Data

FOAF-a-Matichttp://www.ldodds.com/foaf/foaf-a-matic

Page 68: Introduction tothe Semantic Web and Linked Data

semantictweet.com

Page 69: Introduction tothe Semantic Web and Linked Data

semantictweet.com

Page 70: Introduction tothe Semantic Web and Linked Data

semantictweet.com

Can create four FOAF files: • Friends (who I follow)• Followers• All• Just Me

Page 71: Introduction tothe Semantic Web and Linked Data

Linked Data is like a harmonica

• It’s easy to play• It’s a “real” instrument

Page 72: Introduction tothe Semantic Web and Linked Data

The Technologies of RDBMS

• Data• Schemas• Query Language

Page 73: Introduction tothe Semantic Web and Linked Data

RDBMS Datat_people

Name City State Post codeDavid Fredericksburg VA 22408Eric Culver City CA 90230

Page 74: Introduction tothe Semantic Web and Linked Data

RDBMS Schema

Page 75: Introduction tothe Semantic Web and Linked Data

RDBMS Query Language: SQL

SELECT isbn, title, price, price * 0.06 AS

sales_taxFROM Book WHERE price > 100.00 ORDER BY title;

Page 76: Introduction tothe Semantic Web and Linked Data

The Technologies of LOD/LED

• Data• Schemas• Query Language

Page 77: Introduction tothe Semantic Web and Linked Data

The Data Language

ResourceDescriptionFramework

Page 78: Introduction tothe Semantic Web and Linked Data

RDF TriplesSubject Predicate Object

http://plushbeautybar.com dc: creator http://www.ericaxel.com/foaf.rdf

http://www.geonames.org/maps/google_34.021_-118.396.html

dc: location N 34° 1' 16''W 118° 23' 47''

http://twitter.com/ericaxel foaf: knows “Brian Sletten”

Page 79: Introduction tothe Semantic Web and Linked Data

RDF Triple ComponentsSubject Predicate Object

http://plushbeautybar.com dc: creator http://www.ericaxel.com/foaf.rdf

http://www.geonames.org/maps/google_34.021_-118.396.html

dc: location N 34° 1' 16''W 118° 23' 47''

http://twitter.com/ericaxel foaf: knows “Brian Sletten”

URI URI URI orString Literal

http://twitter.com/bsletten

Page 80: Introduction tothe Semantic Web and Linked Data

“RDF is good for distributing dataacross the Web and pretendingit’s in one place.”

-Dean Allemang, TopQuadrant

Page 81: Introduction tothe Semantic Web and Linked Data

Just so you know…There are many ways of representing RDF:

• RDF/XML• N3• JSON

• N-Triples • Turtle• RDFa

Each serialization has pros and cons, but they all are used to connect THINGS and RELATIONSHIPS into TRIPLES

Page 82: Introduction tothe Semantic Web and Linked Data

The Schemata

Linked Data schemas consist of:

Your RDF relationships (predicates)+

Relationship descriptions

Page 83: Introduction tothe Semantic Web and Linked Data

LOD/LED Schemata

id First Name Last Name

1 Tony Shaw

Schema

Data

Initial Schema

hasID

hasFirstName hasLastName

Tony Shaw1

owl:sameAs

hasSurname

Relationshipdescription

Page 84: Introduction tothe Semantic Web and Linked Data

Choosing Relationships

• Reuse popular vocabularies

–FOAF (Friend-of-a-friend)

–Dublin Core (library/publisher metadata)

–SIOC (Semantically-Interlinked Online Communities)

• ...or make up your own!

Page 85: Introduction tothe Semantic Web and Linked Data

RDF TriplesSubject Predicate Object

http://plushbeautybar.com dc: creator http://www.ericaxel.com/foaf.rdf

http://www.geonames.org/maps/google_34.021_-118.396.html

dc: location N 34° 1' 16''W 118° 23' 47''

http://twitter.com/ericaxel foaf: knows “David Wood”

Page 86: Introduction tothe Semantic Web and Linked Data

1. Resource Description Framework Schema (RDFS): Simple, hierarchical classes

2. Simple Knowledge Organization System (SKOS): Port taxonomies to the Semantic Web

3. Web Ontology Language (OWL): Complex logical relationships

Relationship Descriptions

Page 87: Introduction tothe Semantic Web and Linked Data

Combine vocabularies and descriptions

Page 88: Introduction tothe Semantic Web and Linked Data

LOD/LED Schemata

• Put as much work into creating your LED schema as you put into creating your relational schemas

• ... maybe even a bit more (due to links between your data and others’).

Page 89: Introduction tothe Semantic Web and Linked Data

New York Times -SKOS

Page 90: Introduction tothe Semantic Web and Linked Data

New York Times -SKOS

Page 91: Introduction tothe Semantic Web and Linked Data

New York Times -SKOS

SKOS STUFF

Page 92: Introduction tothe Semantic Web and Linked Data

The query language

SPARQLProtocolAndRDFQueryLanguage

SPARQL

Page 93: Introduction tothe Semantic Web and Linked Data

SPARQL Example #1FOAF (some people that Eric Franzon knows)

PREFIX foaf: <http://xmlns.com/foaf/0.1/>SELECT ?nameFROM <http://ericaxel.com/eric.rdf>WHERE {

?knower foaf:knows ?known .?known foaf:name ?name .

}

Page 94: Introduction tothe Semantic Web and Linked Data

SPARQL Example #1

Page 95: Introduction tothe Semantic Web and Linked Data

Example #1 - Results

Page 96: Introduction tothe Semantic Web and Linked Data

SPARQL Example #2Querying two FOAF Profiles

PREFIX foaf: <http://xmlns.com/foaf/0.1/>PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>SELECT ?nameFROM NAMED <http://ericaxel.com/eric.rdf>FROM NAMED <http://zepheira.com/team/dave/dave.rdf>WHERE {GRAPH <http://ericaxel.com/eric.rdf> {?x rdf:type foaf:Person .?x foaf:name ?name .

} .GRAPH <http://zepheira.com/team/dave/dave.rdf> {?y rdf:type foaf:Person .?y foaf:name ?name .

} .}

Page 97: Introduction tothe Semantic Web and Linked Data

Example #2 - Results

Page 98: Introduction tothe Semantic Web and Linked Data

SPARQL Example #3Bart Simpson's chalkboard gags (DBPedia)

SELECT ?episode,?chalkboard_gagWHERE { ?episode skos:subject ?season .

?season rdfs:label ?season_title . ?episode dbpedia2:blackboard ?chalkboard_gag .

FILTER (regex(?season_title, "The Simpsons episodes, season")) . } ORDER BY ?season

Page 99: Introduction tothe Semantic Web and Linked Data

Example #3 - Results

Page 100: Introduction tothe Semantic Web and Linked Data

http://www.milinkito.com/swf/bart.php

Page 101: Introduction tothe Semantic Web and Linked Data

Are *real* companies using Linked Data?

Page 102: Introduction tothe Semantic Web and Linked Data

Easy to play; takes work to master.

Page 104: Introduction tothe Semantic Web and Linked Data

E-Commerce

A vocabulary to describe products, services, and other e-commerce terms.

Page 105: Introduction tothe Semantic Web and Linked Data

Who is using GoodRelations?

1100+ Best Buy stores

Page 106: Introduction tothe Semantic Web and Linked Data

Phase 2

~640,000 “next-gen” product detail pages

Page 107: Introduction tothe Semantic Web and Linked Data

21 Open Box Productslisted at this store!

Page 108: Introduction tothe Semantic Web and Linked Data
Page 109: Introduction tothe Semantic Web and Linked Data

Who is using GoodRelations?

Page 110: Introduction tothe Semantic Web and Linked Data
Page 111: Introduction tothe Semantic Web and Linked Data
Page 112: Introduction tothe Semantic Web and Linked Data
Page 113: Introduction tothe Semantic Web and Linked Data

With RDFa + GoodRelations, but no additional SEO work, PlushBeautyBar.com was indexed by Google within one week.

Page 114: Introduction tothe Semantic Web and Linked Data

Semantic (Web) Technologies

SemanticWeb

LinkedEnterpriseData

RDBMS

CRM

Calendars

LinkedOpenData

Page 115: Introduction tothe Semantic Web and Linked Data

MIXING private and public data?

Absolutely! And it’s really useful to do so!

Page 116: Introduction tothe Semantic Web and Linked Data

Example:

iConcertCal

Page 117: Introduction tothe Semantic Web and Linked Data

Public + Private Data: iConcertCal

Page 118: Introduction tothe Semantic Web and Linked Data

Public + Private Data: iConcertCal

Page 119: Introduction tothe Semantic Web and Linked Data

Example:

Siri

Page 120: Introduction tothe Semantic Web and Linked Data

Siri.com

Siri is a Virtual Assistant.

I ask it to do things for me.

It does, by mixing data,by disambiguating, andby reasoning.

Page 121: Introduction tothe Semantic Web and Linked Data

Siri.com

Siri is a Virtual Assistant.

I ask it to do things for me.

It does, by mixing data,by disambiguating, andby reasoning.

Page 122: Introduction tothe Semantic Web and Linked Data

Siri.com

Siri is a Virtual Assistant.

I ask it to do things for me.

It does, by mixing data,by disambiguating, andby reasoning.

Page 123: Introduction tothe Semantic Web and Linked Data

Siri.com

Siri is a Virtual Assistant.

I ask it to do things for me.

It does, by mixing data,by disambiguating, andby reasoning.

Page 124: Introduction tothe Semantic Web and Linked Data

Siri.com

Siri is a Virtual Assistant.

I ask it to do things for me.

It does, by mixing data,by disambiguating, andby reasoning.

Page 125: Introduction tothe Semantic Web and Linked Data

Siri.com

Siri is a Virtual Assistant.

I ask it to do things for me.

It does, by mixing data,by disambiguating, andby reasoning.

Page 126: Introduction tothe Semantic Web and Linked Data

Example:

Page 127: Introduction tothe Semantic Web and Linked Data

• Largest broadcasting corp. in the world

• 8 national TV channels

• 10 national radio stations

• 40 local radio stations

• An extensive website, bbc.co.uk

Page 128: Introduction tothe Semantic Web and Linked Data

• Broadcasts 1,000-1,500 programs per day.

• Publishes information in several formats: audio, video, textual.

• Needed to relate information across media for both users and third-party developers

Page 129: Introduction tothe Semantic Web and Linked Data

• Approach: Create a Web presence for each

• Broadcast

• Artist

• Species (and other biological ranks), habitat and adaptation

–that the BBC has an interest in.

Page 130: Introduction tothe Semantic Web and Linked Data
Page 131: Introduction tothe Semantic Web and Linked Data
Page 132: Introduction tothe Semantic Web and Linked Data
Page 133: Introduction tothe Semantic Web and Linked Data

"Creating web identifiers for every item the BBC has an interest in, and considering those as aggregations of BBC content about that item, allows us to enable very rich cross-domain user journeys."-- Yves Raimond

Page 134: Introduction tothe Semantic Web and Linked Data

• BBC Music is underpinned by the Musicbrainz music database and Wikipedia.

• “BBC Music takes the approach that the Web itself is its content management system. [BBC] editors directly contribute to Musicbrainz and Wikipedia.”

Page 135: Introduction tothe Semantic Web and Linked Data
Page 136: Introduction tothe Semantic Web and Linked Data

BBC

• Wildlife Finder links existing LOD data with BBC content to make pages about each species, habitat and adaptation:

• Wildlife programmes (clips and episodes) are identified by tagging the clip or episode with the appropriate dbpedia URI.

Page 137: Introduction tothe Semantic Web and Linked Data
Page 138: Introduction tothe Semantic Web and Linked Data

"The RDF representations of these web identifiers allow developers to use our data to build applications."-- Yves Raimond

Page 139: Introduction tothe Semantic Web and Linked Data

A few final thoughts

Page 140: Introduction tothe Semantic Web and Linked Data

A little bit can be very powerful!

Page 141: Introduction tothe Semantic Web and Linked Data
Page 142: Introduction tothe Semantic Web and Linked Data

Web 3.0 = Semantic Web

tripleOWLRDF

SPARQL

Linked Data

RDFs

SKOS

RDFa

Page 143: Introduction tothe Semantic Web and Linked Data

Web 3.0 = Semantic Web

Dublin Core

tripleOWLRDF RDFa PURLs

ontology

NLP

OWL-DLOWL-FullRDFs

entity extraction

OWL2OWL-lite

subject objectpredicate

folksonomy

microformats GRDDL

URI

triplestore

SPARQLArtificial Intelligence cloud computing open world reasoning

reasoning engine

Linked Data

taxonomy

data portability

LOD LED

REST

vocabulary

SKOS

microdata

Page 145: Introduction tothe Semantic Web and Linked Data

Questions? Operators are standing by.

[email protected]

THANK YOU!

Page 146: Introduction tothe Semantic Web and Linked Data

Semantic UniverseFree Informational Resourcewww.SemanticUniverse.com

Semantic Technology Conferencewww.Semantic-Conference.com

June 21-25, 2010

Page 147: Introduction tothe Semantic Web and Linked Data

Resourceshttp://geekandpoke.typepad.com/

http://richard.cyganiak.de/2007/10/lod/

http://iconcertcal.com

http://siri.com

http://data.nytimes.com

http://freedigitalphotos.com

http://aldobucchi.com

http://www.milinkito.com/swf/bart.php

Page 148: Introduction tothe Semantic Web and Linked Data

Resourceshttp://www.flickr.com/photos/kellyhogaboom/4369774518/

http://www.flickr.com/photos/zenera/56677048/

http://www.flickr.com/photos/97964364@N00/59780745/

http://www.flickr.com/photos/starwarsblog/793008715/

http://www.flickr.com/photos/peterpearson/871254091/

http://www.flickr.com/photos/birdfarm/60946474/

http://www.flickr.com/photos/entropy1138/173847148/

http://www.flickr.com/photos/wainwright/351684037/

http://data.nytimes.com/50891932523096258603.rdf