linked data on the web

105
Linked Data on the Web Olaf Hartig http://olafhartig.de/foaf.rdf#olaf Database and Information Systems Research Group Humboldt-Universität zu Berlin

Upload: olaf-hartig

Post on 10-May-2015

2.948 views

Category:

Technology


2 download

DESCRIPTION

An comprehensive overview on Linked Data, including an introduction, technical foundations, consuming LD, and open research issues.

TRANSCRIPT

Page 1: Linked Data on the Web

Linked Data on the Web

Olaf Hartighttp://olafhartig.de/foaf.rdf#olaf

Database and Information Systems Research GroupHumboldt-Universität zu Berlin

Page 2: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Outline

From a Web of Documentsto a Web of Data

Technical Foundations of Linked Data

Consuming Linked Data

Current Research Issues

Page 3: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Traditional Web = Internet + Docs + Links

The Traditional Web

Page 4: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Traditional Web = Internet + Docs + Links

● HTML as shared content format

● HTTP to access documents on the Web

● URLs● Globally unique identifiers for documents● Retrieval mechanism

● Hyperlinks

● Single global information space

The Traditional Web

Page 5: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

So what is the problem?

The Traditional Web

Page 6: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

So what is the problem?

● Web content is only loosely structured

● Difficult for applications to do smart things

The Traditional Web

Page 7: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

So what is the problem?

● Web content is only loosely structured

● Difficult for applications to do smart things

Solution:● Increase the structure of Web content● Publish data

The Traditional Web

Page 8: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

So what is the problem?

● Web content is only loosely structured

● Difficult for applications to do smart things

Solution:● Increase the structure of Web content● Publish data

The Traditional Web

But wait…don't we do that already?

Page 9: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

The Traditional Web

● Content providers offer access via Web APIs

Web APIWeb API

Web APIWeb API

Page 10: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

The Traditional Web

● Content providers offer access via Web APIs

● Mashups combine this data

Web APIWeb API

Web APIWeb API

Page 11: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

The Traditional Web

● Content providers offer access via Web APIs

● Mashups combine this data

Web APIWeb API

Web APIWeb API

Shortcomings:

● APIs are proprietary

● Mashups are based on a fixed set of data sources

● You can not set hyperlinks between data object

Page 12: Linked Data on the Web

Olaf Hartig - Executing SPARQL Queries over the Web of Linked Data

● Use URIs as names for things● Use HTTP URIs so that people

can look up those names.● When someone looks up a

URI, provide useful information.

● Include links to other URIs so that they can discover more things.

Tim Berners-Lee, July 2006

My Movie DB

Page 13: Linked Data on the Web

Olaf Hartig - Executing SPARQL Queries over the Web of Linked Data

http://mymovie.db/movie0362

● Use URIs as names for things● Use HTTP URIs so that people

can look up those names.● When someone looks up a

URI, provide useful information.

● Include links to other URIs so that they can discover more things.

Tim Berners-Lee, July 2006

http://mymovie.db/movie2449

http://mymovie.db/movie5112

http://mym

ovi e.db

/movie

13 42

My Movie DB

Page 14: Linked Data on the Web

Olaf Hartig - Executing SPARQL Queries over the Web of Linked Data

http://mymovie.db/movie0362

● Use URIs as names for things● Use HTTP URIs so that people

can look up those names.● When someone looks up a

URI, provide useful information.

● Include links to other URIs so that they can discover more things.

Tim Berners-Lee, July 2006

http://mymovie.db/movie2449

http://mymovie.db/movie5112

http://mym

ovi e.db

/movie

13 42

My Movie DB

http://mym

ovie.db/movie2449

?

Page 15: Linked Data on the Web

Olaf Hartig - Executing SPARQL Queries over the Web of Linked Data

http://mymovie.db/movie0362

● Use URIs as names for things● Use HTTP URIs so that people

can look up those names.● When someone looks up a

URI, provide useful information.

● Include links to other URIs so that they can discover more things.

Tim Berners-Lee, July 2006

http://mymovie.db/movie2449

http://mymovie.db/movie5112

http://mym

ovi e.db

/movie

13 42

My Movie DB

http://mym

ovie.db/movie2449

?

Page 16: Linked Data on the Web

Olaf Hartig - Executing SPARQL Queries over the Web of Linked Data

http://mymovie.db/movie0362

● Use URIs as names for things● Use HTTP URIs so that people

can look up those names.● When someone looks up a

URI, provide useful information.

● Include links to other URIs so that they can discover more things.

Tim Berners-Lee, July 2006

http://mymovie.db/movie2449

http://mymovie.db/movie5112

http://mym

ovi e.db

/movie

13 42

My Movie DB

http://mym

ovie.db/movie2449

?

Page 17: Linked Data on the Web

Olaf Hartig - Executing SPARQL Queries over the Web of Linked Data

http://mymovie.db/movie0362

● Use URIs as names for things● Use HTTP URIs so that people

can look up those names.● When someone looks up a

URI, provide useful information.

● Include links to other URIs so that they can discover more things.

Tim Berners-Lee, July 2006

http://mymovie.db/movie2449

http://mymovie.db/movie5112

http://mym

ovi e.db

/movie

13 42

My Movie DB http://geo.db/cityCJ

http

://ge

o.d

b/c

ou

ntry

7

http://geo.db/country21

http://geo.db/cityXA

http://mym

ovie.db/movie2449

?

Page 18: Linked Data on the Web

Olaf Hartig - Executing SPARQL Queries over the Web of Linked Data

http://mymovie.db/movie0362

● Use URIs as names for things● Use HTTP URIs so that people

can look up those names.● When someone looks up a

URI, provide useful information.

● Include links to other URIs so that they can discover more things.

Tim Berners-Lee, July 2006

http://mymovie.db/movie2449

http://mymovie.db/movie5112

http://mym

ovi e.db

/movie

13 42

My Movie DB http://geo.db/cityCJ

http

://ge

o.d

b/c

ou

ntry

7

http://geo.db/country21

http://geo.db/cityXA

http://mym

ovie.db/movie2449

?

Page 19: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

http://www4.wi … /0743424425

movie:relatedBook

foaf:based_near

dc:title

rdf:type

Linked Data – An Example

http://data.linkedmdb.org/.../2014

The Shining

http://data.linkedmdb.org/.../film

http://sws.geonames.org/2635167/

Page 20: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

http://www4.wi … /0743424425

movie:relatedBook

foaf:based_near

dc:title

rdf:type

Linked Data – An Example

http://data.linkedmdb.org/.../2014

The Shining

http://data.linkedmdb.org/.../film

gn:population

60943000 rdfs

:labe

l

United Kingdom

http://sws.geonames.org/2635167/

Page 21: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

rdfs

:labe

l

United Kingdom

gn:population

movie:relatedBook

foaf:based_near

dc:title

rdf:type

Linked Data – An Example

http://data.linkedmdb.org/.../2014

The Shining

http://sws.geonames.org/2635167/

http://data.linkedmdb.org/.../film

60943000

dc:title

skos:subject

The Shining

http://www4.wi … /0743424425

http://www4.wi … /Fiction

Page 22: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

rdfs

:labe

l

skos:subject

skos:subject

dc:title

gn:population

movie:relatedBook

foaf:based_near

dc:title

rdf:type

Linked Data – An Example

http://data.linkedmdb.org/.../2014

The Shining

http://sws.geonames.org/2635167/

http://data.linkedmdb.org/.../film

60943000

United Kingdom

http://www4.wi … /0743424425

The Shining

http://www4.wi … /1571884029

http://www4.wi … /Fiction

Page 23: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Properties of Linked Data

● Anyone can publish data to the Web of data

● Entities are connected by links● Giant global data graph that spans data sources

● Data is self-describing● Vocabulary terms are identified by URIs, too● Look-up yields their RDFS or OWL definition

● The Web of data is open● Applications can discover new data sources at run-time

Page 24: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Properties of Linked Data

● Anyone can publish data to the Web of data

● Entities are connected by links● Giant global data graph that spans data sources

● Data is self-describing● Vocabulary terms are identified by URIs, too● Look-up yields their RDFS or OWL definition

● The Web of data is open● Applications can discover new data sources at run-time

Is this real?

Page 25: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

W3C Linking Open Data Project

● Grassroots community effort

● Publish existing, open license datasets as Linked Data

● Interlink things between different data sources

Page 26: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

As of July 2007

> 500M triples ca. 120,000 links

W3C Linking Open Data Project

Page 27: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

ca. 6.7B triples ca. 150M links

W3C Linking Open Data Project

Page 28: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

ca. 6.7B triples ca. 150M links

W3C Linking Open Data Project

Media User generatedcontent Publications

Life Sciences

Geographic

Cross-domain

Page 29: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Linked Data Publishers

● UK government

● US government

● Thomson Reuters (Open Calais)

● MetaWeb (Freebase)

● BBC

● NY Times

● Best Buy

● CNET

etc.

Page 30: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Linked Data Publishers

● UK government

● US government

● Thomson Reuters (Open Calais)

● MetaWeb (Freebase)

● BBC

● NY Times

● Best Buy

● CNET

etc. Can I become part?

Page 31: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Linked Data Publishing Tools

● Use HTTP URIs in your FOAF profile

● Legacy data in relational databases● D2R Server, Triplify, Virtuoso, Ultrawrap, ...

● CMS● Drupal

● Native RDF stores● Sesame, AllegroGraph, Virtuoso● Talis platform (Linked Data in the cloud)

● HTML with RDFa

Page 32: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Integrating the Traditional Web

● Annotate Web documents with Linked Data URIs

● Annotation services using named entity recognition● Open Calais (Thomson Reuters) for news● Zemanta for blog posts● Epiphany

dc:subject

http://data.semanticweb.org/ … /eswc/2007/paper-69

http://dbpedia.org/resource/Machine_Learning

Page 33: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Outline

From a Web of Documentsto a Web of Data

Technical Foundations of Linked Data

Consuming Linked Data

Current Research Issues

Page 34: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Technical Foundations

● HTTP

● URI

● RDF

● RDFS / OWL

There is no magic – Linked Data is based on well-established

(Semantic) Web technologies.

Page 35: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

URIs

● Hash URIs

http://olafhartig.de/foaf.rdf#olaf

● Slash URIs

http://data.linkedmdb.org/resource/film/2014

Page 36: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Looking up URIs

GET /foaf.rdf HTTP/1.1User-Agent: curl/7.19.6 (i686-pc-linux-gnu) libcurl/7.19.6 OpenSSL/0.9.8l zlib/1.2.3Host: olafhartig.deAccept: */*

HTTP Request for http://olafhartig.de/foaf.rdf

Give me data abouthttp://olafhartig.de/foaf.rdf#olaf

Page 37: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Looking up URIs

HTTP/1.1 200 OKDate: Thu, 11 Mar 2010 08:47:53 GMTServer: Apache/2.2.6 (Unix) mod_ssl/2.2.6 OpenSSL/0.9.8gLast-Modified: Fri, 05 Mar 2010 18:01:07 GMTETag: "72a16-1946-7fe53ec0"Accept-Ranges: bytesContent-Length: 6470Content-Type: application/rdf+xmlContent-Language: de

<?xml version="1.0" encoding="UTF-8"?><rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:foaf="http://xmlns.com/foaf/0.1/">

<foaf:PersonalProfileDocument rdf:about=""> <foaf:maker rdf:resource="http://olafhartig.de/foaf.rdf#olaf"/> ...

HTTP Response:

Page 38: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

HTTP Content Negotiation

GET /data/dbprofs HTTP/1.1Host: researchersmap.informatik.hu-berlin.deAccept: text/rdf+n3

● Request the resource in a specific format (representation)

● Use the HTTP header Accept to specify a media type

Example:

Page 39: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

HTTP Content Negotiation

HTTP/1.1 200 OKDate: Thu, 11 Mar 2010 09:02:22 GMTServer: Apache/2.2.13 (Linux/SUSE)Content-Location: dbprofs.n3Vary: negotiate,acceptTCN: choiceLast-Modified: Tue, 05 Jan 2010 14:46:17 GMTETag: "40e4d-2250-47c6be683f0e1;47c6be69482f5"Accept-Ranges: bytesContent-Length: 8784Content-Type: text/rdf+n3

@prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> .@prefix foaf: <http://xmlns.com/foaf/0.1/> .@prefix contact: <http://www.w3.org/2000/10/swap/pim/contact#> . <> a foaf:Document ; foaf:maker <http://www.informatik.hu-berlin.de/~hartig/foaf.rdf#olaf> . ...

HTTP Response:

Page 40: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

URIs

● Hash URIs

http://olafhartig.de/foaf.rdf#olaf

● Slash URIs

http://data.linkedmdb.org/resource/film/2014

Page 41: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Redirections

GET /resource/film/2014 HTTP/1.1User-Agent: curl/7.19.6 (i686-pc-linux-gnu) libcurl/7.19.6Host: data.linkedmdb.orgAccept: application/rdf+xml

HTTP Request for http://data.linkedmdb.org/resource/film/2014

Page 42: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

GET /resource/film/2014 HTTP/1.1User-Agent: curl/7.19.6 (i686-pc-linux-gnu) libcurl/7.19.6Host: data.linkedmdb.orgAccept: application/rdf+xml

HTTP/1.1 303 See OtherDate: Thu, 11 Mar 2010 08:15:50 GMTServer: Jetty(6.1.4)Location: http://data.linkedmdb.org/data/film/2014Content-Length: 0Via: 1.1 data.linkedmdb.orgContent-Type: text/plain

HTTP Request for http://data.linkedmdb.org/resource/film/2014

Response:

Redirections

Page 43: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Redirections

GET /resource/film/2014 HTTP/1.1User-Agent: curl/7.19.6 (i686-pc-linux-gnu) libcurl/7.19.6Host: data.linkedmdb.orgAccept: text/html

HTTP Request for http://data.linkedmdb.org/resource/film/2014

Page 44: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

GET /resource/film/2014 HTTP/1.1User-Agent: curl/7.19.6 (i686-pc-linux-gnu) libcurl/7.19.6Host: data.linkedmdb.orgAccept: text/html

HTTP/1.1 303 See OtherDate: Thu, 11 Mar 2010 08:15:50 GMTServer: Jetty(6.1.4)Location: http://data.linkedmdb.org/page/film/2014Content-Length: 0Via: 1.1 data.linkedmdb.orgContent-Type: text/plain

HTTP Request for http://data.linkedmdb.org/resource/film/2014

Response:

Redirections

Page 45: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Vocabularies and Ontologies

● Defined using RDFS or OWL

● A plenty of vocabularies exist:● People● Social media● Commerce● Events● Radio and TV programmes● Music

etc.

Page 46: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

owl:sameAs

http://sws.geonames.org/2635167/

=

http://dbpedia.org/resource/United_Kingdom

=

http://rdf.freebase.com/ns/guid.9202a8c04000641f800000000003e30b

=

http://www4.wiwiss.fu-berlin.de/factbook/resource/United_Kingdom

=

http://www4.wiwiss.fu-berlin.de/eurostat/resource/countries/United_Kingdom

Page 47: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

http://www4.wi … /0743424425

movie:relatedBook

foaf:based_near

dc:title

rdf:typehttp://data.linkedmdb.org/.../2014

The Shining

http://data.linkedmdb.org/.../film

gn:population

owl:sameAs

60943000

owl:sameAs

rdfs

:labe

l

United Kingdom

http://sws.geonames.org/2635167/

http://dbpedia.org/resource/United_Kingdom

Page 48: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

owl:sameAs

gn:population

rdfs

:labe

l

60943000

United Kingdom

http://sws.geonames.org/2635167/

http://www4.wi … /0743424425

movie:relatedBook

foaf:based_near

dc:title

rdf:typehttp://data.linkedmdb.org/.../2014

The Shining

http://data.linkedmdb.org/.../film

owl:sameAs

44

db:callingCode dbp:le

ader

http://dbpedia.org/resource/Gordon_Brown

http://dbpedia.org/resource/United_Kingdom

Page 49: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Outline

From a Web of Documentsto a Web of Data

Technical Foundations of Linked Data

Consuming Linked Data

Current Research Issues

Page 50: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Consuming Linked Data

… by Humans

● Linked Data browsers

● Faceted browsers

● On-the-fly Linked Data Mashups

● Linked Data based applications

Page 51: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Linked Data Browsers

● Provide a tabular view on retrieved RDF data

● Some integrate data from multiple sources

● Allow to follow RDF links

● Multiple options:● Tabulator● Disco● OpenLink Data Explorer● Zitgist Data Viewer● Marbles

etc.

Page 52: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Faceted Browsers

http://dbpedia.neofonie.de

Page 53: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

On-the-fly Mashups

http://sig.ma

Page 54: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Linked Data based Applications

[SFSW'09]

Page 55: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

New Kind of Applications

● Users retain full control over their data

● Users manage and publish data on their own

● All that is needed for the application is a URI

<http://www.dbis.informatik.hu-berlin.de/ … /freytag.rdf#me> rdf:type :DBProfessor . …

http://researchersmap.informatik.hu-berlin.de/data/dbprofs

Page 56: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Users Really Own their Data

<http://www.dbis.informatik.hu-berlin.de/ … /freytag.rdf#me> contact:fullName "Prof. Johann-Christoph Freytag, Ph.D." ; contact:office [ contact:address [ contact:street "Rudower Chaussee 25" ; contact:city "Berlin"^^xsd:string ; contact:postalCode "12489"^^xsd:string ] ] ; foaf:topic_interest <http://dbpedia.org/resource/Query_optimization> , <http://dbpedia.org/resource/Privacy> , <http://dbpedia.org/resource/Data_quality> , <http://dbpedia.org/resource/Data_warehouse> ; owl:sameAs <http://dblp.l3s.de/d2r/resource/authors/Johann_Christoph_Freytag> . …

http://www.dbis.informatik.hu-berlin.de/ ... /freytag.rdf

Page 57: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Consuming Linked Data

… in Applications

● Look up URIs and process the retrieved data

● Query with SPARQL

Page 58: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

● Query language for RDF data

● Main idea: pattern matching● Describe subgraphs of the queried RDF graph● Subgraphs that match your description yield a result● Mean: graph patterns (i.e. RDF graphs with variables)

?vhttp://.../Volcano

rdf:type

Brief Introduction to SPARQL

Page 59: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Queriedgraph:

?vhttp://.../Volcano

rdf:type

http://.../Mount_Baker http://.../Volcanordf:type

"1880"

p:lastEruption

htp://.../Mount_Etna

rdf:type

?v

http://.../Mount_Bakerhttp://.../Mount_Etna

Results:

Brief Introduction to SPARQL

Page 60: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Querying Linked Data with SPARQL

● Linked Data sources usually provide a SPARQL service

● Send your query, receive the result

Data Source Endpoint Address

DBpedia http://dbpedia.org/sparql

Musicbrainz http://dbtune.org/musicbrainz/sparql

U.S. Census http://www.rdfabout.com/sparql

Semantic Crunchbase http://cb.semsol.org/sparql

More complete list: http://esw.w3.org/topic/SparqlEndpoints

Page 61: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Querying Linked Data with SPARQL

Querying a single dataset is quite boring

compared to:

Issuing SPARQL queries over multiple datasets

How can you do this?

● Issue follow-up queries to different endpoints

● Query a central collection of datasets

● Build store with copies of relevant datasets

● (Use query federation system)

● Use a link traversal based query system

Page 62: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Traditional approach 1: data centralization

● Querying a collection ofcopies from all relevantdatasets

Querying Linked Data with SPARQL

Page 63: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Traditional approach 2: federated query processing

● Querying a mediator whichdistributes subqueries torelevant sources andintegrates the results

Querying Linked Data with SPARQL

???

?

Page 64: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Main drawback:

You have to know the relevantdata sources in advance.

You restrict yourself tothe selected sources.

You do not tap thefull potential of

the Web !

Page 65: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

A novel approach:

Link Traversal Based Query Execution[ISWC'09]

Page 66: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Main Idea

● Intertwine query evaluation with traversal of RDF links

● Alternately:● Evaluate parts of the query on a

continuously augmented set of data● Look up URIs in intermediate

solutions and add retrieved datato the queried data set

Queried data

Page 67: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Main Idea

● Intertwine query evaluation with traversal of RDF links

● Alternately:● Evaluate parts of the query on a

continuously augmented set of data● Look up URIs in intermediate

solutions and add retrieved datato the queried data set

Queried data

filmingLocation

http://.../movie2449

statistics

?loc

Query unemp_rate?ur

?stat

Page 68: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Main Idea

● Intertwine query evaluation with traversal of RDF links

● Alternately:● Evaluate parts of the query on a

continuously augmented set of data● Look up URIs in intermediate

solutions and add retrieved datato the queried data set

Queried data

filmingLocation

http://.../movie2449

statistics

?loc

Query unemp_rate?ur

?stat

http://.../movie2449

?

Page 69: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Main Idea

● Intertwine query evaluation with traversal of RDF links

● Alternately:● Evaluate parts of the query on a

continuously augmented set of data● Look up URIs in intermediate

solutions and add retrieved datato the queried data set

Queried data

filmingLocation

http://.../movie2449

statistics

?loc

Query unemp_rate?ur

?stat

http://.../movie2449

?

Page 70: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Main Idea

● Intertwine query evaluation with traversal of RDF links

● Alternately:● Evaluate parts of the query on a

continuously augmented set of data● Look up URIs in intermediate

solutions and add retrieved datato the queried data set

Queried data

filmingLocation

http://.../movie2449

statistics

?loc

Query unemp_rate?ur

?stat

http://.../movie2449

?

Page 71: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Main Idea

● Intertwine query evaluation with traversal of RDF links

● Alternately:● Evaluate parts of the query on a

continuously augmented set of data● Look up URIs in intermediate

solutions and add retrieved datato the queried data set

Queried data

filmingLocation

http://.../movie2449

statistics unemp_rate?ur

?stat

?loc

Query

Page 72: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Main Idea

● Intertwine query evaluation with traversal of RDF links

● Alternately:● Evaluate parts of the query on a

continuously augmented set of data● Look up URIs in intermediate

solutions and add retrieved datato the queried data set

Queried data

filmingLocationhttp://geo.../Italyhttp://.../movie2449

filmingLocation

http://.../movie2449

statistics unemp_rate?ur

?stat

?loc

Query

Page 73: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Main Idea

● Intertwine query evaluation with traversal of RDF links

● Alternately:● Evaluate parts of the query on a

continuously augmented set of data● Look up URIs in intermediate

solutions and add retrieved datato the queried data set

Queried data

http://geo.../Italy

?loc

filmingLocationhttp://.../movie2449

filmingLocation

http://.../movie2449

statistics unemp_rate?ur

?stat

?loc

Query

http://geo.../Italy

Page 74: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Main Idea

Queried data

http

://ge

o.../

Italy

?

● Intertwine query evaluation with traversal of RDF links

● Alternately:● Evaluate parts of the query on a

continuously augmented set of data● Look up URIs in intermediate

solutions and add retrieved datato the queried data set

http://geo.../Italy

?loc

filmingLocation

http://.../movie2449

statistics unemp_rate?ur

?stat

?loc

Query

Page 75: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Main Idea

Queried data

http

://ge

o.../

Italy

?

● Intertwine query evaluation with traversal of RDF links

● Alternately:● Evaluate parts of the query on a

continuously augmented set of data● Look up URIs in intermediate

solutions and add retrieved datato the queried data set

http://geo.../Italy

?loc

filmingLocation

http://.../movie2449

statistics unemp_rate?ur

?stat

?loc

Query

Page 76: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Main Idea

Queried data

http

://ge

o.../

Italy

?

● Intertwine query evaluation with traversal of RDF links

● Alternately:● Evaluate parts of the query on a

continuously augmented set of data● Look up URIs in intermediate

solutions and add retrieved datato the queried data set

http://geo.../Italy

?loc

filmingLocation

http://.../movie2449

statistics unemp_rate?ur

?stat

?loc

Query

Page 77: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Main Idea

● Intertwine query evaluation with traversal of RDF links

● Alternately:● Evaluate parts of the query on a

continuously augmented set of data● Look up URIs in intermediate

solutions and add retrieved datato the queried data set

http://geo.../Italy

?loc

Queried data

filmingLocation

http://.../movie2449

statistics unemp_rate?ur

?stat

?loc

Query

Page 78: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Main Idea

● Intertwine query evaluation with traversal of RDF links

● Alternately:● Evaluate parts of the query on a

continuously augmented set of data● Look up URIs in intermediate

solutions and add retrieved datato the queried data set

http://geo.../Italy

?loc

Queried data

filmingLocation

http://.../movie2449 unemp_rate?ur

statistics ?stat

?loc

Query

Page 79: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Main Idea

● Intertwine query evaluation with traversal of RDF links

● Alternately:● Evaluate parts of the query on a

continuously augmented set of data● Look up URIs in intermediate

solutions and add retrieved datato the queried data set

http://geo.../Italy

?loc

Queried data

filmingLocation

http://.../movie2449 unemp_rate?ur

statistics ?stat

?loc

Query

statistics http://stat.db/.../it

http://geo.../Italy

Page 80: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Main Idea

● Intertwine query evaluation with traversal of RDF links

● Alternately:● Evaluate parts of the query on a

continuously augmented set of data● Look up URIs in intermediate

solutions and add retrieved datato the queried data set

http://geo.../Italy

?loc

Queried data

http://geo.../Italy http://stats.db/../it

?stat?loc

statistics http://stat.db/.../it

http://geo.../Italy

filmingLocation

http://.../movie2449 unemp_rate?ur

statistics ?stat

?loc

Query

Page 81: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Main Idea

● Intertwine query evaluation with traversal of RDF links

● Alternately:● Evaluate parts of the query on a

continuously augmented set of data● Look up URIs in intermediate

solutions and add retrieved datato the queried data set

http://geo.../Italy

?loc

Queried data

http://geo.../Italy http://stats.db/../it

?stat?loc

filmingLocation

http://.../movie2449 unemp_rate?ur

statistics ?stat

?loc

Query

Page 82: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

In a Nutshell

● Link traversal based query execution:● Evaluation on a continuously augmented dataset● Discovery of potentially relevant data during execution● Discovery driven by intermediate solutions

● Main advantage:● No need to know all data sources in advance

Page 83: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Real-World Example

SELECT DISTINCT ?author ?phone WHERE {

?pub swc:isPartOf <http://data.semanticweb.org/conference/eswc/2009/proceedings> .

?pub swc:hasTopic ?topic . ?topic rdfs:label ?topicLabel .

FILTER regex( str(?topicLabel), "ontology engineering", "i" ) .

?pub swrc:author ?author .

{ ?author owl:sameAs ?authorAlt }

UNION

{ ?authorAlt owl:sameAs ?author }

?authorAlt foaf:phone ?phone

}

Return phone numbers ofauthors of ontology engineering papers

at ESWC'09.

2

297

161min 30sec

# of query results

# of retrieved graphs

# of accessed servers

avg. execution time

Page 84: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Application

SELECT DISTINCT ?i ?labelWHERE {

?prof rdf:type <http://res ... data/dbprofs#DBProfessor> ; foaf:topic_interest ?i .

OPTIONAL { ?i rdfs:label ?label FILTER( LANG(?label)="en" || LANG(?label)="") }}ORDER BY ?label ?

SQUINSemWeb

ClientLib

● Researchers Map implemented with SQUIN● Query interface to the whole Web of Data

Page 85: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

SELECT DISTINCT ?i ?labelWHERE {

?prof rdf:type <http://res ... data/dbprofs#DBProfessor> . ?prof foaf:topic_interest ?i .

OPTIONAL {?i rdfs:label ?labelFILTER( LANG(?label)="en" || LANG(?label)="")

}}ORDER BY ?label

Application

Page 86: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

● Implementation of Researchers Map was very easy due to:● SQUIN / SemWeb Client Lib

● Approx. 700 LOC JavaScript (incl. 100 for the queries)

● Approx. 50 LOC PHP (Mainly to set up server side proxy due to same origin policy)

● Convenient access to SQUIN with SQUIN PHP tools

● Try it: http://squin.org

$s = 'http:// …'; // address of the SQUIN service$q = new SparqlQuerySock( $s, '… SELECT ...' );$res = $q->getJsonResult(); // or getXmlResult()

Application

Page 87: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Consuming Linked Data

… getting started

Issues people have when they want to start:

● Finding URIs

● Finding additional data

● Finding SPARQL endpoints

Page 88: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Finding URIs

Problem:

Two options:

● Data source specific solutions● Some Linked Data sources provide a keyword based search

for things in their dataset(s)

● Search Engines for the Web of data

What URIs exist that identifythe thing I'm interested in?

Page 89: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Page 90: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Page 91: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Finding URIs

You may try a SPARQL query:

What if there is no search possibility?

SELECT DISTINCT ?s WHERE { ?s rdfs:label ?label . FILTER regex( str(?label), "Berlin", "i" ) .}

Page 92: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

● Search engines for the Web of data provide keyword based search for things in different datasets)● Falcons http://iws.seu.edu.cn/services/falcons/● Sindice http://sindice.com● SWSE http://www.swse.org● Watson http://watson.kmi.open.ac.uk

● They have also APIs

Finding URIs

Page 93: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Page 94: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Page 95: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Page 96: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Finding Additional Data

Problem:

Three options:

● Follow links (e.g. rdfs:seeAlso, owl:sameAs)

● Use a search engine for the Web of data

● Use a co-reference service● Co-reference services find different

URIs that refer to the same thing● They may also provide an API

Given a URIs, where do I find more data as what is available by looking it up?

Page 97: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Page 98: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Page 99: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Finding SPARQL Endpoints

Problem:

● Look at: http://esw.w3.org/topic/SparqlEndpoints

● Still an open issue

What relevant endpoints exist?Where is the SPARQL endpointfor a dataset?What is the data provided via a SPARQL endpoint about?

Page 100: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Outline

From a Web of Documentsto a Web of Data

Technical Foundations of Linked Data

Consuming Linked Data

Current Research Issues

Page 101: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Linked Data Fusion

Requirements:

● Schema mapping: map data into a single schema

● Identity resolution: smush data from all sources

● Conflict resolution: resolve inconsistencies in the data

Applications want an integrated view onall data that is available about a thing

Page 102: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

User Interfaces and Interaction

● How do we build interfaces that operate over sucha large amount of data?

● What will be their interaction paradigm?

● How to explain data provenance and data fusion?

Page 103: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Provenance, Quality, and Trust

● There are no facts on the Web – everything is a claim

● Increasing amount of research in this area

● W3C provenance incubator group

● Our contributions so far:● A provenance model for the Web of data [LDOW'09]● A provenance based Information Quality assessment method

[SWPM'09]● tSPARQL – a trust aware extension for SPARQL [ESWC'09]

Page 104: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

Take-away Summary

● Entities are connected by data links

● Data is self-describing

● Anyone can publish data to the Web of data

● Linked Data holds an enormous potential: users may benefit from a virtually unbound set of data sources

● Learn more about Linked Data:● “Linked Data – The Story So Far”

by C. Bizer, T. Heath, T. Berners-Lee● On consuming Linked Data: http://consuminglinkeddata.org

The traditional Web of documents evolves into a Web of data.

Page 105: Linked Data on the Web

Olaf Hartig - Linked Data on the Web

These slides have been created byOlaf Hartig

http://olafhartig.de

Some slides are based on slide sets provided by● Christian Bizer● Juan Sequeda

This work is licensed under aCreative Commons Attribution-Share Alike 3.0 License

(http://creativecommons.org/licenses/by-sa/3.0/)