wtf is the semantic web and linked data

121
WTF is the Semantic Web and Linked Data Juan F. Sequeda Department of Computer Science University of Texas at Austin Nov 17, 2011

Upload: juan-sequeda

Post on 18-Nov-2014

1.721 views

Category:

Technology


4 download

DESCRIPTION

Talk given at UT ISchool on Nov 17, 2011

TRANSCRIPT

Page 1: WTF is the Semantic Web and Linked Data

WTF is the Semantic Web and Linked Data

Juan F. SequedaDepartment of Computer Science

University of Texas at AustinNov 17, 2011

Page 2: WTF is the Semantic Web and Linked Data

Semantic Web? Linked Data?

WTF?

Page 3: WTF is the Semantic Web and Linked Data

WTF is the Semantic Web?

Page 4: WTF is the Semantic Web and Linked Data

WTF is the Semantic Web?

Page 5: WTF is the Semantic Web and Linked Data

Internet != Web

Page 6: WTF is the Semantic Web and Linked Data
Page 7: WTF is the Semantic Web and Linked Data

What is the Web?

“… the Web, is a system of interlinked hypertext documents accessed via the Internet. With a web browser, one can view web pages that may contain text, images […] and navigate between them via hyperlinks”

http://en.wikipedia.org/wiki/World_Wide_Web

Page 8: WTF is the Semantic Web and Linked Data

Current Web = internet + links + docs

Page 9: WTF is the Semantic Web and Linked Data

History of the Web

• Created by Tim Berners-Lee at CERN in 1989• Mosaic browser in 1993• W3C created in 1994• Exponential growth mid 90s• Amazon, Ebay – 1995• Search engines – Google 1998• Dot-com boom 1997 – 2001• Web 2.0 – blogs, Facebook, Twitter, etc

Page 10: WTF is the Semantic Web and Linked Data

What is the problem?

Page 11: WTF is the Semantic Web and Linked Data

WHAT’S THE WEATHER IN

AUSTIN TODAY?

http://www.flickr.com/photos/jamieca/31631256/

Page 12: WTF is the Semantic Web and Linked Data
Page 13: WTF is the Semantic Web and Linked Data
Page 14: WTF is the Semantic Web and Linked Data
Page 15: WTF is the Semantic Web and Linked Data

What is the problem?

• The web is full of documents• We aren’t always interested in documents

– We are interested in THINGS– These THINGS might be in documents

• We can read a HTML document rendered in a browser and find what we are searching for– This is hard for computers. – Computers have to guess (even though they are

pretty good at it)

Page 16: WTF is the Semantic Web and Linked Data

The Web of Documents

Search

Crawler

Search Engine

Page 17: WTF is the Semantic Web and Linked Data

The Web is a Data Shredder

Structured Data

Unstructured Data

Thanks Martin Hepp

Page 18: WTF is the Semantic Web and Linked Data

What would we like?

• Make it easy for computers/software to find THINGS

Do you SEARCH or do you FIND?

Page 19: WTF is the Semantic Web and Linked Data

Search for

Football Players who went to the University of Texas at Austin, played for

the Dallas Cowboys as Cornerback

Page 20: WTF is the Semantic Web and Linked Data
Page 21: WTF is the Semantic Web and Linked Data
Page 22: WTF is the Semantic Web and Linked Data
Page 23: WTF is the Semantic Web and Linked Data

Why can’t we just FIND it…

Page 24: WTF is the Semantic Web and Linked Data
Page 25: WTF is the Semantic Web and Linked Data
Page 26: WTF is the Semantic Web and Linked Data

Guess how I FOUND out?

Page 27: WTF is the Semantic Web and Linked Data

On a Semantic Web

• Besides publishing documents on the web– which computers can’t understand easily

• Let’s publish on the web something that computers can understand

DATA

Page 28: WTF is the Semantic Web and Linked Data

The Semantic Web is a web of data

The current web is a web of documents

Page 29: WTF is the Semantic Web and Linked Data

But wait… doesn’t the web already have data?

Page 30: WTF is the Semantic Web and Linked Data

Current Data on the Web

• Relational Databases• APIs• XML• CSV• XLS• …• Can’t computers and applications already

consume that data on the web?

Page 31: WTF is the Semantic Web and Linked Data

Yes! But it is all in different formats and data models!

Page 32: WTF is the Semantic Web and Linked Data

This makes it hard to integrate data

Page 33: WTF is the Semantic Web and Linked Data

The data in different data sources aren’t linked

Page 34: WTF is the Semantic Web and Linked Data

For example, how do I know that the Juan Sequeda in Facebook is the same as Juan

Sequeda in Twitter

Page 35: WTF is the Semantic Web and Linked Data

Or if I create a mashup from different services, I have to learn different APIs and I get different

formats of data back

Page 36: WTF is the Semantic Web and Linked Data

Data is Siloed

Page 37: WTF is the Semantic Web and Linked Data

Wouldn’t it be great if we had a standard way of publishing data on the Web?

Page 38: WTF is the Semantic Web and Linked Data

We have a standardized way of publishing documents on the web, right?

HTML

Page 39: WTF is the Semantic Web and Linked Data

Then why can’t we have a standard way of publishing data on the Web?

Page 40: WTF is the Semantic Web and Linked Data

Good question! And the answer is YES. There is!

RDF

Page 41: WTF is the Semantic Web and Linked Data

Resource Description Framework (RDF)

• Data Model = a way to model data– i.e. Relational databases use relational data model

• RDF is a graph data model

Page 42: WTF is the Semantic Web and Linked Data

Key Value vs Graph

• Key Values– firstName Juan– lastName Sequeda– livesIn Austin– knows Stephane Corlosquet

• But what are these key/values describing?– ME!

Page 43: WTF is the Semantic Web and Linked Data

RDF is a Graph

• Let’s group the Key/Values together– <JuanSequeda> <firstName> “Juan”– <JuanSequeda> <lastName> “Sequeda”– <JuanSequeda> <livesIn> “Austin”– <JuanSequeda> <knows> <StephaneCorlosquet>– ..– <StephaneCorlosquet> <firstName> “Stephane”– <StephaneCorlosquet> <lastName> “Corlosquet”– <StephaneCorlosquet> <livesIn> “Boston”

Page 44: WTF is the Semantic Web and Linked Data

RDF is a Graph

• Let’s group the Key/Values together– <JuanSequeda> <firstName> “Juan”– <JuanSequeda> <lastName> “Sequeda”– <JuanSequeda> <livesIn> “Austin”– <JuanSequeda> <knows> <StephaneCorlosquet>– ..– <StephaneCorlosquet> <firstName> “Stephane”– <StephaneCorlosquet> <lastName> “Corlosquet”– <StephaneCorlosquet> <livesIn> “Boston”

Key/ValueIdentifier for the “group”

Page 45: WTF is the Semantic Web and Linked Data

RDF can be serialized in different ways

• RDF/XML• RDFa (RDF in HTML)• N3• Turtle• JSON

Page 46: WTF is the Semantic Web and Linked Data
Page 47: WTF is the Semantic Web and Linked Data

RDFa

Page 48: WTF is the Semantic Web and Linked Data

RDF/XML

Page 49: WTF is the Semantic Web and Linked Data

RDF/N-triples

Page 50: WTF is the Semantic Web and Linked Data

RDF/Turtle

Page 51: WTF is the Semantic Web and Linked Data

So does that mean that I have to publish my data in RDF now?

Page 52: WTF is the Semantic Web and Linked Data

You don’t have to… but we would like you to

Schema.orgRich Snippets

Page 53: WTF is the Semantic Web and Linked Data

An example

Page 54: WTF is the Semantic Web and Linked Data

Document on the Web

Page 55: WTF is the Semantic Web and Linked Data

Databases back up documents

Isbn Title Author PublisherID ReleasedData

978-0-596-15381-6

Programming the Semantic Web

Toby Segaran 1 July 2009

… … … … …

PublisherID PublisherName

1 O’Reilly Media

… …

This is a THING:A book title “Programming the Semantic Web” by Toby Segaran, …

THINGS have PROPERTIES:A Book as a Title, an author, …

Page 56: WTF is the Semantic Web and Linked Data

Lets represent the data in RDF

book

Programming the Semantic Web

978-0-596-15381-6

Toby Segaran

Publisher O’Reilly

title

name

author

publisher

isbn

Isbn Title Author PublisherID ReleasedData

978-0-596-15381-6

Programming the Semantic Web

Toby Segaran

1 July 2009

PublisherID PublisherName

1 O’Reilly Media

Page 57: WTF is the Semantic Web and Linked Data

Remember that we are on the web

Everything on the web is identified by a URI

Page 58: WTF is the Semantic Web and Linked Data

And now let’s link the data to other data

http://…/isbn978

Programming the Semantic Web

978-0-596-15381-6

Toby Segaran

http://…/publisher1 O’Reilly

title

name

author

publisher

isbn

Page 59: WTF is the Semantic Web and Linked Data

And now consider the data from Revyu.com

http://…/isbn978

http://…/

review1

Awesome Book

http://…/

reviewer

Juan Sequeda

hasReview

reviewer

description

name

Page 60: WTF is the Semantic Web and Linked Data

Let’s start to link data

http://…/isbn978

Programming the Semantic Web

978-0-596-15381-6

Toby Segaran

http://…/publisher1 O’Reilly

title

name

author

publisher

isbn

http://…/isbn978

owl:sameAs

http://…/

review1

Awesome Book

http://…/

reviewer

Juan Sequeda

hasReview

hasReviewer

description

name

Page 61: WTF is the Semantic Web and Linked Data

Juan Sequeda publishes data too

http://juansequeda.

com/id

livesIn

Juan Sequedaname

http://dbpedia.org/Austin

Page 62: WTF is the Semantic Web and Linked Data

Let’s link more datahttp://…/isbn978

http://…/

review1

Awesome Book

http://…/

reviewer

Juan Sequeda

http://juansequeda.

com/id

hasReview

hasReviewer

description

name

sameAs

livesIn

Juan Sequedaname

http://dbpedia.org/Austin

Page 63: WTF is the Semantic Web and Linked Data

And more

http://…/isbn978

Programming the Semantic Web

978-0-596-15381-6

Toby Segaran

http://…/publisher1

O’Reilly

title

name

author

publisher

isbn

http://…/isbn978

owl:sameAs

http://…/

review1

Awesome Book

http://…/

reviewer

Juan Sequeda

http://juansequeda.

com/id

hasReview

hasReviewer

description

name

owl:sameAs

livesIn

Juan Sequedaname

http://dbpedia.org/Austin

Page 64: WTF is the Semantic Web and Linked Data

Data on the Web that is in RDF and is linked to other RDF data is

LINKED DATA

Page 65: WTF is the Semantic Web and Linked Data

Linked Data Principles1. Use URIs as names for

things2. Use HTTP URIs so that

people can look up (dereference) those names.

3. When someone looks up a URI, provide useful information.

4. Include links to other URIs so that they can discover more things.

Page 66: WTF is the Semantic Web and Linked Data

Linked Data makes the web appear as ONE

GIANTHUGE

GLOBAL

DATABASE!

Page 67: WTF is the Semantic Web and Linked Data

I can query a database with SQL. Is there a way to query Linked Data with a query language?

Page 68: WTF is the Semantic Web and Linked Data

Yes! There is actually a standardize language for that

SPARQL

Page 69: WTF is the Semantic Web and Linked Data

FIND all the reviews on the book “Programming the Semantic Web” by people who live in Austin

Page 70: WTF is the Semantic Web and Linked Data

SELECT ?review ?commentWHERE { isbn:978 ex:hasReview ?review . ?review ex:description ?comment . ?review ex:hasReviewer ?person . ?person ex:lives dbpedia:Austin .}

SPARQL

Page 71: WTF is the Semantic Web and Linked Data

http://…/isbn978

Programming the Semantic Web

978-0-596-15381-6

Toby Segaran

http://…/publisher1 O’Reilly

title

name

author

publisher

isbn

http://…/isbn978

sameAs

http://…/

review1

Awesome Book

http://…/

reviewer

Juan Sequeda

http://juansequeda.

com

hasReview

hasReviewer

description

name

sameAs

livesIn

Juan Sequedaname

http://dbpedia.org/Austin

SELECT ?review ?commentWHERE {isbn:978 ex:hasReview ?review .?review ex:description ?comment .?review ex:hasReviewer ?person .?person ex:lives dbpedia:Austin .}

Page 72: WTF is the Semantic Web and Linked Data

OWL

• Here is where the real semantics shows up• Web Ontology Language• Define schema/vocabulary• Classes, Properties, Inheritance, etc• Subclasses, Subproperties• …• You can get more complicated with rules…

Page 73: WTF is the Semantic Web and Linked Data

dexa:TirmiziSM08

auth: <http://dblp.l3s.de/d2r/page/authors/>dexa: <http://dblp.l3s.de/d2r/page/publications/conf/dexa/>dc: <http://purl.org/dc/elements/1.1/>sw: <http://data.semanticweb.org/person/>swrc: <http://swrc.ontoware.org/ontology#>owl: <http://www.w3.org/2002/07/owl#>rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>rdfs: <http://www.w3.org/2000/01/rdf-schema#>

auth:Juan_Sequeda

auth:Daniel_P._Miranker

auth:Syed_Hamid_Tirmizi

dc:creator

dc:creatordc:creator

“Translating SQL Applications to the

Semantic Web"

dc:title

sw:juan-f-sequeda

sw:daniel-miranker

sw:syed-tirmizi

foaf:Personswrc:InProceedings

swrc:Publication

dc:creator

rdf:type

rdf:type

rdfs:subClassOf

owl:sameAs

owl:sameAs

owl:sameAs

OWL

RDF

Page 74: WTF is the Semantic Web and Linked Data

RDB and the Semantic Web

74

RELATIONAL MODEL

TABLE DEFINITION

CONSTRAINTS

TRIGGERS

RDF

RDFS

OWL

RIF

TIM

E

Page 75: WTF is the Semantic Web and Linked Data

This looks cool, but let’s be realistic. What is the incentive to publish Linked Data?

Page 76: WTF is the Semantic Web and Linked Data

What was your incentive to publish an HTML page in 1990?

Page 77: WTF is the Semantic Web and Linked Data

1) Share data in documents2) Because you neighbor was doing it

… later on …3) Marketing, Advertising, …, SEO

Page 78: WTF is the Semantic Web and Linked Data

So why should we publish Linked Data in 2011?

Page 79: WTF is the Semantic Web and Linked Data

1) Share data as data2) Because you neighbor is doing it

…3) Marketing, Advertising, SEO ++

Page 80: WTF is the Semantic Web and Linked Data

Linked Data Publishers• UK Government• US Government• BBC• Open Calais – Thomson Reuters• Freebase/Google• NY Times• Best Buy• Sears• Kmart• Overstock.com• CNET• Dbpedia• O’Reilly Media• …

Page 81: WTF is the Semantic Web and Linked Data

May 2007

Page 82: WTF is the Semantic Web and Linked Data

Oct 2007

Page 83: WTF is the Semantic Web and Linked Data

Nov 2007

Page 84: WTF is the Semantic Web and Linked Data

Feb 2008

Page 85: WTF is the Semantic Web and Linked Data

Mar 2008

Page 86: WTF is the Semantic Web and Linked Data

Sept 2008

Page 87: WTF is the Semantic Web and Linked Data

Mar 2009 (1)

Page 88: WTF is the Semantic Web and Linked Data

Mar 2009 (2)

Page 89: WTF is the Semantic Web and Linked Data

July 2009

Page 90: WTF is the Semantic Web and Linked Data

September 2010

Page 91: WTF is the Semantic Web and Linked Data

September 2011

Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/

Page 92: WTF is the Semantic Web and Linked Data

YOU GET THE PICTURE

ITS BIG and getting

BIGGER and

BIGGER

Page 93: WTF is the Semantic Web and Linked Data

What is the Web

• Web of Documents HTML• Web of Data RDF• Global Unique IDs HTTP URIs• Schema/Ontologies OWL• Query RDF SPARQL

Page 94: WTF is the Semantic Web and Linked Data

Now what can we do with this data?

Page 95: WTF is the Semantic Web and Linked Data

Generic Applications

Page 96: WTF is the Semantic Web and Linked Data

Linked Data Browsers

Page 97: WTF is the Semantic Web and Linked Data

Linked Data Browsers

• Not actually separate browsers. Run inside of HTML browsers

• View the data that is returned after looking up a URI in tabular form

• User can navigate between data sources by following RDF Links

• (IMO) No usability

Page 98: WTF is the Semantic Web and Linked Data
Page 99: WTF is the Semantic Web and Linked Data

Linked Data Browsers

• http://browse.semanticweb.org/• Tabulator• OpenLink Dataexplorer• Zitgist• Marbles• Explorator• Disco• LinkSailor

Page 100: WTF is the Semantic Web and Linked Data

Linked Data (Semantic Web) Search Engines

Page 101: WTF is the Semantic Web and Linked Data

Linked Data (Semantic Web) Search Engines

• Just like conventional search engines (Google, Bing, Yahoo), crawl RDF documents and follow RDF links.– Current search engines don’t crawl data, unless it’s RDFa

• Human focus Search– Falcons - Keyword– SWSE – Keyworkd– VisiNav – Complex Queries

• Machine focus Search– Sindice – data instances– Swoogle - ontologies– Watson - ontologies– Uberblic – curated integrated data instances

Page 102: WTF is the Semantic Web and Linked Data

(Semantic) SEO ++

• Markup your HTML with RDFa• Use standard vocabularies (ontologies)

– Google Vocabulary– Good Relations– Dublin Core

• Google and Yahoo will crawl this data and use it for better rendering

Page 103: WTF is the Semantic Web and Linked Data
Page 104: WTF is the Semantic Web and Linked Data

On-the-fly Mashups

Page 105: WTF is the Semantic Web and Linked Data

http://sig.ma

Page 106: WTF is the Semantic Web and Linked Data

Domain Specific Applications

Page 107: WTF is the Semantic Web and Linked Data

Domain Specific Applications

• Government– Data.gov– Data.gov.uk– http://data-gov.tw.rpi.edu/wiki/Demos

• Music– Seevl.net

• Dbpedia Mobile• Life Science

– LinkedLifeData• Sports

– BBC World Cup

Page 108: WTF is the Semantic Web and Linked Data

Faceted Browsers

Page 109: WTF is the Semantic Web and Linked Data

http://dbpedia.neofonie.de/browse/

Page 110: WTF is the Semantic Web and Linked Data

Query your data

Page 111: WTF is the Semantic Web and Linked Data

Find all the locations of all the original paintings of Modigliani

Page 112: WTF is the Semantic Web and Linked Data

Select all proteins that are linked to a curated interaction from the literature and to inflammatory response

http://linkedlifedata.com/

Page 113: WTF is the Semantic Web and Linked Data

http://tata.csres.utexas.edu:8080/specify/data/taxon51807

Page 114: WTF is the Semantic Web and Linked Data

http://tata.csres.utexas.edu:8080/specify/data/taxon51807

Links to other Data Sources

Page 115: WTF is the Semantic Web and Linked Data

Linked Data is Data Integration

Specify

Morphbank

Morphster

SPARQLQuery

Diamond

Ultrawrap

Ultrawrap

Ultrawrap

Page 116: WTF is the Semantic Web and Linked Data

Example 1 (Specify – DBpedia)

• Get full name and guid from taxon with id http://tata.csres.utexas.edu:8080/specify/data/taxon51807#thing

• AND fin any subjects it may have “skos:subject”

Page 117: WTF is the Semantic Web and Linked Data

Result Example 1• Note that

http://dbpedia.org/resource/Category:Fish_of_Australia comes from a different data source (dbpedia.org)

Page 118: WTF is the Semantic Web and Linked Data

Example 2 (Specify-Morphbank)

• Get full name and guid from taxon with id http://tata.csres.utexas.edu:8080/specify/data/taxon42947#thing

• AND the rank and kingdom from Morphbank

Page 119: WTF is the Semantic Web and Linked Data

Result Example 2• Note that full name and guid come

from Specify http://tata.csres.utexas.edu:8080/specify/data/taxon42947

• AND rank and kingdom come from Morphbank http://tata.csres.utexas.edu:8080/morphbank/data/taxa398354

Page 120: WTF is the Semantic Web and Linked Data

The killer app for Semantic Technology is YOUR life (online)

– Tom Gruber

A little semantics goes a long way

- Jim HendlerKnowledge is Power

- Jim Hendler

The novel part of the Semantic Web is not the Semantics, but the Web

- Frank van Harmelen

Occupy Your Data- Tim Finin

RAW DATA NOW- Tim Berners-Lee

Linked Data is the (Semantic) Web done right

- Tim Berners-Lee

Page 121: WTF is the Semantic Web and Linked Data

QUESTIONS?