introduction to the semantic web and linked open data

47
Introduction to the Semantic Web and Linked Open Data

Upload: erling

Post on 12-Jan-2016

42 views

Category:

Documents


4 download

DESCRIPTION

Introduction to the Semantic Web and Linked Open Data. Dramatis Personae. Christopher Gutteridge. Nick Gibbins (in spirit). Goals. Overview of issues relating to the publication and use of linked data in HEIs The lessons that we’ve learned! Pragmatism rather than perfection - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Introduction to the  Semantic Web and  Linked Open Data

Introduction to the Semantic Web and Linked Open Data

Page 2: Introduction to the  Semantic Web and  Linked Open Data

Dramatis Personae

Nick Gibbins(in spirit)

Christopher Gutteridge

Page 3: Introduction to the  Semantic Web and  Linked Open Data

• Overview of issues relating to the publication and use of linked data in HEIs

• The lessons that we’ve learned!• Pragmatism rather than perfection• General guidelines rather than detailed

specifications• Coining cool URIs• Publication alongside existing resources• Licensing

Goals

Page 4: Introduction to the  Semantic Web and  Linked Open Data
Page 5: Introduction to the  Semantic Web and  Linked Open Data
Page 6: Introduction to the  Semantic Web and  Linked Open Data

http://is.gd/dqiJc(The only URL you need to write down)

Page 7: Introduction to the  Semantic Web and  Linked Open Data

• Detailed tutorial on the finer points of:• RDF• RDFa• RDF Schema• OWL• SPARQL• …

(an hour and a half isn’t enough for this – and there are good tutorials available online)

Non-Goals

Page 8: Introduction to the  Semantic Web and  Linked Open Data

“If HP knew what HP knows, we’d be three times more profitable”

Lew PlattHewlett-Packard Chairman and CEO

Page 9: Introduction to the  Semantic Web and  Linked Open Data

Linked Data in a NutshellLinked Data in a Nutshell

http://www.flickr.com/photos/arielarielariel/322301228/http://www.flickr.com/photos/arielarielariel/322301228/

Page 10: Introduction to the  Semantic Web and  Linked Open Data

• Linked Data is about providing structured data on the Web

• Doesn’t necessarily require RDF (though it usually uses it)

Page 11: Introduction to the  Semantic Web and  Linked Open Data

•Underlying model of triples used to describe the relations between entities in linked data

• This is the basis of the RDF data model

• (subject, predicate, object)• e.g. “The Hobbit”, “created by”, “JRR Tolkien”

The triple

The Hobbit JRR Tolkiencreated by

subject predicate object

Page 12: Introduction to the  Semantic Web and  Linked Open Data

• Take a citation:• Tim Berners-Lee, James Hendler and Ora Lassila. The

Semantic Web. Scientific American, May 2001

•We can identify a number of distinct statements in this citation:• There is an article titled “The Semantic Web”• One of its authors is a person named “Tim Berners-Lee”

(etc)• It appeared in a publication titled “Scientific American”• It was published in May 2001

Example

Page 13: Introduction to the  Semantic Web and  Linked Open Data

• We can represent these statements graphically:

Example

Tim Berners-Lee

James Hendler

Ora Lassila

The Semantic Web

Scientific American

name

name

name

title

title

creator

publishedIn creator

creator

2001-05

date

Page 14: Introduction to the  Semantic Web and  Linked Open Data

Example

• There are two types of node in this graph:• Literals, which have a value but no identity

(a string, a number, a date)

• Resources, which represent objects with identity(a web page, a person, a journal)

Scientific American

Page 15: Introduction to the  Semantic Web and  Linked Open Data

• Resources are identified by URIs• Property labels are also identified by URIs, and are

drawn from a vocabulary or ontology

Example

http://purl.org/dc/elements/1.1/title

http://www.sciam.com/ Scientific American

subject predicate object

Page 16: Introduction to the  Semantic Web and  Linked Open Data

• The triple-based graph model makes it possible to mix terms from different vocabularies in the same graph

• Simplifies the task of information integration

Mixing Vocabularies

Tim Berners-Lee

James Hendler

Ora Lassila

The Semantic Web

Scientific American

name

name

name

title

title

creator

publishedIn creator

creator

2001-05

date

foaf

dc

bibo

Page 17: Introduction to the  Semantic Web and  Linked Open Data

Set of publishing practices for SW data:

1. Use URIs as names for things2. Use HTTP URIs so that people can look up those

names3. When someone looks up a URI, provide useful

information4. Include links to other URIs. so that they can discover

more things

Effectively, putting the hypertext back into the Semantic Web

Simplifies integration between datasets while maintaining loose coupling

Linked Data Principles

Page 18: Introduction to the  Semantic Web and  Linked Open Data

Example

graph describing ‘sw’

sciam

tbl

jh

ora

sw

The Semantic Web title

creator

publishedIn creator

creator

2001-05

date

graph describing ‘tbl’

Tim Berners-Lee

nametbl

graph describing ‘jh’

James Hendlername

jh

graph describing ‘ora’

Ora Lassilanameoragraph describing ‘sciam’

Scientific American

titlesciam

Page 19: Introduction to the  Semantic Web and  Linked Open Data
Page 20: Introduction to the  Semantic Web and  Linked Open Data

• URIs must only identify one concept. Ever.• I am not my homepage.

Person Document

Page 21: Introduction to the  Semantic Web and  Linked Open Data

• URI represents a person.

• Requesting URI via web gets a “See Other” response.

• Requester redirected to most appropriate document URL. usually HTML or RDF+XML

Publishing Example

Page 22: Introduction to the  Semantic Web and  Linked Open Data

<<>><<><>><>>><>><>><>><>><><>>>><<><><<<<<><><><><><><><><><><><><><<<<>>><><<><><>><>

• DON’T worry about understanding the XML. It’s the equivalent of “view-source” in a webpage!

• Use a tool to covert it to something less icky! (http:/graphite.ecs.soton.ac.uk/browser/ for example)

Publishing RDF

Page 23: Introduction to the  Semantic Web and  Linked Open Data

• Worry about it later!

• Start with data you can make freely available

Access Control

Page 24: Introduction to the  Semantic Web and  Linked Open Data

• You want your data to be used & reused, right?• Don’t prevent commercial use.• Don’t prevent derivative works (prevents people

using it at all!)• If there are any things which your data should not

be used for why are you publishing it?

Licensing

Page 25: Introduction to the  Semantic Web and  Linked Open Data

• Must-Attribute license• Public Domain license

(your info still can’t be used in illegal ways, of course)

• Procrastinate and worry about it later(much better than not publishing your data)

Licensing Options

Page 26: Introduction to the  Semantic Web and  Linked Open Data

Breakout

Page 27: Introduction to the  Semantic Web and  Linked Open Data

• What datasets does your organisation already maintain?

• What is the business case for making them available?• in a machine readable form• to all members • without bureaucracy or restriction.

• What are the barriers to putting them online and maintaining them?

• What are the benefits to the wider community?

• What are the risks?

Task

Page 28: Introduction to the  Semantic Web and  Linked Open Data

• List your 3 easiest wins - the lowest hanging fruit.

• Starting suggestion: Every building & campus in your organisation with:• Number • Building Name• Site (Campus)• Lat & Long This data changes very slowly and also made freely available

already.

Task

Page 29: Introduction to the  Semantic Web and  Linked Open Data

ECS Demo

Page 30: Introduction to the  Semantic Web and  Linked Open Data

• http://id.ecs.soton.ac.uk/docs/

• http://rdf.ecs.soton.ac.uk/person/1248

• http://rdf.ecs.soton.ac.uk/project/42

Page 31: Introduction to the  Semantic Web and  Linked Open Data

Cool URIs

Page 32: Introduction to the  Semantic Web and  Linked Open Data

Beauty

• http://domain/classOfThing/scheme/identifier• http://domain/classOfThing/scheme/identifier.rdf• http://domain/classOfThing/scheme/identifier.html

• http://mysite.org/person/username/t23• http://mysite.org/person/username/t23.rdf• http://mysite.org/person/username/t23.html

Scheme is optional but futureproofs you against next time the university reorganises everything.

Page 33: Introduction to the  Semantic Web and  Linked Open Data

And The Beast

http://www.diy.com/diy/jsp/bq/nav.jsp?action=detail&fh_oneslice=true&fh_view_size=10&fh_reffacet=styleStyle&fh_location=%2f%2fcatalog01%2fen_GB%2fcategories%3C{9372014}%2fcategories%3C{9372039}%2fcategories%3C{9372150}%2fspecificationsProductType%3done_hole_taps%2fstyleStyle%3E{adelaide}&fh_refview=summary&fh_refpath=facet_159017215&fh_secondid=10507747&fh_eds=%C3%9F&ts=1279018688652

Page 34: Introduction to the  Semantic Web and  Linked Open Data

Further ReadingFurther Reading

http://www.flickr.com/photos/markhillary/337685031/http://www.flickr.com/photos/markhillary/337685031/

Page 35: Introduction to the  Semantic Web and  Linked Open Data

• http://www.w3.org/standards/semanticweb/ • http://www.w3.org/standards/techs/rdf • http://www.w3.org/standards/techs/owl• http://www.w3.org/TR/swbp-vocab-pub/

W3C Specifications

Page 36: Introduction to the  Semantic Web and  Linked Open Data

Tools

•Graphite Browser• http://graphite.ecs.soton.ac.uk/browser/

• Tabulator• http://www.w3.org/2005/ajar/tab

Page 37: Introduction to the  Semantic Web and  Linked Open Data

Linked Data Help

• Linked Data Website• http://linkeddata.org/

• The Patterns Book• http://patterns.dataincubator.org/book/

• Semantic Overflow• http://www.semanticoverflow.com/

Page 38: Introduction to the  Semantic Web and  Linked Open Data

• SKOS (Simple Knowledge Organisation Scheme)• Taxonomies and thesauri

• SIOC (Semantically Interlinked Online Communities)• Web forums, mailing lists, etc

• FOAF (Friend of a Friend)• People, social networks

• DC (Dublin Core)• Basic bibliographic information

• BIBO (Bibliographic Ontology)• Advanced bibliographic information

• GEO• Simple geolocation (lat/long) ontology

Common Namespaces

Page 39: Introduction to the  Semantic Web and  Linked Open Data

Cool URIs

• Cool URIs don't change (by TimBL)• http://www.w3.org/Provider/Style/URI

• Cool URIs for the Semantic Web• http://www.w3.org/TR/cooluris/

• ECS URI scheme documentation• http://id.ecs.soton.ac.uk/docs/

Page 40: Introduction to the  Semantic Web and  Linked Open Data

Infrastructure Namespaces

• RDF & RDFS• These describe classes & predicates which are used to tie

everything together. rdf:type is used to give a URI a class <http://id.ecs.soton.ac.uk/person/1248> rdf:type

foaf:Person .

•OWL• Used to describe the meaning of predicates & classes in

machine-readable form.• Start with a human readable documents, OWL is not widely

consumed (yet?)

• XSD• Describes datatypes like String, Positve Integer etc.

Page 41: Introduction to the  Semantic Web and  Linked Open Data

Take Home MessagesTake Home Messages

http://www.flickr.com/photos/71894657@N00/2696793132/http://www.flickr.com/photos/71894657@N00/2696793132/

Page 42: Introduction to the  Semantic Web and  Linked Open Data

• ‘Cool URIs don’t change’ – once you’ve chosen a URI convention for your organisation, it’s a pain to change it

• Getting this right is key to having your linked data used more widely

We think that we got this one mostly right……but we still had too many anonymous nodes around

Good URI Selection

Page 43: Introduction to the  Semantic Web and  Linked Open Data

• Go for an incremental approach• …but keep an eye on possible avenues for future

expansion

• RDFa is not for beginners!

• Don’t do as we did: we tried to build linked data for all of our internal data in one go

Start with the easy stuff

Page 44: Introduction to the  Semantic Web and  Linked Open Data

• Regardless of your application domain, there is probably already an ontology that does some of what you want

• …but don’t be afraid to invent relationships and classes if you can’t find any suitable

• Don’t do as we did! we wrote a new ontology from scratch, rather than reusing FOAF+DC)

Don’t reinvent the wheel

Page 45: Introduction to the  Semantic Web and  Linked Open Data

• Build linked data for your own consumption first• You know what your use cases are – better to support

these than to second guess those of unknown future users

• Don’t do as we did: we overcomplicated our data by trying to support all of the plausible scenarios that we could think of, rather than concentrating on what mattered to us

(be glad I couldn't find any clip art for this slide)

Eat your own dogfood

Page 46: Introduction to the  Semantic Web and  Linked Open Data

• You should aim to publish as RDF• Publishing as CSV may get your data out there

faster as an interim measure

We used CSV as a ‘glue’ data format between different systems, but chose not to expose data until we could do so as RDF.

Don’t underestimate CSV

Page 47: Introduction to the  Semantic Web and  Linked Open Data

Thanks

[email protected]

•@cgutteridge

• http://blogs.ecs.soton.ac.uk/webteam/

http://is.gd/dqiJc