sharing data on the web (2013)

29
Sharing Data on The Web A Semantic Web Tutorial James Leigh @jamesrdf [email protected] February 2013

Upload: 3-round-stones

Post on 07-May-2015

270 views

Category:

Documents


5 download

DESCRIPTION

Toronto Semantic Web Meetup

TRANSCRIPT

Page 1: Sharing data on the web (2013)

Sharing Data on The Web

A Semantic Web Tutorial

James Leigh@jamesrdf

[email protected] 2013

Page 2: Sharing data on the web (2013)

Towards Linked Open Data

● On the Web with an open license

“It's great to have the data accessible on the Web under an open license, however, the data is locked-up in a document. Other than writing a custom scraper, it's hard to get the data out of the document.”

Page 3: Sharing data on the web (2013)

★★

Towards Linked Open Data

● On the Web with an open license● Machine-readable data

“Splendid! The data is accessible on the Web in a structured way (that is, machine-readable), however, the data is still locked-up in a document. To get the data out of the document you depend on proprietary software.”

Page 4: Sharing data on the web (2013)

★★★

Towards Linked Open Data

● On the Web with an open license● Machine-readable data● Non-proprietary format

“Excellent! The data is not only available via the Web but now everyone can use the data easily. On the other hand, it's still data on the Web and not data in the Web.”

Page 5: Sharing data on the web (2013)

★★★★

Towards Linked Open Data

● On the Web with an open license● Machine-readable data● Non-proprietary format● RDF standards

“Wonderful! Now it's data in the Web. The (most important) data items have a URI and can be shared on the Web. A native way to represent the data is using RDF, however other formats such as Atom can be converted/mapped, if required.”

Page 6: Sharing data on the web (2013)

★★★★★

Towards Linked Open Data

● On the Web with an open license● Machine-readable data● Non-proprietary format● RDF standards● Linked RDF

“Brilliant! Now it's data, in the Web linked to other data. Both the consumer and the publisher benefit from the network effect. ”

Page 7: Sharing data on the web (2013)

Path to Success

● Start easy– Well curated– Small dataset

● Reach out to developers● Get others involved early● Ensure internal benefit● Maximize productivity● Integrate related dataset

Page 8: Sharing data on the web (2013)

Choosing a License

● Allow modification?● Require attribution?● Permit commercial uses?● Must others share alike?● What jurisdiction?

Page 9: Sharing data on the web (2013)

Address Data Quality

● Completeness● Conformity● Consistency● Accuracy● Integrity● Eliminate duplication

Page 10: Sharing data on the web (2013)

Put It On The Web

● Upload it and share● Document what is available● Document how to use it

– Solve a consumer need● Encourage feedback

– Continues improvement

Page 11: Sharing data on the web (2013)

★★

Make It Machine-readable

● Simplify data structure● Don't include unstable identifiers● Automate publishing

Page 12: Sharing data on the web (2013)

★★★

Use a Non-proprietary Format

● Standardized formats– CSV instead of excel– XML instead of MDB

● Benefits:– Accessibility– Interoperability– Reduce risk of

● Confidential info● Software viruses

Page 13: Sharing data on the web (2013)

★★★

API Service

● Data access layer● Atom Publishing Protocol● Google Data Protocol● Open Data Protocol● Facebook Graph API● Linked Data

Page 14: Sharing data on the web (2013)

★★★

The Web

● Universal unidirectional links using URLs– Graph of nodes identified by URL

● Cooperation without coordination● It's simple nodes and Links

Page 15: Sharing data on the web (2013)

★★★

The Semantic Web

● A universal scalable interconnection bus● Share and reuse data across

– Community boundaries– Application– Enterprise

Page 16: Sharing data on the web (2013)

★★★

(re)Model the Data

● Identify existing model● Reuse common vocabularies● Model data relationships

– Put aside particular needs of any application● Diagram relationships● Test assumptions with subject matter experts

Page 17: Sharing data on the web (2013)

★★★

Universal Identifiers

● Others can reference things● The Web uses URL for

downloadable things● Uses URI for real world things and

abstract concepts● Two references with the same URI

are the same thing● Quick, easy and scalable look-up

Page 18: Sharing data on the web (2013)

★★★★

Publish RDF

● RDF connects better than any other model● Cooperation without coordination● Data can be combined safely● Mix of vocabularies

Page 19: Sharing data on the web (2013)

★★★★

VoID Description

● De facto standard for describing Linked Data● Used in linked data catalogues● Include provenance and attributions● Associate consumers' license

Page 20: Sharing data on the web (2013)

★★★★

RDF Schema

● Understand structure of the graph– Self-describing

● Define terms and relationships

● Classes of things

Page 21: Sharing data on the web (2013)

★★★★

Simple Knowledge Organization

● Controlled vocabularies● Taxonomies● Thesauri

Page 22: Sharing data on the web (2013)

★★★★

Controlled Vocabularies

● Strongly agree● Agree● Somewhat agree● Neither agree nor

disagree● Somewhat disagree● Disagree● Strongly disagree

● Mr.● Ms.● Mrs.● Miss.● Dr.

● AB● BC● MB● NB● NL● NS● ON● PE● QC● SK● NT● NU● YT

Page 23: Sharing data on the web (2013)

★★★★

Taxonomies

Page 24: Sharing data on the web (2013)

★★★★

Thesauri

Page 25: Sharing data on the web (2013)

★★★★

Web Ontology Language

● Property restrictions● Equivalences across

ontologies● Constraints● Automatic reasoners● Annotations

Page 26: Sharing data on the web (2013)

★★★★★

Linked RDF

● Linked your data to others● Adds more related data● Add value to the data

– By joining it to other data

Page 27: Sharing data on the web (2013)

★★★★★

Generating Data Websites

● Give ordinary users direct access the data● Templates to generate HTML

– Generate static HTML offline– Produce HTML on-demand

Page 28: Sharing data on the web (2013)

Social Responsibility

● Responsibility to maintain published data● Publish frequency of data updates● Have a persistence strategy● Ensure data is accurate as possible● Respond to reports of problematic data

Page 29: Sharing data on the web (2013)

Comments or Questions?