datalift: a catalyser for the web of data - francois scharffe

18
Webscience meetup 5/02/2011 1 With the help of the Datalift team And the support of the French National Research Agency Datalift: A Catalyser for the Web of Data François Scharffe University of Montpellier, LIRMM, INRIA [email protected] @lechatpito

Upload: webscience-montpellier

Post on 04-Jul-2015

542 views

Category:

Education


0 download

DESCRIPTION

Talk at Web Science Montpellier Meetup - 13th May 2011

TRANSCRIPT

Page 1: Datalift: A Catalyser for the Web of Data - Francois Scharffe

Webscience meetup 5/02/2011 1

With the help of the Datalift teamAnd the support of the French National Research Agency

Datalift: A Catalyser for the Web of Data

François ScharffeUniversity of Montpellier, LIRMM, [email protected]@lechatpito

Page 2: Datalift: A Catalyser for the Web of Data - Francois Scharffe

Datalift and Web-Science

Page 3: Datalift: A Catalyser for the Web of Data - Francois Scharffe

3

Datalift

A large scale Web data publication experiment.

Objectives:

- Publish reference datasets

- Automate the data publication process

- Show the interest of publishing linked data

Page 4: Datalift: A Catalyser for the Web of Data - Francois Scharffe

4

Datalift

Motivation:

- Two phenomena:

- Society – Open Data

- Technology – Semantic Web

Data revolution going on : the web of data is explosing as the web of documents exploded in the 90'

Page 5: Datalift: A Catalyser for the Web of Data - Francois Scharffe

Datalift

Datasets publication

R&D to automate the publication process

A modular architecture to assist data publication

Training, tutorials, data publication camps

Page 6: Datalift: A Catalyser for the Web of Data - Francois Scharffe

Welcome aboard the data lift

Published and interlinked data on the Web

Applications

Interconnexion

Publication infrastructure

Data convertion

Vocabulary selection

Raw data

Page 7: Datalift: A Catalyser for the Web of Data - Francois Scharffe

SemWebPro 18/01/2011 7

1st floor - Selection

Page 8: Datalift: A Catalyser for the Web of Data - Francois Scharffe

Vocabulary selection

Vocabularies for linked-data● Are meant to describe resources in RDF● Are based on one of the standard W3C language RDFS

and OWL

Ø What makes a good vocabulary ?● A good vocabulary is a used vocabulary● Other usability criterias : Simplicity, visibility,

documentation, flexibility, semantic integration, social integration

Ø Types of vocabularies● Metadata, reference, domain, general

Page 9: Datalift: A Catalyser for the Web of Data - Francois Scharffe

Vocabulary of a Friend

Øhttp://www.mondeca.com/foaf/voaf

ØA simple vocabulary...

ØTo represent interconnexions between vocabularies

ØA unique entry point to vocabularies and Datasets of the linked-data cloud Linked Data Cloud

ØOngoing work in Datalift

Page 10: Datalift: A Catalyser for the Web of Data - Francois Scharffe

SemWebPro 18/01/2011 10

2nd floor - Conversion

Page 11: Datalift: A Catalyser for the Web of Data - Francois Scharffe

Reference datasets, URI design

● Providing reference datasets for the French ecosystem: geographical, topological, statistical, political. Ex: http://parisemantique.fr

● Providing URI design guidelines● Opaque or transparent URIs ?● Usage of accents in URIs

Page 12: Datalift: A Catalyser for the Web of Data - Francois Scharffe

Convertion tools to RDF

ØHow is the raw data to be converted ?

§ Relational Database ?

§ (Semi-)structured formats ?

§ Programmatic acces (API) ?

ØThere are solutions for all cases

Page 13: Datalift: A Catalyser for the Web of Data - Francois Scharffe

SemWebPro 18/01/2011 13

3rd floor - Publication

Page 14: Datalift: A Catalyser for the Web of Data - Francois Scharffe
Page 15: Datalift: A Catalyser for the Web of Data - Francois Scharffe

SemWebPro 18/01/2011 15

4th floor - Interconnexion

Page 16: Datalift: A Catalyser for the Web of Data - Francois Scharffe

Towards automated interconnexion services

ØRecord linkage, entity reconciliation, instance, ontology, schema matching

§ Using alignments between vocabularies

§ Detection of discriminating properties

§ Indicating comparison methods by attaching metadata to ontologies

ØWork in progress in Datalift

Page 17: Datalift: A Catalyser for the Web of Data - Francois Scharffe

SemWebPro 18/01/2011 17

5th floor - Applications

Page 18: Datalift: A Catalyser for the Web of Data - Francois Scharffe

“It is a time when, even if nets were to guide all consciousness that had been converted to photons

and electrons toward coalescing, standalone individuals have not yet been converted into data to the extent that they can form unique components of

a larger complex”

Mamoru Oshii, Ghost in the Shell