ontologies and thesauri. how to answer complex questions using interoperability?

47
Ontologies and thesauri How to answer complex questions using interoperability ? Stefanie Gehrke, Pauline Charbonnier, Eduard Frunzeanu, Anita Mazur (Équipe Données)

Upload: equipex-biblissima

Post on 09-May-2015

505 views

Category:

Technology


1 download

DESCRIPTION

Présentation sur les ontologies et thesauri dans le cadre de la Training School COST-IRHT "La transmission des textes : nouveaux outils, nouvelles approches" (Paris), par Stefanie Gehrke

TRANSCRIPT

Page 1: Ontologies and thesauri. How to answer complex questions using interoperability?

Ontologies and thesauri

How to answer complex questions using interoperability ?

Stefanie Gehrke, Pauline Charbonnier, Eduard Frunzeanu, Anita Mazur (Équipe Données)

Page 2: Ontologies and thesauri. How to answer complex questions using interoperability?

How to answer complex questions using current

(open access) web facilities?

etc.

Page 3: Ontologies and thesauri. How to answer complex questions using interoperability?

Access point I:

Medium Trame

Page 4: Ontologies and thesauri. How to answer complex questions using interoperability?
Page 5: Ontologies and thesauri. How to answer complex questions using interoperability?

Access point II:

●  The Union Catalog of France (Catalogue Collectif de France / CCFr)

●  BnF archives et manuscrits (BAM) ●  Catalogue Général de la BnF

Page 6: Ontologies and thesauri. How to answer complex questions using interoperability?

Title ? Person ? Subject ?

Shelfmark ? Date ?

...

Page 7: Ontologies and thesauri. How to answer complex questions using interoperability?

Access point III:

BVH BVMM Gallica

Page 8: Ontologies and thesauri. How to answer complex questions using interoperability?
Page 9: Ontologies and thesauri. How to answer complex questions using interoperability?

Access point IV:

MESA

Page 10: Ontologies and thesauri. How to answer complex questions using interoperability?
Page 11: Ontologies and thesauri. How to answer complex questions using interoperability?

Query in MESA

Page 12: Ontologies and thesauri. How to answer complex questions using interoperability?

Dump not available

Page 13: Ontologies and thesauri. How to answer complex questions using interoperability?

Access point V:

Manuscripta Medievalia

Europeana / The European Library

Manuscriptorium

Page 14: Ontologies and thesauri. How to answer complex questions using interoperability?

Query in Europeana I

Page 15: Ontologies and thesauri. How to answer complex questions using interoperability?

Query in Europeana II

Dump available at: http://data.europeana.eu/download/2.0/

Page 16: Ontologies and thesauri. How to answer complex questions using interoperability?

Access point VI:

Isidore

Page 17: Ontologies and thesauri. How to answer complex questions using interoperability?

http://www.rechercheisidore.fr/data/ark:/12148/btv1b84473026.rdf

Page 18: Ontologies and thesauri. How to answer complex questions using interoperability?

Access point VII:

All specialised databases

Page 19: Ontologies and thesauri. How to answer complex questions using interoperability?

Found in the Web Bibliotheca bibliothecarum manuscriptorum nova... du Mauriste Dom Bernard de Montfaucon, 1739

Version PDF:

http://www.libraria.fr/fr/editions/la-bibliotheca-bibliothecarum-de-montfaucon-est-t%C3%A9l%C3%A9chargeable-en-mode-texte-sur-libraria

Page 20: Ontologies and thesauri. How to answer complex questions using interoperability?

Conclusion I: There is no single

access point specialised on the written heritage from the Medieval and Renaissance Europe

Page 21: Ontologies and thesauri. How to answer complex questions using interoperability?

Conclusion II: Search possibilities

regarding the history of collections and texts are

limited

Page 22: Ontologies and thesauri. How to answer complex questions using interoperability?

Vision Biblissima

How to answer complex questions using interoperability ?

Page 23: Ontologies and thesauri. How to answer complex questions using interoperability?

Federate databases ... … by establishing a common RDF framework:

●  Ontology ●  Data in RDF ●  Enable (complex) queries ●  Thesaurus

Page 24: Ontologies and thesauri. How to answer complex questions using interoperability?

Ontology Definition I :

“A conceptual map of the terms and relationships that constitute a domain. For example, a library ontology would likely include "books," and would note that books often have "authors," that authors have "birth_years," and that books also often have "titles," "subtitles," and "subjects." The terms and relationships in an ontology can then be used to determine the metadata to be recorded, as well as the ways in which data can be retrieved from a dataset. Although ontologies pre-date the Semantic Web, they have been brought to prominence by it.”

Source : Getting Started Guide (Harvard University)

Page 25: Ontologies and thesauri. How to answer complex questions using interoperability?

Ontology Definition II :

“A formal model that allows knowledge to be represented for a specific domain. An ontology describes the types of things that exist (classes), the relationships between them (properties) and the logical ways those classes and properties can be used together (axioms).”

Source : Linked Data Glossary (w3c)

Page 26: Ontologies and thesauri. How to answer complex questions using interoperability?

Ontology

Individual Individual

Class

Class

S

SS

S

S

S

S S

S O O

O

O

O

O

O

O

On t o l o g y

predicate

(object-)property

Data

Page 27: Ontologies and thesauri. How to answer complex questions using interoperability?

Mapping

S

S

S

S

S

S

O

O

O

O

O

O

O

Structured data + URIs + Links to external resources Triples

Page 28: Ontologies and thesauri. How to answer complex questions using interoperability?

Data alignment OpenRefine:

Page 29: Ontologies and thesauri. How to answer complex questions using interoperability?

Datasets in a triplestore

S

SS

S

S

S

S

S

S

O

O

O

O

O

Op1

p1

p2

p3 p2

p3

p1

p1 O

O

O

O

p3

p3

Page 30: Ontologies and thesauri. How to answer complex questions using interoperability?

Query via SPARQL PREFIX Bbissima: <http://purl.org/NET/Bbissima#> SELECT ?subject WHERE { ?subject p2 Bbissima:Manuscript.

?subject p1 }

S

SS

S

S

S

S

S

S

O

O

O

O

O

O

O

p1

p1

p3 p2

p1

p1 O

O

O

O

p3 p3

p2

p1

p1

p2

p2

p2

p3

Page 31: Ontologies and thesauri. How to answer complex questions using interoperability?

Biblissima SPARQL

endpoint (test)

Page 32: Ontologies and thesauri. How to answer complex questions using interoperability?

RDF/XML data (simplified) <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:Bbissima="http://purl.org/NET/Bbissima#">

<Bbissima:Expression> <!-- subject -->

<Bbissima:is_carried_by> <!-- predicate -->

<Bbissima:Manuscript rdf:about="http://data.biblissima.fr/ark:/00000/BnF_Mss_Grec1432"> <!-- object / subject -->

<Bbissima:consists_of> <!-- predicate -->

<Bbissima:Material rdf:about="http://data.biblissima.fr/thesaurus/resource/ark:/00000/support_bombycin"/>

<!-- object --> </Bbissima:consists_of>

</Bbissima:Manuscript>

</Bbissima:is_carried_by>

</Bbissima:Expression>

</rdf:RDF>

Page 33: Ontologies and thesauri. How to answer complex questions using interoperability?

…. + sample data thesaurus <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:Bbissima="http://purl.org/NET/Bbissima#"

xmlns:skos="http://www.w3.org/2004/02/skos/core#"> <Bbissima:Expression> <Bbissima:is_carried_by> <Bbissima:Manuscript rdf:about="http://data.biblissima.fr/ark:/00000/BnF_Mss_Grec1432"> <Bbissima:consists_of> <Bbissima:Material rdf:about="http://data.biblissima.fr/thesaurus/resource/ark:/00000/support_bombycin"/> </Bbissima:consists_of>

</Bbissima:Manuscript> </Bbissima:is_carried_by> </Bbissima:Expression>

<skos:Concept rdf:about="http://data.biblissima.fr/thesaurus/resource/ark:/00000/support_bombycin"> <skos:prefLabel xml:lang="fr">Bombycin</skos:prefLabel> <skos:broader rdf:resource="http://data.biblissima.fr/thesaurus/resource/ark:/00000/support_papier"/> </skos:Concept> </rdf:RDF>

Page 34: Ontologies and thesauri. How to answer complex questions using interoperability?

All the manuscripts (in the triplestore) that carry a text in greek on paper: PREFIX skos:<http://www.w3.org/2004/02/skos/core#> PREFIX Bbissima:<http://purl.org/NET/Bbissima#>

SELECT ?manuscript WHERE { ?manuscript Bbissima:has_language <http://id.loc.gov/vocabulary/iso639-5/grk> . ?manuscript Bbissima:consists_of <http://data.biblissima.fr/thesaurus/resource/ark:/00000/support_papier>.}

UNION {?manuscript Bbissima:consists_of ?material . <http://data.biblissima.fr/thesaurus/resource/ark:/00000/support_papier> skos:narrower ?material.}

}

Complex query

Page 35: Ontologies and thesauri. How to answer complex questions using interoperability?

Build on the outcomes of the Europeana Regia

project (2010 - 2012) (BnF, BSB, BHUV, HAB, KBR)

(3 historical collections)

Page 36: Ontologies and thesauri. How to answer complex questions using interoperability?

●  Crosswalk EAD, TEI, Marc21, ESE 3.4, EDM

●  Reference Transformation

●  Data ingestion

●  5 partners (BnF, BSB, BHUV, HAB, KBR)

Page 37: Ontologies and thesauri. How to answer complex questions using interoperability?

Data Sources for 33 Europeana Display Fields

Page 38: Ontologies and thesauri. How to answer complex questions using interoperability?

Medieval Manuscripts in Europe(ana)

Transformation published at Sourceforge: http://sourceforge.net/projects/eregia2eseedm

Page 39: Ontologies and thesauri. How to answer complex questions using interoperability?

Medieval Manuscripts in Europe(ana)

One XSLT module contains controlled multilingual vocabulary for the Europeana display fields

Page 40: Ontologies and thesauri. How to answer complex questions using interoperability?

Medieval Manuscripts in Europe(ana)

Lessons learned:

●  build reliable relations ●  share transformation code ●  modelisation experience

Page 41: Ontologies and thesauri. How to answer complex questions using interoperability?

Ontology and thesaurus Biblissima

●  Build on CIDOC-CRM and FRBRoo

●  Ontology: Publication of v0.1 May 2014

Page 42: Ontologies and thesauri. How to answer complex questions using interoperability?

Ontology Biblissima

See also: http://doc.biblissima-condorcet.fr/sites/doc.biblissima-condorcet.fr/files/interoperabilite/schemaontologieamflorence.pdf

Page 43: Ontologies and thesauri. How to answer complex questions using interoperability?

Ontology Biblissima

Modelisation of the events “Acquisition”, “Transfer of Custody” and “Move” in CIDOC CRM and Biblissima

Page 44: Ontologies and thesauri. How to answer complex questions using interoperability?

Ontology Biblissima

Modelisation of the event “Expression Creation” related to a manuscript in CIDOC CRM and Biblissima

Page 45: Ontologies and thesauri. How to answer complex questions using interoperability?

Thesaurus Biblissima

●  to build a thesaurus in the fields of the transmission of books and texts

●  focus on the vocabularies needed for the scientific description of manuscripts and early prints: codicology, paleography, textual criticism etc.

Page 46: Ontologies and thesauri. How to answer complex questions using interoperability?

Tools used by Biblissima

4store GINCO

OpenRefine Protégé

Sesame XXE

Virtuoso

Page 47: Ontologies and thesauri. How to answer complex questions using interoperability?

Merci !

See you on Wednesday 02/04/2014 at the Workshop about RDF data

Campus Condorcet, IUT, 3-7 rue de la Croix Faron, Saint-Denis, 14:30-18:00