when tei verse becomes linked data

30
When TEI Verse becomes linked data: using TEI tags as a model to build a linked poetry system Elena González-Blanco, UNED-LINHD [email protected] Gimena del Río Riande, SECRIT-CONICET [email protected]

Upload: elena-gonzalez-blanco

Post on 29-Jun-2015

384 views

Category:

Education


1 download

DESCRIPTION

The Text Encoding initiative is conceived as a standard system designed to let interoperability between different projects working on analogous topics. However, interoperability is sometimes not easy to reach due to the different interpretation of tags made by each project or group. Given the variety of tags, there are several concepts that can be easily identified for many projects, such as or , which referent is pretty clear and recognizable. There are some other tags, however, which can experience multiple interpretations, limited by the content of the attributes that modify them. The case of Verse module is : first of all, there are not many projects using the full set of Verse tags (most projects use and and perhaps but do not add many more metrical and rhythmic information), and some of them complete the information related to specific kind of rhythms and stanzas using attributes and new customized tags. This situation makes it even more difficult to make poetry projects interoperable, which becomes a dream, especially after taking into account that many of the projects working in this area -such as metrical repertoires as controversial, especially in its parts related to metrics, due to several issues The Cantigas de Santa Maria Database: http://csm.mml.ox.ac.uk/, Analecta Hymnica Digitalia: database on Medieval Latin poetry: http://webserver.erwin-rauner.de/crophius/Analecta_conspectus.htm, Bibliografia Elettronica dei Trovatori: http://w3.uniroma1.it/bedt/BEdT_03_20/, Le Nouveau Naetebus: database on French narrative Medieval poetry : http://www.nouveaunaetebus.elte.hu/ Répertoire de la Poésie Hongroise Ancienne (RPHA) : Repertoire on Medieval Hungarian poetry: http://rpha.elte.hu/ , MedDB: Lírica Profana Galego-Portuguesa http://www.cirp.es/pls/bdo2/f?p=MEDDB2, Corpus rhythmorum musicum (IV-IX secolo): database on Latin Medieval poetry accompanied with music http://www.corimu.unisi.it/, Skaldic poetry of Scandinavian Middle Ages https://www.abdn.ac.uk/skaldic/db.php , or Dimev: Digital index of medieval English verse: http://www.cddc.vt.edu/host/imev/record.php?recID=6768, use a variety of technological systems, mostly consisting of databases which do not tag their texts with XML. This paper presents a model to link poetry projects based on TEI tags. It consists of a theoretical schema of linked data, whose entities are extracted from the different tags used by the project REMETCA www.remetca.uned.es, the Spanish Digital Repertoire on Medieval Poetry,  conceived and created on a TEI basis. This contribution wants to show how each tag of the xsd schema can be translated and transposed into RDF (following standards like Dublin Core for elements of the such as or ) and published later via linked data in order to create a macro-schema in which other poetical repertoires can see many o

TRANSCRIPT

Page 1: When TEI Verse becomes Linked Data

When TEI Verse becomes linked data: using TEI tags as a model to build a linked poetry system

Elena González-Blanco, [email protected]

Gimena del Río Riande, [email protected]

Page 2: When TEI Verse becomes Linked Data
Page 3: When TEI Verse becomes Linked Data

Technical structure

RELATIONALDATABASE

Page 4: When TEI Verse becomes Linked Data
Page 5: When TEI Verse becomes Linked Data
Page 6: When TEI Verse becomes Linked Data

What is a metrical repertoire?

Page 7: When TEI Verse becomes Linked Data

• Gaston Raynaud, Bibliographie des chansonniers francais des XIIIe et XIVe siecles, 1884.

• Gotthold Naetebus, Die nicht lyrischen Strophenformen des Altfranzosischen, 1891.

• Pillet & Carstens, Bibliographie des Troubadours, 1933

Print repertoires

Page 8: When TEI Verse becomes Linked Data
Page 9: When TEI Verse becomes Linked Data
Page 10: When TEI Verse becomes Linked Data

Linking repertoires…

Page 11: When TEI Verse becomes Linked Data

• Issues to consider– Different structure and level of complexity– Different content and time– Different technological systems• SQL databases• Propietary RDBMS systems• XML tagged text

– Different levels of access

Interoperability – or how to link

repertoires…

Page 12: When TEI Verse becomes Linked Data

Interoperability & TEI

Page 13: When TEI Verse becomes Linked Data

Linking Data ChallengeThe Linked Data Challenge

Page 14: When TEI Verse becomes Linked Data

Why a linked data model

• Based on metadata added to existing databases• Does not alter the internal structure of each

repertoire• Code is published as linked data available on the

web • Extensible and flexible model

Why a linked data model

Page 15: When TEI Verse becomes Linked Data

ReMetCa’s Ontology

Available at: http://www.purl.org/net/remetca and http://datahub.io/dataset/remetca-ontology

Page 16: When TEI Verse becomes Linked Data

Some issues

• No previous ontology on poetry existed• Need of standardization of some fields (eg:

metrical and rhythmical scheme)• Need to create controlled vocabularies to

organize information

Some Issues

Page 17: When TEI Verse becomes Linked Data

Classes

Page 18: When TEI Verse becomes Linked Data

POEMAuthorTitleIncipitManuscriptPost quem Ante quemLanguageTopicsEditionOnline editionWork

POEM

Isometrism

Isostrophism

Metrical scheme

Rhyme scheme

Rhyme

Musical notation

Number of stanzas

Number of lines

Poetic form

ClasesClasses

<origDate><author>

<msItem>

<notAfter><notBefore>

<l><edition>

<lg><lg type=“poem”>

<syll>

<listBibl>

Page 19: When TEI Verse becomes Linked Data

POEMAuthorTitleIncipitManuscriptPost quemAnte quemLanguageTopicsEditionOnline editionWork

POEM

Isometrism

Isostrophism

Metrical scheme

Rhyme scheme

Rhyme

Musical notation

Number of stanzas

Number of lines

Poetic form

PropiedadesProperties

Page 20: When TEI Verse becomes Linked Data

POEMAuthorTitleIncipitManuscriptPost quemAnte quemLanguageTopicsEditionOnline editionWork

POEM

Isometrism

Isostrophism

Metrical scheme

Rhyme scheme

Rhyme

Musical notation

Number of stanzas

Number of lines

Poetic form

AtributosData properties

Page 21: When TEI Verse becomes Linked Data

From SQL to RDFFrom SQL to RDF using KARMA

Page 22: When TEI Verse becomes Linked Data

From SQL to RDFFrom SQL to RDF using KARMA

Page 23: When TEI Verse becomes Linked Data

LOD principles

Information is available on the Web (any format) under an open license

Information is available as structured data (e.g. Excel instead of an image scan of a table)

Non-proprietary formats are used (e.g. CSV instead of Excel)

URI identification is used so that people can point at individual data

Data is linked to other data to provide context

Berners Lee’s Star rating system

Page 24: When TEI Verse becomes Linked Data

Metadata universe is big…

image from http://www.dlib.indiana.edu/~jenlrile/metadatamap/

Metadata universe is big…

Page 25: When TEI Verse becomes Linked Data

ReMetCa Ontology

Available at: http://www.purl.org/net/remetca and http://datahub.io/dataset/remetca-ontology

Page 26: When TEI Verse becomes Linked Data

Me quedo aquí…

REMETCA TEI CIDOC

Author <author> E21 Person

Our decision for mapping

REMETCA TEI DCBibliographical references <listBibl> BibliographicCitation

REMETCA TEI CIDOCDate <creation> E50

REMETCA TEI CIDOCData post quem <origDate notBefore="1450"> E63 Beginning of existence

REMETCA TEI CIDOC

Data ante quem < origDate notAfter="1450"> E64 End of existence

REMETCA CIDOCcomposedBy P94 has created (was created

by)

REMETCA CIDOChasDate P92 was brought into

existence

OBJECT PROPERTIES

Page 27: When TEI Verse becomes Linked Data

http://vocabularios.caicyt.gov.ar/pmc/

Page 28: When TEI Verse becomes Linked Data

http://vocabularios.caicyt.gov.ar/pmc/

Page 29: When TEI Verse becomes Linked Data

http://vocabularios.caicyt.gov.ar/pmc/

Page 30: When TEI Verse becomes Linked Data

Library of Congress, Rosenwald 4, 5r, ca. 1410.Drawing of man, called Microcosmus

Conclusions

…As dwarfs standing on the shoulders of giants (nanos gigantum humeris insidentes)

www.remetca.uned.es