europeana and schema.org antoine isaac dublin core conference schema.org special session 5 september...
TRANSCRIPT
Europeana and schema.org
Antoine Isaac
Dublin Core ConferenceSchema.org special session
5 September 2013
Europeana Data Model: an example
For a general presentation on Europeana and EDM rationale see http://pro.europeana.eu/edm-documentation
Provided Cultural Heritage Object (CHO) and descriptive metadata
Web Resources – digital representations
Aggregations – Bundling it all together
Why using schema.org?
Europeana tries to disseminate data to reach out to as many users as possible
Search engines
• Customization of result lists – rich snippets
• Knowledge Graph
• Search Engine Optimization
Developers more comfortable with parsing web pages
In fact: schema.org and RDFa
Europeana has been publishing structured metadata via its portal since a while
One application case: customization of public domain pages by Creative Commons, with details on the work and Europeana usage guidelines for public domain works
Europeana, Creative Commons and RDFa
http://www.europeana.eu/portal/record/02001/492A0518CA2BDF09B1642B11FA7317F5FE43B96B.html
The Creative Commons Public Domain page triggers a script that harvests mark-up on the Europeana object page
Going further
Creative Commons uses 5-6 fields, Agreed upon with the developer(s) there
What to publish further?
Schema.org as a standardized form of more web page-based data exchange
And it has a case – several ones in fact
Still quite prototype-ish
Current EDM – schema.org mappingEDM element Schema.org mapping
ProvidedCHO & Proxy schema:CreativeWorkdc:contributor schema:contributordc:coveragedc:creator schema:creatordc:datedc:description schema:descriptiondc:formatdc:identifierdc:language schema:inLanguagedc:publisher schema:publisherdc:relationdc:rightsdc:sourcedc:subject schema:aboutdc:title schema:namedc:typedcterms:alternative schema:alternativeHeadlinedcterms:conformsTodcterms:created schema:dateCreateddcterms:extentdcterms:hasFormatdcterms:hasPartdcterms:hasVersiondcterms:isFormatOfdcterms:isPartOfdcterms:isReferencedBydcterms:isReplacedBydcterms:isRequiredBydcterms:issued schema:datePublisheddcterms:isVersionOfdcterms:mediumdcterms:provenancedcterms:references schema:mentionsdcterms:replacesdcterms:requiresdcterms:spatialdcterms:tableOfContentsdcterms:temporaledm:currentLocation schema:contentLocationedm:hasMetedm:hasType schema:additionalTypeedm:incorporatesedm:isDerivativeOfedm:isNextInSequenceedm:isRelatedToedm:isRepresentationOf schema:aboutedm:isSimilarToedm:isSuccessorOfedm:realizesedm:type N/Aedm:unstorededm:wasPresentAtedm:europeanaProxy N/Aedm:userTag schema:keywords or schema:comment edm:yearore:proxyFor N/Aore:proxyIn N/Aowl:sameAsrdf:type
Aggregation and EuropeanaAggregation
ore:aggregates N/A
edm:aggregatedCHO N/A
edm:country schema:addressCountry
edm:dataProvider schema:provider
edm:hasView schema:url
edm:isShownAt schema:url
edm:isShownBy schema:contentUrl
edm:landingPage schema:url
edm:language
edm:object schema:image, if preview opt-out is NOT activated
edm:preview schema:thumbnailUrl if preview opt-out is NOT activated
edm:provider schema:provider
dc:rights
edm:rights
edm:ugc
edm:unstored N/A
WebResource schema:WebPage or schema:MediaObject
schema:url
dc:description schema:description
dc:format
dc:rights
dc:source schema:encodesCreativeWork
dcterms:conformsTo
dcterms:created schema:dateCreated
dcterms:extent
dcterms:hasPart
dcterms:isFormatOf
dcterms:issued schema:datePublished
edm:isNextInSequence
edm:rights
Agent schema:Person or schema:Organization
skos:prefLabel schema:name
skos:altLabel skos:hiddenLabel schema:additionalName
skos:note schema:description
dc:date
dc:identifier
edm:begin
edm:end schema:deathdate
edm:hasMet schema:knows
edm:isRelatedTo
edm:wasPresentAt
foaf:name schema:name
rdaGr2:biographicalInformation schema:description
rdaGr2:dateOfBirth schema:birthdate
rdaGr2:dateOfDeath schema:deathdate
rdaGr2:dateOfEstablishment schema:foundingDate
rdaGr2:dateOfTermination
rdaGr2:gender schema:gender
rdaGr2:professionOrOccupation
schema:jobTitle
owl:sameAs
Place schema:Place
wgs84_pos:lat
wgs84_pos:long
wgs84_pos:alt
wgs84_pos:lat_long
skos:prefLabel schema:name
skos:altLabel skos:hiddenLabel schema:additionalName
skos:note schema:description
dcterms:hasPart
dcterms:isPartOf schema:containedIn
owl:sameAs
Current implementation
A glimpse of an object’s full data
http://www.europeana.eu/portal/record/02001/492A0518CA2BDF09B1642B11FA7317F5FE43B96B.html?format=labels
Anatomy of results from an RDFa parser
http://www.w3.org/2012/pyRdfa/
Several flavors of data in it…
<http://www.europeana.eu/resolve/record/02001/492A0518CA2BDF09B1642B11FA7317F5FE43B96B> a schema:CreativeWork; schema:name "Cofre de base rectangular com tampa de quatro face..."; schema:about "Cofre-relicário"; schema:addressCountry "Portugal"; schema:contentUrl <http://www.matriznet.imc-ip.pt/MatrizNet/CommonServices/ThumbnailDownloader.axd?IdReg=5113&TipoReg=1&ThumbnailType=2>; schema:creator "Desconhecido"; schema:description "Cofre de base rectangular com tampa de quatro faces. A urna é decorada com um friso perlado que rodeia representações da Virgem com o Menino, do Calvário e da Virgem em Glória, nas quatro faces. As arestas da tampa são emolduradas por pequenas caneluras; nas duas faces principais um entrelaçado encerra uma arcaria polilobada."; schema:image <http://www.matriznet.imc-ip.pt/MatrizNet/CommonServices/ThumbnailDownloader.axd?IdReg=5113&TipoReg=1&ThumbnailType=1>;schema:provider "Instituto dos Museus e da Conservação", "Museu de Alberto Sampaio"; schema:url <http://www.europeana.eu/portal/record/02001/492A0518CA2BDF09B1642B11FA7317F5FE43B96B.html>, <http://www.matriznet.imc-ip.pt/MatrizNet/Objectos/ObjectosConsultar.aspx?IdReg=5113>;
Schema.org data
dc11:creator "Desconhecido"; dc11:date "XIII"; dc11:description "Cofre de base rectangular com tampa de quatro faces. […]."; dc11:format "Altura: 9,5 cm; Profundidade: 10 cm; Comprimento: 19,5 cm"; dc11:identifier "MAS O 37"; dc11:rights "Copyright © Instituto dos Museus e da Conservação"; dc11:subject "Cofre-relicário"; dc11:title "Cofre de base rectangular com tampa de quatro face..."; dc11:type "Ourivesaria"; edm:country "Portugal"; edm:rights <http://creativecommons.org/publicdomain/mark/1.0/>; edm:dataProvider "Museu de Alberto Sampaio"; edm:provider "Instituto dos Museus e da Conservação"; edm:isShownAt <http://www.matriznet.imc-ip.pt/MatrizNet/Objectos/ObjectosConsultar.aspx?IdReg=5113>; edm:isShownBy <http://www.matriznet.imc-ip.pt/MatrizNet/CommonServices/ThumbnailDownloader.axd?IdReg=5113&TipoReg=1&ThumbnailType=2>; edm:landingPage <http://www.europeana.eu/portal/record/02001/492A0518CA2BDF09B1642B11FA7317F5FE43B96B.html>; edm:object <http://www.matriznet.imc-ip.pt/MatrizNet/CommonServices/ThumbnailDownloader.axd?IdReg=5113&TipoReg=1&ThumbnailType=1>;
DC / EDM data
og:description "Cofre de base rectangular com tampa de quatro faces. A urna é decorada […]"; og:image "http://europeanastatic.eu/api/image?type=IMAGE&uri=http://www.matriznet.imc-ip.pt/MatrizNet/CommonServices/ThumbnailDownloader.axd?IdReg=5113&TipoReg=1&ThumbnailType=1&size=FULL_DOC"; og:site_name "Europeana"; og:title "Cofre de base rectangular com tampa de quatro face... | Desconhecido"; og:type "website"; og:url "http://preview.europeana.eu/portal/record/02001/492A0518CA2BDF09B1642B11FA7317F5FE43B96B.html” .
FB OpenGraph data
xhv:license <http://creativecommons.org/publicdomain/mark/1.0/>; cc:attributionURL <http://www.matriznet.imc-ip.pt/MatrizNet/Objectos/ObjectosConsultar.aspx?IdReg=5113>; cc:morePermissions <http://www.matriznet.imc-ip.pt/MatrizNet/Objectos/ObjectosConsultar.aspx?IdReg=5113>; cc:useGuidelines <http://www.europeana.eu/rights/pd-usage-guide/>;
Creative Commons data
Observations
• Schema.org is simple
• Not everything can be mapped
• We’re losing grain, including some of the core benefits of Europeana moving to the richer EDM!
• But it’s ok, because it matches needs
• And in fact it’s not entirely because of Schema.org
• And we can publish different flavors of the data in RDFa