extending dcam for metadata provenance

23
Extending DCAM for Metadata Extending DCAM for Metadata Provenance Provenance DCMI Metadata Provenance Task Group Kai Eckert Mannheim University Library Mannheim, Germany [email protected] Daniel Garijo Universidad Politécnica de Madrid Madrid, Spain [email protected] Michael Panzer OCLC Online Computer Library Center, Inc., Dublin, Ohio, USA [email protected] DC-2011 September 22 nd 2011 The Hague, Netherlands

Upload: kai-eckert

Post on 30-Jun-2015

1.031 views

Category:

Education


3 download

DESCRIPTION

The Metadata Provenance Task Group aims to define a data model that allows for making assertions about description sets. Creating a shared model of the data elements required to describe an aggregation of metadata statements allows to collectively import, access, use and publish facts about the quality, rights, timeliness, data source type, trust situation, etc. of the described statements. In this paper we describe the preliminary model created by the task group, together with first examples that demonstrate how the model is to be used.

TRANSCRIPT

Page 1: Extending DCAM for Metadata Provenance

Extending DCAM for Metadata Extending DCAM for Metadata ProvenanceProvenance

DCMIMetadataProvenanceTask Group

Kai EckertMannheim University LibraryMannheim, [email protected]

Daniel GarijoUniversidad Politécnica de MadridMadrid, [email protected]

Michael PanzerOCLC Online Computer Library Center, Inc., Dublin, Ohio, [email protected]

DC-2011September 22nd 2011

The Hague, Netherlands

Page 2: Extending DCAM for Metadata Provenance

● Members:

● Kai Eckert (Mannheim University Library, Germany)● Daniel Garijo (Universidad Politécnica de Madrid, Spain)● Michael Panzer (OCLC, USA)

● Established: June 2010 (duration: 1 year, or so...)

● Goal: A Dublin Core conformant way to represent metadata provenance.

● Further Information:

● http://www.dublincore.org/groups/provenance/● http://wiki.bib.uni-mannheim.de/dc-provenance/

DCMIMetadata ProvenanceTask Group

Page 3: Extending DCAM for Metadata Provenance

Our motivation

● Back in 2009, we wanted Metametadata...● ... to add debugging information to metadata

statements created with crosswalks,● ... to add source information to metadata

statements merged from different sources,● ... to add additional information like confidence

values to automatic indexing results,which is missing in ALL bibliographic metadata formats.

● Those were the days in Seoul... back in 2009.

Page 4: Extending DCAM for Metadata Provenance

Provenance

● Today, we talk about provenance.● “Provenance of a resource is a record that describes entities and

processes involved in producing and delivering or otherwise influencing that resource.” W3C Provenance Incubator Group

● Provenance is everywhere and provenance can be described using Dublin Core.

● But there are many different ways to integrate the provenance information.

Page 5: Extending DCAM for Metadata Provenance

http://www.europeana.eu

Page 6: Extending DCAM for Metadata Provenance

http://www.europeana.eu

Page 7: Extending DCAM for Metadata Provenance

Metadata„The Provenanceof Mona Lisa“

Metametadata„The Provenanceof the Metadata“

„Data“The Mona Lisa

http://www.europeana.eu

Page 8: Extending DCAM for Metadata Provenance
Page 9: Extending DCAM for Metadata Provenance
Page 10: Extending DCAM for Metadata Provenance

Description Set

„Annotations“

Page 11: Extending DCAM for Metadata Provenance

UML Class Model

DCAM

ProposedDCPROVExtension

Page 12: Extending DCAM for Metadata Provenance

The Element Set

● What elements do we propose to use to describe the metadata provenance?

Page 13: Extending DCAM for Metadata Provenance

The Element Set

● What elements do we propose to use to describe the metadata provenance?

● Guess what:

Dublin Core

Page 14: Extending DCAM for Metadata Provenance

The Element Set

● What elements do we propose to use to describe the metadata provenance?

● Guess what:

Dublin Core● We are discussing additional elements:

● dcprov:sourceModified as subproperty of dc:source● dcprov:creationType, dcprov:rank● ...

Page 15: Extending DCAM for Metadata Provenance

RDF ImplementationUsing Named Graphs

DescriptionSet

AnnotationSet

Page 16: Extending DCAM for Metadata Provenance

Newspaper travel guides example● Travel guides from many different places around the

world.

● 1st level provenance information regarding the guides: author, date of publication...

● 2nd level provenance information regarding the metadata of the guides: publisher, author, license...

● 3rd level provenance infor-mation about the RDF serialization displayed to the user: query executed, time of execution...

Page 17: Extending DCAM for Metadata Provenance

El Viajero travel guides example: Modeling

● 1st level: description set (metadata about a guide)

● 2nd level: annotation set (metadata provenance)

Page 18: Extending DCAM for Metadata Provenance

El Viajero travel guides example: Modeling (2)

● 3rd level: Another annotation set, describing the creation process of the data delivered to the user.

Page 19: Extending DCAM for Metadata Provenance

OAI-PMH example: Modeling

Page 20: Extending DCAM for Metadata Provenance

Discoverability

● Use of SPARQL to retrieve annotation and description sets

● Example:

SELECT ?ds ?p ?o WHERE {GRAPH ?ds { :MonaLisa dc:creator :LeonardoDaVinci . }GRAPH ?as { ?ds ?p ?o . ?as rdf:type dcprov:AnnotationSet . }

}

Page 21: Extending DCAM for Metadata Provenance

Provenance Workshop

● We have a half-day workshop tomorrow afternoon: Friday, 2 PM, Room B/C

● Open agenda:● Discuss questions regarding our work.● What is needed for an application profile?● Connection to W3C Provenance Working Group.● Connection to W3C RDF Working Group.● What is the semantic of a description set?● ...

Page 22: Extending DCAM for Metadata Provenance

Remaining steps

● We want to finish the task group shortly after DC-2011.

● Now is the time for feedback that can be incorporated into the application profile.

● Any help regarding the formulation of the application profile or other kinds of deliverables is highly appreciated.

● Please sign up to the low traffic mailing list: http://www.jiscmail.ac.uk/lists/dc-provenance.html

Page 23: Extending DCAM for Metadata Provenance

To summarize:

Make description sets identifable.

Create descriptions of description sets.

And Dublin Core is readyfor Metadata Provenance.

Thank you.

Credits: Photography "Chain" by Zsuzsanna Kilian