www.medra.org piero attanasio managing persistent identifiers and digitisation rights for europe...

19
www.medra.org Piero Attanasio Managing persistent identifiers and digitisation rights for Europe Bologna, 27 May 2011 EuroCRIS meeting

Upload: godfrey-dixon

Post on 18-Jan-2018

212 views

Category:

Documents


0 download

DESCRIPTION

mEDRA - multilingual European DOI Registration Agency  mEDRA is a joint venture between  AIE - Associazione Italiana Editori (Italian Publishers Association)  Cineca - Technological consortium of Italian universities  Thus a public-private-partnership  Born in 2004 after a EU co-funded project with the same title  mEDRA is a DOI (Digital Object Identifier) Registration Agency  Active mainly in Italy and Germany (partnership with MVB)  82% of turnover outside Italy  Other field of activities  mEDRA provides technology services to Office of Publication of European Europe (DOI registration infrastructure) Italian ISBN agency  mEDRA main asset: know how on standard for cultural content  Designed by the parent companies as a center for R&D in this field

TRANSCRIPT

Page 1: Www.medra.org Piero Attanasio Managing persistent identifiers and digitisation rights for Europe Bologna, 27 May 2011 EuroCRIS meeting

ww

w.m

edra

.org

Piero Attanasio

Managing persistent identifiers and digitisation rights for Europe

Bologna, 27 May 2011EuroCRIS meeting

Page 2: Www.medra.org Piero Attanasio Managing persistent identifiers and digitisation rights for Europe Bologna, 27 May 2011 EuroCRIS meeting

ww

w.m

edra

.org

Summary

mEDRA A general approach Our experience in right information: Arrow Lesson learned for other applications

Page 3: Www.medra.org Piero Attanasio Managing persistent identifiers and digitisation rights for Europe Bologna, 27 May 2011 EuroCRIS meeting

ww

w.m

edra

.org

mEDRA - multilingual European DOI Registration Agency

mEDRA is a joint venture between AIE - Associazione Italiana Editori (Italian Publishers Association) Cineca - Technological consortium of Italian universities Thus a public-private-partnership

Born in 2004 after a EU co-funded project with the same title mEDRA is a DOI (Digital Object Identifier) Registration Agency

Active mainly in Italy and Germany (partnership with MVB) 82% of turnover outside Italy

Other field of activities mEDRA provides technology services to

• Office of Publication of European Europe (DOI registration infrastructure)• Italian ISBN agency

mEDRA main asset: know how on standard for cultural content Designed by the parent companies as a center for R&D in this field

Page 4: Www.medra.org Piero Attanasio Managing persistent identifiers and digitisation rights for Europe Bologna, 27 May 2011 EuroCRIS meeting

ww

w.m

edra

.org

The standard field: a labyrinth of acronyms

ERMI ISBNXRLM PIID

ISRCONIX-LT DOIPLUSONIX

ISSN

M-PIID

RFID

DC

ISMNR

DD

MPE

G-2

1R

EL

ISPI

XSLT

MA

RC

ISTC

AC

AP

LOM

Everybody calls for Ariadne!

The current situation

But Ariadne is an acronym of a project in the educational field

Page 5: Www.medra.org Piero Attanasio Managing persistent identifiers and digitisation rights for Europe Bologna, 27 May 2011 EuroCRIS meeting

ww

w.m

edra

.org

A scheme to exit the labyrinth

• We can describe any use of any IP entity as

People make deals with stuffs(Norman Paskin, director of IDF, Towards a data dictionary. Identifiers and semantics at work on

the net, Electronic Publishing Services, June 2002, www.doi.org/topics/020522IMI.pdf)

People / use / objects

• We have to identify and describe people, deals, and stuffs (people, uses, and objects)

Page 6: Www.medra.org Piero Attanasio Managing persistent identifiers and digitisation rights for Europe Bologna, 27 May 2011 EuroCRIS meeting

ww

w.m

edra

.org

A scheme to exit the labyrinth

• Each acronym belong to one cell of the table

Identification

DEALS STUFFPEOPLE

Description

• ISNI *• IPI• VIAF

• ISBN * • ISSN *• ISMN *• DOI *• ISTC *

• ONIX-LT• ACAP

• ONIX• MARC• DC

* ISO standards

Developed, still not usedUnder developmentWell established

• We still have empty cells!

Page 7: Www.medra.org Piero Attanasio Managing persistent identifiers and digitisation rights for Europe Bologna, 27 May 2011 EuroCRIS meeting

ww

w.m

edra

.org

Current and forthcoming trends

Identification

DEALS STUFFPEOPLE

Description

• ISNI *• IPI• VIAF

• ISBN * • ISSN *• ISMN *• DOI *• ISTC *

• ONIX-LT• ACAP

• ONIX• MARC• DC

Current trends

Forthcoming

Page 8: Www.medra.org Piero Attanasio Managing persistent identifiers and digitisation rights for Europe Bologna, 27 May 2011 EuroCRIS meeting

ww

w.m

edra

.org

Standard network resolution

An additional layer of complexity (sorry for this!) From identification to resolution:

I.e: reaching resources about the identified resource in a network environment

This is the core idea of the DOI Standardising also this aspect (which goes beyond identification) is a

value

Page 9: Www.medra.org Piero Attanasio Managing persistent identifiers and digitisation rights for Europe Bologna, 27 May 2011 EuroCRIS meeting

ww

w.m

edra

.org

Resolution vs identification

“What a PI identifies” and “what a PI resolves to” are two different concepts

What the DOI The DOI® (Digital Object Identifier) is a standard for identifying any object of intellectual property. A DOI provides a means of persistently identifying a piece of intellectual property on a digital network and associating it with related current data.

On digital networks, all intellectual property is simply a string of bits; a DOI can apply to any form of intellectual property in any digital environment. DOIs have been called "the bar code for intellectual property": like the physical bar code, they are enabling tools for use all through the supply chain to add value and save cost.

A DOI differs from commonly used internet pointers to material such as the URL – Uniform Resource Locator, the usual means of referring to World Wide Web material – because it identifies an object as a first-class entity, not simply the place where the object is located.

A DOI is also different from commonly used identifiers of intellectual property like standard bibliographic and related identifiers (ISBN, ISSN, ISRC, etc) because it is associated with defined services and is immediately "actionable" on a network. However, the DOI does not compete with these standards since it allows them to be integrated as suffixes in DOI strings.

A DOI is an implementation of the Internet concepts of Uniform Resource Name and Universal Resource Identifier. A DOI is different from abstract naming specifications such as URN in that it is a defined

identification

Identified entity

Info (metadata)

DOI

Resolution 1Resolution 2

Resolution 3

Rightsinfo

How to buy

Resolution 4

Page 10: Www.medra.org Piero Attanasio Managing persistent identifiers and digitisation rights for Europe Bologna, 27 May 2011 EuroCRIS meeting

ww

w.m

edra

.org

Resolution vs identification

What the DOI The DOI® (Digital Object Identifier) is a standard for identifying any object of intellectual property. A DOI provides a means of persistently identifying a piece of intellectual property on a digital network and associating it with related current data.

On digital networks, all intellectual property is simply a string of bits; a DOI can apply to any form of intellectual property in any digital environment. DOIs have been called "the bar code for intellectual property": like the physical bar code, they are enabling tools for use all through the supply chain to add value and save cost.

A DOI differs from commonly used internet pointers to material such as the URL – Uniform Resource Locator, the usual means of referring to World Wide Web material – because it identifies an object as a first-class entity, not simply the place where the object is located.

A DOI is also different from commonly used identifiers of intellectual property like standard bibliographic and related identifiers (ISBN, ISSN, ISRC, etc) because it is associated with defined services and is immediately "actionable" on a network. However, the DOI does not compete with these standards since it allows them to be integrated as suffixes in DOI strings.

A DOI is an implementation of the Internet concepts of Uniform Resource Name and Universal Resource Identifier. A DOI is different from abstract naming specifications such as URN in that it is a defined

identification

Identified entity (e.g. a book)

Info(metadata)

DOI

Resolution 1

Resolution 2

Rights info

How to buy.

Resolution 3

It is also possible that the PI does not resolve to the identified entity

Page 11: Www.medra.org Piero Attanasio Managing persistent identifiers and digitisation rights for Europe Bologna, 27 May 2011 EuroCRIS meeting

ww

w.m

edra

.org

Arrow: connecting book record to right information

The issue: High transaction costs in managing rights in digital library

programmes• Aka as “the orphan work problem”, but it is more than this

The need to connect stuff information (a bibliographic record) to people information (rightholder contact) and possibly with license information

• E.g. offered by a Collecting Management Organisation We started from a number of information resources created for

different purposes We set up a network making those resources interoperable

In use in four countries: Germany, France, UK and Spain We are working to expand the network to many other European

countries (“Arrow Plus”)

Page 12: Www.medra.org Piero Attanasio Managing persistent identifiers and digitisation rights for Europe Bologna, 27 May 2011 EuroCRIS meeting

ww

w.m

edra

.org

Some key characteristics of Arrow

A distribution system made interoperable through use of standard Sometimes cited as a “registry” or a “database”, which is not

Separation between right information management and right clearance This makes the system neutral to legal frameworks and business

models Use of different types of bibliographic resources

National library catalogues, VIAF, Books in print, CMO repertoires (which often use different standards!)

Page 13: Www.medra.org Piero Attanasio Managing persistent identifiers and digitisation rights for Europe Bologna, 27 May 2011 EuroCRIS meeting

ww

w.m

edra

.org

Lessons learned: an identity issue

Connecting stuffs, people and information associated to both is a general problem

This is first a problem of identity (and identification) Which entity is relevant in the “stuff” domain?

• Unambiguous identification of the book concerned• Unambiguous identification of the work(s) contained in that book

Which entity is relevant in the “people” domain?• Unambiguous identification of the public names• Unambiguous identification of people

Page 14: Www.medra.org Piero Attanasio Managing persistent identifiers and digitisation rights for Europe Bologna, 27 May 2011 EuroCRIS meeting

ww

w.m

edra

.org

Lessons learned: connecting entities

We need information associated to works when all the resources are based on books (manifestations) Need to connect <Book> data to <Work> data

We need information about people and often have information about names Personal information are delicate:

• moving from <Name> to <Person> • often is from <Name> to <Resource maintaining person information>

Definition of relevant entities is worth spending large efforts Stakeholders awareness and then agreement needed

Page 15: Www.medra.org Piero Attanasio Managing persistent identifiers and digitisation rights for Europe Bologna, 27 May 2011 EuroCRIS meeting

ww

w.m

edra

.org

Lessons learned: managing errors

We live in a world with imperfect information ISBNs (created in 70ies) are not always there and not always used

properly Work identifier (ISTC) created very recently and at the initial phase

of deployment Right information resident only in proprietary resources

Connecting entities (matching, clustering, relation tracking) is always a probabilistic process

Need to manage errors Never promising the true Combining automatic processes and human intervention Being transparent on this matter and allow users to check the results

Page 16: Www.medra.org Piero Attanasio Managing persistent identifiers and digitisation rights for Europe Bologna, 27 May 2011 EuroCRIS meeting

ww

w.m

edra

.org

Can our experience be relevant for other application?

One example: bibliometric data Indexes rely on data sources, which by definition are imperfect Need to connect:

<Manifestation> to <Expression> to <Work>• Different versions (e.g. pre-print, publisher version, etc.)• Insufficient data about monographs

<Work> to <Citations>• Facilitated by the use of the DOI, but still problematic in many fields

Again: managing errors Need to know errors data Balancing automatic processes and human intervention

Page 17: Www.medra.org Piero Attanasio Managing persistent identifiers and digitisation rights for Europe Bologna, 27 May 2011 EuroCRIS meeting

ww

w.m

edra

.org

How to build on

Once information has been discovered (and possibly assessed), it is crucial that it is re-usable

Again: proper use of standards: Please, don’t call your solution “a standard” A standard requires broad stakeholders consensus

Registering data in the appropriate standard repositories E.g.: Arrow routine to register the ISTC for every <Work> discovered

in the process One further step: making information available through use of

standard resolution systems E.g.: your URN resolver is not enough. Use something that is

accepted by vast communities (such as Handle)

Page 18: Www.medra.org Piero Attanasio Managing persistent identifiers and digitisation rights for Europe Bologna, 27 May 2011 EuroCRIS meeting

ww

w.m

edra

.org

Identification –> Resolution –> Access

Precise and persistent identification is a prerequisite for resolution Resolution facilitates access to precisely identified resource Get a look at “The Answer to the Machine is in the Machine”, one

of the Big Ideas for the Digital Agenda launched by the European Commission The concept: creating stable resolution system between IP entities

and information about IP rights IP right information is not stable, so it cannot be embedded in the

manifestation Through persistent identifiers supported by resolution mechanism, it

is possible to connect the entity with IP information

Page 19: Www.medra.org Piero Attanasio Managing persistent identifiers and digitisation rights for Europe Bologna, 27 May 2011 EuroCRIS meeting

ww

w.m

edra

.org

Thank you - Grazie

Piero [email protected]

Further information on ARROW www.arrow-net.eu