www.medra.org piero attanasio managing persistent identifiers and digitisation rights for europe...
DESCRIPTION
mEDRA - multilingual European DOI Registration Agency mEDRA is a joint venture between AIE - Associazione Italiana Editori (Italian Publishers Association) Cineca - Technological consortium of Italian universities Thus a public-private-partnership Born in 2004 after a EU co-funded project with the same title mEDRA is a DOI (Digital Object Identifier) Registration Agency Active mainly in Italy and Germany (partnership with MVB) 82% of turnover outside Italy Other field of activities mEDRA provides technology services to Office of Publication of European Europe (DOI registration infrastructure) Italian ISBN agency mEDRA main asset: know how on standard for cultural content Designed by the parent companies as a center for R&D in this fieldTRANSCRIPT
![Page 1: Www.medra.org Piero Attanasio Managing persistent identifiers and digitisation rights for Europe Bologna, 27 May 2011 EuroCRIS meeting](https://reader036.vdocuments.net/reader036/viewer/2022083120/5a4d1b417f8b9ab0599a0f2f/html5/thumbnails/1.jpg)
ww
w.m
edra
.org
Piero Attanasio
Managing persistent identifiers and digitisation rights for Europe
Bologna, 27 May 2011EuroCRIS meeting
![Page 2: Www.medra.org Piero Attanasio Managing persistent identifiers and digitisation rights for Europe Bologna, 27 May 2011 EuroCRIS meeting](https://reader036.vdocuments.net/reader036/viewer/2022083120/5a4d1b417f8b9ab0599a0f2f/html5/thumbnails/2.jpg)
ww
w.m
edra
.org
Summary
mEDRA A general approach Our experience in right information: Arrow Lesson learned for other applications
![Page 3: Www.medra.org Piero Attanasio Managing persistent identifiers and digitisation rights for Europe Bologna, 27 May 2011 EuroCRIS meeting](https://reader036.vdocuments.net/reader036/viewer/2022083120/5a4d1b417f8b9ab0599a0f2f/html5/thumbnails/3.jpg)
ww
w.m
edra
.org
mEDRA - multilingual European DOI Registration Agency
mEDRA is a joint venture between AIE - Associazione Italiana Editori (Italian Publishers Association) Cineca - Technological consortium of Italian universities Thus a public-private-partnership
Born in 2004 after a EU co-funded project with the same title mEDRA is a DOI (Digital Object Identifier) Registration Agency
Active mainly in Italy and Germany (partnership with MVB) 82% of turnover outside Italy
Other field of activities mEDRA provides technology services to
• Office of Publication of European Europe (DOI registration infrastructure)• Italian ISBN agency
mEDRA main asset: know how on standard for cultural content Designed by the parent companies as a center for R&D in this field
![Page 4: Www.medra.org Piero Attanasio Managing persistent identifiers and digitisation rights for Europe Bologna, 27 May 2011 EuroCRIS meeting](https://reader036.vdocuments.net/reader036/viewer/2022083120/5a4d1b417f8b9ab0599a0f2f/html5/thumbnails/4.jpg)
ww
w.m
edra
.org
The standard field: a labyrinth of acronyms
ERMI ISBNXRLM PIID
ISRCONIX-LT DOIPLUSONIX
ISSN
M-PIID
RFID
DC
ISMNR
DD
MPE
G-2
1R
EL
ISPI
XSLT
MA
RC
ISTC
AC
AP
LOM
Everybody calls for Ariadne!
The current situation
But Ariadne is an acronym of a project in the educational field
![Page 5: Www.medra.org Piero Attanasio Managing persistent identifiers and digitisation rights for Europe Bologna, 27 May 2011 EuroCRIS meeting](https://reader036.vdocuments.net/reader036/viewer/2022083120/5a4d1b417f8b9ab0599a0f2f/html5/thumbnails/5.jpg)
ww
w.m
edra
.org
A scheme to exit the labyrinth
• We can describe any use of any IP entity as
People make deals with stuffs(Norman Paskin, director of IDF, Towards a data dictionary. Identifiers and semantics at work on
the net, Electronic Publishing Services, June 2002, www.doi.org/topics/020522IMI.pdf)
People / use / objects
• We have to identify and describe people, deals, and stuffs (people, uses, and objects)
![Page 6: Www.medra.org Piero Attanasio Managing persistent identifiers and digitisation rights for Europe Bologna, 27 May 2011 EuroCRIS meeting](https://reader036.vdocuments.net/reader036/viewer/2022083120/5a4d1b417f8b9ab0599a0f2f/html5/thumbnails/6.jpg)
ww
w.m
edra
.org
A scheme to exit the labyrinth
• Each acronym belong to one cell of the table
Identification
DEALS STUFFPEOPLE
Description
• ISNI *• IPI• VIAF
• ISBN * • ISSN *• ISMN *• DOI *• ISTC *
• ONIX-LT• ACAP
• ONIX• MARC• DC
* ISO standards
Developed, still not usedUnder developmentWell established
• We still have empty cells!
![Page 7: Www.medra.org Piero Attanasio Managing persistent identifiers and digitisation rights for Europe Bologna, 27 May 2011 EuroCRIS meeting](https://reader036.vdocuments.net/reader036/viewer/2022083120/5a4d1b417f8b9ab0599a0f2f/html5/thumbnails/7.jpg)
ww
w.m
edra
.org
Current and forthcoming trends
Identification
DEALS STUFFPEOPLE
Description
• ISNI *• IPI• VIAF
• ISBN * • ISSN *• ISMN *• DOI *• ISTC *
• ONIX-LT• ACAP
• ONIX• MARC• DC
Current trends
Forthcoming
![Page 8: Www.medra.org Piero Attanasio Managing persistent identifiers and digitisation rights for Europe Bologna, 27 May 2011 EuroCRIS meeting](https://reader036.vdocuments.net/reader036/viewer/2022083120/5a4d1b417f8b9ab0599a0f2f/html5/thumbnails/8.jpg)
ww
w.m
edra
.org
Standard network resolution
An additional layer of complexity (sorry for this!) From identification to resolution:
I.e: reaching resources about the identified resource in a network environment
This is the core idea of the DOI Standardising also this aspect (which goes beyond identification) is a
value
![Page 9: Www.medra.org Piero Attanasio Managing persistent identifiers and digitisation rights for Europe Bologna, 27 May 2011 EuroCRIS meeting](https://reader036.vdocuments.net/reader036/viewer/2022083120/5a4d1b417f8b9ab0599a0f2f/html5/thumbnails/9.jpg)
ww
w.m
edra
.org
Resolution vs identification
“What a PI identifies” and “what a PI resolves to” are two different concepts
What the DOI The DOI® (Digital Object Identifier) is a standard for identifying any object of intellectual property. A DOI provides a means of persistently identifying a piece of intellectual property on a digital network and associating it with related current data.
On digital networks, all intellectual property is simply a string of bits; a DOI can apply to any form of intellectual property in any digital environment. DOIs have been called "the bar code for intellectual property": like the physical bar code, they are enabling tools for use all through the supply chain to add value and save cost.
A DOI differs from commonly used internet pointers to material such as the URL – Uniform Resource Locator, the usual means of referring to World Wide Web material – because it identifies an object as a first-class entity, not simply the place where the object is located.
A DOI is also different from commonly used identifiers of intellectual property like standard bibliographic and related identifiers (ISBN, ISSN, ISRC, etc) because it is associated with defined services and is immediately "actionable" on a network. However, the DOI does not compete with these standards since it allows them to be integrated as suffixes in DOI strings.
A DOI is an implementation of the Internet concepts of Uniform Resource Name and Universal Resource Identifier. A DOI is different from abstract naming specifications such as URN in that it is a defined
identification
Identified entity
Info (metadata)
DOI
Resolution 1Resolution 2
Resolution 3
Rightsinfo
How to buy
Resolution 4
![Page 10: Www.medra.org Piero Attanasio Managing persistent identifiers and digitisation rights for Europe Bologna, 27 May 2011 EuroCRIS meeting](https://reader036.vdocuments.net/reader036/viewer/2022083120/5a4d1b417f8b9ab0599a0f2f/html5/thumbnails/10.jpg)
ww
w.m
edra
.org
Resolution vs identification
What the DOI The DOI® (Digital Object Identifier) is a standard for identifying any object of intellectual property. A DOI provides a means of persistently identifying a piece of intellectual property on a digital network and associating it with related current data.
On digital networks, all intellectual property is simply a string of bits; a DOI can apply to any form of intellectual property in any digital environment. DOIs have been called "the bar code for intellectual property": like the physical bar code, they are enabling tools for use all through the supply chain to add value and save cost.
A DOI differs from commonly used internet pointers to material such as the URL – Uniform Resource Locator, the usual means of referring to World Wide Web material – because it identifies an object as a first-class entity, not simply the place where the object is located.
A DOI is also different from commonly used identifiers of intellectual property like standard bibliographic and related identifiers (ISBN, ISSN, ISRC, etc) because it is associated with defined services and is immediately "actionable" on a network. However, the DOI does not compete with these standards since it allows them to be integrated as suffixes in DOI strings.
A DOI is an implementation of the Internet concepts of Uniform Resource Name and Universal Resource Identifier. A DOI is different from abstract naming specifications such as URN in that it is a defined
identification
Identified entity (e.g. a book)
Info(metadata)
DOI
Resolution 1
Resolution 2
Rights info
How to buy.
Resolution 3
It is also possible that the PI does not resolve to the identified entity
![Page 11: Www.medra.org Piero Attanasio Managing persistent identifiers and digitisation rights for Europe Bologna, 27 May 2011 EuroCRIS meeting](https://reader036.vdocuments.net/reader036/viewer/2022083120/5a4d1b417f8b9ab0599a0f2f/html5/thumbnails/11.jpg)
ww
w.m
edra
.org
Arrow: connecting book record to right information
The issue: High transaction costs in managing rights in digital library
programmes• Aka as “the orphan work problem”, but it is more than this
The need to connect stuff information (a bibliographic record) to people information (rightholder contact) and possibly with license information
• E.g. offered by a Collecting Management Organisation We started from a number of information resources created for
different purposes We set up a network making those resources interoperable
In use in four countries: Germany, France, UK and Spain We are working to expand the network to many other European
countries (“Arrow Plus”)
![Page 12: Www.medra.org Piero Attanasio Managing persistent identifiers and digitisation rights for Europe Bologna, 27 May 2011 EuroCRIS meeting](https://reader036.vdocuments.net/reader036/viewer/2022083120/5a4d1b417f8b9ab0599a0f2f/html5/thumbnails/12.jpg)
ww
w.m
edra
.org
Some key characteristics of Arrow
A distribution system made interoperable through use of standard Sometimes cited as a “registry” or a “database”, which is not
Separation between right information management and right clearance This makes the system neutral to legal frameworks and business
models Use of different types of bibliographic resources
National library catalogues, VIAF, Books in print, CMO repertoires (which often use different standards!)
![Page 13: Www.medra.org Piero Attanasio Managing persistent identifiers and digitisation rights for Europe Bologna, 27 May 2011 EuroCRIS meeting](https://reader036.vdocuments.net/reader036/viewer/2022083120/5a4d1b417f8b9ab0599a0f2f/html5/thumbnails/13.jpg)
ww
w.m
edra
.org
Lessons learned: an identity issue
Connecting stuffs, people and information associated to both is a general problem
This is first a problem of identity (and identification) Which entity is relevant in the “stuff” domain?
• Unambiguous identification of the book concerned• Unambiguous identification of the work(s) contained in that book
Which entity is relevant in the “people” domain?• Unambiguous identification of the public names• Unambiguous identification of people
![Page 14: Www.medra.org Piero Attanasio Managing persistent identifiers and digitisation rights for Europe Bologna, 27 May 2011 EuroCRIS meeting](https://reader036.vdocuments.net/reader036/viewer/2022083120/5a4d1b417f8b9ab0599a0f2f/html5/thumbnails/14.jpg)
ww
w.m
edra
.org
Lessons learned: connecting entities
We need information associated to works when all the resources are based on books (manifestations) Need to connect <Book> data to <Work> data
We need information about people and often have information about names Personal information are delicate:
• moving from <Name> to <Person> • often is from <Name> to <Resource maintaining person information>
Definition of relevant entities is worth spending large efforts Stakeholders awareness and then agreement needed
![Page 15: Www.medra.org Piero Attanasio Managing persistent identifiers and digitisation rights for Europe Bologna, 27 May 2011 EuroCRIS meeting](https://reader036.vdocuments.net/reader036/viewer/2022083120/5a4d1b417f8b9ab0599a0f2f/html5/thumbnails/15.jpg)
ww
w.m
edra
.org
Lessons learned: managing errors
We live in a world with imperfect information ISBNs (created in 70ies) are not always there and not always used
properly Work identifier (ISTC) created very recently and at the initial phase
of deployment Right information resident only in proprietary resources
Connecting entities (matching, clustering, relation tracking) is always a probabilistic process
Need to manage errors Never promising the true Combining automatic processes and human intervention Being transparent on this matter and allow users to check the results
![Page 16: Www.medra.org Piero Attanasio Managing persistent identifiers and digitisation rights for Europe Bologna, 27 May 2011 EuroCRIS meeting](https://reader036.vdocuments.net/reader036/viewer/2022083120/5a4d1b417f8b9ab0599a0f2f/html5/thumbnails/16.jpg)
ww
w.m
edra
.org
Can our experience be relevant for other application?
One example: bibliometric data Indexes rely on data sources, which by definition are imperfect Need to connect:
<Manifestation> to <Expression> to <Work>• Different versions (e.g. pre-print, publisher version, etc.)• Insufficient data about monographs
<Work> to <Citations>• Facilitated by the use of the DOI, but still problematic in many fields
Again: managing errors Need to know errors data Balancing automatic processes and human intervention
![Page 17: Www.medra.org Piero Attanasio Managing persistent identifiers and digitisation rights for Europe Bologna, 27 May 2011 EuroCRIS meeting](https://reader036.vdocuments.net/reader036/viewer/2022083120/5a4d1b417f8b9ab0599a0f2f/html5/thumbnails/17.jpg)
ww
w.m
edra
.org
How to build on
Once information has been discovered (and possibly assessed), it is crucial that it is re-usable
Again: proper use of standards: Please, don’t call your solution “a standard” A standard requires broad stakeholders consensus
Registering data in the appropriate standard repositories E.g.: Arrow routine to register the ISTC for every <Work> discovered
in the process One further step: making information available through use of
standard resolution systems E.g.: your URN resolver is not enough. Use something that is
accepted by vast communities (such as Handle)
![Page 18: Www.medra.org Piero Attanasio Managing persistent identifiers and digitisation rights for Europe Bologna, 27 May 2011 EuroCRIS meeting](https://reader036.vdocuments.net/reader036/viewer/2022083120/5a4d1b417f8b9ab0599a0f2f/html5/thumbnails/18.jpg)
ww
w.m
edra
.org
Identification –> Resolution –> Access
Precise and persistent identification is a prerequisite for resolution Resolution facilitates access to precisely identified resource Get a look at “The Answer to the Machine is in the Machine”, one
of the Big Ideas for the Digital Agenda launched by the European Commission The concept: creating stable resolution system between IP entities
and information about IP rights IP right information is not stable, so it cannot be embedded in the
manifestation Through persistent identifiers supported by resolution mechanism, it
is possible to connect the entity with IP information
![Page 19: Www.medra.org Piero Attanasio Managing persistent identifiers and digitisation rights for Europe Bologna, 27 May 2011 EuroCRIS meeting](https://reader036.vdocuments.net/reader036/viewer/2022083120/5a4d1b417f8b9ab0599a0f2f/html5/thumbnails/19.jpg)
ww
w.m
edra
.org
Thank you - Grazie
Piero [email protected]
Further information on ARROW www.arrow-net.eu