ezid: easy persistent identifiers and data citation

24
EZID: Easy Persistent Identifiers and Data Citation 31 October 2011 John Kunze and Joan Starr California Digital Library

Upload: university-of-california-curation-center

Post on 11-Nov-2014

2.880 views

Category:

Technology


4 download

DESCRIPTION

DLF Forum Presentation on EZID, Persistent Identifier Service

TRANSCRIPT

Page 1: EZID: Easy Persistent Identifiers and Data Citation

 EZID: Easy Persistent Identifiers and Data Citation

31 October 2011

John Kunze and Joan StarrCalifornia Digital Library

Page 2: EZID: Easy Persistent Identifiers and Data Citation

EZID: Easy Persistent Identifiers & Data Citation

IntroductionCitation, DataCite and EZID

Who? Why? What?

EZID’s next steps: tech talkNew stuff, use cases, feedback

Feedback

Page 3: EZID: Easy Persistent Identifiers and Data Citation

California Digital Library (CDL)

Page 4: EZID: Easy Persistent Identifiers and Data Citation
Page 5: EZID: Easy Persistent Identifiers and Data Citation

The research data problem

an article about data, but no data

Page 6: EZID: Easy Persistent Identifiers and Data Citation

What citation offers

• To aid scientific reproducibility• To provide fair credit• To ensure scientific transparency and reasonable accountability

• To aid in tracking the impact, including– helping data authors verify use of their data and– helping future data users identify how others have used the data

Page 7: EZID: Easy Persistent Identifiers and Data Citation
Page 8: EZID: Easy Persistent Identifiers and Data Citation

German National Library of Economics (ZBW)

German National Library of Science and Technology (TIB)

German National Library of Medicine (ZB MED)

GESIS - Leibniz Institute for the Social Sciences, Germany

Australian National Data Service (ANDS)

ETH Zurich, Switzerland

Canada Institute for Scientific and Technical Information

(CISTI)

Technical Information Center of Denmark

Institute for Scientific & Technical Information (INIST-

CNRS), France

TU Delft Library, The Netherlands

The Swedish National Data Service (SNDS)

The British Library , UK

California Digital Library (CDL), USA

Office of Scientific & Technical Information (OSTI), USA

Purdue University Library

DataCite

Page 9: EZID: Easy Persistent Identifiers and Data Citation

EZID: long-term identifiers made easy

take control of the management

and distribution of your research,

share and get credit for it, and

build your reputation through its

collection and documentation

Primary Functions1. Create persistent identifiers2. Manage identifiers over time3. Manage associated metadata over time

Page 10: EZID: Easy Persistent Identifiers and Data Citation

http://n2t.net/ezid

Page 11: EZID: Easy Persistent Identifiers and Data Citation

Current EZID Clients

UC Berkeley Library (on behalf of the UC Berkeley campus)  Sponsored accounts:

The Digital Archaeological Record (tDAR)

Open Context Dryad Digital Repository

CRCNS.org

UC San Diego Library (on behalf of the UC San Diego campus)

Fred Hutchinson Cancer Research Center

American Astronomical Society (AAS) LabArchives

Centre national de documentation pédagogique (CNDP)

National Center for Atmospheric Research (NCAR)

Cornell Institute for Social & Economic Research

USGS/Earth Sciences Data Clearinghouse (formerly National Biological Info. Infrastructure)

A partial list

Page 12: EZID: Easy Persistent Identifiers and Data Citation

New features in trial or active development

• Service replicas:  manager and resolver• URN (Uniform Resource Name) support (urn:uuid:)• Suffix pass-thru: do NT and get N/ST/S for free• Tombstone/incubation/... surrogate pages, id status (reserved or public), and multiple targets

• Identifier status:  reserved or public• Content negotiation and inflections:  ?    ??    /    .• ARK community and governance, eg, registries

Page 13: EZID: Easy Persistent Identifiers and Data Citation

Service replicas

• EZID is an id manager that populates N2T– It tolerates down time– Other id manager services might one day populate N2T

• N2T (Name-to-Thing) is an id resolver that ...– It is very intolerant of down time, since it services all 

access requests for locations and metadata– N2T was designed with global replication in mind

Page 14: EZID: Easy Persistent Identifiers and Data Citation

URN support

• N2T and EZID are agnostic about kinds of things, names, and metadata– Digital, physical, abstract, living, fictional, groups, etc.– Any metadata & known profiles (DataCite, Dublin Kernel)– ARK, DOI, URN, Handle, IVOA, LSID, PMID, etc., requiring 

namespace “write” permission, eg, via DataCite

• In test:  Uniform Resource Names (URNs)– urn:uuid namespace

Page 15: EZID: Easy Persistent Identifiers and Data Citation

Under the hood keysmithing terms: bows, shoulders, blades, tips, covers

Page 16: EZID: Easy Persistent Identifiers and Data Citation

Suffix pass-thru: NT gets N/ST/S for free

Idea: if name N points to target T, then requests for N extended by any suffix N/S can take you to T/S

• For dataset doi:10.5072/Big4 with 10,000 nameable components,– Register and manage 10,001 names or 1 name?– Eg, http://x.y.z/foo/Big4/db/table/cell/45-8.txt could be reached with doi:1.5072/Big4/table/cell/45-8.txt

• In test with ARKs. Conflict with other resolvers?

Page 17: EZID: Easy Persistent Identifiers and Data Citation

Tombstone and other surrogate pages

Tombstone, incubation, and other surrogate pages (probation?) auto-generated from metadata, eg,

      http://n2t.net/ezid/tombstone/id/ark:/20775/bb3243444z

Page 18: EZID: Easy Persistent Identifiers and Data Citation

Reserved identifiers and multiple targets

• Some ids must be created and managed (reserved) before going public, eg, for manuscript preparation

• In test: infrastructure for multiple targets and multiple instances of any metadata element

• What should user experience be for multiple targets?  – Present a menu of targets (burden of choice)?– One target chosen for them (burden of inflexibility)?

Page 19: EZID: Easy Persistent Identifiers and Data Citation

Identifier (ARK) inflections:  ?   ??   /   .

• Inflect: change endings w.o. creating new words– Terminal ? means “I want metadata”, which is similar to linked data content negotiation (also in EZID test)

– Terminal ?? means “I also want support metadata”– Drawing board: / could mean “I want a landing page” and  .  could mean “I want the usual computable thing”

• Allow inflections beyond ARKs to DOIs/URNs?

Page 20: EZID: Easy Persistent Identifiers and Data Citation

Example: http://n2t.net/ark:/13030/qt0349g1rh?

erc:who: Renninger, Heidi,; Phillips, Nathan,; Hodel, Donald,what: Comparative hydraulic and anatomic properties in palm trees (Washingtonia robusta) of varying heightswhen: 2009-04-29where: ark:/13030/qt0349g1rh

Renninger, Heidi; Phillips, Nathan; Hodel, Donald. “Comparative hydraulic and anatomic properties in palm trees (Washingtonia robusta) of varying heights”. 2009-04-29.  ark:/13030/qt0349g1rh

HTML content with embedded comments in ANVL/ERC and RDF

Page 21: EZID: Easy Persistent Identifiers and Data Citation

ARK community and governance

• ARKs soon to have a mailing list• Topics: governance, community, standardization• Registry maintenance:  shoulders and NAANs• N2T consortium with alternative EZID-like services

Page 22: EZID: Easy Persistent Identifiers and Data Citation

For information

• http://www.cdlib.org/services/uc3/ezid• Understanding ids and conventions (shoulders, etc)• Choosing the right identifier (ARK vs DOI? ARK and DOI?)• EZID FAQs and N2T vision• EZID Service Guidelines• EZID Handout/brochure• EZID webinars & slides

Contact Joan Starr at [email protected]

Page 23: EZID: Easy Persistent Identifiers and Data Citation

For (even) more informationEZID http://n2t.net/ezid/  http://www.cdlib.org/services/uc3/ezid/

UC Curation Center http://www.cdlib.org/uc3 [email protected]

UC3 webinar series http://www.cdlib.org/uc3/uc3webinars.html 

UC3/CDLStephen Abrams  David LoyLisa Colvin  Mark Reyes Patricia Cruse  Abhishek SalveScott Fisher  Tracy Seneca Erik Hetzner  Carly Strasser Greg Janée Joan StarrJohn Kunze  Marisa StrongMargaret Low  Perry Willett

Page 24: EZID: Easy Persistent Identifiers and Data Citation

Questions?

by Horia Varlan http://www.flickr.com/photos/horiavarlan/4273168957/in/photostream/