creating citable data identifiers
DESCRIPTION
Creating Citable Data Identifiers. Ryan Scherle Mark Diggory. Mimosa house 807 South Virginia Dare Trail Kill Devil Hills, NC USA 27948. 1903-12- 17 36.019705 N, 75.668769 W. 79330-S84-A41 WP0ZZZ99ZTS392124. Loxosceles reclusa. Citing identifiers. Mimosa house - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: Creating Citable Data Identifiers](https://reader035.vdocuments.net/reader035/viewer/2022062501/56816448550346895dd60f46/html5/thumbnails/1.jpg)
Creating Citable Data IdentifiersRyan ScherleMark Diggory
![Page 2: Creating Citable Data Identifiers](https://reader035.vdocuments.net/reader035/viewer/2022062501/56816448550346895dd60f46/html5/thumbnails/2.jpg)
Mimosa house 807 South Virginia Dare Trail Kill Devil Hills, NC USA 27948
![Page 3: Creating Citable Data Identifiers](https://reader035.vdocuments.net/reader035/viewer/2022062501/56816448550346895dd60f46/html5/thumbnails/3.jpg)
1903-12-17 36.019705 N, 75.668769 W
![Page 4: Creating Citable Data Identifiers](https://reader035.vdocuments.net/reader035/viewer/2022062501/56816448550346895dd60f46/html5/thumbnails/4.jpg)
79330-S84-A41 WP0ZZZ99ZTS392124
![Page 5: Creating Citable Data Identifiers](https://reader035.vdocuments.net/reader035/viewer/2022062501/56816448550346895dd60f46/html5/thumbnails/5.jpg)
Loxosceles reclusa
![Page 6: Creating Citable Data Identifiers](https://reader035.vdocuments.net/reader035/viewer/2022062501/56816448550346895dd60f46/html5/thumbnails/6.jpg)
Citing identifiers Mimosa house 807 South Virginia Dare Trail 1903-12-17 27948 Loxosceles reclusa 36.019705 N, 75.668769 W 79330-S84-A41 WP0ZZZ99ZTS392124
![Page 7: Creating Citable Data Identifiers](https://reader035.vdocuments.net/reader035/viewer/2022062501/56816448550346895dd60f46/html5/thumbnails/7.jpg)
Identifiers matter Some identifiers are machine-friendly,
some are human-friendly For citations, you need to strike a
balance Good identifiers are a critical selling
point for an repository
![Page 8: Creating Citable Data Identifiers](https://reader035.vdocuments.net/reader035/viewer/2022062501/56816448550346895dd60f46/html5/thumbnails/8.jpg)
![Page 9: Creating Citable Data Identifiers](https://reader035.vdocuments.net/reader035/viewer/2022062501/56816448550346895dd60f46/html5/thumbnails/9.jpg)
http://purl.dlib.indiana.edu/iudl/lilly/slocum/LL-SLO-009276
![Page 10: Creating Citable Data Identifiers](https://reader035.vdocuments.net/reader035/viewer/2022062501/56816448550346895dd60f46/html5/thumbnails/10.jpg)
![Page 11: Creating Citable Data Identifiers](https://reader035.vdocuments.net/reader035/viewer/2022062501/56816448550346895dd60f46/html5/thumbnails/11.jpg)
![Page 12: Creating Citable Data Identifiers](https://reader035.vdocuments.net/reader035/viewer/2022062501/56816448550346895dd60f46/html5/thumbnails/12.jpg)
Principles ofcitable identifiers
![Page 13: Creating Citable Data Identifiers](https://reader035.vdocuments.net/reader035/viewer/2022062501/56816448550346895dd60f46/html5/thumbnails/13.jpg)
1. Use DOIs http://dx.doi.org/10.5061/dryad.123ab Scientists are familiar with DOIs
![Page 14: Creating Citable Data Identifiers](https://reader035.vdocuments.net/reader035/viewer/2022062501/56816448550346895dd60f46/html5/thumbnails/14.jpg)
1. Use DOIs http://dx.doi.org/10.5061/dryad.123ab Scientists are familiar with DOIs DOIs are supported by many tools and
services
![Page 15: Creating Citable Data Identifiers](https://reader035.vdocuments.net/reader035/viewer/2022062501/56816448550346895dd60f46/html5/thumbnails/15.jpg)
1. Use DOIs http://dx.doi.org/10.5061/dryad.123ab Scientists are familiar with DOIs DOIs are supported by many tools and
services
Current support:Eprints Dspace FedoraNo No With work
![Page 16: Creating Citable Data Identifiers](https://reader035.vdocuments.net/reader035/viewer/2022062501/56816448550346895dd60f46/html5/thumbnails/16.jpg)
2. Keep identifiers simple http://dx.doi.org/10.5061/dryad.123ab Complex identifiers are fine for machines, but
they’re bad for humans. Despite best intentions, humans sometimes
need to work with identifiers manually.
http://dx.doi.org/10.1179/1743131X11Y.0000000009
http://dx.doi.org/10.1016/B978-0-12-220851-5.00003-4
![Page 17: Creating Citable Data Identifiers](https://reader035.vdocuments.net/reader035/viewer/2022062501/56816448550346895dd60f46/html5/thumbnails/17.jpg)
2. Keep identifiers simple http://dx.doi.org/10.5061/dryad.123ab Complex identifiers are fine for machines, but
they’re bad for humans. Despite best intentions, humans sometimes
need to work with identifiers manually.
Current support:Eprints Dspace FedoraYes Yes Yes
![Page 18: Creating Citable Data Identifiers](https://reader035.vdocuments.net/reader035/viewer/2022062501/56816448550346895dd60f46/html5/thumbnails/18.jpg)
3. Use syntax to illustrate relationships http://dx.doi.org/10.5061/dryad.123ab/3 Adding a tiny bit of semantics to an
identifier is incredibly usefulhttp://files.eprints.org/691/http://files.eprints.org/447/http://files.eprints.org/556/
Useful for various human “hacks” Useful for statistics
![Page 19: Creating Citable Data Identifiers](https://reader035.vdocuments.net/reader035/viewer/2022062501/56816448550346895dd60f46/html5/thumbnails/19.jpg)
3. Use syntax to illustrate relationships http://dx.doi.org/10.5061/dryad.123ab/3 Adding a tiny bit of semantics to an
identifier is incredibly useful
Current support:Eprints Dspace FedoraNo No With work
![Page 20: Creating Citable Data Identifiers](https://reader035.vdocuments.net/reader035/viewer/2022062501/56816448550346895dd60f46/html5/thumbnails/20.jpg)
4. When “meaning-bearing” content changes, create a versioned identifier Scientists want data to be invariant to
enable reuse by machines Even a single bit makes a difference Watch out for implicit abstractions…
http://dx.doi.org/10.5061/dryad.123ab/thumbnail
What about DOI conventions?
![Page 21: Creating Citable Data Identifiers](https://reader035.vdocuments.net/reader035/viewer/2022062501/56816448550346895dd60f46/html5/thumbnails/21.jpg)
5. When “meaningless” content changes, retain the current identifier Descriptive metadata must be editable
without creating a new identifier. Humans rarely care about metadata
changes, especially for citation purposes!
Caveat: machine-oriented systems may consider the “metadata” to be data, which requires identifier changes
![Page 22: Creating Citable Data Identifiers](https://reader035.vdocuments.net/reader035/viewer/2022062501/56816448550346895dd60f46/html5/thumbnails/22.jpg)
Current versioning supportEPrints Support for flexible versioning/relationships,
but no support for expressing these relationships in identifiers.
DSpace None.
Fedora Implicit versioning of all data and metadata. This is highly useful, but it is too granular for citation purposes.
![Page 23: Creating Citable Data Identifiers](https://reader035.vdocuments.net/reader035/viewer/2022062501/56816448550346895dd60f46/html5/thumbnails/23.jpg)
Principles of citable identifiers1. Use DOIs2. Keep identifiers simple3. Use syntax to illustrate relationships 4. When “meaning-bearing” content changes, create a versioned identifier5. When “meaningless” content changes, retain the current identifier
![Page 24: Creating Citable Data Identifiers](https://reader035.vdocuments.net/reader035/viewer/2022062501/56816448550346895dd60f46/html5/thumbnails/24.jpg)
Hacking DSpace to support…
DOI identifier registrationSemantics in identifiersCitation publicationVersioning
![Page 25: Creating Citable Data Identifiers](https://reader035.vdocuments.net/reader035/viewer/2022062501/56816448550346895dd60f46/html5/thumbnails/25.jpg)
DSpace identifier services Handle system independence
More future identifier systems will come. Granular control
Separate reservation from registration Citation
Registration of metadata with external services
![Page 26: Creating Citable Data Identifiers](https://reader035.vdocuments.net/reader035/viewer/2022062501/56816448550346895dd60f46/html5/thumbnails/26.jpg)
DSpace identifier services
![Page 27: Creating Citable Data Identifiers](https://reader035.vdocuments.net/reader035/viewer/2022062501/56816448550346895dd60f46/html5/thumbnails/27.jpg)
DataCite content service
![Page 28: Creating Citable Data Identifiers](https://reader035.vdocuments.net/reader035/viewer/2022062501/56816448550346895dd60f46/html5/thumbnails/28.jpg)
Promoting accurate citationsAdded suggested citation formats up front
![Page 29: Creating Citable Data Identifiers](https://reader035.vdocuments.net/reader035/viewer/2022062501/56816448550346895dd60f46/html5/thumbnails/29.jpg)
Versioning Versioning is item “editioning” Creation of new versions is a “user
mediated” process (submitter or reviewer)
Versioning does not alter the original item
Version relationships are maintained independent of the item’s metadata
![Page 30: Creating Citable Data Identifiers](https://reader035.vdocuments.net/reader035/viewer/2022062501/56816448550346895dd60f46/html5/thumbnails/30.jpg)
Submission-based revisions
![Page 31: Creating Citable Data Identifiers](https://reader035.vdocuments.net/reader035/viewer/2022062501/56816448550346895dd60f46/html5/thumbnails/31.jpg)
![Page 32: Creating Citable Data Identifiers](https://reader035.vdocuments.net/reader035/viewer/2022062501/56816448550346895dd60f46/html5/thumbnails/32.jpg)
![Page 33: Creating Citable Data Identifiers](https://reader035.vdocuments.net/reader035/viewer/2022062501/56816448550346895dd60f46/html5/thumbnails/33.jpg)
Result: Citable data versionsdoi:10.5061/dryad.bb7m4
![Page 34: Creating Citable Data Identifiers](https://reader035.vdocuments.net/reader035/viewer/2022062501/56816448550346895dd60f46/html5/thumbnails/34.jpg)
Future technical directions Add metadata versioning under the
hood -- may need to rethink some of the current system
Integrate our changes to core DSpace Moving these features into the core
requires further discussion with the Dspace user community
![Page 35: Creating Citable Data Identifiers](https://reader035.vdocuments.net/reader035/viewer/2022062501/56816448550346895dd60f46/html5/thumbnails/35.jpg)
How are we doing?For 186 articles associated with Dryad deposits:
77% had “good” citations to the data 2% had “bad” citations to the data 21% had no data citations
Standards for data citation are still evolving. Journals have yet to agree on where to place data citations, and authors are just starting to become familiar with the concept.
![Page 36: Creating Citable Data Identifiers](https://reader035.vdocuments.net/reader035/viewer/2022062501/56816448550346895dd60f46/html5/thumbnails/36.jpg)
![Page 37: Creating Citable Data Identifiers](https://reader035.vdocuments.net/reader035/viewer/2022062501/56816448550346895dd60f46/html5/thumbnails/37.jpg)
![Page 38: Creating Citable Data Identifiers](https://reader035.vdocuments.net/reader035/viewer/2022062501/56816448550346895dd60f46/html5/thumbnails/38.jpg)
What should you do now? Analyze how data is used and cited
outside the repository Determine whether use is more
machine-oriented or more human-oriented
Design identifiers and identifier management to facilitate the observed uses