data journals and repositories: getting academic credit for data sharing
TRANSCRIPT
DATA JOURNALS AND REPOSITORIES:
Getting academic credit for data sharing
Gareth Knight
LSHTM Research Data Management Service
Open Access Week
27 October 2016
Agenda
1. Publish data as supplemental material
2. Submit data to a digital repository
3. Publish a Data Paper
Motivations
Enable new research
Higher rate of citation More outputs that can be cited
Ensure validity & replicability
Piwowar HA, Vision TJ. (2013) Data reuse and the open data citation
advantage https://doi.org/10.7717/peerj.175
Data Deposit Process
1Prepare your data
o Select data that underpins
research findings
o Remove personal,
sensitive, and superfluous details
o Prepare documentation
and other support files
2Create metadata record
o Create a metadata record
and enter title, description, names, and other details.
o Specify access status and licence for each file
o Identify corresponding
author
3Update article
o Obtain Digital Object
Identifier
o Add to appropriate location in article
OPEN DATA
Anonymised data that can be made available for open access
• Journals encourage use of licences that allow widest access and use
SAFEGUARDED DATA
Data with personal & sensitive information that can’t be removed requires extra protection
• Provide method of contact
• Access request must be reviewed by multiple
people, not just creator
• Use Data Transfer Agreement which establishes permitted & non-permitted uses
Data access and use
Data should be as open as possible, but as closed as necessary
Data Citation Practices
Reference List Data Access Statement
Some journals require datasets to be included in the reference list:Mandatory:
• Author/creator
• Title,
• Publication date
• Publisher: a repository, institution
• DOI/URL or other access information
Optional:
• Resource Type (dataset, survey)
• Version
A section on 1st page of a journal article, typically next to funding information
If data is openly available:
• the repository name
• the DOI/URL of the data record
• Any licence conditions (e.g. CC-BY)
If data is restricted:
• a DOI/URL that describes data
• reason for access constraints
• how it may be requested
• Brief details of access conditions
Data Access Statements
Open data:
“All data supporting this paper are openly available from the LSHTM data repository at http://dx.doi.org/10.17037/DATA.1234”
Safeguarded data:
“Due to the (ethically/politically/commercially) sensitive nature of the research, interview transcripts will be made available to bona fide researchers, on condition that they provide evidence of ethics approval and sign a Data Transfer Agreement. Further information on the data and access conditions can be found at http://dx.doi.org/10.17037/DATA.120”
Third party data:
"This study used secondary data obtained from third parties. Quarterly labour force data can be accessed by visiting http://dx.doi.org/10.5255/UKDA-SN-7037-5 and registering with the UK Data Service. Traffic pollution information was provided by Brighton County Council and may be requested by contacting their traffic department at [email address]. Additional details on how these datasets were processed are openly available at LSHTM data repository at http://dx.doi.org/10.17037/DATA.120”
Publish a Data Paper
Purpose:
1. Provides a citable journal publication that provides scholarly credit to data creator and others involved in process
2. Ensures data is documented to a minimum standard that considers data
reuse
3. It improve awareness of the data within the scholarly community
a peer reviewed publication which contains facts about one or more datasets
Content:
• Describes the data itself, not hypotheses or arguments built on the data
• Published in XML & PDF format (with additional metadata in ISA-Tab)
• Authors retain copyright and grant first publication rightto journal – encourage
Publication process
Image source: Open Health Datahttp://openhealthdata.metajnl.com/about/submissions/
Pure & hybrid data journals
https://www.wiki.ed.ac.uk/display/datashare/Sources+of+dataset+peer+review
Pure & hybrid data journals
https://www.wiki.ed.ac.uk/display/datashare/Sources+of+dataset+peer+review
Peer review Criteria
• Rigour of procedures described in paper
• Level of detail provided in data description
Data must be:
• hosted in a DOI-capable repository
• publicly available without restriction, excepting reasonable controls related to human privacy issues or public safety
• Ideally made available in an open format and open licence (e.g.CC-BY)
Open Health Data: Data Papers
1. Overview:
• Title, paper author details, abstract, keywords, introduction/Study Description
2. Context:
• Spatial coverage, temporal coverage, species
3. Methods:
• Steps, sampling strategy, quality control, constraints, privacy, ethics
4. Dataset description:
• Dataset creators, object name, data type, ontologies, file format, creation/publication dates, dataset creators, language, licence, repository location
5. Reuse potential
• Funding statement, references
APC:
£100http://openhealthdata.metajnl.com/about/submissions/
Scientific Data: Data Descriptor
• Title
• Authors
• Affiliations
• Abstract
• Background & Summary
• Methods (capture, processing, etc.)
• Code availability
• Data Records - description
• Technical Validation
• Usage Notes
• Author contributions
• References
• Data Citationshttp://www.nature.com/sdata/publish/submission-guidelines#templates
APC:
£890 + optional data storage
costs
Email us at
Read more at http://www.lshtm.ac.uk/research/researchdataman/hello
Tweet us @LSHTMrdm
Get in touch