Data Archiving and Networked Services
DANS is an institute of KNAW en NWO
Towards sustainable data sharing
Peter Doorn
Open Access Week, University of Groningen26 October 2012
Contents
• Data is hot!• Open Data policies• Commission Schuyt• DANS and digital archiving• Connecting community & content• Enhanced publications or data in
context
NiederlandeRenommierter Psychologe gesteht Fälschungen
Data is hot!
• Article on “trends for 2012”: “Keeping your research data secret until they are finally printed in a scientific journal is so 2011”
• Neelie Kroes (Vice-President of the European Commission responsible for the Digital Agenda): “Data is the new gold”
Open Access and Open Data Policies
• NWO: NWO funded data collection has to be publicly available – for humanities (GW) and social science (MaGW)
departments of NWO data contracts have to be signed with DANS
– long-term preservation and access will be accepted as a budget line in project proposals
• KNAW: All Academy research data is stored sustainably and made publicly accessible unless there are serious reasons for not doing so (those reasons are set out in an Academy institute’s data memorandum).
• European Commission: Data Management Plan to become part of proposals for Horizon 2020
NWO and DANS want to ensure that new research data are archived in a sustainable manner and made available for further research. After having been granted the subsidy, researchers are obliged to enter into a data contract with DANS. The data contract is intended to guarantee accessibility to the data as well as their digital sustainability for additional scientific research. It is based on the General Grant Conditions of NWO, available in the recently updated Regeling Subsidieverlening NWO.
Outline of KNAW Open Data Policy
What to preserve digitally?
All research data, unless the data memorandum stipulates that this is not necessary.
Where to preserve digitally?
In a data repository that complies with generally accepted standards, for example the Data Seal of Approval.
When should material be uploaded to the repository?
Preferably immediately after the dataset is complete, but no later than when the research is completed.
What material should be openly accessible?
All research data, unless the data memorandum stipulates that this is not necessary.
When should material be made openly accessible?
Preferably immediately after it is deposited. An embargo of at most eighteen months after conclusion of the research is possible (if stipulated in the data memorandum).
Metadata All metadata is made accessible, unless the data memorandum stipulates otherwise.
Block on access Data whose public nature is disputed can quickly be made non-accessible by the repository administrator.
1. Preparation of the data collection project
2. Implementation of the data collection project
3. Archiving data and making data available
http://www.dans.knaw.nl/content/categorieen/diensten/data-management-plan
Data Management Plans
Data Management Plan in more detail
1. Preparation of the data collection project1.1 General information about the data collection1.2 Overview of previously collected data1.3 Choice of software and hardware to be used1.4 Determination of intellectual property and legal requirements1.5 User information1.6 Interoperability2. Implementation of the data collection project2.1 Data Management2.2 Choice of metadata to be used3. Archiving data and making data available3.1 Depositing data in a data archive or repository
Report of Commission Schuyt
• Access to research data improves transparency
• Make integrity rules a vital element of scientific conduct
• Many differences between disciplines
• Small-scale research more risky than big science
• Peer pressure • Data management in SEP
What is DANS?
• Institute of Dutch Academy and Research Funding Organisation (KNAW & NWO) since 2005
• First predecessor dates back to 1964 (Steinmetz Foundation), Historical Data Archive 1989
• Mission: promote and provide permanent access to digital research information (started with digital archives in the humanities and social sciences)
Our main activities and services
• Encourage researchers to self-archive and reuse data by means of our Electronic Archiving SYstem EASY
• Our largest digital collections are in archaeology, social sciences and history (moving into other domains)
• Provide access, through Narcis.nl, to thousands of scientific datasets, e-publications and other research information in the Netherlands
• Data projects in collaboration with research communities and partner organisations
• Advice, training and support (Data Seal of Approval, Persistent Identifier Infrastructure)
• R&D into archiving of and access to digital information
Why is digital preservation of data important?
• Checks on mistakes (and fraud!) by data creators and replication research
• Data re-use in comparative studies
Cultures of data sharing differ over disciplines, but also change over time
Six common objections to data sharing… and how to overcome them (1-4)
Adapted from Stephen H. Koslow (2000)
Six common objections to data sharing… and how to overcome them (5-6)
Adapted from Stephen H. Koslow (2000), ‘Should the neuroscience community make a paradigm shift to sharing primary data?’, Nature America Inc., 3:9 (September), p. 863-865.
Access control & licenses
• Open (after registration)• Restricted (depositor is the access authority)• Other (DANS as security backup)
Therefore:- Data at DANS is not “up for grabs”!- Access management is facilitated by archive
system, easy and fast - Embargo for limited time period is possible
Self archiving system fre
e of charge for datasets < 1 Gb
Community reviews of data sets
5 Criteria16 Guidelines
The research data:• can be found on the
Internet• are accessible (clear
rights and licenses)• are in a usable format• are reliable• can be referred to
(persistent identifier)
Data Seal of Approval
www.datasealofapproval.orgPartnersh
ip with ISO and DIN sta
ndards of Trustw
orthy Archives
CLARIAH
NeDiMAHDASISH
APA
BiGGrid
EGI
e-Humanities
Community projects NL Health data
National infrastructure
European
ESFRI Roadmap
Community projects EU Europeana
RD-Alliance
OpenAire+
Geodata
CLARIN CESSDA
ESS
ARIADNE
EHRI
LifeWatch
International
Univ. Libraries/3TU DCNCDD SURF
CLIO-INFRA
EUDAT
E-IRG
DCCD DARIAH
Certification
Partner in
national and international infra
structures
DwB
CBS
Challenge for the next five years
Why connect data to publications?
1. Data as background/additional information2. Publication as background to data 3. Different views on the data4. Check author’s assertions5. User can find all information in one place
NARCIS.nl: Access to Research Information, e-Publications, Data Sets and more
New!!New!!
Doctoral Theses (Dissertations)
Archaeological excavations
Publications by Tilburg University researchers
Enhanced Scientific Communication by Aggregated Publication Environments (ESCAPE)
Gallows in Late Medieval Frisia
Research Data
ResearchersReport
Organizations involved:Funder and research institute
Topics linking to related information
Aggregation: the enhanced publication
Research Data
Publication
Researchers
Enhanced publication
Research organization
Funder
Related subjects
PersistentIdentifier
Links directly to data in DANS archive
All data types: other examples with video, audio, still images…
Project
PhD Thesis University
Persons involved: PhD candidate & Professors
Project information
Digital Author Identifier
Researcher information
Researcher network
Motto
Open if possible, restricted if necessary!
Data Archiving and Networked Services
DANS is an institute of KNAW en NWO
Thank you for your attentionand visit us at:www.dans.knaw.nlwww.narcis.nl