martin donnelly sarah jones dmp online
DESCRIPTION
Research data management: from policy to practice with DMP OnlineMartin Donnelly Sarah JonesTRANSCRIPT
Future Perfect 2012: Digital Preservation by DesignTe Papa Tongarewa, Wellington, New Zealand
26 – 27 March 2012
Research data management: from policy to practice with DMP Online
Martin DonnellyDigital Curation CentreUniversity of Edinburgh
Sarah JonesDigital Curation CentreUniversity of Glasgow
Running order (c. 25 mins)1. Introduction to the DCC & research data management 2. Data-related policies in the UK 3. The DCC & data management planning4. DMP Online v3.05. Connections and collaborations6. Putting it into practice (UMF work and other things)7. Summary / conclusion
Sarah
Martin
1. The Digital Curation Centre
- Founded in 2004- Three partners: Edinburgh, Glasgow and Bath- Primary funder is JISC
Helping to build capacity, capability and skills in data management and curation across the UK’s higher education research community
- DCC Phase 3 Business Plan
What does the DCC do?
• Develop tools – CARDIO, DAF, DRAMBORA, DMP Online
• Offer guidance – helpdesk, briefing papers, how-to guides
• Run training & events– DC101, roadshow, RDMF, IDCC
• Support the JISC – esp. the Managing Research Data programmes
“the active management and appraisal of data over the lifecycle of scholarly and
scientific interest”
Data management is part of good research practice
What is Research Data Management?
Manage
Share
How does RDM affect preservation?
The costs of ingest – receiving data, preparing it for long-term storage, and incorporating it into the digital archive – receives the largest allocation of resources.
- Keeping Research Data Safe 2
2. Data-related policies in the UK
http://www.dcc.ac.uk/resources/policy-and-legal/overview-funders-data-policies
RCUK Common Principles• Publicly funded research data are a public good, produced in the public interest,
which should be made openly available with as few restrictions as possible in a timely and responsible manner that does not harm intellectual property.
• Institutional and project specific data management policies and plans should be in accordance with relevant standards and community best practice. Data with acknowledged long-term value should be preserved and remain accessible and usable for future research.
• To enable research data to be discoverable and effectively re-used by others, sufficient metadata should be recorded and made openly available ....
7 principles agreed by all the UK research councils in May 2011
http://www.rcuk.ac.uk/research/Pages/DataPolicy.aspx
UK research funder expectations
• timely release of data– once patents are filed or on (acceptance for) publication
• open data sharing – minimal or no restrictions– deposit in data centres, structured databases, data enclave
• preservation of data – most funders state expect 5-10+ years
• submission of data management and sharing plans…
3. The DCC and DMP
Links to all DMP resources via http://www.dcc.ac.uk/resources/data-management-plans
We’ve responded to requirements by offering support
Analysed requirements
Developed a Checklist
Provided tools & guidance
What is a DMP?
UK research funders typically ask for:
• A short statement/plan submitted in grant applications
• An outline of what you will create/collect, methods, standards, data management and long-term plans
• How and why – justify your decisions and any limits
Common DMP questions
• What data will be created (format, types) and how?
• How will the data be documented and described?
• How will you manage ethics and Intellectual Property?
• What are the plans for data sharing and access?
• What is the strategy for long-term preservation?
§1: Introduction and Context§2: Data Types, Formats, Standards and Capture
Methods§3: Ethics and Intellectual Property§4: Access, Data Sharing and Re-use§5: Short-Term Storage and Data Management§6: Deposit and Long-Term Preservation§7: Resourcing§8: Adherence and Review§9: Agreement/Ratification by Stakeholders§10: Annexes
DCC Checklist Coverage
Checklist for a Data Management Plan v3.0 (Donnelly and Jones,
March 2011)
http://www.dcc.ac.uk/resources/data-management-plans
DMP-related resources
– “Dealing with Data” (Lyon, 2008)– Analysis of Funder Policies (Jones, 2009)– Checklist for a Data Management Plan
(Donnelly and Jones, 2009)– “How to Develop a Data Management and
Sharing Plan” (Jones, 2011) Edinburgh: Digital Curation Centre
– “Data Management Plans and Planning” (Donnelly, 2012) in Pryor (ed.) Managing Research Data, London: Facet
Links to all DCC resources via http://www.dcc.ac.uk/resources/data-management-plans
Key things to remember
All research projects are different
The DMP will depend upon the nature of the research AND the context (funder, domain, institution(s) etc)
DMPs are useful communication tools
Not a UK phenomenon
“Data Management Plans and Planning” (Donnelly, 2012) in Pryor (ed.) Managing Research Data, London: Facet
“Research data policies: principles, requirements and trends” (Jones, 2012) in Pryor (ed.) Managing Research Data, London: Facet
Read about the international policy and DMP landscape in:
4. www.dcc.ac.uk/dmponline
What does do?
A web-based tool that enables users to...
i. Create, store and update multiple versions of Data Management Plans across the research lifecycle
ii. Meet a variety of specific data-related requirements (from funders, institutions, publishers, etc.)
iii. Get tailored guidance on best practice and helpful contacts, at the point of need
iv. Customise export are share DMPs in a variety of formats in order to facilitate communications within and beyond research projects
* N.B. The templates have varying degrees of endorsement from funders, stakeholder communities, etc. More on this shortly…
Technologies involved (v3.0)
– Ruby on Rails (v3.1.3)– JavaScript (jQuery v1.7.1)– MySQL database (v5+)– Hosting: University of Edinburgh Information Services
Virtual Hosting (13 managed servers across 2 sites)– Authentication: registered users with passwords encrypted
in DB (we are also testing Shibboleth for integration with UK Access Management Federation for Education and Research)
– Various export formats (DOCX, PDF, XML, CSV, etc)
DMP Online v3.0: Spring 2012- Improved user interface, inc. customisable
institutional versions- New features
- Overlaying multiple templates for ‘hybrid’ DMPs- Template phases (e.g. pre- / during / post-project)- Granular read / write / share permissions- API for systems interoperability (e.g. this project)- Shibboleth authentication- Multilingual support / boilerplate text
- Endorsement from funders
- Generic data management guidance ( in conjunction with )
- Funder-specific guidance developed in collaboration with the funders themselves
- Institution-specific guidance developed with key institutional contacts
- Discipline-specific guidance developed and deployed with JISC MRD projects (e.g. DMT Psych at York)
- Joint training programmes organised and delivered by DCC and UKDA
- Provided advice to US consortium
Collaborations
Templates: Stakeholder Liaison (i)RCUK funders Status
Arts and Humanities Research Council (AHRC) Discussions beginning
Biotechnology and Biological Sciences Research Council (BBSRC)
Discussions ongoing
Engineering and Physical Sciences Research Council (EPSRC)
No explicit data management plan requirements: DCC referenced in roadmap requirements
Economic and Social Research Council (ESRC) Template and guidance developed in collaboration with ESRC and ESDS. Funder’s online guidance points applicants towards tool.
Medical Research Council (MRC) Template in preparation through collaboration with funder
NERC (Natural Environment Research Council) Discussions ongoing
Science and Technology Facilities Council (STFC) DCC resources referenced in data requirements
Other funders Status
The Wellcome Trust Template and guidance endorsed by funder
National Science Foundation (US) Template developed by Sherry Lake, University of Virginia
Templates: Stakeholder Liaison (ii)Disciplinary templates Status
History Developed in conjunction with University of Hull and University of Hertfordshire
Psychology Developed by DMT Psych project, led by University of York
Mechanical Engineering Developed as part of REDm-MED project, led by University of Bath
Health sciences Developed by DATUM for Health project, led by University of Northumbria
Spatial information (INSPIRE) Developed in conjunction with EDINA (UK national data centre) and trialled with Freshwater Biological Association
Institutional templates Status
University of Northampton Developed in collaboration with Information Services department
More institutional and subject-based templates are being developed through the JISC RDM projects and UMF institutional engagements…
Institutional Engagements: Putting it into practice
- Working with eighteen institutions over approximately 18 months to improve data management capabilities
- A broad variety of institutional types and sizes, from research intensive ancient universities, to new universities and small specialist institutions (e.g. art colleges)
- Institutions select from a ‘menu’ of tools and services, e.g. (next slide)
Components of a Data Management Strategy (Research and Admin)
DCC Tools DCC Services
Policy Data Asset Framework (DAF)
Policy development
Planning DMP Online Strategy development
Advocacy CARDIO Training
Tools DRAMBORA Workflow assessment
Training Costing
Institutional data catalogues (discovery)
The Menu
Workflow connectionsDMP Online can also be used in conjunction with other tools that support the data management/curation lifecycle, e.g.…
- DAF (Data Asset Framework)- DRAMBORA (Digital Repository Audit Method
Based On Risk Assessment)- CARDIO (Collaborative Assessment of
Research Data Infrastructure and Objectives)
Also non-DCC tools:
- LIFE- Planets tools- and more
For machine readership…
- Facilitates quick public sharing
- Compatible with API for linking with other systems
- Minimal formatting
For human readership…
- Pleasant formatting
- Editable. Can be used in conjunction with (e.g. MS Sharepoint)
- Removes all formatting
How to connect: six export formats
Systems– CRIS / admin systems– RCUK Je-S system– Institutional Repositories– DDI repository– DMP Tool (US)– Other instances of DMP
Online via federated model (? -TBC)
External connectionsStandards / protocols– CERIF*
– SWORD2– DDI* – RDF (? - TBC)
* via RESTful API
Researcher(s)
Research Support Office
Computing Support
Faculty Ethics Committee Etc...
DATAMANAGEMENTPLAN
UNRULYDATA
Data Library / Repository / Archive
To sum...
All of our DMP-related resources available online via:
www.dcc.ac.uk/dmponline/
Thank you
Image credits: Slide 1 - http://upload.wikimedia.org/wikipedia/commons/8/88/LernaeanHydraRephael.jpg Slide 5 - http://www.dcc.ac.uk/resources/curation-lifecycle-model Slide 6 (The Scream) - http://www.flickr.com/photos/terryfreedman/6548040049 Slide 6 (OAIS) - http://public.ccsds.org/publications/archive/650x0b1.pdf Slide 29 - http://en.wikipedia.org/wiki/File:Hercules_slaying_the_Hydra.jpg Slide 30 - http://www.treehugger.com/picture-is-worth-sum-car-parts.jpg
This work is licensed under the Creative Commons Attribution 2.5 UK: Scotland License.
Martin DonnellyDigital Curation CentreUniversity of Edinburgh
[email protected]: @mkdDCC
Sarah JonesDigital Curation CentreUniversity of Glasgow
[email protected]: @sjDCC
Check out DCC at: www.dcc.ac.uk or follow us on twitter @digitalcuration and #ukdcc