edinburgh datashare – a dspace data repository: achievements and aspirations

10
Edinburgh DataShare – Edinburgh DataShare – A DSpace Data Repository: Achievements and A DSpace Data Repository: Achievements and Aspirations Aspirations Stuart Macdonald Stuart Macdonald EDINA National Data EDINA National Data Centre & Edinburgh Centre & Edinburgh University Data Library University Data Library Fedora-UK&I&EU Meeting, Fedora-UK&I&EU Meeting, Oxford, 8 December Oxford, 8 December Flickr CC Image http://www.flickr.com/photos/laszlo-photo/1899390628/

Upload: robert

Post on 13-Jan-2016

35 views

Category:

Documents


0 download

DESCRIPTION

Edinburgh DataShare – A DSpace Data Repository: Achievements and Aspirations. Stuart Macdonald EDINA National Data Centre & Edinburgh University Data Library Fedora-UK&I&EU Meeting, Oxford, 8 December. Flickr CC Image – http://www.flickr.com/photos/laszlo-photo/1899390628/. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Edinburgh DataShare –  A DSpace Data Repository: Achievements and Aspirations

Edinburgh DataShare – Edinburgh DataShare – A DSpace Data Repository: Achievements and AspirationsA DSpace Data Repository: Achievements and Aspirations

Stuart Macdonald Stuart Macdonald

EDINA National Data Centre & EDINA National Data Centre & Edinburgh University Data Edinburgh University Data LibraryLibrary

Fedora-UK&I&EU Meeting,Fedora-UK&I&EU Meeting,Oxford, 8 DecemberOxford, 8 December

Flickr CC Image – http://www.flickr.com/photos/laszlo-photo/1899390628/

Page 2: Edinburgh DataShare –  A DSpace Data Repository: Achievements and Aspirations

Flickr CC Image - http://www.flickr.com/photos/artimagesmarkcummins/300173269/

• EDINA and Data Library (EDL) together are a division within Information Services of the University of Edinburgh.

• EDINA is a JISC-funded National Data Centre providing national online resources for education and research.

• The Data Library service (established in 1983) assists Edinburgh University users in the discovery, access, use and management of research data assets.

• Building relationships with researchers via postgraduate teaching activities, IS Skills workshops, Research Data Management training and through traditional reference interviews.

• Edinburgh Datashare is a digital repository of multi-disciplinary research datasets produced at the University of Edinburgh, hosted by the Data Library

Page 3: Edinburgh DataShare –  A DSpace Data Repository: Achievements and Aspirations

Flickr CC Image - http://www.flickr.com/photos/ronin691/2285257955/

DISC-UK DataShare DISC-UK DataShare ProjectProject

DISC-UK DataShare Project – funded by JISC (March 2007 – March 2009) - a collaborative project which investigated the legal, cultural and technical issues surrounding research data sharing within UK tertiary education community

• Explore new pathways to assist academics wishing to share their data over the Internet via Institutional Repositories (IRs)

• Policy-Making for Research Data in Repositories: A Guide - Green, A., Macdonald, S. and R. Rice, (2009).

• Edinburgh DataShare digital repository – post project output – embedded into University Information Services policy

Page 4: Edinburgh DataShare –  A DSpace Data Repository: Achievements and Aspirations

ShareGeoDSpace, Restricted

EDINA repository projects

Edinburgh DataShareDSpace, Open Access

Publications RepositoryDSpace, closed

Open Access Publications Policypassed by Senate, February 2009

Informatics Publications Repositoryincl.Artificial Applications Institute,

Institute for Adaptive and neural Computation,centre for Speech technology Research,

Institute for Computer Systems ArchitectureOpen Access

Research Publication Service

Online remote-storage servicese.g. Drop box, Mozy, Humyo

Cloud

Remote storagee.g. Andrew File System

Restricted

ECDF SAN(Edinburgh Compute Data Facility)

Parallel processingInfrastructure SAN

Grid, iRODS etc

Edinburgh Repository LandscapeEdinburgh Repository Landscape

Hosted DSpace Repositories:St Andrews, (Heriot Watt / UHI / Aberdeen?)

‘Namedentity’ recognition’

JORUM DSpace, Open Access

Edinburgh Research Archive(ERA) - DSpace, Open Access

Launch Jan. 2010

Repository Junction

Page 5: Edinburgh DataShare –  A DSpace Data Repository: Achievements and Aspirations

Edinburgh Datashare – technical Edinburgh Datashare – technical developmentdevelopment DSpace v.1.51 with new theme aligned with University corporate style

• Embargo option - coded to restrict full data download with open metadata until specified date

• Open Data Commons License option (PDDL)

• Dynamically queries Geonames, a community generated spatial database to ensure consistency in metadata entry for Spatial Coverage field

• Extension to DSpace to record bitstream downloads in usage statistics

• Implementation of JACS for assigning keyword to content

• Download All option (zip file of all item components)

• Citation field automatically generated based on specified metadata values

• Dublin Core-based metadata schema for datasets

Flickr CC Image – http://www.flickr.com/photos/59414209@N00/3367225630/

Page 6: Edinburgh DataShare –  A DSpace Data Repository: Achievements and Aspirations

Flickr CC Image - http://www.flickr.com/photos/silvertje/3512611046/

Potential Next Steps:Potential Next Steps:• Semantification of content – OpenLinkedData & MIT’s SIMILIE project (which aims to leverage and extend DSpace by supporting semantic web techniques)

• Visualisation tools – APIs that can be utilised within repository environment (Timeplot, Timeline) unlike existing open utilities

• Spatial analysis using open geo-browsers and mapping utilities

• Annotation, tagging, data citation

• Implementation of Deposit tool(s) – SWORD

• Streaming & viewing heterogeneous content

Page 7: Edinburgh DataShare –  A DSpace Data Repository: Achievements and Aspirations

Research Data Management Training

Developed on the back of the Data Audit Framework lead by HATII (Edinburgh being one of the pilot implementation projects) – developing online tools and methodologies which enable information specialists to engage and build relationships with researchers in order to ascertain the data holdings in a research school / department / group

Developing online and F-2F modules in conjunction with the Postgraduate Transferable Skills Unit - http://www.transkills.ed.ac.uk/and PG Essentials - http://www.transkills.ed.ac.uk/pgessentials.htm

Research data management guidance & Data sharing and preservation– http://www.ed.ac.uk/is/data-management

Flickr CC Image – http://www.flickr.com/photos/sgrantarch/3563676104/

Engaging researchers through RDM exercises, teaching, reference interviews & funded-projects are crucial to the bottom-up development of tools, services and infrastructures meant to serve them and which ultimately improve both the efficiency and effectiveness of their research.

Page 8: Edinburgh DataShare –  A DSpace Data Repository: Achievements and Aspirations

Image courtesy of the periodic table printmaking project – http://azuregrackle.com/periodictable/table/58.html

Engaging with the Research Community (1)

Human Geography (Professor Jane Jacobs)

High Rise project - an interdisciplinary research programme involving architects and geographers –

“It investigates two cases that encapsulate the varied fortunes of the highrise experience: the UK, where the form is routinely condemned, even demolished; and Singapore, where it is embraced enthusiastically and continues to be built at greaterheights and densities - http://www.ace.ed.ac.uk/highrise/”

• Content of the High Rise Digital Archive includes images, sound recordings, video, transcripts, architectural drawings

• Customise DataShare to allow streaming or embedded player functionality, ingest heterogeneous content, employ multi-media metadata standards

• Develop customised learning and teaching materials for deposit in JORUM Open

Page 9: Edinburgh DataShare –  A DSpace Data Repository: Achievements and Aspirations

Engaging with the research community (2)

Centre for Earth System Dynamics (Dr Mike Mineter, Dr Magnus Hagdorn)

The aim of the CESD is to develop climate models across SAGES and other multi-disciplinary and international partners to quantify and predict climate and environmental change

• Use DataShare to provide federated access via shibboleth for international partners for ‘working’ datasets (e.g. from the Arctic Biosphere Atmosphere Coupling at Multiple Scales (ABACUS) programme)

• Store data via DataShare but also link to large datasets (c.1TB) stored on remote storage (Andrew File Systems, ECDF SAN)

• Content for ingestion includes large climate models/simulations, fieldwork and experimental output in proprietary formats

• Employ discipline-specific metadata standards to describe content

Page 10: Edinburgh DataShare –  A DSpace Data Repository: Achievements and Aspirations

[email protected] Creative Commons images from Flickr (unless otherwise stated)

Flickr CC Image – http://www.flickr.com/photos/hippie/2556161507/

ENDThank You