increasing research impact: the national data registry - alex ball - jisc digital festival 2014

17
because good research needs good data Increasing research impact The national data registry Alex Ball DCC/UKOLN Informatics, University of Bath 11 March 2014 Except where otherwise stated, this work is licensed under the Creative Commons Attribution 4.0 International licence: http://creativecommons.org/licenses/by/4.0/ Supported by Jisc Digifest, ICC, Birmingham 2014-03-11 #jiscrdr

Upload: jisc

Post on 26-Jan-2015

109 views

Category:

Education


0 download

DESCRIPTION

Evidence shows that all forms of research output have a role in increasing the impact and value of research. Data is particularly valuable, which is why research funders are placing so much emphasis on its retention, management and discoverability. However, few universities have data collections large enough to make their data globally visible, and few have the resources to connect data held locally with data in international data centres. Jisc’s data registry service plans to cost-effectively solve this problem for universities, whilst also providing feedback for them and their researchers on how to increase the impact of their research data. This session will explain the goals and approach of the pilot, relate it to lessons from other countries and in government open data, and explain how Jisc and the community can work together to drive future developments in data discovery.

TRANSCRIPT

Page 1: Increasing research impact: the national data registry - Alex Ball - Jisc Digital Festival 2014

because good research needs good data

Increasing research impactThe national data registry

Alex Ball

DCC/UKOLN Informatics, University of Bath

11 March 2014

Except where otherwise stated, this work is licensed underthe Creative Commons Attribution 4.0 International licence:http://creativecommons.org/licenses/by/4.0/

Supported by

Jisc Digifest, ICC, Birmingham 2014-03-11 #jiscrdr

Page 2: Increasing research impact: the national data registry - Alex Ball - Jisc Digital Festival 2014

UK Research Data (Metadata) Registry Pilot Project

Project TeamÉ Kevin Ashley, DCC (Edinburgh)É Alex Ball, DCC (Bath)É Patrick McCann, DCC (Glasgow)É Laura Molloy, DCC (Glasgow)É Veerle Van den Eynden, UKDA

Funded by Jisc

Jisc Digifest, ICC, Birmingham 2014-03-11 #jiscrdr

Page 3: Increasing research impact: the national data registry - Alex Ball - Jisc Digital Festival 2014

Outline

Motivation

Project overview

Architecture

Collaborators

Metadata

Evaluation

Future

Jisc Digifest, ICC, Birmingham 2014-03-11 #jiscrdr

Page 4: Increasing research impact: the national data registry - Alex Ball - Jisc Digital Festival 2014

UK data landscape

Jisc Digifest, ICC, Birmingham 2014-03-11 #jiscrdr

Page 5: Increasing research impact: the national data registry - Alex Ball - Jisc Digital Festival 2014

Motivation

É Not just specialist data centres any more. . .É Institutional data repositoriesÉ Generalist repositories related to journals

É Interdisciplinary and multidisciplinary research requires datadrawn from diverse sources.

É Data as a first class research outputÉ Funder impactÉ Research Excellence Framework

Jisc Digifest, ICC, Birmingham 2014-03-11 #jiscrdr

Page 6: Increasing research impact: the national data registry - Alex Ball - Jisc Digital Festival 2014

Brainstorming the national data registry

Researchdata registry

Gateway toResearch

Equipment.data

DataCitationIndex

DMPs

Metadatascheme

Interop-erability

Useful fordiscovery

Harvestfrom. . .

Institutionaldata

repositories

CRISes

Datacentres

Jisc Digifest, ICC, Birmingham 2014-03-11 #jiscrdr

Page 7: Increasing research impact: the national data registry - Alex Ball - Jisc Digital Festival 2014

Research Data Australia About Collections Parties Activities Services Themes

What’s in Research Data Australia

Collections (92633)Research datasets or collections of research

materials.

Parties (25467)Researchers or research organisations that create

or maintain research datasets or collections.

Activities (40674)Projects or programs that create research

datasets or collections.

Services (184)Services that support the creation or use of

research datasets or collections.

Spotlight on research data

N.C.W. Beadle Herbarium

The N.C.W. Beadle Herbarium (NE) at University of

New England contains around 90,000 pressed,

dried, incorporated and databased plant specimens.

The collection includes more than 150 TYPE

specimens that anchor scientific names as cited in

the original publication of those names. This rich

resource contains many collections that are of great

interest to local and international researchers. The

specimen sheet collection of the N.C.W. Beadle

Herbarium is databased and available to registered users for online data entry

and data query.

Explore the N.C.W. Beadle Herbarium Collection through Research Data

Australia >>>

Browse by Subject Area Browse by Map Coverage

Advanced Search

Page 8: Increasing research impact: the national data registry - Alex Ball - Jisc Digital Festival 2014

Research Data UK?

Attractions of the Research Data Australia software:

É Familiar to project teamÉ Proven technologyÉ Plays nicely with search enginesÉ Displays sample citations and access/rights information up front

Challenges of using the software in the UK:

É Not used before outside AustraliaÉ Uses uncommon metadata standard (RIF-CS) internallyÉ Original implementation only harvests in RIF-CSÉ No UK data centre can output RIF-CS metadata

Jisc Digifest, ICC, Birmingham 2014-03-11 #jiscrdr

Page 9: Increasing research impact: the national data registry - Alex Ball - Jisc Digital Festival 2014

Project overview

1. Implement a working instance of the ANDS software.

2. Assemble a group of contributors and establish how theirmetadata will be harvested.

3. Write crosswalks for transforming contributed metadata intoRIF-CS.

4. Harvest metadata from contributors.5. Reports on

É using the Research Data Australia software;É how harvesting from data centres went;É how harvesting from university repositories went;É the value of continuing to develop the registry.

Jisc Digifest, ICC, Birmingham 2014-03-11 #jiscrdr

Page 10: Increasing research impact: the national data registry - Alex Ball - Jisc Digital Festival 2014

Architecture

CentOS LinuxMS Azure

Access management

Front end

Metadata registry

OAI-PMH harvester

Indexer (Apache Solr)

CMS editor

ID manager

UKRDR

Collectionswithout

OAI-PMHsupport

HTTP

Collectionswith

OAI-PMHsupport

OAI-PMH

Jisc Digifest, ICC, Birmingham 2014-03-11 #jiscrdr

Page 11: Increasing research impact: the national data registry - Alex Ball - Jisc Digital Festival 2014

Collaborators

Data centres:

É UK Data ArchiveÉ NERC Data Catalogue

ServiceÉ BADCÉ BODCÉ EIDCÉ NEODCÉ NGDCÉ PDCÉ UKSSDCÉ ADS

Universities:

É EdinburghÉ GlasgowÉ HullÉ LincolnÉ LeedsÉ OxfordÉ Oxford BrookesÉ St AndrewsÉ Southampton

Jisc Digifest, ICC, Birmingham 2014-03-11 #jiscrdr

Page 12: Increasing research impact: the national data registry - Alex Ball - Jisc Digital Festival 2014

Metadata crosswalks

DDI Codebook 2.5É UK Data Archive

DataCite 3É Edinburgh (TBC)É Oxford (TBC)É Hull (TBC)

OAI-PMH Dublin CoreÉ Oxford Brookes (TBC)

UK Gemini 2.2É NERC Data Catalogue

Service

EPrints 3É GlasgowÉ LeedsÉ Lincoln (TBC)É Southampton

Jisc Digifest, ICC, Birmingham 2014-03-11 #jiscrdr

Page 13: Increasing research impact: the national data registry - Alex Ball - Jisc Digital Festival 2014

DDI Codebook 2.5 CrosswalkMapping from DDI to —IF-CS

The following table provides a mapping to populate a RIF-CS Collection record from aUKDA DDI record. The value of the UKDA ID is recorded in the DDI record at codeBook >std⁴Dscr > citation > titlStmt > IDNo[AGENCY=UKDA].

RIF-CS . element Source using DDI . recordcollection[dateAccessioned] codeBook > std⁴Dscr > citation > distStmt >

depDate[date]

identifier[t⁴pe=doi] codeBook > std⁴Dscr > citation > titlStmt >IDNo[agenc⁴=datacite]

identifier[t⁴pe=local] codeBook > std⁴Dscr > citation > titlStmt > IDNo[agenc⁴] content

name[t⁴pe=primar⁴] > namePart codeBook > std⁴Dscr > citation > titlStmt > titl

name[t⁴pe=alternative] > namePart codeBook > std⁴Dscr > citation > titlStmt > altTitl

dates[t⁴pe=dc.available, dc.issued]> date[t⁴pe=dateFrom]

codeBook > std⁴Dscr > citation > distStmt > distDate

dates[t⁴pe=dc.dateSubmitted] >date[t⁴pe=dateFrom]

codeBook > std⁴Dscr > citation > distStmt >depDate[date]

location > address >electronic[t⁴pe=url] > value

codeBook > std⁴Dscr > citation > holdings[U—I]

subject[t⁴pe=hasset] codeBook > std⁴Dscr > std⁴Info > subject >ke⁴²ord[vocab=S]

subject[termIdentifier] codeBook > std⁴Dscr > std⁴Info > subject >ke⁴²ord[vocab=S vocabU—I]

subject[t⁴pe=ukdasc] codeBook > std⁴Dscr > std⁴Info > subject > topClas

description[t⁴pe=full] codeBook > std⁴Dscr > std⁴Info > abstract

coverage > temporal >date[t⁴pe=dateFrom]

codeBook > std⁴Dscr > std⁴Info > sumDscr >collDate[event=start, single date],timePrd[event=start, single date]

coverage > temporal >date[t⁴pe=dateTo]

codeBook > std⁴Dscr > std⁴Info > sumDscr >collDate[event=end date], timePrd[event=end date]

coverage > spatial[t⁴pe=te³t] codeBook > std⁴Dscr > std⁴Info > sumDscr >geogCover, geogUnit, nation codeBook > std⁴Dscr >std⁴Info > subject > ke⁴²ord[vocab=G]

relatedInfo[t⁴pe=metadata] >identifier[t⁴pe=uri]

‘http://esds.ac.uk/DDI /’ + UKDA ID + ‘.xml’

06/03/2014 Ddi2p5ToRifcs.php 1

/home/ab318/Data/git/ANDS-Registry-Core/applications/registry/core/crosswalks/Ddi2p5ToRifcs.php

<?phpclass Ddi2p5ToRifcs extends Crosswalk {private $oaipmh = null;private $rifcs = null;private $ddiProviders = array("http://oai.ukdataservice.ac.uk/oai/provider" => "UK Data Archive",

);function __construct(){require_once(REGISTRY_APP_PATH . "core/crosswalks/_crosswalk_helper.php");$this->rifcs = simplexml_load_string(CrosswalkHelper::RIFCS_WRAPPER);

}public function identify(){return "DDI v2.5 to RIF-CS (Experimental)";

}public function metadataFormat(){return "ddi_2.5";

}public function payloadToRIFCS($payload){$this->load_payload($payload);foreach ($this->oaipmh->ListRecords->children() as $record){if ($record->getName() != "record") {continue;

}$reg_obj = $this->rifcs->addChild("registryObject");if (array_key_exists((string) $this->oaipmh->request, $this->ddiProviders)) {$reg_obj->addAttribute("group", $this->ddiProviders[(string) $this->oaipmh->request]);

}$key = $reg_obj->addChild("key", $record->header->identifier);$originatingSource = $reg_obj->addChild("originatingSource", $this->oaipmh->request);$coll = $reg_obj->addChild("collection");$coll->addAttribute("type", "dataset");$coll->addAttribute("dateModified", date(DATE_W3C));$citation = $coll->addChild("citationInfo");$citation_metadata = $citation->addChild("citationMetadata");$coverage = $coll->addChild("coverage");$rights = $coll->addChild("rights");foreach ($record->metadata->codeBook->stdyDscr->children() as $node){foreach ($node->children() as $subnode) {$func = "process_".$subnode->getName();if (is_callable(array($this, $func))){call_user_func(array($this, $func),$subnode,array("registry_object" => $reg_obj,"key" => $key,"collection" => $coll,"citation_metadata" => $citation_metadata,"coverage" => $coverage,"rights" => $rights

));

}}

}}return $this->rifcs->asXML();

}public function validate($payload){$this->load_payload($payload);if (!$this->oaipmh){return false;

}if ($this->oaipmh->getName() != "OAI-PMH") {return false;

}if (empty($this->oaipmh->request)) {return false;

}if (empty($this->oaipmh->ListRecords)) {return false;

}foreach($this->oaipmh->ListRecords as $record) {if ($record->getName() == "record") {

Jisc Digifest, ICC, Birmingham 2014-03-11 #jiscrdr

Page 14: Increasing research impact: the national data registry - Alex Ball - Jisc Digital Festival 2014

From UKDA to UKRDR

Documentation Related Studies Download/Order Get full DDI XML

Catalogue

UK Data Service data catalogue record for:

Attitudes of Students at the London School of Economics, February 1980

TITLE DETAILS

SN: 1354

Title: Attitudes of Students at the London School of Economics, February 1980

Persistent identifier: 10.5255/UKDA-SN-1354-1

Series: Attitudes of Students at the London School of Economics, 1980-

Depositor: Husbands, C., London School of Economics and Political Science. Department of Sociology

Principal investigator(s): Husbands, C., London School of Economics and Political Science. Department of Sociology

SUBJECT CATEGORIES

Higher and further

ABSTRACT

To conduct a course exercise that collects questionnaire-based information each year from a sample of students at the London School ofEconomics. The studies focus on background characteristics relevant to a student population, on attitudes to selected political and socialissues, and on participation in various activities at LSE. Questions vary somewhat from year to year.

COVERAGE, UNIVERSE, METHODOLOGY

Dates of fieldwork: 6 February 1980 - 22 February 1980

Country: England

Geography: London

Observation units: IndividualsGroups

Universe: SubnationalStudentsA sample of registered part-time and full-time students at London School of Economics and Political Science eachyear between 1980-1992

Time dimensions: Repeated cross-sectional studysurveys conducted annually

Sampling procedures: Quota samplebased on sex, undergraduate/graduate status, domestic/overseas status, and department

Number of units: 288 (target) 280 (obtained)

Method of data collection: Face-to-face interview

Weighting: No information recorded

KEYWORDS

ABORTION (INDUCED) ALCOHOL CONSUMPTION ATTITUDESEDUCATIONAL FEES EDUCATIONAL FINANCE EDUCATIONAL GRANTSFAMILY INFLUENCE FOREIGN STUDENTS GENDERGREATER LONDON NARCOTIC DRUGS OCCUPATIONSPARENTS PART-TIME COURSES POLITICAL PARTICIPATIONPORNOGRAPHY SEXUAL BEHAVIOUR SMOKINGSOCIAL ACTIVITIES (LEISURE) SOCIAL CLASS SOCIAL PROTESTSTUDENT HOUSING STUDENT LEISURE STUDENT PARTICIPATIONSTUDENTS UNIVERSITY COURSES

ADMINISTRATIVE AND ACCESS INFORMATION

Date of release:

First edition: 01 January 1981

Access conditions: The depositor has specified that registration is required and standard conditions of use apply. The depositor maybe informed about usage. See terms and conditions of access for further information.

Availability: UK Data Service

Contact: Get in touch

DOCUMENTATION

Title File Name Size (KB)

User Guide 1354userguide.pdf 1199

Study information and citation UKDA_Study_1354_Information.htm 13

RELATED STUDIES AND GUIDES

Related studies:

Attitudes of Students at the London School of Economics, January 1981 (SN 1517)Attitudes of Students at the London School of Economics, January 1982 (SN 1676)Attitudes of Students at the London School of Economics, January 1983 (SN 2068)Attitudes of Students at the London School of Economics, January - February, 1985 (SN 2088)Attitudes of Students at the London School of Economics, 1984 (SN 2570)Attitudes of Students at the London School of Economics, 1986 (SN 2571)Attitudes of Students at the London School of Economics, 1987-1988 (SN 3110)Attitudes of Students at the London School of Economics, 1989-1990 (SN 3111)Attitudes of Students at the London School of Economics, 1991-1992 (SN 3112)

UK DATA SERVICE makes use of browser cookies.By continuing to use this website you are agreeing to our use of cookies. Tell me more

Attitudes of Students at the LondonSchool of Economics, February 1980

IdentifiersLocal: sn1354

DOI: 10.5255/UKDA-SN-1354-1

Additional MetadataURI: http://esds.ac.uk/DDI25/1354.xml

Spatial Coverage:text: GREATER LONDON

text: England

text: London

Temporal Coverage:From 1980-02-06 to 1980-02-22

Access

Access rightsThe depositor has specified thatregistration is required and standardconditions of use apply. The depositormay be informed about usage. See

forfurther information.

ConnectionsPeople

Suggested LinksInternal Records

with matching subjects

External Recordsfrom DataCite

Home / UK Data Archive / Collection

To conduct a course exercise that collects questionnaire-based information each year from asample of students at the London School of Economics. The studies focus on backgroundcharacteristics relevant to a student population, on attitudes to selected political and socialissues, and on participation in various activities at LSE. Questions vary somewhat from year toyear.

How to Cite this CollectionCitation (Metadata):

Husbands, C. ( 1 Ja,1 Ja,1 Ja,1 Ja ): Attitudes of Students at the London School of Economics, February 1980.UK Data Service. DOI: 10.5255/UKDA-SN-1354-1.http://dx.doi.org/10.5255/UKDA-SN-1354-1http://dx.doi.org/10.5255/UKDA-SN-1354-1

SubjectsKeywords

ABORTION (INDUCED) ALCOHOL CONSUMPTION ATTITUDES

EDUCATIONAL FEES EDUCATIONAL FINANCE EDUCATIONAL GRANTS

FAMILY INFLUENCE FOREIGN STUDENTS GENDER NARCOTIC DRUGS

OCCUPATIONS PARENTS PART-TIME COURSES

POLITICAL PARTICIPATION PORNOGRAPHY SEXUAL BEHAVIOUR

SMOKING SOCIAL ACTIVITIES (LEISURE) SOCIAL CLASS

SOCIAL PROTEST STUDENT HOUSING STUDENT LEISURE

STUDENT PARTICIPATION STUDENTS UNIVERSITY COURSES

Higher and further

http://dx.doi.org/10.5255/UKD...

terms and conditions of access

C. Husbands (PI)

258 records

1 records

Jisc Digifest, ICC, Birmingham 2014-03-11 #jiscrdr

Page 15: Increasing research impact: the national data registry - Alex Ball - Jisc Digital Festival 2014

Evaluation questions

É Does the software work as intended?É Do the harvested records look useful and accurate?É Is the system straightforward to use?É What might be improved?É What additional functions would be desirable?

Jisc Digifest, ICC, Birmingham 2014-03-11 #jiscrdr

Page 16: Increasing research impact: the national data registry - Alex Ball - Jisc Digital Festival 2014

Future work

Formal evaluationÉ ROAMEF = Rationale, Objectives, Appraisal, Monitoring,

Evaluation, Feedback

Questions to considerÉ Would another platform suit us better?É Would another internal metadata scheme suit us better than

RIF-CS?É What use cases should the registry target?É How can we add value to the registry’s records?É Could the registry add value to other systems?

Jisc Digifest, ICC, Birmingham 2014-03-11 #jiscrdr

Page 17: Increasing research impact: the national data registry - Alex Ball - Jisc Digital Festival 2014

because good research needs good data

Thank you for your attention

DCC Website: http://www.dcc.ac.uk/Alex Ball: http://alexball.me.uk/

UKRDR Pilot Project: http://www.dcc.ac.uk/projects/research-data-registry-pilot

Jisc Digifest, ICC, Birmingham 2014-03-11 #jiscrdr