data management: international challenges, national infrastructure, and institutional responses

34
Data Management: International challenges, National Infrastructure, and Institutional Responses - an Australian Perspective Dr Andrew Treloar Director of Technology Australian National Data Service

Upload: andrew-treloar

Post on 13-Jan-2015

1.212 views

Category:

Education


1 download

DESCRIPTION

Presentation delivered to UKOLN on April 1, 2011.

TRANSCRIPT

Page 1: Data management: international challenges, national infrastructure, and institutional responses

Data Management: International challenges, National Infrastructure, and Institutional Responses - an Australian Perspective

Dr Andrew TreloarDirector of Technology

Australian National Data Service

Page 2: Data management: international challenges, national infrastructure, and institutional responses

INTERNATIONAL CHALLENGES

Page 3: Data management: international challenges, national infrastructure, and institutional responses

Inconvenient data

DOI: 10.1098/rsta.2005.1569

Page 4: Data management: international challenges, national infrastructure, and institutional responses

Imprisoneddata

DOI 10.1098/rsta.2006.1793

Page 5: Data management: international challenges, national infrastructure, and institutional responses

Invisible data

DOI 10.1098/rsta.2006.1793

Page 6: Data management: international challenges, national infrastructure, and institutional responses

Inaccessible data

Page 7: Data management: international challenges, national infrastructure, and institutional responses

Incomprehensible data

ands.org.au 7

Survey ID Ind. Cat.(O) T-PC F-Views A-Convenience

12345 O Y a sa

Date Depth (m) Temperature (Celsius) Salinty (ppt) Sigma -T (kgm-3)

30/10/80 10 -1.875 34.555 27.841

Date Depth Temperature Salinity Density

30/10/80 10 -1.875 34.555 27.841

Page 8: Data management: international challenges, national infrastructure, and institutional responses

8

Summary Not a first class object Unmanaged Disconnected Unfindable Unreusable

Page 9: Data management: international challenges, national infrastructure, and institutional responses

Why re-use data? Efficiency Validation Integrity Value for money Self-interest

Page 10: Data management: international challenges, national infrastructure, and institutional responses

10

Astronomy case study Hubble Space Telescope (HST) operating since 1990 Observations are proposed, and if accepted, data is collected and

made available to the proposers – who then write a research paper

Each year around 1,000 proposals are reviewed and approximately 200 are selected, for a total of 20,000 individual observations

Data is stored at the Space Telescope Science Institute and made available after embargo period

There are now more research papers written by “second use” of the research data, than by the use initially proposed

Page 11: Data management: international challenges, national infrastructure, and institutional responses

11Source: http://archive.stsci.edu/hst/bibliography/pubstat.html

Page 12: Data management: international challenges, national infrastructure, and institutional responses

Cancer micro-array trial case study Piwowar, et. al., “Sharing Detailed Research Data Is

Associated with Increased Citation Rate” http://www.plosone.org/article/info:doi/10.1371/journal.pone.

0000308 Looked at the citation history of cancer microarray

clinical trial publications Found that publicly available data was associated with

a 69% increase in citations, independent of journal impact factor, date of publication, and author country of origin

12

Page 13: Data management: international challenges, national infrastructure, and institutional responses

Alzheimer’s Disease NeuroImaging Initiative Collaborative effort to find

brain biomarkers for Alzheimer’s disease

Key: All brain scans and other data freely available to scientific community without embargo.

Over 3K full downloads and 1M scan downloads by over 400 investigators world-wide

Over 100 publications13

http://www.fnih.org/work/areas/chronic-disease/adni

Institut Douglas CC BY-NC-ND

Page 14: Data management: international challenges, national infrastructure, and institutional responses

14

NATIONAL INFRASTRUCTURE

Page 15: Data management: international challenges, national infrastructure, and institutional responses

National approaches Number of different countries: UK, US, DE, NL Different environments => different ecosystems

and so some local tradeoffs But some common themes emerging:

Do the things that only you can do Be the ‘voice for data’ Prime the pump

Page 16: Data management: international challenges, national infrastructure, and institutional responses

Australian National Data Service An initiative of the Australian Government being

conducted as part of the National Collaborative Research Infrastructure Strategy ($A24M) and the Super Science Initiative ($A48M)

A collaboration between Monash University, the Australian National University and CSIRO

Nearly 50 staff, funded to mid 2013 More researchers re-using more data more often Data as a first-class object

ands.org.au 16

Page 17: Data management: international challenges, national infrastructure, and institutional responses

ANDS is enabling the transformation of:

Data that are: Unmanaged Disconnected Invisible Single use

17

Collections that are: Managed Connected Findable Reusable

so that Australian researchers can easily discover, access and re-use data

Page 18: Data management: international challenges, national infrastructure, and institutional responses

18

Defining characteristics of ANDS Building national services Engaging with institutions not researchers (mostly) Working within funding constraints

use, not amount! Building the Australian Research Data Commons

Page 19: Data management: international challenges, national infrastructure, and institutional responses
Page 20: Data management: international challenges, national infrastructure, and institutional responses

20

ANDS Programs Frameworks and Capability Seeding the Commons Data Capture Metadata Stores ARDC Core Public Sector Data Applications

Page 21: Data management: international challenges, national infrastructure, and institutional responses

21

Spending profile

Page 22: Data management: international challenges, national infrastructure, and institutional responses

22

RDA Demo http://www.google.com/

Page 23: Data management: international challenges, national infrastructure, and institutional responses

INSTITUTIONAL RESPONSES

Page 24: Data management: international challenges, national infrastructure, and institutional responses

24

Driven by Australian Code for Responsible Conduct of Research Equivalent of UKRIO’s Code of Practice for Research:

Promoting good practice and preventing misconduct Takes significant time to get accepted ANDS providing models of good practice Seeding the Commons U->M

Data management policy and planning

Page 25: Data management: international challenges, national infrastructure, and institutional responses

25

Retrospective data description Different selection mechanisms Seeding the Commons U->M

Fixing the past

Page 26: Data management: international challenges, national infrastructure, and institutional responses

26

Improving internal CRIS systems Better integration Moving beyond publications Better links to data collection descriptions Seeding the Commons, Metadata Stores D->C

Page 27: Data management: international challenges, national infrastructure, and institutional responses

27

Facilitating easier/better capture of data and metadata from selected ‘instruments’

Making the right thing easier Improving quality of metadata Data Capture U->M S->R

Fixing the future

Page 28: Data management: international challenges, national infrastructure, and institutional responses

28

Describing institutions research data assets Series of metadata stores rollouts plus some

ancillary activity Metadata Stores, Seeding the Commons, Data

Capture D->C I->F

Page 29: Data management: international challenges, national infrastructure, and institutional responses

29

Page 30: Data management: international challenges, national infrastructure, and institutional responses

30

ONGOING ISSUES

Page 31: Data management: international challenges, national infrastructure, and institutional responses

Country-Institution-Discipline Who wins? Who should win?

31

Page 32: Data management: international challenges, national infrastructure, and institutional responses

Sustainability, sustainability, sustainability… Institutional activity National services/resources Developed software

32

Page 33: Data management: international challenges, national infrastructure, and institutional responses

33

Priming the pump, or continuing to pump? If institutions/researchers/disciplines don’t care,

why should the funders?

Role of Government