updata a data curation experiment at the university of porto using dspace joão rocha da silvafeup...

29
UPData A data curation experiment at the University of Porto using DSpace João Rocha da Silva FEUP Cristina Ribeiro DEI- FEUP / INESC-Porto João Correia Lopes DEI- FEUP / INESC-Porto Eugénia Matos Fernandes U.PORTO Reitoria (Central Services)

Upload: alison-hopkins

Post on 12-Jan-2016

215 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: UPData A data curation experiment at the University of Porto using DSpace João Rocha da SilvaFEUP Cristina RibeiroDEI- FEUP / INESC-Porto João Correia

UPData

A data curation experiment at the University of Porto

using DSpace

João Rocha da Silva FEUP

Cristina Ribeiro DEI- FEUP / INESC-Porto

João Correia Lopes DEI- FEUP / INESC-Porto

Eugénia Matos Fernandes U.PORTO Reitoria (Central Services)

Page 2: UPData A data curation experiment at the University of Porto using DSpace João Rocha da SilvaFEUP Cristina RibeiroDEI- FEUP / INESC-Porto João Correia

Contents

• Motivation– Goals of the experiment

• Our users, the researchers– Researcher concerns & needs– Adding data curation to the research workflow

• Building a repository– Using DSpace for research data curation

• Conclusions

Page 3: UPData A data curation experiment at the University of Porto using DSpace João Rocha da SilvaFEUP Cristina RibeiroDEI- FEUP / INESC-Porto João Correia

MOTIVATION

Page 4: UPData A data curation experiment at the University of Porto using DSpace João Rocha da SilvaFEUP Cristina RibeiroDEI- FEUP / INESC-Porto João Correia

The “standard” research workflow

Page 5: UPData A data curation experiment at the University of Porto using DSpace João Rocha da SilvaFEUP Cristina RibeiroDEI- FEUP / INESC-Porto João Correia

However…

Page 6: UPData A data curation experiment at the University of Porto using DSpace João Rocha da SilvaFEUP Cristina RibeiroDEI- FEUP / INESC-Porto João Correia

GOALS

Page 7: UPData A data curation experiment at the University of Porto using DSpace João Rocha da SilvaFEUP Cristina RibeiroDEI- FEUP / INESC-Porto João Correia

Evaluating the research data management effort

• Interviewing researchers in several areas• Collecting data samples• Documenting use cases for research data• Identifying data curation practices

Page 8: UPData A data curation experiment at the University of Porto using DSpace João Rocha da SilvaFEUP Cristina RibeiroDEI- FEUP / INESC-Porto João Correia

Project Phases

Phase 1 : Interviews

Page 9: UPData A data curation experiment at the University of Porto using DSpace João Rocha da SilvaFEUP Cristina RibeiroDEI- FEUP / INESC-Porto João Correia

Our users, the researchers

• …are not data preservation experts

• ...use many document formats

• ...create and gather data from many sources

Page 10: UPData A data curation experiment at the University of Porto using DSpace João Rocha da SilvaFEUP Cristina RibeiroDEI- FEUP / INESC-Porto João Correia

Researcher concerns and needs

• Repositories cannot be “graveyards for data”, they have to provide effective ways to access the stored data

• Data has to be well annotated or else cannot be reused (experiment contexts, meanings of variables…)

• Better ways to find data (e.g. domain-specific restrictions and not just generic metadata)

Page 11: UPData A data curation experiment at the University of Porto using DSpace João Rocha da SilvaFEUP Cristina RibeiroDEI- FEUP / INESC-Porto João Correia

Researcher concerns and needs

• Easy sharing of data (e.g. sending a link to the place where a user can find a specific dataset)

• Researchers can be cited by their peers through the datasets that they offer

• Ensuring reproducibility of scientific findings

Page 12: UPData A data curation experiment at the University of Porto using DSpace João Rocha da SilvaFEUP Cristina RibeiroDEI- FEUP / INESC-Porto João Correia

Project Phases

Phase 2 : Determine

changes to current workflow

Page 13: UPData A data curation experiment at the University of Porto using DSpace João Rocha da SilvaFEUP Cristina RibeiroDEI- FEUP / INESC-Porto João Correia

ADDING A DATA CURATION STEP TO THE RESEARCH WORKFLOW

Page 14: UPData A data curation experiment at the University of Porto using DSpace João Rocha da SilvaFEUP Cristina RibeiroDEI- FEUP / INESC-Porto João Correia

The role of the “Data Curator”

Page 15: UPData A data curation experiment at the University of Porto using DSpace João Rocha da SilvaFEUP Cristina RibeiroDEI- FEUP / INESC-Porto João Correia

Data curation meeting

Page 16: UPData A data curation experiment at the University of Porto using DSpace João Rocha da SilvaFEUP Cristina RibeiroDEI- FEUP / INESC-Porto João Correia

Annotating data

Page 17: UPData A data curation experiment at the University of Porto using DSpace João Rocha da SilvaFEUP Cristina RibeiroDEI- FEUP / INESC-Porto João Correia

After the meeting

Data+Metadata in Excel format

Page 18: UPData A data curation experiment at the University of Porto using DSpace João Rocha da SilvaFEUP Cristina RibeiroDEI- FEUP / INESC-Porto João Correia

How other researchers will see it

• Explore• Filter• Download just what you need

Page 19: UPData A data curation experiment at the University of Porto using DSpace João Rocha da SilvaFEUP Cristina RibeiroDEI- FEUP / INESC-Porto João Correia

Project Phases

Phase 3 : Build tools to

support the workflow

Page 20: UPData A data curation experiment at the University of Porto using DSpace João Rocha da SilvaFEUP Cristina RibeiroDEI- FEUP / INESC-Porto João Correia
Page 21: UPData A data curation experiment at the University of Porto using DSpace João Rocha da SilvaFEUP Cristina RibeiroDEI- FEUP / INESC-Porto João Correia

Project Phases

Phase 4 : Test tool

using real world

data

Page 22: UPData A data curation experiment at the University of Porto using DSpace João Rocha da SilvaFEUP Cristina RibeiroDEI- FEUP / INESC-Porto João Correia

DATA DEPOSIT- DEMO

Page 23: UPData A data curation experiment at the University of Porto using DSpace João Rocha da SilvaFEUP Cristina RibeiroDEI- FEUP / INESC-Porto João Correia

VIDEO 1

Page 24: UPData A data curation experiment at the University of Porto using DSpace João Rocha da SilvaFEUP Cristina RibeiroDEI- FEUP / INESC-Porto João Correia

DATA EXPLORING AND DOWNLOAD- DEMO

Page 25: UPData A data curation experiment at the University of Porto using DSpace João Rocha da SilvaFEUP Cristina RibeiroDEI- FEUP / INESC-Porto João Correia

VIDEO 2

Page 26: UPData A data curation experiment at the University of Porto using DSpace João Rocha da SilvaFEUP Cristina RibeiroDEI- FEUP / INESC-Porto João Correia

FIND DATASETS- DEMO

Page 27: UPData A data curation experiment at the University of Porto using DSpace João Rocha da SilvaFEUP Cristina RibeiroDEI- FEUP / INESC-Porto João Correia

VIDEO 3

Page 28: UPData A data curation experiment at the University of Porto using DSpace João Rocha da SilvaFEUP Cristina RibeiroDEI- FEUP / INESC-Porto João Correia

Conclusions + Future Work

• Some data management requirements of the researchers at U.Porto have been analysed and approached

• Dspace has been successfully customized to include Data Exploration capabilities for tabular data

• Future Work• Gather feedback on the data repository extension from the

group of researchers who have been interviewed