20160414 23 research data things
TRANSCRIPT
How do I access the 23 Things?
Overview
• ands.org.au/23-things
How often do I need to do a Thing?
• 1 Thing is released each week
• Complete at your own pace
UWA participation
• Monthly catch-ups in BJM which include a
national ANDS webinar
• Discussions in our Google + UWA-only group
Week Mon Tue Wed Thu Fri1 MarchKick-off Webinar
2 3 4
1 - Getting started with data 7 8 9 10 11
2 - Issues in research data management 14 15 16 17 183 - Data in the research lifecycle 21 22 23 24 25Easter Week 28 29 30 31 1 April
4 - Repositories for data discovery 4 5 6 7 811 12
Webinar Catch-up13 14 15
5 - Repositories for data sharing 18 19 20 21 226 - Long-lived data: curation & preservation 25 26 27 28 297 - Data citation for access & attribution 2 May 3 4 5 68 - Citation metrics for data 9 10 11 12 139 - Licensing data for reuse 16 17 18 19 20
23 24Webinar Catch-up
25 26 27
10 - Sharing sensitive data 30 31 1 June 2 311 - What's my schema? 6 7 8 9 1012 - Vocabularies for data description 13 14 15 16 1713 - Walk the crosswalk 20 21 22 23 24
27 28Webinar Catch-up
29 30 1 July
14 - Identifiers and linked data 4 5 6 7 811 12 13 14 15
15 - Data management plans 18 19 20 21 2216 - What are publishers & funders saying about data?
25 26 27 28 29
1 August 2Webinar Catch-up
3 4 5
17 - Data literacy & outreach 8 9 10 11 1218 - Data interviews: talk the talk 15 16 17 18 1919 - Exploring APIs and Apps 22 23 24 25 2620 - Do it with data! 29 30 31 1 September 2
5 6Webinar Catch-up
7 8 9
12 13 14 15 16
21 - Tools of the trade 19 20 21 22 2322 - What's in a name? 26 27 28 29 3023 - Making connections 3 October 4 5 6 7
10 11 12 13 14
17 18Webinar Catch-up
19 20 21
Calendar of Events
Catch-up webinars and topic review at BJM
each month in 2016
• 1 March
• 12 April
• 24 May
• 28 June
• 2 August
• 6 September
• 18 October
• Open up this
record of research
data collected during
a CSIRO voyage
which explored the
sea floor (i.e.
Benthic zone) of the
Marmion Lagoon,
located just off
Perth, in 2007
Thing 1
• your favourite research data tech or software story or
experience (eg did you compete in GovHack 2015? or
Science Hackfest Melbourne 4-6 March 2016?)
• a software tool or service for research data you think others
might be interested in
• a question or research data problem to crowdsource a
solution
Thing 1
Issues in research data management
Research data is for everyone. Governments and universities
all around Australia and the world are now encouraging
researchers to better manage their data so others can use it.
Research data might be critical to solving the big questions of
our time, but so much data are being lost or poorly managed.
Thing 2
Issues in research data management
• https://www.yout
ube.com/watch?
v=66oNv_DJuPc
• As you watch the
cartoon note the
data
management
mistakes which
interest or appal
you.
Thing 2
Issues in research data management
"Big Data" is a term we're hearing with
increasing frequency. Data management
for Big Data brings much complexity -
citing dynamic data, software, high volume
compute, storage costs, transfer of
petabytes of data, preservation,
provenance, more.
• Read this post and presentation titled:
"Big Data: The 5Vs Everyone Must
Know.
Thing 2
• Laboratory Notebooks are used by researchers to formally record their
research activities. As research has become increasingly digital and
collaborative the utility of traditional hard copy Lab Notebooks has been
challenged. Not surprisingly then, eLab Notebooks (ELN) have emerged
as an alternative.
• Effective data management for constantly updated data, such as that
within ELNs, is a real challenge for projects who wish to publish their
data during the project.
Thing 2
Issues in research data management
• Definition
Electronic lab notebook (ELN) software allows scientists to access, search
and share results of their experiments. An ELN is essentially a computer
program that is meant to replace traditional paper laboratory notebooks so
that scientists and researchers can search their records more easily and have
more efficient means to backup and copy their data onto other electronic
devices. ELNs encourage collaboration, as it is possible for multiple
researchers or scientists to view lab data at the same time. ELNs also have
the capacity to work alongside other research instruments so that additional
data can be incorporated quickly and efficiently. ELNs should be supported by
strong security measures to ensure that the data and the researchers’
process of creating the data are not jeopardized in any way. Additionally ELNs
should be flexible to change if a particular research process is altered or new
data is required. This flexibility is best addressed when developing the
specific software for an ELN.
Thing 2
Issues in research data management
• International team of scientists
open sources search for malaria
cure about how an international
team of scientists and citizen
scientists are using open source
ELNs to speed up a cure for
malaria.
Thing 2
Issues in research data
management
• You can see their
open ELNs here
Thing 2
Issues in research data
management
• You can see their
open ELNs here
Thing 2
Issues in research
data management
Thing 2
Issues in research
data management
http://www.wellcome.ac.uk/News/Media-office/Press-releases/2016/WTP060169.htm
Data in the research lifecycle
• Data and its management change over time. Here we look at data and
research lifecycles and make connections between them.
• Data often have a longer lifespan than the research project that creates
them.
• Follow-up projects may analyse or add to the data, and data may be
reused by other researchers.
• Journals publishers are increasingly mandating that the data underpinning
a journal article be retained and made accessible for the long term.
Thing 3
Data in the research lifecycle
• A data lifecycle shows the
different phases a dataset
goes through as the
research project moves
from
o "having a brilliant idea" to
o "making ground breaking
discoveries" to
o "telling the world about it"
Thing 3
http://www.data-archive.ac.uk/create-
manage/life-cycle
Data in the research lifecycle
• A data lifecycle shows the
different phases a dataset
goes through as the
research project moves
from
o "having a brilliant idea" to
o "making ground breaking
discoveries" to
o "telling the world about it"
Thing 3
http://www.data-archive.ac.uk/create-
manage/life-cycle
Data in the research lifecycle
Thing 3
http://www.data-archive.ac.uk/create-
manage/life-cycle
• A data lifecycle shows the
different phases a dataset
goes through as the
research project moves
from
o "having a brilliant idea" to
o "making ground breaking
discoveries" to
o "telling the world about it"
Data in the research lifecycle
Thing 3
http://www.data-archive.ac.uk/create-
manage/life-cycle
• A data lifecycle shows the
different phases a dataset
goes through as the
research project moves
from
o "having a brilliant idea" to
o "making ground breaking
discoveries" to
o "telling the world about it"
Data in the research lifecycle
Thing 3
http://www.data-archive.ac.uk/create-
manage/life-cycle
• A data lifecycle shows the
different phases a dataset
goes through as the
research project moves
from
o "having a brilliant idea" to
o "making ground breaking
discoveries" to
o "telling the world about it"
Data in the research lifecycle
Thing 3
http://www.data-archive.ac.uk/create-
manage/life-cycle
• A data lifecycle shows the
different phases a dataset
goes through as the
research project moves
from
o "having a brilliant idea" to
o "making ground breaking
discoveries" to
o "telling the world about it"
Data in the research lifecycle
Thing 3
• A data lifecycle shows the
different phases a dataset
goes through as the
research project moves
from
o "having a brilliant idea" to
o "making ground breaking
discoveries" to
o "telling the world about it"
Thing 3
Data in the research lifecycle
http://www.dcc.ac.uk/resources/curation-lifecycle-model
• Digital Curation
Centre
• Take a look at the
DCC Curation
Lifecycle Model
which
concentrates of
preservation and
curation within
data
management.
Thing 3
Data in the research lifecycle
http://www.dcc.ac.uk/resources/curation-lifecycle-model
• What could we
add???
Thing 3
Data in the research lifecycle
http://www.dcc.ac.uk/resources/curation-lifecycle-model
Thing 3
Data in the research lifecycle
http://www.library.uwa.edu.au/research/services
Data Discovery
• Repositories enable discovery of data by publishing data descriptions
("metadata") about the data they hold - like a library catalogue describes
the materials held in a library.
• Most repositories provide access to the data itself, but not always.
Thing 4
Data Discovery
• Data portals or aggregators draw together research data records from a
number of repositories.
• eg Research Data Australia (RDA) aggregates records from over 100
Australian research repositories.
• https://researchdata.ands.org.au/measuring-effects-human-leptonychotes-
weddellii/640511/
Thing 4
Data Discovery
Thing 4
• What data repositories exist and how are Australian researchers sharing
their data?
• Start by going to re3data.org
Data Discovery
Thing 4
DCC checklist for evaluating data repositories
What does this checklist cover and what does it exclude?
Choosing a long-term service to look after data means asking questions similar to
those you ask when choosing a publisher; ‘if I hand this over, will they review it,
safeguard the content, and make sure it is accessible for as long as it is of
value?’ This checklist relates these questions to the following key considerations:
1. Is a reputable repository available?
2. Will it take the data you want to deposit?
3. Will it be safe in legal terms?
4. Will the repository sustain the data value?
5. Will it support analysis and track data usage?
See more at: http://www.dcc.ac.uk/resources/how-guides-checklists/where-keep-
research-data#1
Contacts
Contact UWA 23 Things Coordinators:
Caroline Clark
Nola Steiner
Katina Toufexis