data organisation & description library – rdm support project basic training course for...

42
Data organisation & description Library – RDM Support Project Basic training course for information specialists

Upload: jaren-hadaway

Post on 16-Dec-2015

216 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Data organisation & description Library – RDM Support Project Basic training course for information specialists

Dataorganisation & description

Library – RDM Support Project

Basic training course for information specialists

Page 2: Data organisation & description Library – RDM Support Project Basic training course for information specialists

RDMS Data organisation & description 2

Reasons for organisation & description

Human memory is fallible

All employment contracts come to an end sometime

Explaining the same thing over and over again gets

boring

Page 3: Data organisation & description Library – RDM Support Project Basic training course for information specialists

RDMS Data organisation & description 3

Aim of organisation & description

Ensuring that the origin, structure and content of a

dataset are comprehensible to everyone at all times

Page 4: Data organisation & description Library – RDM Support Project Basic training course for information specialists

RDMS Data organisation & description 4

Test question

What does someone who is not familiar with your data

need in order to find, assess, understand and reuse

your data?

Page 5: Data organisation & description Library – RDM Support Project Basic training course for information specialists

RDMS Data organisation & description 5

Temperature: 32.3

Measured? Calculated? How?

When?

Where?

Temperature of what? Unit? Fahrenheit? Celsius?

Measured with what?

Was the equipment calibrated, and if so how?

Page 6: Data organisation & description Library – RDM Support Project Basic training course for information specialists

RDMS Data organisation & description 6

Organisation

Page 7: Data organisation & description Library – RDM Support Project Basic training course for information specialists

RDMS Data organisation & description 7

Benefit of organisation

Finding files easily (saving time)

Separating ongoing and completed work

(maintaining an overview)

Separating master files and working files (preventing

loss of files)

Page 8: Data organisation & description Library – RDM Support Project Basic training course for information specialists

RDMS Data organisation & description 8

Do's

Assign clear names to folders and subfolders

Limit the number of levels in hierarchy

Describe structure in a readme.txt

Page 9: Data organisation & description Library – RDM Support Project Basic training course for information specialists

RDMS Data organisation & description 9

Don’ts

Name folders and subfolders after researchers

Make identification of files dependent on the folder

that contains them

Have organisation determined by software from

which file structure cannot be exported

Page 10: Data organisation & description Library – RDM Support Project Basic training course for information specialists

RDMS Data organisation & description 10

File names

Pile

d H

ighe

r and

Dee

per

by Jo

rge

Cham

htt

p://

ww

w.p

hdco

mic

s.co

m/c

omic

s/ar

chiv

e.ph

p?co

mic

id=1

531

Page 11: Data organisation & description Library – RDM Support Project Basic training course for information specialists

RDMS Data organisation & description 11

Good file names

make files findable

help to distinguish between different files and

versions of files

prevent confusion when sharing files

Page 12: Data organisation & description Library – RDM Support Project Basic training course for information specialists

RDMS Data organisation & description 12

Possible building blocks

Brief description of content

Maker or indication of measuring instrument

Creation, recording or publication date

Version number

Project number and/or name of the research group

Page 13: Data organisation & description Library – RDM Support Project Basic training course for information specialists

RDMS Data organisation & description 13

Practical tips

Make file names relevant and short (approx. 25

characters)

Do not use any special characters in file name

Use _underscores_ instead of spaces

Page 14: Data organisation & description Library – RDM Support Project Basic training course for information specialists

RDMS Data organisation & description 14

Practical tips

Always enter dates in file names in the same way

(yyyymmdd / yymmdd)

Abbreviations or initials in the file name? Enter the

meaning in a readme.txt

Page 15: Data organisation & description Library – RDM Support Project Basic training course for information specialists

Version management

Successive versions? Use numbers (v01, v02, etc.)

instead of designations such as "draft" or "final"

Different versions of the same file? Indicate this in the

file name (e.g. ‘tb’ for thumbnail)

If applicable: software for version management (SVN)

RDMS Data organisation & description 15

Page 16: Data organisation & description Library – RDM Support Project Basic training course for information specialists

RDMS Data organisation & description 16

Test question

Does the file name contain the information needed to

identify the file, regardless of where the file is stored?

Page 17: Data organisation & description Library – RDM Support Project Basic training course for information specialists

RDMS Data organisation & description 17

Example

../Research Data Management/Second outline for the

course January 2014.docx

better:

../201401_RDMCourseOutline_v02.docx

../201401RDMCourseOutline02.docx

Page 18: Data organisation & description Library – RDM Support Project Basic training course for information specialists

RDMS Data organisation & description 18

Renaming files

Bulk Rename Utility (Windows)

Renamer 4 Mac (Mac)

Renamer (Mac)

PS Renamer (Windows, Mac, Linux)

Page 19: Data organisation & description Library – RDM Support Project Basic training course for information specialists

RDMS Data organisation & description 19

MetadataDan Cohen is Executive Director of the Digital Public Library of America (DPLA)

https://twitter.com/dancohen/status/431440078951682049

Page 20: Data organisation & description Library – RDM Support Project Basic training course for information specialists

Metadata: functions

RDMS Data organisation & description 20

Make datasets findable: metadata provide the building

blocks that a repository's search function needs

Make datasets citable: metadata provide the elements

for a citation of a dataset

Page 21: Data organisation & description Library – RDM Support Project Basic training course for information specialists

RDMS Data organisation & description 21

Types of metadata

Descriptive: identification, location, classification Technical: file formats, equipment settings, software or

hardware used Administrative: property rights, licence Use: access rights, embargo Retention: checksums, migration, conversion

Page 22: Data organisation & description Library – RDM Support Project Basic training course for information specialists

RDMS Data organisation & description 22

When to use metadata?

Generally assigned when sharing, publishing and/or

filing data

Embedded metadata: automatically saved in file (≠

guarantee when moving file)

Page 23: Data organisation & description Library – RDM Support Project Basic training course for information specialists

RDMS Data organisation & description 23http://www.flickr.com/photos/surfannemiek/3038325213/

Page 24: Data organisation & description Library – RDM Support Project Basic training course for information specialists

RDMS Data organisation & description 24http://www.flickr.com/photos/surfannemiek/3038325213/meta

Page 25: Data organisation & description Library – RDM Support Project Basic training course for information specialists

RDMS Data organisation & description 25

Page 26: Data organisation & description Library – RDM Support Project Basic training course for information specialists

RDMS Data organisation & description 26

Standards

Generic: Dublin Core, DataCite

Field-specific: e.g. Data Documentation Initiative

(DDI, Social Sciences)

Standards of repositories and data portals

Searchable overview on DCC website

Page 27: Data organisation & description Library – RDM Support Project Basic training course for information specialists

Dublin Core® Metadata Initiative (DCMI)

Contributor

Coverage

Creator

Date

Description

Format

Identifier

Language

Publisher

Relation

Rights

Source

Subject

Title

Type

RDMS Data organisation & description 27

Page 28: Data organisation & description Library – RDM Support Project Basic training course for information specialists

RDMS Data organisation & description 28 Exam

ple

take

n fr

om h

ttps

://l

ibra

ry.u

oreg

on.e

du/d

atam

anag

emen

t/m

etad

ata.

htm

l

Page 29: Data organisation & description Library – RDM Support Project Basic training course for information specialists

Search function

RDMS Data organisation & description 29

Page 30: Data organisation & description Library – RDM Support Project Basic training course for information specialists

DataCite properties

AlternateIdentifier (O)

Contributor (R)

Creator (M)

Date (R)

Description (R)

Format (O)

GeoLocation (R)

Identifier (M)

Language (O)

PublicationYear (M)

Publisher (M)

RelateIdentifier (R)

ResourceType (R)

Rights (O)

Size (O)

Subject (R)

Title (M)

Version (O)

RDMS Data organisation & description 30

M = mandatory, R = recommended, O = optional

Page 31: Data organisation & description Library – RDM Support Project Basic training course for information specialists

DataCite properties

RDMS Data organisation & description 31More examples at http://schema.datacite.org/meta/kernel-3/index.html

Page 32: Data organisation & description Library – RDM Support Project Basic training course for information specialists

Choosing a standard

RDMS Data organisation & description 32

Which type of data is the researcher collecting?

What is customary in the field or research group?

Where is the data going to be deposited (= what

does the repository want and what can it do)?

Page 33: Data organisation & description Library – RDM Support Project Basic training course for information specialists

Challenge

RDMS Data organisation & description 33

Convert the information a researcher has on his or her

dataset into the metadata fields used in a repository

Often far more multiple fields are possible and desired by the researcher

beforehand than are actually entered and used when searching (= experience

from previous data storage project)

Page 34: Data organisation & description Library – RDM Support Project Basic training course for information specialists

Metadata: tools

RDMS Data organisation & description 34

http:

//lib

rary

.sta

nfor

d.ed

u/re

sear

ch/d

ata-

man

agem

ent-

serv

ices

/dat

a-be

st-p

racti

ces

/cre

ating

-met

adat

a/m

etad

ata-

tool

s-co

mpa

rison

Page 35: Data organisation & description Library – RDM Support Project Basic training course for information specialists

RDMS Data organisation & description 35

Documentation

Page 36: Data organisation & description Library – RDM Support Project Basic training course for information specialists

RDMS Data organisation & description 36

(Meta)data vs documentation

Data

Metadata: description of data for computers

Documentation: description of data for humans

Page 37: Data organisation & description Library – RDM Support Project Basic training course for information specialists

RDMS Data organisation & description 37

(Meta)data vs documentation - example

Data answers to questions

Metadata maker of survey, date survey

taken, etc.

Documentation survey itself, description of

method used, etc.

Page 38: Data organisation & description Library – RDM Support Project Basic training course for information specialists

RDMS Data organisation & description 38

Documentation at three levels

Research project: context, methodology,

instruments

Dataset or database: relationship between files

File: content and structure of individual files

(variables, codes, etc.)

Page 39: Data organisation & description Library – RDM Support Project Basic training course for information specialists

RDMS Data organisation & description 39

Test question

What does someone who is not familiar with your data

need in order to assess, understand and reuse your

data?

Page 40: Data organisation & description Library – RDM Support Project Basic training course for information specialists

In practice…

RDMS Data organisation & description 40

Edinburgh University Data Library [Data Library]. (2012, May 4). MANTRA – John MacInnes – Data documentation in secondary data analysis. Retrieved from http://youtu.be/Ebaiwg08CW8

Page 41: Data organisation & description Library – RDM Support Project Basic training course for information specialists

RDMS Data organisation & description 41

Information sources

Digital Curation Centre (DCC), Resources for digital curators: Disciplinary Metadatahttp://www.dcc.ac.uk/resources/metadata-standards

Dublin Core® Metadata Initiative User Guidehttp://wiki.dublincore.org/index.php/User_Guide

DataCite Metadata Schemahttp://schema.datacite.org/meta/kernel-3/index.html

Stanford University Libraries, Metadata toolshttp://library.stanford.edu/research/data-management-services/data-best-practices/creating-metadata/metadata-tools

Page 42: Data organisation & description Library – RDM Support Project Basic training course for information specialists

RDMS Data organisation & description 42

Publication information

Presentation:

Mariëtte van Selm | [email protected]

Images, unless credited otherwise:

Jørgen Stamp | http://www.digitalbevaring.dk/

M. van Selm, RDM Support - basic training course for information specialists, course material for session 4. February 2014.This work is licensed under the Creative Commons Attribution-ShareAlike 4.0 International licence