![Page 1: Amanda Whitmire Maura Valentino OSU Libraries OPP Workshop Series 5 December 2012](https://reader034.vdocuments.net/reader034/viewer/2022051401/56649e625503460f94b5d767/html5/thumbnails/1.jpg)
Where’s Your Data?
Amanda WhitmireMaura Valentino
OSU Libraries
OPP Workshop Series5 December 2012
![Page 2: Amanda Whitmire Maura Valentino OSU Libraries OPP Workshop Series 5 December 2012](https://reader034.vdocuments.net/reader034/viewer/2022051401/56649e625503460f94b5d767/html5/thumbnails/2.jpg)
Why is a Librarian asking?
We are curious.
We manage information.
Data are a kind of
information.
![Page 3: Amanda Whitmire Maura Valentino OSU Libraries OPP Workshop Series 5 December 2012](https://reader034.vdocuments.net/reader034/viewer/2022051401/56649e625503460f94b5d767/html5/thumbnails/3.jpg)
TAKING CARE OF YOUR DATA
What’s your plan?
![Page 4: Amanda Whitmire Maura Valentino OSU Libraries OPP Workshop Series 5 December 2012](https://reader034.vdocuments.net/reader034/viewer/2022051401/56649e625503460f94b5d767/html5/thumbnails/4.jpg)
GOAL:
Achievable habits for implementing
data management best practices into your workflow
![Page 5: Amanda Whitmire Maura Valentino OSU Libraries OPP Workshop Series 5 December 2012](https://reader034.vdocuments.net/reader034/viewer/2022051401/56649e625503460f94b5d767/html5/thumbnails/5.jpg)
“…the recorded factual material commonly accepted in the scientific community
as necessary tovalidate research findings.”
Research data is:
U.S. Office of Management and Budget, Circular A-110
![Page 6: Amanda Whitmire Maura Valentino OSU Libraries OPP Workshop Series 5 December 2012](https://reader034.vdocuments.net/reader034/viewer/2022051401/56649e625503460f94b5d767/html5/thumbnails/6.jpg)
“…management activities required to maintain research
data long-termsuch that it is available for reuse and preservation.”
Data curation is:
Wikipedia
CURATION ≠ ARCHIVAL
![Page 7: Amanda Whitmire Maura Valentino OSU Libraries OPP Workshop Series 5 December 2012](https://reader034.vdocuments.net/reader034/viewer/2022051401/56649e625503460f94b5d767/html5/thumbnails/7.jpg)
“It is obvious that making data widely
available is an essential element of scientific research.”
Science editorial, “Making Data Maximally
Available,”11 Feb 2011
![Page 8: Amanda Whitmire Maura Valentino OSU Libraries OPP Workshop Series 5 December 2012](https://reader034.vdocuments.net/reader034/viewer/2022051401/56649e625503460f94b5d767/html5/thumbnails/8.jpg)
The case for data managementstewardship
curationetc.
$
![Page 9: Amanda Whitmire Maura Valentino OSU Libraries OPP Workshop Series 5 December 2012](https://reader034.vdocuments.net/reader034/viewer/2022051401/56649e625503460f94b5d767/html5/thumbnails/9.jpg)
Common missteps
“Why can’t I open this WordPerfect document?”“I think those data are on a ZipDisk somewhere…”“Oh, that dataset is on our group server…” “I never actually gave my advisor the final dataset…”“My laptop got stolen, so I lost the data…”“It was so long ago, I can’t remember …”
![Page 10: Amanda Whitmire Maura Valentino OSU Libraries OPP Workshop Series 5 December 2012](https://reader034.vdocuments.net/reader034/viewer/2022051401/56649e625503460f94b5d767/html5/thumbnails/10.jpg)
Research data lifecycle
New research question
posedResearch
planning & design
Data collection & description
Data processing &
analysisDissemination &
publication of findings
Data archiving
Accessible data located
Data transformed / repurposed
Research Cycle
![Page 11: Amanda Whitmire Maura Valentino OSU Libraries OPP Workshop Series 5 December 2012](https://reader034.vdocuments.net/reader034/viewer/2022051401/56649e625503460f94b5d767/html5/thumbnails/11.jpg)
How can we help?
New research question
posedResearch
planning & design
Data collection & description
Data processing &
analysisDissemination &
publication of findings
Data archiving
Accessible data located
Data transformed / repurposed
Research Cycle
![Page 12: Amanda Whitmire Maura Valentino OSU Libraries OPP Workshop Series 5 December 2012](https://reader034.vdocuments.net/reader034/viewer/2022051401/56649e625503460f94b5d767/html5/thumbnails/12.jpg)
Where to start?
How much data?
Resources needed
Roles & responsibilities
Metadata
Data formats
Data storage
Ethics & consent
Copyright (open data)
Sharing
Make a plan. Consider:
![Page 13: Amanda Whitmire Maura Valentino OSU Libraries OPP Workshop Series 5 December 2012](https://reader034.vdocuments.net/reader034/viewer/2022051401/56649e625503460f94b5d767/html5/thumbnails/13.jpg)
A fewtidbits
![Page 14: Amanda Whitmire Maura Valentino OSU Libraries OPP Workshop Series 5 December 2012](https://reader034.vdocuments.net/reader034/viewer/2022051401/56649e625503460f94b5d767/html5/thumbnails/14.jpg)
Data storage & curation
Anticipate: Volume/File type(s) Raw data vs. processed/analyzed data File Naming Conventions Privacy Concerns Storage practice Backup plans (LOCKSS, checksums)
![Page 15: Amanda Whitmire Maura Valentino OSU Libraries OPP Workshop Series 5 December 2012](https://reader034.vdocuments.net/reader034/viewer/2022051401/56649e625503460f94b5d767/html5/thumbnails/15.jpg)
File naming conventions
1. Be consistent• Have conventions for naming: (1) Directory structure
(2) Folder names(3) File names
• Always include the same information (e.g. date and time)• Retain the order of information (e.g. YYYYMMDD, not
MMDDYYY )
2. Be descriptive• Try to keep file and folder names under 32 characters
example: Project_instrument_location_YYYYMMDDhhmmss_extra.ext
SG157_20100426_001.raw (raw data)
SG157_20100426_001.mat (working data)
ESPOMZ_SG157_20100426_001.txt
(shareable)
![Page 16: Amanda Whitmire Maura Valentino OSU Libraries OPP Workshop Series 5 December 2012](https://reader034.vdocuments.net/reader034/viewer/2022051401/56649e625503460f94b5d767/html5/thumbnails/16.jpg)
Legal and ethical considerations
Intellectual property• Office for Commercialization & Corporate Development (OCCD)• Copyright
LicensingCharging for data?Data attribution & citation
Human subjects? Informed consent & anonymization prior to publishingResources @ OSU:• Office of Research Integrity, Institutional
Review Board (IRB)• Responsible Conduct of Research (RCR)
Program
![Page 17: Amanda Whitmire Maura Valentino OSU Libraries OPP Workshop Series 5 December 2012](https://reader034.vdocuments.net/reader034/viewer/2022051401/56649e625503460f94b5d767/html5/thumbnails/17.jpg)
Archiving and preservation
PoliciesPreservation optionsTypes of repositoriesCosts and benefits
![Page 18: Amanda Whitmire Maura Valentino OSU Libraries OPP Workshop Series 5 December 2012](https://reader034.vdocuments.net/reader034/viewer/2022051401/56649e625503460f94b5d767/html5/thumbnails/18.jpg)
University of SouthamptonSchool of Electronics & Computer ScienceSouthampton, UK, 2005
A word about backups…
![Page 19: Amanda Whitmire Maura Valentino OSU Libraries OPP Workshop Series 5 December 2012](https://reader034.vdocuments.net/reader034/viewer/2022051401/56649e625503460f94b5d767/html5/thumbnails/19.jpg)
Metadata
“The metadata accompanying your data should be written for a user 20 years into the future -- what does that person need to know to use your data properly? Prepare the metadata for a user who is unfamiliar with your project, methods, or observations.”
Oak Ridge National Laboratory Distributed Active Archive Center for Biogeochemical
Dynamics(ORNL DAAC)
![Page 20: Amanda Whitmire Maura Valentino OSU Libraries OPP Workshop Series 5 December 2012](https://reader034.vdocuments.net/reader034/viewer/2022051401/56649e625503460f94b5d767/html5/thumbnails/20.jpg)
What is Metadata?
Metadata is “data about data”
WHO created the data? WHAT is the content of the data? WHEN were the data created? WHERE is it geographically? HOW were the data developed? WHY were the data developed?
![Page 21: Amanda Whitmire Maura Valentino OSU Libraries OPP Workshop Series 5 December 2012](https://reader034.vdocuments.net/reader034/viewer/2022051401/56649e625503460f94b5d767/html5/thumbnails/21.jpg)
Metadata schemes
Dublin Core (DC), Darwin Core (DwC), EML, DDI, NBII,
FGDC/CSDGM, ISO 19139,
ISO 19115, DIF, LDIF, e-GMS,
AGLS, METS, MODS, PREMIS,
OAI-PMH, MARC, CDWA, CIDOC/CRM, DACS, DIG35,
GILS, GML, ISBD, LCSH, KML,
MARCXML, MEI, MODS, MIX,
OAIS, ANSI/NISO Z39.88, PB
Core, PRISM, QDC, RDF, SGML, VSO, XML, XMP
X
![Page 22: Amanda Whitmire Maura Valentino OSU Libraries OPP Workshop Series 5 December 2012](https://reader034.vdocuments.net/reader034/viewer/2022051401/56649e625503460f94b5d767/html5/thumbnails/22.jpg)
Metadata schemes
“Metadata schemes are like toothbrushes – everybody agrees that you should use one, but nobody wants to use someone else’s.”
![Page 23: Amanda Whitmire Maura Valentino OSU Libraries OPP Workshop Series 5 December 2012](https://reader034.vdocuments.net/reader034/viewer/2022051401/56649e625503460f94b5d767/html5/thumbnails/23.jpg)
You already use metadata…
-23
87
48
![Page 24: Amanda Whitmire Maura Valentino OSU Libraries OPP Workshop Series 5 December 2012](https://reader034.vdocuments.net/reader034/viewer/2022051401/56649e625503460f94b5d767/html5/thumbnails/24.jpg)
Metadata in use
State City Location Date Time Temperature (F)
Alaska Anchorage City Hall 2/12/2010 1400 -23
Florida Miami Weather Center 2/12/2010 1400 87
New York New York Empire State Building 2/12/2010 1400 48
![Page 25: Amanda Whitmire Maura Valentino OSU Libraries OPP Workshop Series 5 December 2012](https://reader034.vdocuments.net/reader034/viewer/2022051401/56649e625503460f94b5d767/html5/thumbnails/25.jpg)
Metadata in real life
You use it all the time…
![Page 26: Amanda Whitmire Maura Valentino OSU Libraries OPP Workshop Series 5 December 2012](https://reader034.vdocuments.net/reader034/viewer/2022051401/56649e625503460f94b5d767/html5/thumbnails/26.jpg)
Darwin Core | biological diversity, taxonomy
Dublin Core | general
DDI (Data Documentation Initiative) | social and
behavioral sciences data
DIF (Directory Interchange Format) |
environmental sciences
EML (Ecological Metadata Language) | ecology
FGDC/CSDGM (Federal Geographic Data
Committee/Content Standard for Digital
Geospatial Metadata) | geographic data
NBII (National Biological Information
Infrastructure) | biology
Major metadata standards
http://sbc.lternet.edu/cgi-bin/showDataset.cgi?docid=knb-lter-sbc.10
![Page 27: Amanda Whitmire Maura Valentino OSU Libraries OPP Workshop Series 5 December 2012](https://reader034.vdocuments.net/reader034/viewer/2022051401/56649e625503460f94b5d767/html5/thumbnails/27.jpg)
Metadata activity!
Take it away, Maura…
![Page 28: Amanda Whitmire Maura Valentino OSU Libraries OPP Workshop Series 5 December 2012](https://reader034.vdocuments.net/reader034/viewer/2022051401/56649e625503460f94b5d767/html5/thumbnails/28.jpg)
Let’s Describe this Dataset
Bright orange Garibaldi fishHypsypops rubicundusCalifornia, USA
Ornate Butterfly fishChaetodon ornatissimusIndo-Pacific
![Page 29: Amanda Whitmire Maura Valentino OSU Libraries OPP Workshop Series 5 December 2012](https://reader034.vdocuments.net/reader034/viewer/2022051401/56649e625503460f94b5d767/html5/thumbnails/29.jpg)
Scenario 1
Research for preschoolers to see if they learn colors and
patterns better from real life examples
![Page 30: Amanda Whitmire Maura Valentino OSU Libraries OPP Workshop Series 5 December 2012](https://reader034.vdocuments.net/reader034/viewer/2022051401/56649e625503460f94b5d767/html5/thumbnails/30.jpg)
Scenario 2
Research on what fish are local to a particular area. The
photos are the data
![Page 31: Amanda Whitmire Maura Valentino OSU Libraries OPP Workshop Series 5 December 2012](https://reader034.vdocuments.net/reader034/viewer/2022051401/56649e625503460f94b5d767/html5/thumbnails/31.jpg)
Scenario 3
Research into specific details of specific types of fish
![Page 32: Amanda Whitmire Maura Valentino OSU Libraries OPP Workshop Series 5 December 2012](https://reader034.vdocuments.net/reader034/viewer/2022051401/56649e625503460f94b5d767/html5/thumbnails/32.jpg)
File/Folder Organization
You have monitors attached to 18 athletes (6 tennis players, 6 golfers, 6 rowers) for 7 days. Each day you get 2 readouts for each athlete, 1 for heart rate and 1 for body temperature. You transfer the data to Excel. Name and organize the files for this experiment.
![Page 33: Amanda Whitmire Maura Valentino OSU Libraries OPP Workshop Series 5 December 2012](https://reader034.vdocuments.net/reader034/viewer/2022051401/56649e625503460f94b5d767/html5/thumbnails/33.jpg)
Think about your own data– What types of data need to be described?
– What are the relationships between them?
– What descriptive metadata can you find?
– What metadata is being captured automatically?
– What other descriptive metadata do you need to help users find your data?
– What metadata do you need to help other scientists reproduce your data or use it for comparison?
– What events has/will the data undergo?
– For how long do you want to retain the data?
– How intensive are your preservation needs?
– How diverse is your user base? Does this influence your preservation needs?
![Page 34: Amanda Whitmire Maura Valentino OSU Libraries OPP Workshop Series 5 December 2012](https://reader034.vdocuments.net/reader034/viewer/2022051401/56649e625503460f94b5d767/html5/thumbnails/34.jpg)
Data Management Plans
![Page 35: Amanda Whitmire Maura Valentino OSU Libraries OPP Workshop Series 5 December 2012](https://reader034.vdocuments.net/reader034/viewer/2022051401/56649e625503460f94b5d767/html5/thumbnails/35.jpg)
Data Management Plans
The types of dataData & metadata standards | format
and content
Policies for access and sharingPolicies and provisions for re-usePlans for archiving data{Budget} $$$
![Page 36: Amanda Whitmire Maura Valentino OSU Libraries OPP Workshop Series 5 December 2012](https://reader034.vdocuments.net/reader034/viewer/2022051401/56649e625503460f94b5d767/html5/thumbnails/36.jpg)
Use available resources
http://www.dataone.org/data-management-planning
https://dmp.cdlib.org/
![Page 37: Amanda Whitmire Maura Valentino OSU Libraries OPP Workshop Series 5 December 2012](https://reader034.vdocuments.net/reader034/viewer/2022051401/56649e625503460f94b5d767/html5/thumbnails/37.jpg)
Contact information
Amanda Whitmire | Data Management
Specialist
Maura Valentino| Metadata Librarian
![Page 38: Amanda Whitmire Maura Valentino OSU Libraries OPP Workshop Series 5 December 2012](https://reader034.vdocuments.net/reader034/viewer/2022051401/56649e625503460f94b5d767/html5/thumbnails/38.jpg)
fin