managing digital content over time: module 1:...

42
Digital Preservation Outreach and Education (DPOE) Managing Digital Content over Time: Protect Module CUWL June 3, 2016

Upload: others

Post on 26-Aug-2020

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Managing Digital Content over Time: Module 1: Identifyrecollectionwisconsin.org/wp-content/uploads/2016/... · June 3, 2016 . Modules DPOE Baseline Modules: Intro, version 2.0, Nov

Digital Preservation Outreach and Education (DPOE)

Managing Digital Content over Time:

Protect Module

CUWL

June 3, 2016

Page 2: Managing Digital Content over Time: Module 1: Identifyrecollectionwisconsin.org/wp-content/uploads/2016/... · June 3, 2016 . Modules DPOE Baseline Modules: Intro, version 2.0, Nov

Modules

DPOE Baseline Modules: Intro, version 2.0, Nov 2011

Select - what portion of that content will be preserved?

Identify - what digital content do you have?

Store - what issues are there for long term storage?

Protect - what steps are needed to protect your digital content?

Manage - what provisions are needed for long-term management?

Provide - what considerations are there for long-term access?

2

Page 3: Managing Digital Content over Time: Module 1: Identifyrecollectionwisconsin.org/wp-content/uploads/2016/... · June 3, 2016 . Modules DPOE Baseline Modules: Intro, version 2.0, Nov

identify

select

store

protect

manage

provide

DPOE Baseline Modules

3

Page 4: Managing Digital Content over Time: Module 1: Identifyrecollectionwisconsin.org/wp-content/uploads/2016/... · June 3, 2016 . Modules DPOE Baseline Modules: Intro, version 2.0, Nov

• Understand the various types of

threats facing digital materials

and how to address them.

• Learn about the types of policies

that can assist with protecting

your content.

Session Goals

4

Page 5: Managing Digital Content over Time: Module 1: Identifyrecollectionwisconsin.org/wp-content/uploads/2016/... · June 3, 2016 . Modules DPOE Baseline Modules: Intro, version 2.0, Nov

Identify and evaluate risks

Protection of content

requires you to identify

and evaluate risks:

– related to

management of the

content

– related to emergencies

Photo courtesy of NARA 5

Page 6: Managing Digital Content over Time: Module 1: Identifyrecollectionwisconsin.org/wp-content/uploads/2016/... · June 3, 2016 . Modules DPOE Baseline Modules: Intro, version 2.0, Nov

• Hardware/equipment

• Digital files

• Content of the digital files

– Including sensitive information

• Your investment!

• Your reputation!

What are you protecting?

6

Page 7: Managing Digital Content over Time: Module 1: Identifyrecollectionwisconsin.org/wp-content/uploads/2016/... · June 3, 2016 . Modules DPOE Baseline Modules: Intro, version 2.0, Nov

Dangers! to digital objects

Change and loss – accidental

7

Page 8: Managing Digital Content over Time: Module 1: Identifyrecollectionwisconsin.org/wp-content/uploads/2016/... · June 3, 2016 . Modules DPOE Baseline Modules: Intro, version 2.0, Nov

Dangers! to digital objects

Change and loss – intentional

8

Page 9: Managing Digital Content over Time: Module 1: Identifyrecollectionwisconsin.org/wp-content/uploads/2016/... · June 3, 2016 . Modules DPOE Baseline Modules: Intro, version 2.0, Nov

Dangers! to digital objects

Inappropriate access – e.g., confidential data

9

Page 10: Managing Digital Content over Time: Module 1: Identifyrecollectionwisconsin.org/wp-content/uploads/2016/... · June 3, 2016 . Modules DPOE Baseline Modules: Intro, version 2.0, Nov

Dangers! to digital objects

Obsolescence – as technology evolves

10

Page 11: Managing Digital Content over Time: Module 1: Identifyrecollectionwisconsin.org/wp-content/uploads/2016/... · June 3, 2016 . Modules DPOE Baseline Modules: Intro, version 2.0, Nov

Dangers! to digital objects

https://archive.org/details/historicalsoftware

Obsolescence – as software evolves

11

Page 12: Managing Digital Content over Time: Module 1: Identifyrecollectionwisconsin.org/wp-content/uploads/2016/... · June 3, 2016 . Modules DPOE Baseline Modules: Intro, version 2.0, Nov

Dangers! to digital objects

Non-compliance – standards and requirements

12

Page 13: Managing Digital Content over Time: Module 1: Identifyrecollectionwisconsin.org/wp-content/uploads/2016/... · June 3, 2016 . Modules DPOE Baseline Modules: Intro, version 2.0, Nov

Dangers! to digital objects

Disasters – Natural Disasters

13

Page 14: Managing Digital Content over Time: Module 1: Identifyrecollectionwisconsin.org/wp-content/uploads/2016/... · June 3, 2016 . Modules DPOE Baseline Modules: Intro, version 2.0, Nov

Dangers! to digital objects

Disasters – Physical Disasters

14

Page 15: Managing Digital Content over Time: Module 1: Identifyrecollectionwisconsin.org/wp-content/uploads/2016/... · June 3, 2016 . Modules DPOE Baseline Modules: Intro, version 2.0, Nov

Readiness

Proper planning should

allow you to:

• Prevent

• Predict

• Detect

• Respond

• Repair

The American Fireman Facing the Enemy

Image ID: WHi-22879 15

Page 16: Managing Digital Content over Time: Module 1: Identifyrecollectionwisconsin.org/wp-content/uploads/2016/... · June 3, 2016 . Modules DPOE Baseline Modules: Intro, version 2.0, Nov

How to achieve readiness?

Ulti atel , a orga izatio ould use a suite of plans to properly prepare

response, recovery, and continuity

activities for disruptions affecting the

orga izatio s IT systems, business

processes, and the facility.

Source: NIST Contingency Planning Guide for Information Technology Systems, pg. 7.

16

Page 17: Managing Digital Content over Time: Module 1: Identifyrecollectionwisconsin.org/wp-content/uploads/2016/... · June 3, 2016 . Modules DPOE Baseline Modules: Intro, version 2.0, Nov

Steps for Effective Risk Management

Steps to protect your content:

• Identify possible risks

• Define those risks (nature and scope)

• Assess potential impact (possible damage)

• Develop appropriate, feasible responses (plans)

• Respond to risks, threats (implement plans)

17

Page 18: Managing Digital Content over Time: Module 1: Identifyrecollectionwisconsin.org/wp-content/uploads/2016/... · June 3, 2016 . Modules DPOE Baseline Modules: Intro, version 2.0, Nov

Ask Yourself…..

During an emergency, could your agency access

its digital content?

Photo courtesy of NARA 18

Page 19: Managing Digital Content over Time: Module 1: Identifyrecollectionwisconsin.org/wp-content/uploads/2016/... · June 3, 2016 . Modules DPOE Baseline Modules: Intro, version 2.0, Nov

Risk Analysis

19

Page 20: Managing Digital Content over Time: Module 1: Identifyrecollectionwisconsin.org/wp-content/uploads/2016/... · June 3, 2016 . Modules DPOE Baseline Modules: Intro, version 2.0, Nov

Emergency Protection

• Engage in ongoing disaster planning

– Establish committee and share information

– Develop and maintain documents

– Update plans regularly

• Identify possible outcomes and prepare

– e.g., server goes down, media is damaged

– Simulations and drills

20

Page 21: Managing Digital Content over Time: Module 1: Identifyrecollectionwisconsin.org/wp-content/uploads/2016/... · June 3, 2016 . Modules DPOE Baseline Modules: Intro, version 2.0, Nov

• Safet first…

– Physical safety of guest and employees comes first then

prote t the stuff

• K o hat ou eed e t…

– Identify core functions of your operation

– Determine allowable downtime for each

– Consider steps to re-establish each function

– Develop relevant response documents

• Keep planning and response documents accessible!!

Identify Priorities

21

Page 22: Managing Digital Content over Time: Module 1: Identifyrecollectionwisconsin.org/wp-content/uploads/2016/... · June 3, 2016 . Modules DPOE Baseline Modules: Intro, version 2.0, Nov

NIST Planning Components

From NIST Contingency Planning Guide for Information Technology Systems, pg. 10.

Business Continuity

Plan (BCP)

Business Recovery

Plan (BRP)

Continuity of Operations Plan

(COOP)

IT Contingency

Plan

Crisis Communication

Plan

Cyber Incident Response Plan

Disaster Recovery Plan (DRP)

Occupant Emergency Plan

(OEP)

22

Page 23: Managing Digital Content over Time: Module 1: Identifyrecollectionwisconsin.org/wp-content/uploads/2016/... · June 3, 2016 . Modules DPOE Baseline Modules: Intro, version 2.0, Nov

From NIST Contingency Planning Guide for Information Technology Systems, pg. 11.

23

Page 24: Managing Digital Content over Time: Module 1: Identifyrecollectionwisconsin.org/wp-content/uploads/2016/... · June 3, 2016 . Modules DPOE Baseline Modules: Intro, version 2.0, Nov

• Still need to do the Risk Analysis

• Review existing disaster planning documents

• Create a response team to develop a plan

• Make sure copies are offsite (along with the inventory)

• Add to your inventory

– Where are the copies?

– Which collections need to be restored first?

– What is the recovery time?

What About the Digital Content?

24

Page 25: Managing Digital Content over Time: Module 1: Identifyrecollectionwisconsin.org/wp-content/uploads/2016/... · June 3, 2016 . Modules DPOE Baseline Modules: Intro, version 2.0, Nov

Data Criticality

25

Page 26: Managing Digital Content over Time: Module 1: Identifyrecollectionwisconsin.org/wp-content/uploads/2016/... · June 3, 2016 . Modules DPOE Baseline Modules: Intro, version 2.0, Nov

Treating Electronic Media (AKA - he ad thi gs happe to good I stitutio s…….)

Simple fact is … No matter how well you plan, some things are

JUST GONNA’ HAPPEN

However - There are several actions you can take

within the first 48 hours to mitigate the damage to

your digital collections.

26

Page 27: Managing Digital Content over Time: Module 1: Identifyrecollectionwisconsin.org/wp-content/uploads/2016/... · June 3, 2016 . Modules DPOE Baseline Modules: Intro, version 2.0, Nov

Treating Electronic Media

• Some materials should be kept wet until they

can be recovered by a contractor who

specializes in the recovery of those materials.

• Examples include:

– Hard drives from computers

– CDs

– Floppies

– Microfilm

– Motion picture film

27

Page 28: Managing Digital Content over Time: Module 1: Identifyrecollectionwisconsin.org/wp-content/uploads/2016/... · June 3, 2016 . Modules DPOE Baseline Modules: Intro, version 2.0, Nov

• Priority Action:

– Act quickly

– Do NOT attempt to recover the data yourself

• Keep hard drives wet

• Do NOT:

– Rinse in clean water

– Dry

– Shake

– Turn on

Computers and Hard Drives

Photo courtesy of NARA

Orleans Parish -Post Katrina - 2006 28

Page 29: Managing Digital Content over Time: Module 1: Identifyrecollectionwisconsin.org/wp-content/uploads/2016/... · June 3, 2016 . Modules DPOE Baseline Modules: Intro, version 2.0, Nov

• Priority Action:

– Clean and air dry discs within 48 hours

– Never freeze, vacuum dry,

or expose wet discs to

heat

– It is usually easier to

discard wet discs if

backup copies are

readily available

CDs and Floppy Disks

Photo courtesy of NARA—Suitland—NIH material—2006 29

Page 30: Managing Digital Content over Time: Module 1: Identifyrecollectionwisconsin.org/wp-content/uploads/2016/... · June 3, 2016 . Modules DPOE Baseline Modules: Intro, version 2.0, Nov

• Priority Action:

– Air dry discs within 48 hours

– Do not freeze dry

or use heat to dry

– Do not attempt to

play back

Audio and video

Photo courtesy of NARA—WNRC—NARA —2006

30

Page 31: Managing Digital Content over Time: Module 1: Identifyrecollectionwisconsin.org/wp-content/uploads/2016/... · June 3, 2016 . Modules DPOE Baseline Modules: Intro, version 2.0, Nov

Disaster Planning Resources

31

Page 32: Managing Digital Content over Time: Module 1: Identifyrecollectionwisconsin.org/wp-content/uploads/2016/... · June 3, 2016 . Modules DPOE Baseline Modules: Intro, version 2.0, Nov

Heritage Preser atio s Field Guide a d Wheel

Heritage Preservation’s Field Guide to Emergency Response

Heritage Preservation’s Emergency Response and Salvage Wheel

32

Page 33: Managing Digital Content over Time: Module 1: Identifyrecollectionwisconsin.org/wp-content/uploads/2016/... · June 3, 2016 . Modules DPOE Baseline Modules: Intro, version 2.0, Nov

Everyday Protection

• Have multiple copies

• Know where your content is located

– Onsite and offsite; online and offline

• Know who can have access to it

– DP staff, IT staff, others? (Permissions)

• Know who accesses your secure information

– For staff, depositors, users (Authentication)

• Know about your users to improves service

– Web use, internal use and activities, maintenance

33

Page 34: Managing Digital Content over Time: Module 1: Identifyrecollectionwisconsin.org/wp-content/uploads/2016/... · June 3, 2016 . Modules DPOE Baseline Modules: Intro, version 2.0, Nov

• How do you know if a digital file has changed?

– Generate and monitor checksums!

Checksum = Mathematical algorithm run

on/against a file and its resulting value

• Resulting value may also be

called:

– Fingerprint

– Hash Value

– Cryptographic Hash Function

Monitor the Files Themselves

Ex: from a Unix cksum utility 34

Page 35: Managing Digital Content over Time: Module 1: Identifyrecollectionwisconsin.org/wp-content/uploads/2016/... · June 3, 2016 . Modules DPOE Baseline Modules: Intro, version 2.0, Nov

Why is this a good thing?

• Checksums help maintain the INTEGRITY of your collections because they will tell you when things change over time

• If two files are exactly the same, the checksums of those files will also be exactly the same

• If a file becomes corrupted, degraded or is changed in some way, the next time you run the utility on it, the checksum will change

35

Page 36: Managing Digital Content over Time: Module 1: Identifyrecollectionwisconsin.org/wp-content/uploads/2016/... · June 3, 2016 . Modules DPOE Baseline Modules: Intro, version 2.0, Nov

• Open MD5summer

• Select your

root folder

• Select

Create Su s

MD5 Summer

36

Page 37: Managing Digital Content over Time: Module 1: Identifyrecollectionwisconsin.org/wp-content/uploads/2016/... · June 3, 2016 . Modules DPOE Baseline Modules: Intro, version 2.0, Nov

37

Page 38: Managing Digital Content over Time: Module 1: Identifyrecollectionwisconsin.org/wp-content/uploads/2016/... · June 3, 2016 . Modules DPOE Baseline Modules: Intro, version 2.0, Nov

If The Files Are Differe t……

Uh-Oh! 38

Page 39: Managing Digital Content over Time: Module 1: Identifyrecollectionwisconsin.org/wp-content/uploads/2016/... · June 3, 2016 . Modules DPOE Baseline Modules: Intro, version 2.0, Nov

Exact File Checksum on a Single File

Checksums on Group of Files (Create Digest)

Checksums to put with files burned to CD/DVD (TestFile Applet)

Multiple checksum algorithms available

Verify Checksums (Test Digest)

39

Page 40: Managing Digital Content over Time: Module 1: Identifyrecollectionwisconsin.org/wp-content/uploads/2016/... · June 3, 2016 . Modules DPOE Baseline Modules: Intro, version 2.0, Nov

Creati g the Che ksu …

Exact File 40

Page 41: Managing Digital Content over Time: Module 1: Identifyrecollectionwisconsin.org/wp-content/uploads/2016/... · June 3, 2016 . Modules DPOE Baseline Modules: Intro, version 2.0, Nov

Outcomes

Good practice should result in:

• Keeping content accessible for the long-

term

• Practices in place to manage day-to-day

protection – an implemented security plan

• Disaster planning in place to prevent,

predict, detect, respond, repair –

preparation in the event of an emergency

41

Page 42: Managing Digital Content over Time: Module 1: Identifyrecollectionwisconsin.org/wp-content/uploads/2016/... · June 3, 2016 . Modules DPOE Baseline Modules: Intro, version 2.0, Nov

42