digital curation workshop - archivematica...mar 14, 2013  · the digital preservation problem: 7....

Post on 04-Oct-2020

3 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Digital Curation Workshop

March 14, 2013SFU Wosk Centre for Dialogue

Vancouver, BC

Introduction to

What is Archivematica?

● digital preservation/curation system ● designed to maintain standards-based, long-

term access to collections of digital objects● free and open-source (AGPLv3)● supported by Artefactual Systems Inc.

digital preservation consulting open-source sofware for archives and libraries

The Digital Preservation Problem:

1. Rapid technological change drives constant system upgrades,

migrations and retirement of legacy technologies.

2. Incompatible, obsolete, obscure or proprietary systems and file

formats.

3. Loss or damage to bitstreams due the fragility of digital storage

media, system error, or human error.

The Digital Preservation Problem:

4. The overwhelming volume of digital information objects created

daily, each with many possible copies and versions.

5. The lack or loss of adequate metadata describing digital

information objects.

6. Accidental or malicious content alteration.

The Digital Preservation Problem:

7. Doubts about the reliability and integrity of electronic records

and the inability to vouch for their authenticity.

8. The complexity of digital information objects which requires

preservation of their content, structure, context, presentation,

behaviour as intellectual entities as well as bitstreams.

9. The lack of formally recognized organizational responsibility,

resources and enterprise architecture components that facilitate

digital curation, preservation and long-term access.

now future

bitstream

storage media

packaging

storage device

storage driver

file system

error correction operating system

application software user interface

input / output devices

metadata

find

relate / bind

authenticate

contextualize

stored

copied

protected

Accessible?Usable?Authentic?

compression

decryption

file format

character encoding fonts

codec

Responsible?Architecture?Resources?

contentcontext

structurepresentationbehaviour

The Digital Preservation Problem for University Libraries

Digitized collections Born-digital faculty publications and research output

Student E-theses University e-records Born-digital donor collections Born-digital research data sets etc...

The Digital Preservation Problem for University Libraries

ILS Discovery engines / platforms Website CMS Custom departmental / collections websites Dspace Fedora OJS Islandora LOCKSS etc...

Capacity Gap

● No preservation features in key systems

● No digital preservation planning● Format obsolescence

● System & platform incompatibility/obsolesence

● Digital preservation metadata

● External media processing

● Dedicated storage and geo-remote backup

● No Trusted Digital Repository (TDR)

● No obvious next steps to improve capacity

TRAC: Trustworthy Digital Repositories

ISO 16363:2012

Data Management

Preservation Planning

Archival Storage

Ingest

Administration

SIP

MANAGEMENT

AIP Access DIP

PRODUCER

CONSUMER

Open Archival Information System (OAIS) reference model (ISO-STD 14721)

What is Archivematica?

● Allows users to process digital objects from ingest to access in conformance with the ISO-OAIS functional model

● Archivematica creates high-quality, standards-compliant Archival Information Packages (AIP)

● Archivematica provides an architecture for implementing preservation strategies

● Archivematica provides a framework for evaluating and implementing format policies

`

web-based dashboard

monitor and control

web server

MCP server

micro-service processing clients

watched directory

success

error

fileshare

successdigital curationmicro-services

pythonscripts

FOSStools

AIP

DIP

SIP or

transfer of digital objects

& metadata

The METS file<dmdSec> (descriptive metadata) Dublin Core XML EAD XML MODS XML [whatever] XML<amdSec> (administrative metadata) <techMD> PREMIS: object <digiProvMD> PREMIS: events PREMIS: agents <rightsMD> PREMIS: rights<fileSec> (a list of the files and their roles and relationships)<structMap> (a representation of the physical structure of the AIP)

Preservation planning

● A two-pronged approach:– Normalization on ingest

– Preservation of the original file to support future strategies such as migration and emulation

● Normalization relies on format policies based on an analysis of the significant characteristics of file formats

● A format policy indicates the actions, tools and settings to apply to a file of a particular file format (e.g. normalization to preservation and/or access format)

https://www.archivematica.org/preservation

Archivematica format policies

● Criteria for selecting default formats:● Non-proprietary● Freely available specifications● Widely used/endorsed by major repositories● No compression/lossless compression● Tools available to write and render the format

● Format policies will change as community standards, practices and tools evolve.

PRONOM

UDFR

Format Policy Registry (FPR)

?

API GUI

Systems Integration

● Application Programming Interfaces (API)– Storage

– Ingest

– Access

● Dspace● ContentDM● ICA-AtoM● Archivist Toolkit● LOCKSS● Islandora● Fedora● Dataverse

Archivematica Clients / Partners

● 30 – 50 users worldwide● Active discussion list, twitter feed, website● Courses, community participation● Current Artefactual clients:

– UBC Library

– UofA Library

– SFU Library

– SFU Archives

– City of Vancouver Archives

– Rockefeller Archive Center

– International Monetary Fund Archives

– Columbia University Library

– Museum of Modern Art (MoMA)

– Yale University Library

Foundation orSteering Committee

Governance

Coordination

Funding

Promotion

Users

Lead institutions Funding DevelopmentAll users Bug reports Enhancement requests Code patches Documentation Promotion Open Source Software

Code

Knowledge

Community

Service Providers

Development

Technical Support

Hosting

Training

Promotion

CodeTime

MoneyKnowledge

CodeTimeMoneyKnowledge

TimeMoney

Knowledge

Free Beer!

“They’ll never take our freedom”

© 1995 Paramount Pictures & 20th Century FoxSee fair use rationale: http://en.wikipedia.org/wiki/File:Brave_mel.jpg

http://archivematica.org

top related