islandora as an access system for irods managed information packages kilian amrhein marco klindt...
TRANSCRIPT
Islandora as an access system for iRODS managed information packages
Kilian AmrheinMarco Klindt
Zuse Institute Berlin (ZIB)
LTDP is too complex to be implemented by a single (small) institution!
2010
• Concept for digitization of cultural heritage in Berlin
2012
• Funding scheme
• Servicecenter Digitization (digiS) @ZIB
Digitization in Berlin
3
ZIB - Infrastructure
Tape Library• Peta-Scale• independent silos
“Konrad“• 1872 nodes• 44928 cores• 120TB RAM• 1.4 Pflop/s
Small photo by naezmi
Berlin‘s Cultural Heritage digitizedCC-BY-SA 3.0 DE: Berlinische Galerie – Zustiftung der Dr. Jörg Thiede-Stiftung
CC BY 3.0: StadtmuseumBerlin
PD: StadtmuseumBerlin
PD: StadtmuseumBerlin
CC-BY-SA: ZLB (Roehrensee)
CC BY-SA 4.0: HTW-Berlin Sammlungen
CC-BY-SA: Museum für Naturkunde Berlin
PD: ZLB
6
Collection / Creation
Processing
Analysis
Publishing
Preservation
Discovery and Re-use
Research Data
Photos by janneke staaks
ZIB LTDP Pipeline
Archivematica
• AIP/DIP Generation
iRODS
• abstraction of storage resources (online/offline)
• storage metadata
Fedora / Islandora
• data management
• access
Pre-
Inge
st
ZIB-LTDP Architecture
Access requirements
• Two views on objects:– administrative– content
• Access to iRODS– metadata– AIP retrieval
9
Different object viewsObject Display
Compound Object[PID]
Islandora Compound Object
RDF Statements (RELS-EXT)
Dublin Core (DC)
Thumbnail (TN)
Child[PID]
RDF Statements (RELS-EXT)
Dublin Core (DC)
Thumbnail (TN)
0-n
Content(JPG|XML|…)
11
Admin Access Compound Object
(AACO)
XML-child
DC
Content Access Compound Object
(CACO)
JPG-child
POLICY
PREMIS
RELS-EXT
TN
AIP-METS TN
DC(Submission-)DC
object type
datastream
JPGXML
DC
POLICY
PREMIS
RELS-EXT
POLICYRELS-EXT
POLICYRELS-EXT
12
Datastream structure
Info
rmati
on P
acka
ge
isConstituentOf
digiSzib:digisislandora:collectionCModel
Bröhan Museum Berlinzib:DE-MUS-019910islandora:collectionCModel
LABEL
ID
ContentModel
Digitization Project 2013 zib:bac186cd-4d11-48ac-bb1d-4ab2cd7593cc_aacoislandora:compoundCModel
Selfportrait with bananazib:bac186cd-4d11-48ac-bb1d-4ab2cd7593cc_jpgislandora:sp_large_image_cmodel
Selfportrait with bananazib:bac186cd-4d11-48ac-bb1d-4ab2cd7593cc_xmlislandora:sp_xml_cmodel
…
Info
rmati
on P
acka
ge
Selfportrait with banana
zib:bac186cd-4d11-48ac-bb1d-4ab2cd7593cc_caco
islandora:compoundCModel
… …
…
IDs & Collection assignment
isMemberOfCollection
isConstituentOf
Colle
ction
s
14
Views
Administrative Content
Compound Navigation Block
15
16
Access Control
• Drupal roles• access/manage role in RELS-EXT and XACML
StorageAIP access and delivery
Storage Metadata
iRODS can store metadata for data objects
18
AVUs defined for dataObj bac186cd-4d11-48ac-bb1d-4ab2cd7593cc.tar:
attribute: uuidvalue: bac186cd-4d11-48ac-bb1d-4ab2cd7593ccunits:---- attribute: producerIDvalue: DE-MUS-019910units: ----attribute: submissionIDvalue: DE-MUS-019910-201505131006units: ----attribute: checksumvalue: sha2:E4dMTd7/J4z9qg36CSjSzdXXIa4ltgAak+MKfSuPKww=units: ----attribute: lastFixityCheckvalue: 2015-05-13T08:06:16Zunits: ----attribute: typevalue: AIPunits:
Storage Abstraction
• AIPs managed by iRODS• need to be accessed through front end
• Retrieval:– AIP-uuid in iRODS metadata– reference in RELS-EXT of AACO
19
Islandora iRODS
20
Wrap up
22
Conclusion
• New data model – added support for different user views on a
preservation item
• New Islandora-module – (Basic) iRODS access
23
Future development
• Trigger preservation workflow actions from Islandora– migration– fixity check– metadata amendment
• Reporting– Total Objects– occupied storage
• Managing iRODS (resources)
24
Thanks!
https://github.com/kamrhein
[email protected]@zib.de
http://www.zib.de/sihttp://www.servicestelle-digitalisierung.de/