save this book: posterity’s challenge

54
Peter Brantley Internet Archive

Upload: bookrepublic

Post on 20-Aug-2015

1.176 views

Category:

Education


1 download

TRANSCRIPT

Page 1: Save this book: posterity’s challenge

Peter Brantley Internet Archive

Page 2: Save this book: posterity’s challenge

A. About preservationB. National policies

Page 3: Save this book: posterity’s challenge
Page 4: Save this book: posterity’s challenge

Preservation for digital materials –

A. Heritage is obligatory for social continuity.B. Maintenance of assets is business requisite.

Page 5: Save this book: posterity’s challenge

The Library of Alexandria continues to be a platform for democracy and has, since its inception opened its doors to all Egyptians to participate in open dialogues about reform […]. …

I have no doubt that once life returns to its norms, the Library of Alexandria will continue its peaceful dialogue and civic responsibility as an institution of learning.

Ismail Serageldin (via telephone)Director of the Library of Alexandria29 January 2011

Page 6: Save this book: posterity’s challenge
Page 7: Save this book: posterity’s challenge

• Myriads of ebook formats• Digital rights management• No national policy in place

No one is in charge of the preservation of our growing cultural heritage in digital books.

Page 8: Save this book: posterity’s challenge

Digital storage means it is easy to preserve in multiple locations, in different formats. More redundancy than paper could ever afford.

Storage costs trending toward insignificant on a per-byte basis.

Page 9: Save this book: posterity’s challenge

High density storage: Each rack stores between 0.5 and 0.75 petabytes.

Page 10: Save this book: posterity’s challenge

A book is a complex assembly of content –Text, video, foreword, illustrations, photos.Each item needs careful management.

Page 11: Save this book: posterity’s challenge

Digital production workflow requires:

- content management system - sophisticated rights / use tracking - skilled personnel esp. engineering

Page 12: Save this book: posterity’s challenge

Possibility of loss should motivate preservation.Loss can arise for oneself – or one’s partners –

via –

1. complex, tightly coupled systems2. struggle of memory vs. forgetting

Page 13: Save this book: posterity’s challenge
Page 14: Save this book: posterity’s challenge

From an engineering perspective, complexity in work flow systems raises the chance of catastrophic loss. Interactions become “tightly-coupled” as system efficiency increases.

Tightly coupled systems are prone to routine accidents with unforeseen cascading effects.

Page 15: Save this book: posterity’s challenge

- Charles PerrowSociology Dept, Yale University

Ex.: Apollo 13

“[The] accident was not the result of a chance malfunction in a statistical sense, but rather resulted from an unusual combination of mistakes, coupled with a somewhat deficient and unforgiving design.” source: Wikipedia.

Page 16: Save this book: posterity’s challenge

Traditional (print) publishers are constructing baroque and complex systems to maintain heritage production while transitioning to newer business systems.

Potential trigger events are ubiquitous.

Page 17: Save this book: posterity’s challenge

“We're working quickly to recover from a major issue in one of our database clusters. We're incredibly sorry for the inconvenience.”

Page 18: Save this book: posterity’s challenge

:

“Unfortunately, I have mixed up the accounts and accidentally deleted yours. I am terribly sorry for this grave error and hope that this mistake can be reconciled. ” …

“Our teams are currently working hard to try to restore the contents of this user's account. We are working on a process that would allow us to easily restore deleted accounts and we plan on rolling this functionality out soon.” (em. added)

Page 19: Save this book: posterity’s challenge
Page 20: Save this book: posterity’s challenge

Many archives are lost by simply forgetting where or how they were stored. Canisters of film, shelves of books, or backup tapes.

“Those books are in a warehouse in Jersey.” “The servers are in one of our datacenters.”

Page 21: Save this book: posterity’s challenge

Commonly, companies get acquired or shutter business operations; records are abandoned.

Servers are redeployed without an audit. Databases run with no backup routines.

Page 22: Save this book: posterity’s challenge

Preserving without metadata is not helpful. Metadata provides necessary context.

Need to point backwards (provenance) and forward (to find superseded content)

Without necessary metadata infrastructure, preservation architectures are rickety.

Page 23: Save this book: posterity’s challenge

If you are a publisher, do you safeguard your digital assets? Has that process been audited against security threats? Technical accidents? Are your workflows well understood? Have you conducted trial asset recovery exercises?

Does your insurance company know?Do you mind if I give them a call?

Page 24: Save this book: posterity’s challenge
Page 25: Save this book: posterity’s challenge

For 100s of years, libraries preserved books.Sort of anyway (witness Alexandria).

By-product of the inefficiency of distribution:Lots of libraries wound up with same books.

Lots of copies keep stuff safe.

Page 26: Save this book: posterity’s challenge

Books were once bespoke – monks with pens.

Over around 500 years, books became industrialized – mass produced. Although ebooks are the ultimate industrial product, we come full circle.

Digital books stand on threshold of tremendous mutability.

Page 27: Save this book: posterity’s challenge

It is easy to store objects now, arguably, but … It is also easy to save objects that cannot be easily recovered.

E.g., PDF can be a stew of non standard formats and arbitrary data. EPUB with DRM locks data with risk that key will be lost.

Page 28: Save this book: posterity’s challenge

We may be able to preserve digital editions.Can we preserve the book of the future?

• Scripted and Interactive• Networked and Distributed • Personalized and Mobile

Page 29: Save this book: posterity’s challenge
Page 30: Save this book: posterity’s challenge

In U.S., beyond public and academic library collections, the Library of Congress plays a unique role:

§407. The Required copies or phonorecords shall be deposited in the Copyright Office for the use or disposition of the Library of Congress.

Page 31: Save this book: posterity’s challenge

Current U.S. code privileges print as best edition for preservation. Slowly being changed.

(Congress approved e-journal demand deposit for LoC in the 2010 legislative session).

Page 32: Save this book: posterity’s challenge

The Copyright Office historically held that transmission to the LoC of deposited digital books would be an infringing faithful copy. despite –

§ 704. In the case of published works, all copies, phonorecords, and identifying material deposited are available to the Library of Congress for its collections ….

Page 33: Save this book: posterity’s challenge
Page 34: Save this book: posterity’s challenge

I led an initiative to confront this issue in Summer 2008. Fizzled.

Convened meeting in NYC via Digital Library Federation with LoC, Mellon, Portico, NISO, IDPF, BISG, and publisher consultants and representatives to discuss digital archives that would also serve as escrow.

Page 35: Save this book: posterity’s challenge

The group decided that we should attempt a trial project with a small set of publishers who would deposit sample books into Portico’s repository; the pilot would inform against business, legal, and policy issues.

With that in hand, LoC would be in stronger position to solicit rule changes by Congress.

Page 36: Save this book: posterity’s challenge

Portico is a not for profit jointly representing publishers and libraries preserving a growing volume of digital content with high reliability in a rights respecting, secure archive. Serving as an escrow, access to the archive by its members is triggered only under carefully-defined and contractually-specified circumstances.

Initial funding came from Mellon and LoC.

Page 37: Save this book: posterity’s challenge
Page 38: Save this book: posterity’s challenge

The AAP evidenced support.

Ed McCoyd, Director of Digital Policy:

“Thank you for speaking at the AAP Digital Issues Working Group meeting on [11 June 2008], regarding your interest in bringing parties together to develop digital book archives for preservation.

Page 39: Save this book: posterity’s challenge

“Your points about preserving cultural patrimony, and assuring permanent access by libraries and other digital content customers as well as by the publishers themselves, certainly resonated with the group.

“I look forward to talking further about your initiative in the coming months, and will keep the publishers apprised of the additional details as they develop.”

Page 40: Save this book: posterity’s challenge

It fizzled due to the inability of the LoC to determine whether it had the capacity to pursue a deposit initiative, and if so, what department of the Library should proceed.

Page 41: Save this book: posterity’s challenge
Page 42: Save this book: posterity’s challenge

Portico decided to continue the pursuit of ebooks deposits on behalf of its members.

Members are primarily research libraries –‘cuz, Who else has a mandate to care about preservation?

With inevitable focus on academic ebookpublishers, e.g. Elsevier, starting in 2008.

Page 43: Save this book: posterity’s challenge

Portico has been unable to penetrate trade publishing sector through private initiative.

No national policy requiring digital deposit.

Page 44: Save this book: posterity’s challenge
Page 45: Save this book: posterity’s challenge

Google Book Search (GBS) has emerged in the absence of international rulemaking as the default archive for some institutions.

GBS does not do preservation quality imaging, and there is requirement for comprehensive publisher participation.

This is NOT a good solution.

Page 46: Save this book: posterity’s challenge

Europeana digital library seeking to build a collection preserving Europe’s vast cultural heritage.

In September 2010,Ghent Univ. Library became first in Europe to deposit public domain books scanned by Google into Europeana.

Page 47: Save this book: posterity’s challenge

GBS partners’ collection management of older public domain books is a pale shadow of the comprehensive international policy framework needed to mandate preservation of our cultural heritage in digital books.

Preservation must mandate participation for libraries and publishers in a legal framework.

Page 48: Save this book: posterity’s challenge

Widespread recognition of need for new copyright regime supporting digital use.

Internationally interoperable network of rights assertions in rights registries capable of automated status query in geographic and national domains is a very useful support.

Page 49: Save this book: posterity’s challenge
Page 50: Save this book: posterity’s challenge

Mandating the deposit of digital works for preservation purposes to ensure adequate representation of copyright assertions for a content manifest would be one approach.

Arguably avoids Berne / TRIPS.

Page 51: Save this book: posterity’s challenge

Most publishers are demonstrably willing to engage in discussion of efforts supporting the development of persistent heritage archives.

Requires hard political work conjoining some subset of copyright policies, the imperatives of cultural heritage, technical architecture.

Page 52: Save this book: posterity’s challenge

If nothing is done, and we cannot solve this problem for the simple digital books of the 20th Century, what are we going to do with the books of the 21st Century?

Page 53: Save this book: posterity’s challenge
Page 54: Save this book: posterity’s challenge

peter brantley

director, bookserver projectinternet archivesan francisco, ca

(twitter) @naypinya (slideshare)

peter at archive.org