de conferentie 2006 paul wouters

15
Paul Wouters Paul Wouters Web archieven voor onderzoek - zit daar muziek in? Web archieven voor onderzoek - zit daar muziek in?

Upload: digitaal-erfgoedconferentie

Post on 29-Nov-2014

435 views

Category:

Education


1 download

DESCRIPTION

 

TRANSCRIPT

Page 1: DE Conferentie 2006 Paul Wouters

Paul WoutersPaul Wouters

Web archieven vooronderzoek - zit daar muziek

in?

Web archieven vooronderzoek - zit daar muziek

in?

Page 2: DE Conferentie 2006 Paul Wouters
Page 3: DE Conferentie 2006 Paul Wouters
Page 4: DE Conferentie 2006 Paul Wouters
Page 5: DE Conferentie 2006 Paul Wouters
Page 6: DE Conferentie 2006 Paul Wouters
Page 7: DE Conferentie 2006 Paul Wouters

Reasons and dilemmasReasons and dilemmas

• online activities• no reliable archive

available• longitudinal research• comparative research• sharing Website data• cultural heritage

• online activities• no reliable archive

available• longitudinal research• comparative research• sharing Website data• cultural heritage

• new archiving paradigm• huge technical and

financial task• storage difficult

challenge• what to archive?• how often to archive?• at what level &

granularity?• future questions?

• new archiving paradigm• huge technical and

financial task• storage difficult

challenge• what to archive?• how often to archive?• at what level &

granularity?• future questions?

Page 8: DE Conferentie 2006 Paul Wouters

Time

Collecting multiple impressions of the “same” site over time

Link across base URLs

Link across similar URLs

T4

T3

T2

T1

Starting with the same base URL

Page 9: DE Conferentie 2006 Paul Wouters

Preserving bits, content, experience

Preserving bits, content, experience

• …bits – “retain the exact bit sequence of the original.”

• …content – “retain the content (e.g., the words in a text or the appearance of an image) but not the full interactive nature of a Web site.”

• …experience – “retain the entire experience of interacting with the digital material, including the look and feel, and execution of dynamic elements.”

• …bits – “retain the exact bit sequence of the original.”

• …content – “retain the content (e.g., the words in a text or the appearance of an image) but not the full interactive nature of a Web site.”

• …experience – “retain the entire experience of interacting with the digital material, including the look and feel, and execution of dynamic elements.”

Source:Arms et al, 2001: Collecting and Preserving the Web: The Minerva Prototypehttp://www.rlg.org/preserv/diginews/diginews5-2.html#feature1

Page 10: DE Conferentie 2006 Paul Wouters

In other words …In other words …• The document may not be the best basic

concept• Both boundaries and links need to be

considered• Specific attention for design needs to be

developed• Web archiving lies at the heart of history and

heritage• Scholarship central both to creation and use

of Web archives

• The document may not be the best basic concept

• Both boundaries and links need to be considered

• Specific attention for design needs to be developed

• Web archiving lies at the heart of history and heritage

• Scholarship central both to creation and use of Web archives

Page 11: DE Conferentie 2006 Paul Wouters

What is already clear (1)What is already clear (1)• Dynamics of Web behaviour difficult to

archive• All technical puzzles (what/how/how

often/storage) are also conceptual choices• Hyperlink structure central feature• Ways of searching the archives key issue• Generic archive interface should not

“blackbox” styles of research

• Dynamics of Web behaviour difficult to archive

• All technical puzzles (what/how/how often/storage) are also conceptual choices

• Hyperlink structure central feature• Ways of searching the archives key issue• Generic archive interface should not

“blackbox” styles of research

Page 12: DE Conferentie 2006 Paul Wouters

What is already clear (2)What is already clear (2)

• Web is highly skewed representation of culture• Complex relationship between action & online

representation• Context still counts - the need for off-line studies• Metadata critical, both human-made and

machine-generated• Creating Web archives entails shaping the

historical record of cultural representation

• Web is highly skewed representation of culture• Complex relationship between action & online

representation• Context still counts - the need for off-line studies• Metadata critical, both human-made and

machine-generated• Creating Web archives entails shaping the

historical record of cultural representation

Page 13: DE Conferentie 2006 Paul Wouters
Page 14: DE Conferentie 2006 Paul Wouters

Elements of a research agenda

Elements of a research agenda

• balance broad and narrow crawls• heritage archiving <-> research archiving• relationship research styles and archive accessibility• multilevel infrastructures to support small projects• role of scholars in constructing culture• ethics of Web archiving• non-academic use of Web archives• unexpected consequences (lock-ins)

• balance broad and narrow crawls• heritage archiving <-> research archiving• relationship research styles and archive accessibility• multilevel infrastructures to support small projects• role of scholars in constructing culture• ethics of Web archiving• non-academic use of Web archives• unexpected consequences (lock-ins)

Page 15: DE Conferentie 2006 Paul Wouters

Last but not least ...Last but not least ...

Serious Web archiving requires “grand coalition”

Serious Web archiving requires “grand coalition”