ponencia rose holley
DESCRIPTION
I Seminario Internacional de la Biblioteca de Galicia. Bibliotecas digitalesTRANSCRIPT
Rose HolleyManager - TroveNational Library of Australia
1st International Seminar of the Library of Galicia:Digital Libraries
Santiago de Compostela 7-9 April 2011
Collecting, sharing and improving data: Changing roles for librarians and users
viernes 22 de abril de 2011
Overview• Changes in librarianship 1907-2010.• New strategies for 2011.
National Library of Australia• Australian Newspapers Digitisation
Program.• Trove single discovery service.
2
viernes 22 de abril de 2011
3
Women librarians 1907Cite: http://nla.gov.au/nla.news-
article14849217
viernes 22 de abril de 2011
4
Reference Librarian 1985
viernes 22 de abril de 2011
5
Arrival of the Internet 1998
Photo courtesy Genevieve Bell. Location: near Morgan, South Australia
viernes 22 de abril de 2011
6
Digitisation 2001• Millions of items
digitised by cultural heritage institutions
• Maps, photos, artworks, architectural plans, journals, archives, documents, books, newspapers, music.
viernes 22 de abril de 2011
7
Collaborative Delivery 2002
viernes 22 de abril de 2011
Single search vision (2002)
Search and Navigation Interface
Image
Collections
Websites Databases E-Journals Library
Catalogues
8
viernes 22 de abril de 2011
Mass digitisation 2008
9
viernes 22 de abril de 2011
Digital Librarian 2010• Digitising resources.• Collecting/creating born digital objects.• Making resources accessible online.• Giving users online tools to interact with
data, each other and support research.• Encouraging addition of knowledge to
resources and creation of new resources.• Preserving digital objects.
10
viernes 22 de abril de 2011
The scope 2011….• Digital AND non digital• Galleries Libraries, archives, museums
(GLAM)• Full-text (books, newspapers) GOOGLE• User-generated content Flickr, YouTube,
Wikipedia
Changing roles……Technology has turned librarianship on its
head:
11
viernes 22 de abril de 2011
12
Why libraries still matter• Long term preservation and access
• No commercial motives
• Universal access
• “Free for all”
ALWAYS and FOREVER….
Libraries have:Librarians who can open doors with
viernes 22 de abril de 2011
Who are the researchers?• Once content is
liberated anyone can become a ‘researcher’.
• The ‘ivory tower’ of gated and protected knowledge is gone.
• ‘Formal’ scholars are replaced by the crowd in the cloud.
• Today’s public are educated and engaged, demonstrated by their
13
viernes 22 de abril de 2011
What are their expectations?“Self service, satisfaction and seamlessness are
definitive of information seekers expectations. Ease of use, convenience and availability are equally as important to information seekers as information quality and trustworthiness.”
2003 OCLC Environmental Scan
To interact with content,other users and theorganisation (web 2.0)
To be able to annotate content and contribute their own
14
viernes 22 de abril de 2011
Important Things• Connections• Linkages• Related• Context
• Sharing• Re-purposing• Mashing• Adding
Giving users• Access to resources• Tools to do stuff• Freedom and choices• Ways to work collaboratively together
15
viernes 22 de abril de 2011
Where are the walls? There are no walls
only bridges:People outside your
building are accessing information within it.
People inside your building are accessing information from outside.
Changing use of spaces.
16
viernes 22 de abril de 2011
Change institutional thinking “Freedom is actually a bigger game than
power. Power is about what you can control.Freedom is about what you can unleash.”
Harriet Rubin
Librarians are gatekeepers who need to focus on opening rather than closing doors….
17
viernes 22 de abril de 2011
New ways of developing services
Learning the ‘art of with’ Charles Leadbeater
Not to peopleNot for people
WITH PEOPLE (USERS)
Public feedback should drive development of services:
CRITICAL, RELEVANT, INTERESTING, FUN
“Libraries need to think they are leading a mass movement, not just serving a
18
viernes 22 de abril de 2011
NLA Strategic Directions “We will explore new models for
creating and sharing information and for collecting materials, including supporting the creation of knowledge by our users. “
(not just NLA resources… all Australian content)
“The changing expectations of users that they will not be passive receivers of information, but rather contributors and participants in 19
viernes 22 de abril de 2011
20
2007 http://www.nla.gov.au/
20
viernes 22 de abril de 2011
National Program and • Initial focus on
major titles from each state and territory
• ‘Regional’ titles being contributed by libraries 2010 onwards
• Coverage: published between 1803 – 1954
West Australian
Northern Territory Times
Courier Mail
Advertiser
Sydney Morning Herald
Sydney Gazette
Argus
Mercury
Canberra Times
21
viernes 22 de abril de 2011
• Increase access to
historic Australian newspapers
• Key Features– Online Access– Freely available
Aims
The Argus 12 October 1951
22
viernes 22 de abril de 2011
23
1803 to 1954
23
viernes 22 de abril de 2011
24
http://www.nla.gov.au/
24
viernes 22 de abril de 2011
25
Sydney Morning Herald
25
viernes 22 de abril de 2011
Finding missing pages not on
26
viernes 22 de abril de 2011
Australian Women’s Weekly
27
viernes 22 de abril de 2011
Building National • Storage• Newspaper Content Management
system (digitisation workflow) • Public delivery system• Panel of digitisation contractors
(mass digi) • Quality assurance processes and
team
28
viernes 22 de abril de 2011
Microfilm scanned into digital
29
viernes 22 de abril de 2011
3030
viernes 22 de abril de 2011
31
Page sequence
Metadata creation
Missing page
Checking Pages
31
viernes 22 de abril de 2011
Tapes with digital images sent to India
32
viernes 22 de abril de 2011
Article zoning and categorising,
33
viernes 22 de abril de 2011
150 data operators Chennai
34
viernes 22 de abril de 2011
Final quality assurance checks
35
viernes 22 de abril de 2011
Articles go into public beta system
36
viernes 22 de abril de 2011
37
Text correction- testing user engagement
37
viernes 22 de abril de 2011
Greatest fears!
• No one will do it OR• People will deliberately vandalise the
text.
Questions? • Moderation?• Login?• Integration of data?
38
viernes 22 de abril de 2011
39
Interaction at article level
39
viernes 22 de abril de 2011
40
Add a tag ‘titanic sinking’
40
viernes 22 de abril de 2011
41
41
viernes 22 de abril de 2011
42
Add a comment
42
viernes 22 de abril de 2011
43
Fix text – power edit mode
43
viernes 22 de abril de 2011
44
After enhancements
44
viernes 22 de abril de 2011
Text Correction Activity
0
3,8
7,5
11,3
15,0
Aug-08 Nov-08 Feb-09 May-09 Aug-09 Nov-09 Feb-10 Apr-10
Lines corrected - millionsWestNorth
45
viernes 22 de abril de 2011
30 million lines January 2011
46
viernes 22 de abril de 2011
Public feedback on the
‘OCR text correction is great! I think I just found my new hobby!’
‘It’s looking like it will be very cool and the text fixing and tagging is quite addictive.’
‘An interesting way of using interested readers “labour”! I really like it.’
‘A wonderful tool - the amount of user control is very surprising but refreshing.’
‘ ‘I applaud the capability for readers to correct 47
viernes 22 de abril de 2011
Why do it?• I love it• It’s interesting and fun• It is a worthy cause• It’s addictive• I am helping with something important
e.g. recording history, finding new things
• I want to do some voluntary work• I want to help non-profit making
organisations like libraries• I want to learn something• It’s a challenge• I want to give something back to the
community48
viernes 22 de abril de 2011
AchievementsMarch 2011 (2.5 yrs since
release) 30,000+ volunteer text correctors
32 million lines of text corrected in 1.3 million articles
811, 000 tags added 18,800 comments added
49
viernes 22 de abril de 2011
Significant newspaper research
• Climate change• Influenza in Australia• Australian words and first usage e.g.
‘jumbuck’• Dating early colonial music• Building of railways and tramways• Convicts and outlaws
50
viernes 22 de abril de 2011
Trove – single search 2009
Migrate NLA discovery services into Trove: • Australian Newspapers• Picture Australia• Australian Research Online• Libraries Australia• Register of Archives and Manuscripts• Australia Dancing• Music Australia• PANDORA
51
viernes 22 de abril de 2011
Single search vision (2002)
Search and Navigation Interface
Image
Collections
Websites Databases E-Journals Library
Catalogues
52
viernes 22 de abril de 2011
53
browse
zones
Single search
Restrict
search
viernes 22 de abril de 2011
Refine/limit search results
Get item
Groups resultsin zones
Use API’s for Wikipedia, Amazon, Google video…
54
viernes 22 de abril de 2011
Features
Tag, comment, list, send link to, cite, check copyright
55
viernes 22 de abril de 2011
Trove Strategy 2010 -2011
1.Grow2.Develop3.Engage4.Promote
56
viernes 22 de abril de 2011
57
1. Grow – existing Content Collectors1100 organisations:• Libraries• Museums• Galleries• Archives
Open sources• Open Library
(Internet Archive)• Hathi Trust• OAISTER
Targets – websites•Amazon•Wikipedia•Google Books•YouTube
120 million items
Content CreatorsAustralian Broadcasting Commission
viernes 22 de abril de 2011
Grow – new contributors • Large aggregators e.g. Atlas of Living
Australia, Bio-Diversity Heritage Library – Australian node
• Large Australian cultural institutions especially museums and archives
• National Libraries with Australian content e.g. UK, New Zealand.
• Collection specific e.g. Australian sport
58
viernes 22 de abril de 2011
2. Develop• Agile development based on user
feedback• In 2010 - 17 new releases v1-v3• Usability testing• IT team of 5– 2 Programmers– Business analyst–Web developer– IT Manager
59
viernes 22 de abril de 2011
Version 4: April 2011
60
New homepage
Access to subscription
e-journalcontent
‘Contribute’has greaterprominence
viernes 22 de abril de 2011
61
3. Engage: with content and each
viernes 22 de abril de 2011
User generated content via Flickr: objects
62
viernes 22 de abril de 2011
User generated content:
http://trove.nla.gov.au/work/37255844 By Nomad Tales63
viernes 22 de abril de 2011
http://trove.nla.gov.au/work/37288101 Flexigel
Family photos – identify people
64
viernes 22 de abril de 2011
Context – Tools - Lists
65
viernes 22 de abril de 2011
Personal List to record your finds and add notes
66
viernes 22 de abril de 2011
Institutional list for virtual
67
viernes 22 de abril de 2011
Educators List – Teaching aid
68
viernes 22 de abril de 2011
69
viernes 22 de abril de 2011
Alerting to new content
70
viernes 22 de abril de 2011
Text correctors - Hall of
71
viernes 22 de abril de 2011
Profile -overall ranking and history
72
viernes 22 de abril de 2011
Wikipedia citation style
73
viernes 22 de abril de 2011
Lionel Logue – The King’s Speech
74
viernes 22 de abril de 2011
Wikipedia links to Trove sources
75
viernes 22 de abril de 2011
Wikipedia links
76
viernes 22 de abril de 2011
77
viernes 22 de abril de 2011
Feedback Christmas Day
3000 comments and feedback received in 2010
78
viernes 22 de abril de 2011
User Forum
79
viernes 22 de abril de 2011
Trove Blog
80
viernes 22 de abril de 2011
Trove Tweets
81
viernes 22 de abril de 2011
New Years Eve 2010
82
viernes 22 de abril de 2011
Public raise money for
83
viernes 22 de abril de 2011
Rockhampton ‘Trovers’
84
viernes 22 de abril de 2011
85
http://climatehistory.com.au This landmark project, spanning the sciences and the humanities, draws together a team of leading climate scientists, water managers and historians to better understand south-eastern Australian climate history over the past 200–500 years. It is the first study of its kind in Australia.
viernes 22 de abril de 2011
Re-purposing informatio
n and sharing
Blog using newspaper
articles
86
http://lynnwalsh.wordpress.com
viernes 22 de abril de 2011
http://
87
viernes 22 de abril de 2011
4. Promote use
88
viernes 22 de abril de 2011
89
Spikes caused by mediahttp://www.abc.net.au/news/video/2010/04/29/2885984.htm
viernes 22 de abril de 2011
Trove screencasting on
90
viernes 22 de abril de 2011
Trove promotional video
91
viernes 22 de abril de 2011
Incoming Trove traffic
Google 70%
Referrals: Bing, Yahoo, Wikipedia, NLA sites 16%
Direct14%
January 2011
92
viernes 22 de abril de 2011
Trove dependant on…Collaboration across cultural heritage
institutions (digitisation, storage, service delivery, crowdsourcing, standards).
Data sharing
Being ‘open’ e.g. OAI, API’s
Changing institutional strategic thinking from power/control to freedom
New ideas and revisiting old ideas93
viernes 22 de abril de 2011
RoseThe site you manage is a nightmare! It’s addictive. Keeps me awake at night. Congratulations!Mary
Trove finds the pieces and puts them together for you
94
viernes 22 de abril de 2011