an institutional repository for the university of pretoria ina smith dspace platform manager...
TRANSCRIPT
An Institutional Repository for the University of Pretoria
Ina SmithDSpace Platform Manager
University of Pretoria
20 May 2005DITCHE Conference & Workshop
Nelson Mandela Metropolitan University
Institutional Repository
• Set of services• Management & dissemination of digital
materials• Organisational commitment• Stewardship• Long-term preservation• Organisation, access, distribution
Approaches
• Digitally born resources• Digitisation
“Champions”
• Africana & Special Collections• Architecture• Scholarly Publications• Veterinary Science
Evaluation of software
• EPrints• Greenstone• DSpace• Fedora• MyCoRe• CDSware• ARNO• Innopac Millennium• WebCT
Why Open Source?
• Promote collaboration• Promote knowledge sharing• Benefit all – not only the vendor• Belongs to all – lots of support• No marketing
Why Open Source?
• Flexible• Openness and creative thinking• Open for scrutiny• Searchable & retrievable via the WWW• Used by institutions with minimal
resources• Uses world standards and open standards
Why DSpace?
• MIT & Hewlett Packard• Archiving & permanent (long-term)
preservation• Easy retrieval• Scalability• Supports full text• Good separation of data and metadata• Supports OAI
Why DSpace?
• Availability of code• Integration into portal• Well defined workflow• Lucene Search Engine• Taxonomic Structure• UP IT Architecture• Open source• GAP Authentication
DSpace Architecture
Three layers with components– Storage layer: Physical storage of
metadata and content– Business logic layer: Manage content,
e-people, authorisation, workflow– Application layer: Communicate with
outside, e.g. Web User Interface, Metadata Harvesting
National & International DSpace Support
• Informative & active web-site• 100+ instances running• Active mailing lists• Enthusiastic community of developers• DSpace wiki – sense of community,
visibility
DSpace Open Source Software License
• Open Software License V. 2.1http://www.opensource.org/licenses/osl 2.1.php
• Open Source Initiative (OSI)http://www.opensource.org/index.php
“ a non-profit corporation dedicated to managing and promoting the Open Source Definition for the good of the community, specifically through the OSI Certified Open
Source Software certification mark and program. “
DSpace for Digital Archiving
• Institutional Repositories• Learning Object Repositories• E-theses• Electronic Records Management• Digital Preservation• Publications
System Requirements
• Linux• Sun Java Compiler• Tomcat (Java servlet engine)• Ant Java build tool: generate .jar or .war files• PostgreSQL (Relational Database
Management System)• Mail Server
About DSpace
• DSpace Stable 1.2.2 May 2005• Version.major.minor• Minor releases: bug fixes, no migration of
data• Major releases: changes to database,
migration needed• Versions: restructuring of architecture,
significant data migration
Default Look & Feel
UPSpace Look & Feel
Information Model
Top-level Community (e.g. Faculty)
Sub-Community (e.g. Department)
Collections (e.g. Pretoriana, Freedom Struggle)
Top-level Communities
Top-level Community
Collections within Top-level
Community
Items in DSpace
• Articles (preprints & postprints) • Technical reports • Working papers • Conference reports • E-theses • E-books
Items in DSpace
• Datasets: statistical, geospatial, etc. • Images: visual, scientific, etc. • Audio files • Video files • Learning objects • Reformatted digital library collections
An Item in DSpace
• Metadata• Files (Bitstreams) with digital content• Many formats are accepted• Largest file 500 MB video (Infosys)• File formats e.g. xml, tiff, jpeg, wav, mov,
qt
Thumbnails
Roles in DSpace
• DSpace Administrator• Collection Administrators• Submitters• Reviewers• Metadata Editors• Approvers
Permission policies in DSpace
• Set permission policies for various collections (read, remove, add, write)
• E-person can be removed from Collection • Items in Collections can only be viewed by
authenticated persons
Workflow
Submit
Accept
Reject
Edit Metadata Commit DSpace
Submission
• Default Submission Interface• Community/ Collection Specific Interface• Mandatory fields: Title, Language• Distributed input
About Metadata
• Descriptive information about an object or resource
• E.g. title, subjects, keywords, author/s, date
Metadata in DSpace
• Qualified DC Metadata• 15 Elements + Qualifiers• Edit DC Registry• Add / delete elements/qualifiers (not
recommended)• Mandatory / Optional fields
Harvesting Metadata
• OAISTER• Search Engines e.g. Google
Bitstreams
• Bitstream Registry
Persistent Identifiers
• CNRI Handle System• Valid citations• Unique handle for each item• E.g. http://hdl.handle.net/1849/171
1849 = Naming authority assigned by handle system to e.g. UP
171 = Unique local name assigned to an item in the repository
Lucene Search Engine
• Simple, high performance, powerful search engine
• Open source• Features
– Boolean searches– Phrase and proximity searching– Relevancy ranking– Field searching– Advanced and basic search– Browse by Community/Collection, Author, Title,
Date
Managing the Project http://www.ais.up.ac.za/edu/dspace
Evaluation
Proposal
Needs Analysis
Design
Development
Implementation
Evaluation
Evaluation
• DSpace on UPDev• DSpace on UP Q&A• DSpace on Portal Server• Evaluation Instrument Program Usability• Evaluation Instrument Design
Reflection & Communication
• DSpace Listserve [email protected] • DSpace web www.ais.up.ac.za/edu/dspace • Update documents & policies• Meetings• Community of Practice
Marketing
• Meetings• UP Library Community • Faculties & Departments• UP Dept. of Marketing• UPSpace E-Newsletter• New staff orientation program• Online brochure• Statistics to Faculties• Support from UP Management
Training
• Information Specialists• Cataloguers• End-users
User Support
• Help landline 8/5• Online help 24/7
– Viewlets e.g. Macromedia Captivate, PPT, Open Office
• Tutorial• User policy• Pamphlet• E-mail [email protected]
Policy Issues
• Submission• Communities and Collections• Responsibilities & rights• Licensing, copyright, privacy, intellectual
property• Preservation support• Withdrawals• Workflow• System back-up and availability
Copyright, Rights & Licensing
• License stored with each item• Submitter “Grants License”• Default DSpace License• Community/ Collection License• Copyright note when creating Community• DC Element Rights• DSpace StyleSheet: © University of
Pretoria
IT Support & Back-up
• Dept. of Information Technology• Back-up report• DSpace• Master .tiff images
Limitations
• Limited resources e.g. scanner• Java expertise
The way forward …
• Federated Search Engine• Emerging IR software• Long-term budget• Migration of digital objects• Repository size• Bandwidth• Collaboration e.g. DISA• Collaboration between institutions
The way forward …
• Register with – Open Archives Institute– DSpace Wiki– OAISTER
Conclusions
• Great “digital” libraries will last• Comfortable with change, open source• Traditional library activities
Curator shipCollection DevelopmentQuality Assurance etc.
The WWW can be a library …
Success …
“Courage to explore
Knowledge to exceed
Technology to excel”
Thank you!
Tel.: +27 12 420 3082
Many thanks to TENET for this opportunity!