the impact of linked data in digital curation and application to the cataloguer's workflow
TRANSCRIPT
The Impact of Linked Data in Digital Curation and Application
to the Cataloguer’s WorkflowJenny JingErin TrippIan BigelowStacey BoileauDanielle Emon
Access 2015TorontoSept. 10, 2015
http://www.ekoob.com/tag/linked-data/
Library Staff & Linked Data
Managers
Metadata staff
IT staff
Strategy/ Resource
Skills / WorkflowSkills / Workflow
What/Why
How
How
Interface, System, Data & Linked Open Data
System/AdminInterface/Web Data/DB
Data: Queries, Perl, XML,DB: Oracle, MySQL, AccessTools: Perl, Excel, XSLT
Programing: php, Javascript, html, etc.Publish Tools: Drupal, WordPress
Server: Shell scripting, config, maintain, etc. OS: UNIX, WindowsEditor tools: Vi, Pico
Linked Data
Web pages
ILS
DAMPreservation
IR Local dbs
DS
Mobile Access Full Text
Library Data(metadata)
Library Data: Where, What, How to control?
Metadata Schema
What is Linked Data?
• Use URIs • Use HTTP URI: http://viaf.org/viaf/xxx• Use open standards such as RDF,SPARQL• Refer to other things: Triples
• The Semantic Web: Promotes common data formats on the web.
• The four principles of Linked Data
Why is it important for libraries? • Librarians: We put information in Information Technology.
• Add value to information:• Create, update• Store, preserve• Index, catalog• Retrieve, access
Catalog web pagesFindability, data reuse
Authority Control
Software Infrastructure/DCPlatformsILS: Voyager, AlephILL, Etc.
Link resolver : 360 Link library guides: LibGuidesOnline databases
IR (Scholarly Publications): DSpaceDAM (Digital Preservation): FedoraWebsites: Library websites
Digital Curation & Linked Data
Bring traffic? Add AU control? Catalog/find them easily?
Linked Data can:
• Web traffic: 80% of hits from Google Home page redesign
• Data reuse: by other web applications • Authority control: subject headings
with URIs. Don’t have to pay vendors.• BIBFRAME: edit records faster
Digital Curation, Data Formats & sample works
RDF/XMLMARC
Linked Data
Query: SQL, Access Programming: VB (BatchCat), PerlTools: MarcEditSystem: UNIX
Query: XQuery, Programming: XSLT, Perl, JavascriptTools: OxygenSystem: UNIX
Linked Data: LifecycleInter-
linkingEnrichment
Analysis
Evolution
Discovery
Extraction
Querying
Authoring
1.Extraction2. Querying 3. Authoring4. Interlinking5. Enrichment 6. Quality Analysis 7. Evaluation 8. Discovery
Search/Browse/ Exploration
http://www.slideshare.net/JensLehmann/ld-lifecycle
Linked Data in Digital Curation: IR & DAM
Data, Linked Data & Library Services
Acknowledgements & Resources• Robert Chavez, Senior Content Solutions Architect, Massachusetts Medical Society• Alison Hitchens, Cataloguing & Metadata Librarian, University of Waterloo Library• Jodi Schneider, Digital Enterprise Research Institute, National University of Ireland• Sharon Farnel, Metadata Coordinator, University of Alberta Libraries
Resources: http://linkeddata.org/faq• http://tomheath.com/papers/bizer-heath-berners-lee-ijswis-linked-data.pdf• Fedora 4.0 in Action at Penn State and Stanford• Oregon Digital , UNLV's Linked Data Project• Linked Data for Libraries (Cornell, Harvard & Stanford) • http://jingjenny.wordpress.com/linked-data/
Linked Open Data & Digital Repositories
Erin Tripp, discoverygarden [email protected] @eeohalloran
With files from: Frits van Latum, TU Delft Project Manager
Linking Digital Content in Islandora
TU Delft Repository ExampleNavigate here to browse the repository:
http://bit.ly/1Nfy5GW
TU Delft Repository Example
Making the Connection Content types:
● Buildings, ● Maps, ● Images, ● Films, ● Documents, ● Persons, ● Organisations ● AAT Concepts (from
AAT)● Geoname Concepts
http://bit.ly/1JEW1gA
Making the Connection ● Extra RELS EXT triples
between these objects
○ Triple = object + predicate + subject
○ Generation of triples was custom coded by discoverygarden (Delftdora module)
Making the Connection
Impact on Users Approach has potential ● Aid intuitive discovery of
repository objects● Promote re-use of
objects● Generate and encourage
the discussion about the significance, management, and the use of materials.
http://bit.ly/1i1UNVY
For More Information ● The Colonial Architecture & Town Planning Repository,
http://colonialarchitecture.eu/
● In depth presentation from TU Delft Project Manager Frits van Latum, http://bit.ly/1VDeQs7
● The discoverygarden Website, http://www.discoverygarden.ca/
● Contact Erin Tripp, [email protected]
THE OCLS CATALOGUING WORKFLOW
THE DEVELOPMENT OF A FRBR BASED RDA WORKFLOW FOR THE ONTARIO COLLEGES
STACEY BOILEAU, DANIELLE EMON, & IAN BIGELOW ON BEHALF OF: OCLS AND BSWG
WELCOME TO JURASSIC MARC!
LINKED DATA• LINKED DATA WILL BE A NEW REALITY FOR LIBRARIES. THIS MAY BE THROUGH
BIBFRAME, OR OTHER APPLICATIONS (SCHEMA.ORG, ALIADA … LOCAL DEVELOPMENT).
“RDA REPRESENTS A REVISION TO AACR2 THAT IMPLEMENTS FRBR CONCEPTS AND INCORPORATES FRBR TERMINOLOGY. THE CONVERGENCE OF RDA AND FRBR WILL FOCUS CATALOGING, OR RESOURCE DESCRIPTION, ON THE RESOURCES’ RELATIONSHIPS WITH EACH OTHER AND STEER THE PROCESS OF RETRIEVAL AND ACCESS TOWARD NAVIGATING LINKS THROUGH A HIERARCHY OF RELATIONSHIPS. THIS CHANGE IN EMPHASIS POSITIONS LIBRARIES TO PARTICIPATE IN THE EMERGING SEMANTIC WEB.” (SZETO, 2013, P. 306)
••HOWEVER, RDA WILL ONLY POSITION LIBRARIES TO PARTICIPATE IN THE SEMANTIC WEB IF WE EMBRACE THESE ASPECTS OF RDA.
TIME TO MARC THE IMPORTANCE OF LINKED DATA FOR LIBRARIES!•THROUGHOUT THE DEVELOPMENT OF THE OCLS CATALOGUING WORKFLOW THERE WAS A CERTAIN TENSION EVIDENT BETWEEN DETAILING CURRENT PRACTICE AND PLANNING FOR FUTURE CHANGES. IN CREATING A NEW RDA WORKFLOW, IT SEEMED PRUDENT TO ALSO TAKE SOME SMALL STEPS IN ADAPTING PRACTICE FOR LINKED DATA.
“THE RESOURCE DESCRIPTION AND ACCESS (RDA) STANDARD ATTEMPTS TO BRIDGE THE GAP BETWEEN LEGACY LIBRARY DATA AND LINKED DATA, BUT MOST CATALOGERS CONTINUE TO CREATE MARC RECORDS IN TRADITIONAL CLOSED LIBRARY DATABASES. IN THIS TRANSITIONAL ENVIRONMENT, CAN CATALOGERS DO SOMETHING MORE TO MAKE THEIR DATA SMARTER AND RICHER? IN SHORT, WHAT CAN THE PRESENT-DAY CATALOGER DO TO “PREPARE THE WAY” FOR FUTURE DATA NEEDS?” (SEEMAN & GODDARD, 2015, P. 331)
IN ORDER TO PREPARE FOR FUTURE BIBLIOGRAPHIC METADATA NEEDS, WE NEED TO:
1. ACKNOWLEDGE THAT MARC WILL BE HERE FOR SOME TIME TO COME…AND THUS PROVIDE INSTRUCTION ON USING RDA IN A MARC ENVIRONMENT
2. AT THE SAME TIME MARC WILL LIKELY EVOLVE OR BE REPLACED
-INCORPORATE MARC AND EXAMPLES IN A MODULAR WAY AS MUCH AS POSSIBLE, SUCH THAT THE WORKFLOW STANDS SEPARATELY (FROM MARC) AND CAN MORE READILY BE ADAPTED IN FUTURE.-BUILD IN TRAINING ON RDA, FRBR, AAP AND OTHER CONCEPTS AND FUNDAMENTALLY ALTER THE APPROACH OF RESOURCE DESCRIPTION FROM DESCRIBING RECORDS TO DESCRIBING METADATA ELEMENTS.
DON’T PANIC!
• MARC IS NOT COMPLETELY DISJOINT FROM LINKED DATA CONCEPTS• LINKED DATA MAY WELL COME IN STAGES:
1. MARC (PRESENT)2. MARC TRANSFORMED INTO RDF SO LIBRARY HOLDINGS CAN BE MADE
VISIBLE ONLINE (PRESENT/NEAR FUTURE)3. LIBRARY METADATA NATIVE TO THE WEB (FUTURE )
• SO, WE NEED TO WORK WITH MARC IN SUCH A WAY BUILD FUTURE COMPATIBLE LIBRARY METADATA!
LIBHUB INITIATIVE
CORE SERVICES
CUC / COLLECT *
Eresource License Management
Contract Management with selection, acquisition and cataloguing service providers
Research
ONTARIO COLLEGES LIBRARY SERVICE AND THE 24 ONTARIO COLLEGES
OPT-IN SERVICES
Management of consortial ILS (Symphony)
Remote Access
Federated Search
Video On Demand
askON Virtual Reference Service
Professional Services
● Under the direction of the CUCLDRSG, Ontario Colleges Library Service manages and maintains the College Union Catalogue
● BSWG with the support of OCLS maintains the minimum standards for records being uploaded into the union catalogue
● Various sizes and systems - Symphony, Mandarin, Evergreen, Voyager, Surpass Centriva
RDA has arrived. Now what?
HTTP://LITTLELANTERNPRESS.COM/WP-CONTENT/UPLOADS/2015/02/ROLLERSKATES.JPG
OUR INITIAL PROBLEM:● INTRODUCTION OF RDA POSED CERTAIN TRAINING CHALLENGES● ORIGINAL CATALOGUING MANUAL MARC DRIVEN -- AACR2 ● HEAVY EMPHASIS ON DESCRIPTION AT MANIFESTATION LEVEL● LINEAR, TEMPLATE DRIVEN WORKFLOW MADE IT DIFFICULT TO SUPPORT
UNDERLYING MODEL FROM WHICH RDA WAS BUILT
QUESTIONS WE ASKED OURSELVES:
1. COULD WE PROVIDE AN UPDATED MANUAL / WORKFLOW THAT WOULD HELP FRONTLINE STAFF FURTHER UNDERSTAND AND ADOPT THE UNDERLYING FRBR MODEL FROM WHICH RDA WAS DEVELOPED?
2. OUR SYSTEMS ARE MARC 8. INSPIRED BY WORK DONE BY LC, COULD WE DEVELOP A MANUAL THAT SUPPORT A FRBR BASED WORKFLOW WHEN THE EXISTING ENCODING ENVIRONMENT WAS LIFTED AWAY?
OUR JOURNEY:● THINKING ABOUT CURRENT FRONTLINE CATALOGUING
WORKFLOWS, OUR INITIAL EFFORTS EXPLORED THE PATH OF DESCRIBING THE MANIFESTATION IN HAND, THEN TRANSCRIBING ELEMENTS FOR WORK AND EXPRESSION AND BUILDING APPROPRIATE RELATIONSHIPS
INITIAL INSPIRATION GUIDED CATALOGUER FROM MANIFESTATION TO WORK & EXPRESSION:
● CAMBRIDGE, ● BRITISH LIBRARY AND THE ● PAN-CANADIAN WORKING GROUP ON RDA
AN EYE TO THE FUTURE:
● COULD WE PRESENT OUR WORKFLOW IN A WAY THAT WOULD HELP TO PREPARE US FOR CATALOGUING IN LINKED ENVIRONMENTS?
● THE WEMI APPROACH WAS ADOPTED IN RECOGNITION OF ITS FUTURE POTENTIAL FOR WORKFLOW ADVANTAGES IN LINKED ENVIRONMENTS
STEP 1: DESCRIBE THE WORK, CREATE AAP FOR WORK & RECORD RELATIONSHIPS
STEP 2: DESCRIBE THE EXPRESSION, CREATE AAPS FOR EXPRESSION & RECORD RELATIONSHIPS
STEP 3: DESCRIBE THE MANIFESTATION, CREATE AAPS FOR MANIFESTATION & RECORD RELATIONSHIPS
STEP 4: ITEM
Focus on relationship building opportunities at each stage of the workflow with generous use of authority data …for now.
UPCOMING IMPLEMENTATION HURDLES:
● CURRENT CATALOGUE RECORDS HEAVILY DESCRIBE ELEMENTS OF MANIFESTATION, AND BY THEIR VERY NATURE CAUSE US TO ‘KEEP OUR HEADS’ IN THIS PARADIGM
● TEMPLATES/SYSTEMS/CODING ENVIRONMENTS CURRENTLY USED REINFORCE LINEAR MARC DRIVEN APPROACH TO CATALOGUING
● PRE-CAT PROCESS OF CHECKING FOR CATALOGUING COPY CHECKS FOR ELEMENTS OF MANIFESTATION -- THE CATALOGUER WILL NEED TO ‘SHIFT HIS VIEW’ TO CONSIDER ELEMENTS OF WORK AND EXPRESSION AS THEY COMPLETE DERIVATIVE CATALOGUING TASKS
BENEFITS: ● REGARDLESS OF ENCODING OR LINKING ENVIRONMENT ... WEMI WORKFLOW WILL PROMPT
FRONTLINE STAFF TO ASK THE BIG QUESTIONS UP FRONT AS THEY DESCRIBE RESOURCES. I.E., WHAT RELATIONSHIPS ARE IMPORTANT AS I CATALOGUE THIS RESOURCE?
● WHAT RELATIONSHIPS THAT ARE IMPORTANT TO USERS AS THEY IDENTIFY AND SELECT INFORMATION
● EVENTUAL SHIFT FOCUS FROM TRANSCRIBING ITEM HAND TO BUILDING RELATIONSHIPS. TO EXPAND ON SEEMAN & GODDARD -- CONCENTRATE ON ASPECTS OF LEGACY DATA THAT ARE “REUSABLE IN LINKED DATA”
● BY BUILDING A WEMI WORKFLOW, WE ENCOURAGE CATALOGUERS TO THINK ABOUT AUTHORITIES AND CONTROLLED ACCESS POINTS AT EVERY LEVEL
Welcome to the OCLS CATALOGUING WORKFLOW!
HTTP://PHOTOS1.BLOGGER.COM/BLOGGER/1505/1307/320/A%20STACK-OF-PAPERS.JPG
● Online + Print
● 10 FormatsBookeBookSerialElectronic SerialDVDStreaming VideoCD, AudioStreaming AudioWebsitesSoftware / Objects
Two Functions:
1. Process / Workflow
2. Reference / Training
PROCESS / WORKFLOW
● Step by step
● How to use guide for copy cataloguing, derivative cataloguing and original cataloguing
PROCESS / WORKFLOW
● WEMI STRUCTURE AND ORDER
● EACH WEMI ELEMENT REPRESENTS A STEP
● EACH WEMI ELEMENT HAS AN INDEX
PROCESS / WORKFLOW
● EACH ELEMENT HAS A CORRESPONDING LINK TO THE RDA RULE IN THE TOOLKIT AND THE MARC FIELD INSTRUCTION
● INDICATES WHETHER THE ELEMENT IS REQUIRED
● DEFINITION
● SOURCE OF INFORMATION
PROCESS / WORKFLOW
● Step by step how to record with illustrative examples
● Outline of any OCLS policy
● Examples for the pertinent physical formats and/or special format instructions
PROCESS / WORKFLOWDIFFERENT ENTRANCE POINTS
Adapted from Business Mazes by Joni Farthing, Hart-Davis, 1981
● Linear Beginning to End
● How to Use Guide
● MARC Index
OCLS CATALOGUING WORKFLOW AS A REFERENCE DOCUMENT
OCLS CATALOGUING WORKFLOW AS A REFERENCE DOCUMENT
WHAT DOES THIS MEAN FOR US?!“CATALOGERS MUST CREATE DATA IN MARC THAT WILL ALLOW IT TO BE BATCH PROCESSED INTO RDF AUTOMATICALLY WITH THE LEAST AMOUNT OF DIFFICULTY. HERE AGAIN THEY CAN LOOK TO THE FUNDAMENTALS OF DATA QUALITY AND REQUIRE THAT THE DATA PRODUCED DURING BATCH PROCESSING WILL BE DISCRETE, CONSISTENT, AND SEMANTICALLY UNAMBIGUOUS.” (SEEMAN & GODDARD, 2015, P. 335)
1. OVERALL, FOCUS ON DESCRIBING RESOURCES WITH A NUMBER OF RELATED ELEMENTS, RATHER THAN BIBLIOGRAPHIC RECORDS.
2. EMPHASIZE THE USE OF AUTHORIZED ACCESS POINTS3. USE IDENTIFIERS WHENEVER POSSIBLE4. LEVERAGE THE USE OF RELATIONSHIPS IN RDA AND MARC
WHEREVER POSSIBLE
DESCRIBE ELEMENTS, NOT RECORDSWHEN YOU BREAK SOMETHING DOWN TO COMPONENT PARTS, AND ENSURE THAT STANDARD ESTABLISHED LINKS ARE USED TO CONNECT VARYING SETS AND PIECES, THE SKY IS THE LIMIT TO THE FUN YOU CAN HAVE!
LET’S ASSUME THAT THE TRACK IN THIS PICTURE IS NAILED TO THE TABLE – THIS CAN BE LIKENED TO CURRENT CATALOGUING PRACTICE. WE DESCRIBE OUR “RECORDS” AS A WHOLE, SUCH THAT IT IS DIFFICULT TO PULL APART COMPONENT ELEMENTS OR LINK OR PLAY WELL WITH OTHER METADATA SETS OR STANDARDS.
RELATIONSHIPS, AAP & IDENTIFIERS“THE CREATION OF AUTHORITIES FURTHERS THE CAUSE OF CREATING DIFFERENTIATED VALUES, AS UNIQUE TEXTUAL HEADINGS ARE CONSTRUCTED TO CLEAR AWAY SEMANTIC AMBIGUITY AND MAKE DISTINCTIONS BETWEEN ENTITIES CLEAR.” (SEEMAN & GODDARD, 2015, P. 335)
THE OCLS CATALOGUING WORKFLOW EMPHASIZES THE USE OF AAP, IDENTIFIERS AND RELATIONSHIPS THROUGHOUT, BUT IN PARTICULAR WE HAVE TRIED TO OUTLINE DESCRIPTION OF RELATIONSHIPS IN A MORE COMPREHENSIVE WAY.
LINKED DATA FOR LIBRARIES (HTTP://EXPERIMENT.WORLDCAT.ORG/ENTITY/WORK/DATA/1840249565) HAS AUTHOR (HTTP://ID.LOC.GOV/VOCABULARY/RELATORS/AUT) HOOLAND, SETH VAN (HTTP://ID.LOC.GOV/AUTHORITIES/NAMES/NO2014097600)
LINKED DATA FOR LIBRARIES → HAS AUTHOR → HOOLAND, SETH VAN
THIS IS STARTING TO LOOK LIKE A TRIPLE!
When the track changes, it would be best if our metadata and cataloguers are prepared (We want outcome B)
A.B.
ACKNOWLEDGEMENTSMANY THANKS TO ALL THE AUTHORS OF THE OCLS RDA WORKFLOW AND FOR ALL THE SUPPORT FROM HLLR, OCLS AND THE BSWG.
AUTHORS: IAN BIGELOW, STACEY BOILEAU, DANIELLE EMON, ROSINA LEUNG, MARINA MORGAN, IRENE SILLIUS
SUPPORT: HEADS OF LIBRARIES & LEARNING RESOURCES (HLLR)
OWNER/ISSUING BODY: ONTARIO COLLEGES LIBRARY SERVICE (OCLS)
STANDARDS/APPROVING BODY: BIBLIOGRAPHIC STANDARDS WORKING GROUP (BSWG) / JOINT RDA IMPLEMENTATION SUBCOMMITTEE (JRDA)
WITH ACKNOWLEDGMENTS TO DORIS RANKIN & SALLY PRESS FOR EARLY PROJECT CONTRIBUTIONS
BIBLIOGRAPHYALEMU, G., STEVENS, B., ROSS,, P., & CHANDLER. (2012). LINKED DATA FOR LIBRARIES: BENEFITS OF A CONCEPTUAL SHIFT FROM LIBRARY-SPECIFIC RECORD STRUCTURES TO RDF-BASED DATA MODELS. NEW LIBRARY WORLD, 113 (11/12), 549. doi:10.1108/03074801211282920
AMERICAN LIBRARY ASSOCIATION. (2010). RDA TOOLKIT. CHICAGO, ILL: AMERICAN LIBRARY ASSOCIATION. RETRIEVED FROM HTTP://ACCESS.RDATOOLKIT.ORG/
CLARKE, R. I. (2015). BREAKING RECORDS: THE HISTORY OF BIBLIOGRAPHIC RECORDS AND THEIR INFLUENCE IN CONCEPTUALIZING BIBLIOGRAPHIC DATA. CATALOGING & CLASSIFICATION QUARTERLY, 53(3/4), 286-302. DOI:10.1080/01639374.2014.960988
COYLE, K. (2013, JULY 23). LINKED DATA FIRST STEPS & CATCH-21 [BLOG POST]. RETRIEVED FROM: HTTP://KCOYLE.BLOGSPOT.CA/2013/07/LINKED-DATA-FIRST-STEPS-CATCH-21.HTML
HILARIO, A. B. R., FERNANDEZ, T.R. AND CAMPO, D.M. (2014). FROM BIBLIOGRAPHIC RECORDS TO DATA: CHANGES IN THE LIBRARY ENVIRONMENT WITH THE APPLICATION OF LINKED OPEN DATA TECHNOLOGIES. INFORMATION RESOURCES MANAGEMENT JOURNAL, 27.3 (JULY-SEPTEMBER 2014).
BIBLIOGRAPHY CONT’DMCCALLUM, S. (2015). BIBFRAME UPDATE. Library of Congress. https://www.youtube.com/watch?v=0-B0sUhGnKo
MILLER, E. (2014, DECEMBER). MOVING FROM MARC: HOW BIBFRAME MOVES THE LINKED DATA IN LIBRARIES CONVERSATION TO LARGE-SCALE ACTION. PRESENTATION AT THE SEMANTIC WEB IN LIBRARIES CONFERENCE, BONN, GERMANY. SLIDES RETRIEVED FROM HTTP://SWIB.ORG/SWIB14/SLIDES/MILLER_SWIB14_57.PDF
SEEMAN, D., & GODDARD, L. (MAY 19, 2015). PREPARING THE WAY: CREATING FUTURE COMPATIBLE CATALOGING DATA IN A TRANSITIONAL ENVIRONMENT. CATALOGING & CLASSIFICATION QUARTERLY, 53, 331-340.
STAHMER, C. (2015). LINKED DATA IN THE LIBRARY WORKFLOW ECOSYSTEM. https://www.youtube.com/watch?v=6MEyLLQshJY
SZETO, K. (2013). POSITIONING LIBRARY DATA FOR THE SEMANTIC WEB: RECENT DEVELOPMENTS IN RESOURCE DESCRIPTION. JOURNAL OF WEB LIBRARIANSHIP, 7(3), 305-321. DOI:10.1080/19322909.2013.802584
MORE INFORMATION ON LINKED DATA INITIATIVES FOR LIBRARIES:
• BIBFRAME HTTP://WWW.LOC.GOV/BIBFRAME/• SCHEMA BIBEX HTTPS://WWW.W3.ORG/COMMUNITY/SCHEMABIBEX/• OCLC LINKED DATA WEBSITE
HTTPS://WWW.OCLC.ORG/DEVELOPER/DEVELOP/LINKED-DATA.EN.HTML• OCLC LINKED DATA RESEARCH
HTTP://WWW.OCLC.ORG/RESEARCH/THEMES/DATA-SCIENCE/LINKEDDATA.HTML• WC3 LINKED DATA HTTP://WWW.W3.ORG/STANDARDS/SEMANTICWEB/DATA• LIBHUB INITIATIVE HTTP://WWW.LIBHUB.ORG/• LINKED DATA WEBSITE HTTP://LINKEDDATA.ORG/HOME• ALIADA HTTP://WWW.ALIADA-PROJECT.EU/