deconstructing cataloging a web services approach to bibliographic control thomas hickey

24
Deconstructing Cataloging A Web Services Approach to Bibliographic Control Thomas Hickey

Upload: luke-bird

Post on 27-Mar-2015

217 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Deconstructing Cataloging A Web Services Approach to Bibliographic Control Thomas Hickey

Deconstructing Cataloging

A Web Services Approach to

Bibliographic Control

Thomas Hickey

Page 2: Deconstructing Cataloging A Web Services Approach to Bibliographic Control Thomas Hickey

T. Hickey Access 20022

What is Deconstruction?

• Allowing access to what are now monolithic systems

• Do it using standard protocols

Page 3: Deconstructing Cataloging A Web Services Approach to Bibliographic Control Thomas Hickey

T. Hickey Access 20023

Why?

• Metadata being created in many places

• Even within libraries– Digitization projects seldom involve

cataloguers– MARC21 isn’t used

Page 4: Deconstructing Cataloging A Web Services Approach to Bibliographic Control Thomas Hickey

T. Hickey Access 20024

Why Not?

• How things work now– Tech processing departments– Books, serials

• Limitations– MARC21– Cataloguers– Integrated library systems

Page 5: Deconstructing Cataloging A Web Services Approach to Bibliographic Control Thomas Hickey

T. Hickey Access 20025

How could we do it differently?

• Deconstruct integrated systems

• Expose the pieces with Web services

• Move metadata with standard protocols

• Goal: use Web services

to build new, flexible systems

create metadata anywhere

Page 6: Deconstructing Cataloging A Web Services Approach to Bibliographic Control Thomas Hickey

T. Hickey Access 20026

What are Web Services?

• Machine-to-machine communication• Run over standard Web protocols

– XML syntax, HTTP packaging

• Several approaches– REST

• Representational State Transfer

– SOAP• Simple Object Access Protocol

Page 7: Deconstructing Cataloging A Web Services Approach to Bibliographic Control Thomas Hickey

T. Hickey Access 20027

REST

• REST is simplest– Send an HTTP request– Get back a response (XML, HTML)– Examples

• Web browser• OpenURL• Z39.50 as a Web service: SRU

http://sfx.bath.ac.uk/sfxmenu?genre=book&isbn=1234-5678

http://www.kb.nl/cgi-zoek/srw.pl?query=test&maxRec=10&recSchema=dc_record

Page 8: Deconstructing Cataloging A Web Services Approach to Bibliographic Control Thomas Hickey

T. Hickey Access 20028

REST: OAI-PMH

• Now in version 2.0

• Allows synchronization of databases

• Other services can be layered– ODL: Open Digital Libraries– XTCat name authority service

Page 9: Deconstructing Cataloging A Web Services Approach to Bibliographic Control Thomas Hickey

T. Hickey Access 20029

SOAP is more complex

• Simple Object Access Protocol

• Z39.50 as a SOAP service: SRW– http://www.loc.gov/z3950/agency/zing/srwu/wsdl/zing_srw_binding.wsdl

Page 10: Deconstructing Cataloging A Web Services Approach to Bibliographic Control Thomas Hickey

T. Hickey Access 200210

What Could be Deconstructed?

• Classification services

• Name authority services

• Catalogue subsets

• Search (via SRU/SRW)

• Holdings

• Duplicate resolution/FRBR

• Rights management

Page 11: Deconstructing Cataloging A Web Services Approach to Bibliographic Control Thomas Hickey

T. Hickey Access 200211

An Example: ePrints UK

• Records harvested from archives

• Web services to enhance metadata

Page 12: Deconstructing Cataloging A Web Services Approach to Bibliographic Control Thomas Hickey

T. Hickey Access 200212

Institutionale-printarchives

Non-institutionale-printarchives

Personale-printarchives

SubjectClassification

Service

NameAuthorityService

CitationAnalysisService

RDNGateway/portal

Service

RDNGateway/portal

Service

RDNGateway/portal

Service

OAI-PMH

SOAP

SOAPHTTPZ39.50

e-Prints UK

Page 13: Deconstructing Cataloging A Web Services Approach to Bibliographic Control Thomas Hickey

T. Hickey Access 200213

Web Services for ePrints UK

• Personal name authorities

• Subject classification

• Citation analysis

Page 14: Deconstructing Cataloging A Web Services Approach to Bibliographic Control Thomas Hickey

T. Hickey Access 200214

Subject Classification Service

• Receives metadata, full text

• Sends back– List of DDC numbers, captions– Subject headings

Page 15: Deconstructing Cataloging A Web Services Approach to Bibliographic Control Thomas Hickey

T. Hickey Access 200215

Classification Service

DocumentText

Metadata

PDF Document

e-Prints UK

ClassificationService

ClassificationMetadata

Page 16: Deconstructing Cataloging A Web Services Approach to Bibliographic Control Thomas Hickey

T. Hickey Access 200216

Name Authority Service

• Receives metadata record, full text

• For each name, sends back – Authorized form– URI representing persona

• If in interactive mode:– List of possibilities

Page 17: Deconstructing Cataloging A Web Services Approach to Bibliographic Control Thomas Hickey

T. Hickey Access 200217

Example: XTCat

• Experimental Theses Catalog

• Union catalog of thesis and dissertation metadata

Page 18: Deconstructing Cataloging A Web Services Approach to Bibliographic Control Thomas Hickey

T. Hickey Access 200218

Example: XTCat

• Records extracted from WorldCat

• Available via OAI-PMH– Separate ‘set’ for electronic theses

• We harvest from other repositories

• Integrate with existing records

• Allow re-harvesting (e.g. by VTLS)

Page 19: Deconstructing Cataloging A Web Services Approach to Bibliographic Control Thomas Hickey

T. Hickey Access 200219

FRBR Services

• Send in list of metadata records– Get back FRBRized list– Get back Uri's to works

Page 20: Deconstructing Cataloging A Web Services Approach to Bibliographic Control Thomas Hickey

T. Hickey Access 200220

Other Considerations

• More complicated flows

• Services in records

• Getting Google in the picture

Page 21: Deconstructing Cataloging A Web Services Approach to Bibliographic Control Thomas Hickey

T. Hickey Access 200221

Two-Way flows

• Union catalog could harvest– Integration done for union catalog

• Local systems could harvest– Use enhanced record

• All movement with OAI-PMH

Page 22: Deconstructing Cataloging A Web Services Approach to Bibliographic Control Thomas Hickey

T. Hickey Access 200222

Embedding Services in Records

• Name authority

• Library– Providers– Requests

• Records stand on their own

• Allow branding

Page 23: Deconstructing Cataloging A Web Services Approach to Bibliographic Control Thomas Hickey

T. Hickey Access 200223

Allowing General Web Harvesting

• HTML Tree

• Services lead back to libraries

Page 24: Deconstructing Cataloging A Web Services Approach to Bibliographic Control Thomas Hickey

T. Hickey Access 200224

Questions?