oclc research lorcan dempsey vp research, oclc february 2004 (see next slide for where this...

41
OCLC Research Lorcan Dempsey VP Research, OCLC February 2004 (see next slide for where this presentation was given)

Upload: isaiah-thompson

Post on 27-Mar-2015

216 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: OCLC Research Lorcan Dempsey VP Research, OCLC February 2004 (see next slide for where this presentation was given)

OCLC Research

Lorcan Dempsey

VP Research, OCLCFebruary 2004

(see next slide for where this presentation was given)

Page 2: OCLC Research Lorcan Dempsey VP Research, OCLC February 2004 (see next slide for where this presentation was given)

Different versions of this presentation were given at the following meetings:

OCLC Australian Advisory CouncilMelbourne, February 1, 2004

National Library of Australia, Canberra, February 6, 2004

OCLC Members’ CouncilDublin, Ohio, February 9, 2004

Page 3: OCLC Research Lorcan Dempsey VP Research, OCLC February 2004 (see next slide for where this presentation was given)

Overview

• MARC 21

• MARC-XML, MODS, Dublin Core, Onix, LOM

• EAD, TEI, DC, MARC

• METS, SCORM, DIDL, …

• DDI, FGDC, ..

• MARC AMC, EAD, DC, RSLP

• OAIS, METS, OCLC/RLG, …

• Z39.50, SRU/W, Xquery, …

• SOAP, WSDL, UDDI, …

• GIF, TIFF, PNG, JPEG, …

• XML, RDF, DAML+OIL, ..

• DDC, LCSH, LCC, TGN, AAT, …

• PURL, DOI, ISTC, URN, ERROL, POI, …

• XRML, ODRL, ..

• ZTHES, VDEX, TIF, ..

Page 4: OCLC Research Lorcan Dempsey VP Research, OCLC February 2004 (see next slide for where this presentation was given)

Research possibilities …

• .. are endless!

• Becoming more complex as more activities enter a network space.

• Focus …– on maximizing impact of a limited

resource. – on where can make an internal and

external impact.– on making valuable work more visible– on engaging external partners in useful

collaboration.

Page 5: OCLC Research Lorcan Dempsey VP Research, OCLC February 2004 (see next slide for where this presentation was given)

Overview

Collection and useranalysis

Interoperability

System &service architecture

Knowledgeorganization

Contentmanagement

Page 6: OCLC Research Lorcan Dempsey VP Research, OCLC February 2004 (see next slide for where this presentation was given)
Page 7: OCLC Research Lorcan Dempsey VP Research, OCLC February 2004 (see next slide for where this presentation was given)

Collection and user analysis

• Change creates demand for better data.

• Growing interest in knowing more about:– Characteristics– Gaps and overlaps– Use

• Tuning collections based on data.

• Focus collection spending where creates most value.

The idea of the balanced—but unread—collection is disappearing.

Librarians cannot change user behavior so they need to meet the user.

Page 8: OCLC Research Lorcan Dempsey VP Research, OCLC February 2004 (see next slide for where this presentation was given)

OR objectives

• Support better management decisions by– Making data work – Exploring user behaviors.

Page 9: OCLC Research Lorcan Dempsey VP Research, OCLC February 2004 (see next slide for where this presentation was given)

Some projects

• Characteristics of collections– WorldCat– CIC

• Compare ILL, circulation and holdings data.

• Last copy: what is irreplaceable?

• ARL Global Resources.– Exploring coverage of overseas

titles in ARL libraries.

• Large scale user behavior study– IMLS

project with OSU and OCLC

Page 10: OCLC Research Lorcan Dempsey VP Research, OCLC February 2004 (see next slide for where this presentation was given)

Comparing CIC Collection Profiles

Page 11: OCLC Research Lorcan Dempsey VP Research, OCLC February 2004 (see next slide for where this presentation was given)
Page 12: OCLC Research Lorcan Dempsey VP Research, OCLC February 2004 (see next slide for where this presentation was given)

Content management

• Digital asset management a growing concern– Cultural heritage, special collections, …– Learning objects– Institutional repositories

• Issues– Repository selection and interoperability– Securing long term access to digital assets

Page 13: OCLC Research Lorcan Dempsey VP Research, OCLC February 2004 (see next slide for where this presentation was given)

Content management

• Digital preservation– Economics of digital

preservation– Consensus making –

OCLC/RLG working groups

– Preservation metadata (PREMIS)

• Repository architectures– Contributions to

Dspace codebase to support its interoperability

OAI SRW

– Reference models IMS repository

interoperability

Page 14: OCLC Research Lorcan Dempsey VP Research, OCLC February 2004 (see next slide for where this presentation was given)
Page 15: OCLC Research Lorcan Dempsey VP Research, OCLC February 2004 (see next slide for where this presentation was given)

System and service architecture

• The library systems environment is getting more complex– ILS– Digital asset

management– Resolution– Portal– Resource sharing– License management– Auth*

• Build, buy, opensource?

• Integration– Integrated workflow

Portal Cataloging …

Page 16: OCLC Research Lorcan Dempsey VP Research, OCLC February 2004 (see next slide for where this presentation was given)

OR objectives

• Investigate new ways of structuring and viewing WorldCat and associated knowledge structures

• Exploit emerging technologies, open standards and protocols to prototype new services

Page 17: OCLC Research Lorcan Dempsey VP Research, OCLC February 2004 (see next slide for where this presentation was given)

Some projects

• ‘Unplug and play’– Metadata schema

transformation– E-prints UK– Terminology

services– Name authority

services– XISBN

• Text searching– Fast searching on

Beowulf clusters

• Harvesting– NDLTD Union

Catalog

Page 18: OCLC Research Lorcan Dempsey VP Research, OCLC February 2004 (see next slide for where this presentation was given)

Metadata schema transformation

Metadata schematranslator

Web services layer

Crosswalkrepository

client

Record translationclient

A transformed record

A record

A metadata crosswalk

Page 19: OCLC Research Lorcan Dempsey VP Research, OCLC February 2004 (see next slide for where this presentation was given)
Page 20: OCLC Research Lorcan Dempsey VP Research, OCLC February 2004 (see next slide for where this presentation was given)

xISBN

• An experimental web service– Give it an ISBN, it returns all related ISBNs– Based on WorldCat– Designed for machine-to-machine data exchange

• Examples:– Check user ILL requests against all editions/versions in

OPAC– Find library’s editions when user finds any

edition/version of item on Amazon– Check OPAC for all editions during

selection/acquisitions/gift book processing

Page 21: OCLC Research Lorcan Dempsey VP Research, OCLC February 2004 (see next slide for where this presentation was given)

Searchingfor the book on Amazon

Searchingfor the book on Amazon

Page 22: OCLC Research Lorcan Dempsey VP Research, OCLC February 2004 (see next slide for where this presentation was given)

LibraryLookup bookmarklet

LibraryLookupLibraryLookup

http://www.amazon.co.uk/exec/obidos/ASIN/1860464955/qid=1075134526/sr=1-1/ref=sr_1_10_1/202-6426661-8213436

Is the book at my library?Is the book at my library?

SingleISBN

Page 23: OCLC Research Lorcan Dempsey VP Research, OCLC February 2004 (see next slide for where this presentation was given)

xISBN bookmarklet

http://www.amazon.co.uk/exec/obidos/ASIN/1860464955/qid=1075134526/sr=1-1/ref=sr_1_10_1/202-6426661-8213436

xISBNserver

LibraryLookupLibraryLookup xISBNxISBN

Multiple ISBNs

ADDED

ADDED

ADDED

ADDED

ADDED

Is the book at my library?Is the book at my library?

Page 24: OCLC Research Lorcan Dempsey VP Research, OCLC February 2004 (see next slide for where this presentation was given)
Page 25: OCLC Research Lorcan Dempsey VP Research, OCLC February 2004 (see next slide for where this presentation was given)
Page 26: OCLC Research Lorcan Dempsey VP Research, OCLC February 2004 (see next slide for where this presentation was given)

Knowledge organization and semantic web

"The Semantic Web is an extension of the current web in which information is given well-defined meaning, better enabling computers and people to work in cooperation." -- Tim Berners-Lee, James Hendler, Ora Lassila, The Semantic Web, Scientific American, May 2001

Page 27: OCLC Research Lorcan Dempsey VP Research, OCLC February 2004 (see next slide for where this presentation was given)

Mmmm….

Page 28: OCLC Research Lorcan Dempsey VP Research, OCLC February 2004 (see next slide for where this presentation was given)

OR objectives

• To release the value of the historical library investment in controlled vocabularies and knowledge structures– Redeploy tools for accessing or assigning

names, subjects, and classification numbers

– Make knowledge organization services more accessible.

Page 29: OCLC Research Lorcan Dempsey VP Research, OCLC February 2004 (see next slide for where this presentation was given)

Projects

• FAST

• Terminology services

• FRBR

• Automatic classification

• VIAF – Virtual International Authority File– Library of Congress, Die Deutsche

Bibliothek

Page 30: OCLC Research Lorcan Dempsey VP Research, OCLC February 2004 (see next slide for where this presentation was given)

FAST Geographic Search by Area

Avalon Lake

Bellaire, Lake

Charlevoix, Lake

Fletcher Pond

Munro Lake

Ocqueoc Lake

Bar 1Bay 5Bridge 1Channel 2Civil 23Forest 4Island 4Lake 6Park 10Ppl 92Stream 10

Page 31: OCLC Research Lorcan Dempsey VP Research, OCLC February 2004 (see next slide for where this presentation was given)

Knowledge org systems

• Plethora of vocabularies

• Incompatible approaches to encoding

• Few connections– Education

GEM Subjects, ERIC Thesaurus, LCSH, CIP (Classification of instructional programs)

– Cultural Heritage AAT, Thesaurus for Graphic Materials (TGM) Subjects

& Genre Terms

• Not built for the web– Link to concepts

Page 32: OCLC Research Lorcan Dempsey VP Research, OCLC February 2004 (see next slide for where this presentation was given)

Terminology services:‘Webulating’ knowledge organization

• The goal of this project is to offer accessible, modular, web-based terminology services.

• Make vocabularies more available for – Metadata creation– Searching– …

• Refine and extend mappings

• Represent vocabularies in major encoding standards, e.g., MARC, Zthes, TIF

• Prototype custom web services as appropriate

Page 33: OCLC Research Lorcan Dempsey VP Research, OCLC February 2004 (see next slide for where this presentation was given)

2.6+ million fiction records from Worldcat, clustered by OCLC’s FRBR algorithm

Make greater use of data (genres, settings, imaginary characters, etc)

Page 34: OCLC Research Lorcan Dempsey VP Research, OCLC February 2004 (see next slide for where this presentation was given)

Work display

Page 35: OCLC Research Lorcan Dempsey VP Research, OCLC February 2004 (see next slide for where this presentation was given)

Work/expression display

Page 36: OCLC Research Lorcan Dempsey VP Research, OCLC February 2004 (see next slide for where this presentation was given)

Work/expression/manifestation

Page 37: OCLC Research Lorcan Dempsey VP Research, OCLC February 2004 (see next slide for where this presentation was given)
Page 38: OCLC Research Lorcan Dempsey VP Research, OCLC February 2004 (see next slide for where this presentation was given)

Interoperability

• Extract maximum value from investment in – Metadata– Content– Services

• By ensuring that they are – Sharable– Reusable– Recombinable

Page 39: OCLC Research Lorcan Dempsey VP Research, OCLC February 2004 (see next slide for where this presentation was given)

OR objectives

• Provide leadership in Internet and information standardization

• Help to raise the visibility of the values and value of librarianship

Page 40: OCLC Research Lorcan Dempsey VP Research, OCLC February 2004 (see next slide for where this presentation was given)

Some examples

• Dublin Core– Central to library,

cultural heritage and related communities.

– Harvested data: OAI– 8 Governments – Corporations and

NGOs

• Protocols– Z39.50, SRW/U, OAI,

Zthes

• Identifiers– INFO URI, PURL

• Registries– DCMI, OpenURL, Info

URI

• Everywhere …!

Cliff Lynch on Info URI: … it represents an important new step in collaboration ACROSS standards organizations, and … I think the work is of real importance to the CNI community.

Page 41: OCLC Research Lorcan Dempsey VP Research, OCLC February 2004 (see next slide for where this presentation was given)