international bibliographic standards, linked data, and the impact on library cataloging

47
International Bibliographic Standards, Linked Data, and the Impact on Library Cataloging Gordon Dunsire A NISO/DCMI Webinar 24 August 2011

Upload: sana

Post on 22-Feb-2016

55 views

Category:

Documents


0 download

DESCRIPTION

International Bibliographic Standards, Linked Data, and the Impact on Library Cataloging. Gordon Dunsire A NISO/DCMI Webinar 24 August 2011. Overview. Bibliographic standards International Federation of Library Associations and Institutions (IFLA) Others - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: International Bibliographic Standards, Linked Data, and the Impact on Library  Cataloging

International Bibliographic Standards, Linked Data, and the

Impact on Library Cataloging

Gordon DunsireA NISO/DCMI Webinar

24 August 2011

Page 2: International Bibliographic Standards, Linked Data, and the Impact on Library  Cataloging

Overview

Bibliographic standardsInternational Federation of Library Associations

and Institutions (IFLA)Others

Representation in Resource Description Framework (RDF)

Creating triples from catalogue recordsImpact and implications for catalogues

Page 3: International Bibliographic Standards, Linked Data, and the Impact on Library  Cataloging

IFLA standardsRDF representations of standards for “universal”

bibliographic control are being developed“FR” (Functional Requirements) family of models

For Bibliographic Records (FRBR)For Authority Data (FRAD)For Subject Authority Data (FRSAD)

International Standard Bibliographic Description (ISBD)Record structure and content standard for exchange of national

metadataUNIMARC

Encoding for ISBD records (Bibliographic) and FRAD (Authorities)

Page 4: International Bibliographic Standards, Linked Data, and the Impact on Library  Cataloging

Representation in RDF

Entities => RDF classesE.g. FRBR “Person”

Attributes, tags, (sub)fields, relationships => RDF propertiesE.g. ISBD “title proper”E.g. UNIMARC “200 $a” (title proper)E.g. FRBR “title of the manifestation”

Controlled term values => SKOS vocabulariesE.g. ISBD Area 0 (content and media type)

Page 5: International Bibliographic Standards, Linked Data, and the Impact on Library  Cataloging

FR familyEach model has its own namespace

To reflect historical developmentEach re-uses earlier RDF elements

Consolidated model under developmentBeing informed by analysis of RDF representation

FRBR RDF publishedFRBRer (entity-relationship) ontology

Namespace elements plus OWLFRBRoo (object-oriented)

Extension of CIDOC Conceptual Reference Model (for museums)FRAD and FRSAD imminent

Approved at IFLA 2011 conference

Page 6: International Bibliographic Standards, Linked Data, and the Impact on Library  Cataloging

ISBD

Element set and vocabularies for content and media types

Namespace now publishedDC Application Profile in development

Models the ISBD recordWhat properties (fields)Mandatory? Repeatable?Aggregated statements

Sub-elements and punctuation

Page 7: International Bibliographic Standards, Linked Data, and the Impact on Library  Cataloging

ISBD AP snippet

<!-- Area 0 is mandatory and non-repeatable--> <StatementTemplate ID="hasContentFormAndMediaTypeArea" minOccurs="1" maxOccurs="1" type="nonliteral"> <Property>http://iflastandards.info/ns/isbd/elements/P1158</Property> <!-- Area 0 is an aggregated statement with SES --> <NonLiteralConstraint descriptionTemplateRef="DThasContentFormAndMediaTypeArea"> <ValueStringConstraint> <SyntaxEncodingScheme>http://iflastandards.info/ns/isbd/elements/C2003 </SyntaxEncodingScheme> </ValueStringConstraint> </NonLiteralConstraint> </StatementTemplate>

Page 8: International Bibliographic Standards, Linked Data, and the Impact on Library  Cataloging

UNIMARC

Proposal for RDF representation made at IFLA 2011http://conference.ifla.org/sites/default/files/files/

papers/ifla77/187-dunsire-en.pdfDiscussed with Permanent UNIMARC

CommitteeDecision taken to proceed

Page 9: International Bibliographic Standards, Linked Data, and the Impact on Library  Cataloging

Other library standards in RDF (1)RDA: resource description and access

Content standard based on FR modelsRefines the FR propertiesMany more controlled vocabularies than AACR

Anglo-American Cataloguing RulesMODS/MADS (Metadata Object/Authority

Description Schema)Metadata structure based on MARC21Library of Congress Name Authority File in MADS RDFRDF representation of MODS just beginning ...

Page 10: International Bibliographic Standards, Linked Data, and the Impact on Library  Cataloging

Other library standards in RDF (2)BIBO: Bibliographic Ontology

Classes and properties for citations and bibliographic references

DCMI Metadata Terms (Dublin Core)High-level common-denominator classes and

properties for memory institution metadataLots of controlled vocabularies

Library of Congress Subject Headings, Rameau (French subject headings), SWD (German subject headings), Dewey Decimal Classification, RDA vocabularies, etc.

Page 11: International Bibliographic Standards, Linked Data, and the Impact on Library  Cataloging
Page 12: International Bibliographic Standards, Linked Data, and the Impact on Library  Cataloging
Page 13: International Bibliographic Standards, Linked Data, and the Impact on Library  Cataloging
Page 14: International Bibliographic Standards, Linked Data, and the Impact on Library  Cataloging
Page 15: International Bibliographic Standards, Linked Data, and the Impact on Library  Cataloging
Page 16: International Bibliographic Standards, Linked Data, and the Impact on Library  Cataloging

From record to triples (in 9 stages)Very large numbers of records

Catalogue records, finding aids, etc.300 million; 1 billion?

High quality metadataIn comparison with other communities

Each record may generate many triples30 “raw” triples (no inferences) per MARC record?

Very, very large numbers of triplesBillions? Trillions?

Page 17: International Bibliographic Standards, Linked Data, and the Impact on Library  Cataloging

1. Take a recordField/attribute ValueRecord ID 54321Title Museum archives: an introductionAuthor Wythe, DeborahDate 2004LCSH Museum archivesMedia/GMD ElectronicContent form Text

Page 18: International Bibliographic Standards, Linked Data, and the Impact on Library  Cataloging

2. Disaggregate to single statementsRecord Attribute Value54321 (has) title Museum archives: an

introduction54321 (has) author Wythe, Deborah54321 (has) date 200454321 (has) LCSH Museum archives54321 (has) media type Electronic54321 (has) content form Text

Page 19: International Bibliographic Standards, Linked Data, and the Impact on Library  Cataloging

3. Create URI for record

Must be unique, so 54321 no good on its ownhttp URIs are a good thing (W3C)So add record ID to a unique http domain

E.g. http://MyLibraryX.com (unique to the library)+ 54321

http://MyLibraryX.com/54321(or http://MyLibraryX.com#54321)

This is not a URL!

Page 20: International Bibliographic Standards, Linked Data, and the Impact on Library  Cataloging

4. Replace record ID with URIURI Attribute Valuemlx:54321 (has) title Museum archives:

an introductionmlx:54321 (has) author Wythe, Deborahmlx:54321 (has) date 2004mlx:54321 (has) LCSH Museum archivesmlx:54321 (has) media type Electronicmlx:54321 (has) content form Text

“mlx” = qname (xmlns) = shorthand for “http://MyLibraryX.com/”

Page 21: International Bibliographic Standards, Linked Data, and the Impact on Library  Cataloging

5. Find URIs for attributesAttributes are modelled as RDF properties (predicates) in

“element set” namespacesE.g. Dublin Core terms (dct); ISBD (isbd); FRBR (frbrer); RDA

(rdaxxx); Bibliographic Ontology (bibo); etc.Choose a namespace, find property with same (or closest)

“meaning” (e.g. definition) as attributeNearest property minimises loss of information

Get URI for property If no suitable property, choose another namespace

Properties do not have to come from single namespaceMatch and mix!

Page 22: International Bibliographic Standards, Linked Data, and the Impact on Library  Cataloging

5 (cont). Find URI for titlehttp://purl.org/dc/terms/title (dct:title)http://iflastandards.info/ns/isbd/elements/

P1014 (isbd:P1014)hasTitleProper

http://RDVocab.info/Elements/titleProper (rdaGR1:titleProper)

Page 23: International Bibliographic Standards, Linked Data, and the Impact on Library  Cataloging

5 (cont). Find URI for authordct:creatorrdarole:author(isbd does not cover “headings”)

Page 24: International Bibliographic Standards, Linked Data, and the Impact on Library  Cataloging

5 (cont). Find URI for datedct:dateisbd:P1018

hasDateOfPublicationProductionDistributionrdaGr1:dateOfPublication

Page 25: International Bibliographic Standards, Linked Data, and the Impact on Library  Cataloging

5 (cont). Find URI for LCSHLCSH is a subject vocabulary

Controlled termsSo attribute is really “subject”

And the term itself is the valuedct:subject

Page 26: International Bibliographic Standards, Linked Data, and the Impact on Library  Cataloging

5 (cont). Find URI for media typeAssuming record uses new ISBD Area 0 ...isbd:P1003

hasMediaType

Page 27: International Bibliographic Standards, Linked Data, and the Impact on Library  Cataloging

5 (cont). Find URI for content formAssuming record uses new ISBD Area 0 ...isbd: P1001

hasContentForm

Page 28: International Bibliographic Standards, Linked Data, and the Impact on Library  Cataloging

6. Replace attributes with URIsURI URI Valuemlx:54321 isbd:P1014 Museum archives:

an introductionmlx:54321 rdarole:author Wythe, Deborahmlx:54321 isbd:P1018 2004mlx:54321 dct:subject Museum archivesmlx:54321 isbd:P1003 Electronicmlx:54321 isbd:P1001 Text

Page 29: International Bibliographic Standards, Linked Data, and the Impact on Library  Cataloging

7. Find URIs for values If object of a triple is a URI, it can link to the subject of

another triple with the same URILinked data!

Values from controlled vocabularies may have URIsPossible vocabularies: author, subject, ISBD Area 0NOT: title, date

For author: Virtual International Authority File (VIAF)For LCSH: Library of Congress Authorities &

VocabulariesFor ISBD Area 0: Open Metadata Registry

Page 30: International Bibliographic Standards, Linked Data, and the Impact on Library  Cataloging

7 (cont). Find URI for authorAuthor: Wythe, DeborahVIAF: http://www.viaf.org/

viaf:31899419/#Wythe,+Deborah

Page 31: International Bibliographic Standards, Linked Data, and the Impact on Library  Cataloging

7 (cont). Find URI for subject (LCSH)LCSH: Museum archivesLoC: http://id.loc.gov/authorities/

lcsh:/sh85088707#concept

Page 32: International Bibliographic Standards, Linked Data, and the Impact on Library  Cataloging

7 (cont). Find URIs for ISBD Area 0

Media type: ElectronicISBD media type

isbdmt:T1002Content form: TextISBD Content form

isbdcf:T1009

Page 33: International Bibliographic Standards, Linked Data, and the Impact on Library  Cataloging

8. Replace values with URIssubject predicate objectmlx:54321 isbd:P1014 “Museum archives: an

introduction”mlx:54321 rdarole:author viaf:31899419/#Wythe,

+Deborahmlx:54321 isbd:P1018 “2004”mlx:54321 dct:subject lcsh:/

sh85088707#concept mlx:54321 isbd:P1003 isbdmt:T1002mlx:54321 isbd:P1001 isbdcf:T1009

Page 34: International Bibliographic Standards, Linked Data, and the Impact on Library  Cataloging

9. Publish triples (linked data)mlx:54321 | isbd:P1014 | “Museum archives: an

introduction” mlx:54321 | rdarole:author | viaf:31899419/#Wythe,

+Deborahmlx:54321 | isbd:P1018 | “2004”

mlx:54321 | dct:subject | lcsh:/sh85088707#concept

mlx:54321 | isbd:P1003 | isbdmt:T1002

mlx:54321 | isbd:P1001 | isbdcf:T1009

Page 35: International Bibliographic Standards, Linked Data, and the Impact on Library  Cataloging

Linked data chainsmlx:54321 | dct:subject | lcsh:/sh85088707#concept

lcsh:/sh85088707#concept | skos:related | rameau:XXX

rameau:XXX | frbrer:isSubjectOf | mly:98765

rameau:XXX | skos:prefLabel | “archives du musée”

mly:98765 | rda:titleOfTheWork | “Managing archives in museums”

Page 36: International Bibliographic Standards, Linked Data, and the Impact on Library  Cataloging

Linked data cluster = “record”mlx:54321 | isbd:P1014 | “Museum archives: an

introduction” mlx:54321 | rdarole:author | viaf:31899419/#Wythe,

+Deborahmlx:54321 | isbd:P1018 | “2004”

mlx:54321 | dct:subject | lcsh:/sh85088707#concept

mlx:54321 | isbd:P1003 | isbdmt:T1002

mlx:54321 | isbd:P1001 | isbdcf:T1009

Page 37: International Bibliographic Standards, Linked Data, and the Impact on Library  Cataloging

Duplication and legacy recordsMany copies of legacy records

Copied and amended for local useDanger of minting multiple URIs for the same

resourceNational bibliographic agencies have significant role

to playAs memory/cultural institutionsThe linked-data memory/culture of a nationProposal from IFLA Namespaces Task Group to IFLA

Bibliography Section

Page 38: International Bibliographic Standards, Linked Data, and the Impact on Library  Cataloging

FRBRization

FRBR splits record into four functional partsUser-centred functions

Subject of a FRBR triple is one of the parts, not the resource as a whole

But subject of ISBD triple is the resource as a whole

Class collisions can be avoided by using unbounded (no domain or range) versions of properties

Page 39: International Bibliographic Standards, Linked Data, and the Impact on Library  Cataloging

A short historyof the evolution

of the library catalogue record

Page 40: International Bibliographic Standards, Linked Data, and the Impact on Library  Cataloging

Lee, T. B.

Cataloguing has a future. - Audio disc (Spoken word). - Donated by the author.

1. Metadata

In the beginning ...

... the catalogue card

Page 41: International Bibliographic Standards, Linked Data, and the Impact on Library  Cataloging

Author:

Title:

Content type:

Provenance:

Subject:

Lee, T. B.

Cataloguing has a future

Spoken word

Audio disc

Metadata

Donated by the author

Carrier type:

From flat-file record ...

... to relational record

Name:Biography:

...

Name authority

Term:Definition:

...

Subject authority

Bibliographic description

Page 42: International Bibliographic Standards, Linked Data, and the Impact on Library  Cataloging

Author:

Title:

Content type:

Provenance:

Subject:

Lee, T. B.

Cataloguing has a future

Spoken word

Audio disc

MetadataDonated by the author

Carrier type:

From flat-file description ...

... to FRBR record

Name:Biography:

...

Name authority

Term:Definition:

...

Subject authority

Bibliographic description

Item

Manifestation

Author:

Content type:

Subject:

Spoken wordExpression

Work

Page 43: International Bibliographic Standards, Linked Data, and the Impact on Library  Cataloging

Lee, T. B.

Metadata

From FRBR record ...

... to extinction!

Name:

Name authority

Term:

Subject authority

Item

Manifestation

Expression

Work

Provenance: Donated by the author

Subject:Author:

Title: Cataloguing has a future

Content type: Spoken word

Audio discCarrier type:Term:

RDA content type

Term:

RDA carrier type

Donor:

Title:

Amazon/Publisher

Page 44: International Bibliographic Standards, Linked Data, and the Impact on Library  Cataloging

Where is the record?Implicit, not explicit

Everywhere and nowhereA semantic Web will allow machines to create the

record just-in-timeWe will not have to maintain records just-in-case

The user will have control over the presentationI want to see an archive or library or museum or Amazon

or Google or Flickr or ? displayAnd by avoiding duplication, we can all get on with

describing new stuff ...

Page 45: International Bibliographic Standards, Linked Data, and the Impact on Library  Cataloging

The hyperdimensional (Tardis) card

Lee, T. B.

Cataloguing has a future. - Audio disc (Spoken word). - Donated by the author.

1. Metadata

Audio shop

Lee MuseumSpoken word archive

W3C Library

“TARDIS four port USB hub, for office-bound Time Lords:Open a time vortex on your desk” – Pocket-lint

Page 46: International Bibliographic Standards, Linked Data, and the Impact on Library  Cataloging

Metadata focus

Shift of focus of metadata creation, maintenance, storage, preservation (by professionals, amateurs, machines)

From Record To Statement(s) = triple(s)

But metadata display ...... aggregates triples (from multiple sources) to create records on the fly

Page 47: International Bibliographic Standards, Linked Data, and the Impact on Library  Cataloging

Thank you!

[email protected]://metadataregistry.org/

Open Metadata Registryhttp://www.ifla.org/en/node/5353

IFLA Namespaces Task Group (needs updated)http://dublincore.org/dcmirdataskgroup/

DCMI/RDA Task Group (in revision)