beyond marc: marc, linked data, and bibframe

48
Beyond MARC MARC, Linked Data, and Bibframe 17 April 2013 Thomas Meehan Head of Current Cataloguing [email protected]

Upload: thomas-meehan

Post on 26-Jan-2015

118 views

Category:

Education


9 download

DESCRIPTION

 

TRANSCRIPT

Page 1: Beyond MARC: MARC, linked data, and Bibframe

Beyond MARCMARC, Linked Data, and Bibframe

17 April 2013Thomas Meehan

Head of Current [email protected]

Page 2: Beyond MARC: MARC, linked data, and Bibframe

Card Index Catalogue

http://cardcat.ucl.ac.uk/cgi-bin/carddisplay.pl?card=887;drawer=13;max=931;ctype=C

Page 3: Beyond MARC: MARC, linked data, and Bibframe

AACR2

Models for decision : a conference under the auspices of the United Kingdom Automation Council organised by the British Computer Society and the Operational Research Society / edited by C.M. Berners-Lee. -- London : English Universities Press, 1965. x, 149 p. : ill. ; 23 cm. Includes bibliographical references.• Berners-Lee, C. M.

Page 4: Beyond MARC: MARC, linked data, and Bibframe

MARC

MAchine

Readable

Cataloguing

Page 5: Beyond MARC: MARC, linked data, and Bibframe

AACR2 in MARC21245 00 $a Models for decision : $b a conference under the auspices of the United Kingdom

Automation Council organised by the British Computer Society and the Operational Research Society /

$c edited by C.M. Berners-Lee.260 __ $a London : $b English Universities Press, $c 1965.300 __ $a x, 149 p. : $b ill. ; $c 23 cm.504 __ $a Includes bibliographical references.700 1_ $a Berners-Lee, C. M.

Page 6: Beyond MARC: MARC, linked data, and Bibframe

AARC2 in .mrc00788nam a2200181 a 4500001002700000005001700027008004100044024001500085245021000100260004900310300003200359504004100391650003300432700002300465710003900488710003000527710004900557_UCL01000000000000000477125_20061112120300.0_850710s1965 enka b 000 0 eng _8 _ax280050495_00_aModels for decision :_ba conference under the auspices of the United Kingdom Automation Council organised by the British Computer Society and the Operational Research Society /_cedited by C.M. Berners-Lee._ _aLondon :_bEnglish Universities Press,_c1965._ _ax, 149 p. :_bill. ;_c23 cm._ _aIncludes bibliographical references._ 0_aDecision making_vCongresses._1 _aBerners-Lee, C. M._2 _aUnited Kingdom Automation Council._2 _aBritish Computer Society._2 _aOperational Research Society (Great Britain)__

Page 7: Beyond MARC: MARC, linked data, and Bibframe

RDA in MARC21245 00 $a Models for decision : $b a conference under the auspices of the United Kingdom Automation

Council organised by the British Computer Society and the Operational Research Society /

$c edited by C.M. Berners-Lee.264 _1 $a London : $b The English Universities Press Limited, $c 1965.264 _4 $c ©1965300 __ $a x, 149 pages : $b illustrations ; $c 23 cm.336 __ $a text $2 rdacontent337 __ $a unmediated $2 rdamedia338 __ $a volume $2 rdacarrier504 __ $a Includes bibliographical references.700 1_ $a Berners-Lee, C. M., editor of compilation.

Page 8: Beyond MARC: MARC, linked data, and Bibframe

What is MARC for?

Page 9: Beyond MARC: MARC, linked data, and Bibframe

What is MARC for?

• Storage• Exchange and distribution• Manipulation• Display• Input (http://www.aurochs.org/zz/marc_input/marc_input.html)

• “Lingua franca of library cataloguing”

Page 10: Beyond MARC: MARC, linked data, and Bibframe

Finite Notation Problem

Too many subject schemes650 _0 for LCSH650 _1 for LC for Childrens650 _2 for MeSH…650 _7 Source specified in subfield $2

Not enough indicators246 184 $aThe title on the spine

Page 11: Beyond MARC: MARC, linked data, and Bibframe

Data in More Than One Place

Languages008 (positions 35-37) eng041 __ $a eng240 10 $l English546 __ $a In English.

Page 12: Beyond MARC: MARC, linked data, and Bibframe

Double Encoding: ISBD and MARC

Blanket : Constellation of Orion, 3.

260 __ $a Blanket$b Constellation of Orion$c 3

260 __ $a Blanket :$b Constellation of Orion,$c 3.

Page 13: Beyond MARC: MARC, linked data, and Bibframe

Data Mixed UpGMD245 10 $a Data on the web

$h [electronic resource] :$b research and applications /$c Antonis Bikakis, Adrian Giurca (eds.).

245 10 $a Data on the web$b research and applications /$c Antonis Bikakis, Adrian Giurca (eds.).

Nothing allowed after 245$c245 10 $a Enduring resistance :

$b cultural theory after Derrida /$c edited by Sjef Houppermans, Rico Sneller, Peter van Zilfhout. = La résistance

persérvère : la théorie de la culture (d')aprés Derrida / edité par Sjef Houppermans, Rico Sneller, Peter van Zilfhout.

Page 14: Beyond MARC: MARC, linked data, and Bibframe

Text, Not DataISBN020 __ $a 9780285638976 (pbk.)020 __ $a 012002618X (ebook)

Title245 10 $a British goblins :

Place of publication260 __ $a Köln

Copyright date260 __ $c c2005264 _4 $c ©2002260 _4 $c copyright 2005260 _4 $c ℗1983260 _4 $c phonogram 1993

Extent300 __ $a ix, 300 p.

Dimensions300 __ $c 23 cm300 __ $c 9 mm

Page 15: Beyond MARC: MARC, linked data, and Bibframe

Changing Text as Primary Key for Headings and Authorities

Author heading for deceased personNiemeyer, Oscar, 1907-

Different preferences for writing nameMao, Tse-tung, 1893-1976 [Former heading]Mao, Zedong, 1893-1976毛泽东 , 1893-1976

Small differences could break matchMao, Zedong, 1893-1976.Mao, Zedong, 1893-1976

Page 16: Beyond MARC: MARC, linked data, and Bibframe

Expressing Relationships

What does this mean?

700 0_ $a Homer.$t Iliad.

700 1_ $a Berners-Lee, Tim.

Page 17: Beyond MARC: MARC, linked data, and Bibframe

Record Not Data

00788nam a2200181 a 4500001002700000005001700027008004100044024001500085245021000100260004900310300003200359504004100391650003300432700002300465710003900488710003000527710004900557_UCL01000000000000000477125_20061112120300.0_850710s1965 enka b 000 0 eng _8 _ax280050495_00_aModels for decision :_ba conference under the auspices of the United Kingdom Automation Council organised by the British Computer Society and the Operational Research Society /_cedited by C.M. Berners-Lee._ _aLondon :_bEnglish Universities Press,_c1965._ _ax, 149 p. :_bill. ;_c23 cm._ _aIncludes bibliographical references._ 0_aDecision making_vCongresses._1 _aBerners-Lee, C. M._2 _aUnited Kingdom Automation Council._2 _aBritish Computer Society._2 _aOperational Research Society (Great Britain)__

LeaderDirectoryData245 field, final 710 field

Page 18: Beyond MARC: MARC, linked data, and Bibframe

Other Considerations

• Only libraries use MARC– Libraries tied to library-specific software/processes– Outside agencies can’t take advantage of library data and

standards (See Also: RDA not freely available)• Not even all of libraries use MARC– Archives– Repositories– Non-MARC LMSs

• US RDA test demanded progress be made on a replacement before agreeing to adopt RDA

Page 19: Beyond MARC: MARC, linked data, and Bibframe

Linked Data: “the web of data”

• Use URIs as names for things • Use HTTP URIs so that people can look up

those names. • When someone looks up a URI, provide useful

information, using the standards (RDF, SPARQL)

• Include links to other URIs. so that they can discover more things.

Tim Berners-Lee (2006)

Page 20: Beyond MARC: MARC, linked data, and Bibframe

English sentence

Brideshead Revisited was written by Evelyn Waugh.

Page 21: Beyond MARC: MARC, linked data, and Bibframe

ERM written out

Brideshead revisited created by Evelyn Waugh

Page 22: Beyond MARC: MARC, linked data, and Bibframe

Adding URIs: Brideshead revisited

http://id.loc.gov/authorities/names/no97080492

created by Evelyn Waugh

Page 23: Beyond MARC: MARC, linked data, and Bibframe

Adding URIs: Waugh

http://id.loc.gov/authorities/names/no97080492

created by

http://id.loc.gov/authorities/names/n79049248

Page 25: Beyond MARC: MARC, linked data, and Bibframe

RDF Statement<http://id.loc.gov/authorities/names/no97080492><http://purl.org/dc/terms/creator><http://id.loc.gov/authorities/names/n79049248> .

Page 26: Beyond MARC: MARC, linked data, and Bibframe

RDF (Turtle)@prefix lc_names: <http://id.loc.gov/authorities/names/> .@prefix dc: <http://purl.org/dc/terms/> .

lc_names:no97080492 dc:creator lc_names:n79049248 .

Page 27: Beyond MARC: MARC, linked data, and Bibframe

Brideshead Revisited @prefix lc_names: <http://id.loc.gov/authorities/names/> .@prefix lc_languages: <http://id.loc.gov/vocabulary/languages> .@prefix dc: <http://purl.org/dc/terms/> .

lc_names:no97080492 dc:creator lc_names:n79049248 .dc:created "1945" .dc:extent "1 volume" .dc:language lc_languages:eng .dc:title "Brideshead revisited" .dc:type <http://purl.org/dc/dcmitype/Text> .

Page 28: Beyond MARC: MARC, linked data, and Bibframe

Brideshead Revisited @prefix lc_names: <http://id.loc.gov/authorities/names/> .@prefix lc_languages: <http://id.loc.gov/vocabulary/languages/> .@prefix dc: <http://purl.org/dc/terms/> .

lc_names:no97080492 dc:creator lc_names:n79049248 .

dc:created "1945" .dc:extent "1 volume" .dc:language lc_languages:eng .dc:title "Brideshead revisited" .dc:type <http://purl.org/dc/dcmitype/Text> .

Page 29: Beyond MARC: MARC, linked data, and Bibframe

LC Name Authority for Waugh (excerpt)

@prefix lc_names: <http://id.loc.gov/authorities/names/> .@prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> .@prefix mads: <http://www.loc.gov/mads/rdf/v1#> .@prefix viaf: <http://viaf.org/viaf/sourceID/> .

lc_names:n79049248 rdf:type mads:PersonalName . rdf:type mads:Authority . mads:authoritativeLabel "Waugh, Evelyn, 1903-1966"@en . mads:hasExactExternalAuthority viaf:68937142 .

Page 30: Beyond MARC: MARC, linked data, and Bibframe

Microdata, RDFa, Schema.org

OCLC Worldcat uses embedded Schema.org:

http://www.worldcat.org/oclc/221944758

Page 31: Beyond MARC: MARC, linked data, and Bibframe

Worldcat Schema.org data for a book@prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> .@prefix schema: <http://schema.org/> .@prefix worldcat: <http://www.worldcat.org/oclc/> .@prefix library: <http://purl.org/library/> .@prefix viaf: <http://viaf.org/viaf/> .@prefix lc_authorities: <http://id.loc.gov/authorities/names/> .@prefix mads: <http://www.loc.gov/mads/rdf/v1#> . worldcat:221944758 rdf:type schema:Book; library:oclcnum "221944758"; schema:name "Models for decision : a conference under the auspices of the United Kingdom Automation Council organised by the British Computer Society and the Operational Research Society"; library:placeOfPublication _:1; schema:publisher _:4 . schema:datePublished "[1965]"; schema:numberOfPages "149"; schema:contributor viaf:149407214; schema:contributor viaf:130073090; schema:contributor viaf:137135158; schema:contributor viaf:36887201;

_:1 rdf:type schema:Place; schema:name "London :" ._:4 rdf:type schema:Organization; schema:name "English Universities Press" .viaf:149407214 rdf:type schema:Organization; madsrdf:isIdentifiedByAuthority lc_authorities:n79056431; schema:name "British Computer Society." .viaf:130073090 rdf:type schema:Organization; madsrdf:isIdentifiedByAuthority lc_authorities:n85076053; schema:name "Operational Research Society." .viaf:137135158 rdf:type schema:Organization; madsrdf:isIdentifiedByAuthority lc_authorities:n79063901; schema:name "Institution of Electrical Engineers." .viaf:36887201 rdf:type schema:Person; schema:name "Berners-Lee, C. M." .

(http://www.aurochs.org/rdfv/rdfv.html : click Get Sample Data (OCLC))

Page 32: Beyond MARC: MARC, linked data, and Bibframe

Author Information in Worldcat RDF (Turtle)

@prefix worldcat: <http://www.worldcat.org/oclc/> .@prefix schema: <http://schema.org/> .@prefix viaf: <http://viaf.org/viaf/> .

worldcat:221944758 schema:contributor viaf:36887201 .

Page 34: Beyond MARC: MARC, linked data, and Bibframe

Lots of Ways To Do It@prefix schema: <http://schema.org/> .@prefix dc: <http://purl.org/dc/terms/> .@prefix viaf: <http://viaf.org/viaf/> .@prefix rda_roles: <http://rdvocab.info/roles/> .@prefix cam: <http://data.lib.cam.ac.uk/id/entity/> .@prefix bnb_person: <http://bnb.data.bl.uk/id/person/> .

example:book0001 dc:creator cam:cambrdgedb_eeacef63d900c2acffc3daa400f3d4e4 .example:book0001 dc:creator bnb_person:WaughEvelyn1903-1966 .example:book0001 schema:creator viaf:68937142 .example:book0001 rda_roles:creator viaf:68937142 .example:book0001 dc:creator lc_names:n79049248 .example:book0001 dc:creator "Waugh, Evelyn, 1903-1966" .

[from CUL, BNB, OCLC Worldcat, RDA+VIAF, Dublic Core+LC Names, made-up]

Page 36: Beyond MARC: MARC, linked data, and Bibframe

Linked Data: Is It Any Good?• Not-library specific

– Detailed library data becomes part of the web– Libraries can benefit from wider software, community, and expertise; less tied to specific vendors– Non-librarians can use our data

• Not catalogue-specific: e.g. if archives, repositories, and catalogues, and others can publish linked data and share identifiers (URIs) then it can be mixed and re-used in interesting ways

• Can be linked with other schemes. E.g. authorities such as VIAF with Wikipedia, ORCID, and ISNI• Backbone of other big initiatives:

– Schema.org used by major search engines (Google, Bing, Yahoo, Yandex)– UK government open data: data.gov.uk– Dbpedia– BBC websites, e.g. wildlife finder (takes data from Wikipedia) and World Cup sites.

• Based on very basic and flexible Entity Relationship Model (ERM), the same structure as e.g. FRBR

• Provenance of deconstructed data hard to determine• Can get complex very quickly• Linked data often synonymous with linked open data (a good thing for libraries)• No standard way of presenting bibliographic information as linked data, although…

Page 37: Beyond MARC: MARC, linked data, and Bibframe

BIBFRAME

BIBliographic

FRAMEwork Initiative

Page 38: Beyond MARC: MARC, linked data, and Bibframe

BIBFRAME Model

Page 39: Beyond MARC: MARC, linked data, and Bibframe

BIBFRAME Model: ResourceA BIBFRAME Resource can be anything: a Work, Instance, Authority, or Annotation

bf:authorizedAccessPointbf:descriptionbf:identifierbf:labelbf:subjectbf:relatedResource

Page 40: Beyond MARC: MARC, linked data, and Bibframe

BIBFRAME Model: WorkWork: A resource reflecting a conceptual essence of the cataloging resource. (A FRBR Work/Expression)

bf:creatorbf:notebf:languagebf:titlebf:subjectbf:relatedWorkbf:hasInstancebf:hasExpressionbf:expressionOf

Page 41: Beyond MARC: MARC, linked data, and Bibframe

BIBFRAME Model: InstanceInstance: A resource reflecting an individual, material embodiment of the Work. (A FRBR Manifestation)

bf:titlebf:contributorbf:placePubbf:providerbf:pubDatebf:extentbf:otherFeaturesbf:dimensionsbf:isbnbf:languagebf:notebf:instanceOf

Page 42: Beyond MARC: MARC, linked data, and Bibframe

BIBFRAME Model: Authority (Person)

Authority: A resource reflecting key authority concepts that have defined relationships reflected in the Work and Instance.

bf:resourceRolebf:isnibf:orcidbf:viaf

Page 43: Beyond MARC: MARC, linked data, and Bibframe

BIBFRAME Model: AnnotationAnnotation: A resource that decorates other BIBFRAME resources with additional information, e.g. holdings, cover art, reviews.

bf:annotatesbf:annotationAssertedBybf:annotationBody

Page 45: Beyond MARC: MARC, linked data, and Bibframe

BIBFRAME: Is It Any Good?

• Still very much in draft• Uses own scheme and namespace– ensures security– but against spirit and usual practice of linked data

• Reliant on successful conversion of MARC records (more so than move from AACR2 to RDA)

• Not limited to encoding AACR2 or RDA or FRBR or Dublin Core or…

• Basically under the control of the Library of Congress

Page 46: Beyond MARC: MARC, linked data, and Bibframe

More InformationMARC• MARC21 Standards http://www.loc.gov/marc/• MARC21 Bibliographic http://www.loc.gov/marc/bibliographic/ecbdhome.html• MARC21 Record Structure http://www.loc.gov/marc/specifications/specrecstruc.html• UKMARC Manual http://www.bl.uk/bibliographic/ukmarc.html• MARC Must Die / Roy Tennant. http://www.libraryjournal.com/article/CA250046.html

Linked Data• Library of Congress Linked Data Service. http://id.loc.gov/ Includes LC Name Authorities, LCSH, geographic and language codes, and

others.• Virtual International Authority File (VIAF). http://viaf.org/• The RDA (Resource Description and Access) Vocabularies at the Open Metadata Registry. http://rdvocab.info/• Schema.org for books. http://schema.org/Book• Dbpedia. http://dbpedia.org/About A linked data version of Wikipedia. • Linked Open BNB. http://bnb.data.bl.uk/search• data.lib.cam.ac.uk http://data.lib.cam.ac.uk• BBC Wildlife Finder, http://www.bbc.co.uk/nature/wildlife. Compare e.g. http://www.bbc.co.uk/nature/life/Desert_locust and http://

www.bbc.co.uk/nature/life/Desert_locust.rdf• Bookmarklet for searching catalogues from Wikipedia.

http://www.aurochs.org/aurlog/2013/03/25/bookmarklet-for-searching-catalogues-from-wikipedia/

Bibframe• LC Bibliographic Framework Transition Initiative. http://www.loc.gov/marc/transition/• BIBFRAME.org : New Bibliographic Framework. http://bibframe.org/• Bibframe examples / Karen Coyle. http://kcoyle.net/bibframe/ Converted from MARCXML. Mosly in JSON, but one in turtle.• NISO Bibliographic Roadmap Development Project. http://www.niso.org/topics/tl/BibliographicRoadmap/

Page 47: Beyond MARC: MARC, linked data, and Bibframe

Beyond MARC24510$aBeyond MARC

dc:title “Beyond MARC”bf:title “Beyond MARC”

17 April 2013Thomas Meehan

Head of Current [email protected]

Page 48: Beyond MARC: MARC, linked data, and Bibframe

Appendix: Look at your own BIBFRAME examples!

How to get some BIBFRAME RDF/XML1. Go to http://bibframe.org/tools/compare/2. Enter an LC system ID into the box (e.g. 10342843) and click on Search.3. Click on BIBFRAME RDF/XML.

How to convert RDF/XML to RDF/Turtle4. Select and copy some RDF/XML.5. Go to http://www.rdfabout.com/demo/validator/ and make sure Input Format is set to RDF/XML6. Paste the BIBFRAME data into the box, overwriting anything that’s already there.7. Click on Validate. You will get three versions of the RDF: as Notation 3 (of which Turtle is a subset); N-

Triples; and RDF/XML.

How to view the RDF/Turtle8. Select and copy some Turtle, Notation 3 or N-Triples. Notation 3 or Turtle are by far the easiest to read and

look much the same.9. Go to http://www.aurochs.org/rdfv/rdfv.html10. Paste the RDF data into the box.11. Click on Submit.12. Click on the data itself to highlight various bits.