beyond marc: marc, linked data, and bibframe
DESCRIPTION
TRANSCRIPT
Beyond MARCMARC, Linked Data, and Bibframe
17 April 2013Thomas Meehan
Head of Current [email protected]
Card Index Catalogue
http://cardcat.ucl.ac.uk/cgi-bin/carddisplay.pl?card=887;drawer=13;max=931;ctype=C
AACR2
Models for decision : a conference under the auspices of the United Kingdom Automation Council organised by the British Computer Society and the Operational Research Society / edited by C.M. Berners-Lee. -- London : English Universities Press, 1965. x, 149 p. : ill. ; 23 cm. Includes bibliographical references.• Berners-Lee, C. M.
MARC
MAchine
Readable
Cataloguing
AACR2 in MARC21245 00 $a Models for decision : $b a conference under the auspices of the United Kingdom
Automation Council organised by the British Computer Society and the Operational Research Society /
$c edited by C.M. Berners-Lee.260 __ $a London : $b English Universities Press, $c 1965.300 __ $a x, 149 p. : $b ill. ; $c 23 cm.504 __ $a Includes bibliographical references.700 1_ $a Berners-Lee, C. M.
AARC2 in .mrc00788nam a2200181 a 4500001002700000005001700027008004100044024001500085245021000100260004900310300003200359504004100391650003300432700002300465710003900488710003000527710004900557_UCL01000000000000000477125_20061112120300.0_850710s1965 enka b 000 0 eng _8 _ax280050495_00_aModels for decision :_ba conference under the auspices of the United Kingdom Automation Council organised by the British Computer Society and the Operational Research Society /_cedited by C.M. Berners-Lee._ _aLondon :_bEnglish Universities Press,_c1965._ _ax, 149 p. :_bill. ;_c23 cm._ _aIncludes bibliographical references._ 0_aDecision making_vCongresses._1 _aBerners-Lee, C. M._2 _aUnited Kingdom Automation Council._2 _aBritish Computer Society._2 _aOperational Research Society (Great Britain)__
RDA in MARC21245 00 $a Models for decision : $b a conference under the auspices of the United Kingdom Automation
Council organised by the British Computer Society and the Operational Research Society /
$c edited by C.M. Berners-Lee.264 _1 $a London : $b The English Universities Press Limited, $c 1965.264 _4 $c ©1965300 __ $a x, 149 pages : $b illustrations ; $c 23 cm.336 __ $a text $2 rdacontent337 __ $a unmediated $2 rdamedia338 __ $a volume $2 rdacarrier504 __ $a Includes bibliographical references.700 1_ $a Berners-Lee, C. M., editor of compilation.
What is MARC for?
What is MARC for?
• Storage• Exchange and distribution• Manipulation• Display• Input (http://www.aurochs.org/zz/marc_input/marc_input.html)
• “Lingua franca of library cataloguing”
Finite Notation Problem
Too many subject schemes650 _0 for LCSH650 _1 for LC for Childrens650 _2 for MeSH…650 _7 Source specified in subfield $2
Not enough indicators246 184 $aThe title on the spine
Data in More Than One Place
Languages008 (positions 35-37) eng041 __ $a eng240 10 $l English546 __ $a In English.
Double Encoding: ISBD and MARC
Blanket : Constellation of Orion, 3.
260 __ $a Blanket$b Constellation of Orion$c 3
260 __ $a Blanket :$b Constellation of Orion,$c 3.
Data Mixed UpGMD245 10 $a Data on the web
$h [electronic resource] :$b research and applications /$c Antonis Bikakis, Adrian Giurca (eds.).
245 10 $a Data on the web$b research and applications /$c Antonis Bikakis, Adrian Giurca (eds.).
Nothing allowed after 245$c245 10 $a Enduring resistance :
$b cultural theory after Derrida /$c edited by Sjef Houppermans, Rico Sneller, Peter van Zilfhout. = La résistance
persérvère : la théorie de la culture (d')aprés Derrida / edité par Sjef Houppermans, Rico Sneller, Peter van Zilfhout.
Text, Not DataISBN020 __ $a 9780285638976 (pbk.)020 __ $a 012002618X (ebook)
Title245 10 $a British goblins :
Place of publication260 __ $a Köln
Copyright date260 __ $c c2005264 _4 $c ©2002260 _4 $c copyright 2005260 _4 $c ℗1983260 _4 $c phonogram 1993
Extent300 __ $a ix, 300 p.
Dimensions300 __ $c 23 cm300 __ $c 9 mm
Changing Text as Primary Key for Headings and Authorities
Author heading for deceased personNiemeyer, Oscar, 1907-
Different preferences for writing nameMao, Tse-tung, 1893-1976 [Former heading]Mao, Zedong, 1893-1976毛泽东 , 1893-1976
Small differences could break matchMao, Zedong, 1893-1976.Mao, Zedong, 1893-1976
Expressing Relationships
What does this mean?
700 0_ $a Homer.$t Iliad.
700 1_ $a Berners-Lee, Tim.
Record Not Data
00788nam a2200181 a 4500001002700000005001700027008004100044024001500085245021000100260004900310300003200359504004100391650003300432700002300465710003900488710003000527710004900557_UCL01000000000000000477125_20061112120300.0_850710s1965 enka b 000 0 eng _8 _ax280050495_00_aModels for decision :_ba conference under the auspices of the United Kingdom Automation Council organised by the British Computer Society and the Operational Research Society /_cedited by C.M. Berners-Lee._ _aLondon :_bEnglish Universities Press,_c1965._ _ax, 149 p. :_bill. ;_c23 cm._ _aIncludes bibliographical references._ 0_aDecision making_vCongresses._1 _aBerners-Lee, C. M._2 _aUnited Kingdom Automation Council._2 _aBritish Computer Society._2 _aOperational Research Society (Great Britain)__
LeaderDirectoryData245 field, final 710 field
Other Considerations
• Only libraries use MARC– Libraries tied to library-specific software/processes– Outside agencies can’t take advantage of library data and
standards (See Also: RDA not freely available)• Not even all of libraries use MARC– Archives– Repositories– Non-MARC LMSs
• US RDA test demanded progress be made on a replacement before agreeing to adopt RDA
Linked Data: “the web of data”
• Use URIs as names for things • Use HTTP URIs so that people can look up
those names. • When someone looks up a URI, provide useful
information, using the standards (RDF, SPARQL)
• Include links to other URIs. so that they can discover more things.
Tim Berners-Lee (2006)
English sentence
Brideshead Revisited was written by Evelyn Waugh.
ERM written out
Brideshead revisited created by Evelyn Waugh
Adding URIs: Brideshead revisited
http://id.loc.gov/authorities/names/no97080492
created by Evelyn Waugh
Adding URIs: Waugh
http://id.loc.gov/authorities/names/no97080492
created by
http://id.loc.gov/authorities/names/n79049248
Adding URIs: creator
http://id.loc.gov/authorities/names/no97080492
http://purl.org/dc/terms/creator
http://id.loc.gov/authorities/names/n79049248
RDF Statement<http://id.loc.gov/authorities/names/no97080492><http://purl.org/dc/terms/creator><http://id.loc.gov/authorities/names/n79049248> .
RDF (Turtle)@prefix lc_names: <http://id.loc.gov/authorities/names/> .@prefix dc: <http://purl.org/dc/terms/> .
lc_names:no97080492 dc:creator lc_names:n79049248 .
Brideshead Revisited @prefix lc_names: <http://id.loc.gov/authorities/names/> .@prefix lc_languages: <http://id.loc.gov/vocabulary/languages> .@prefix dc: <http://purl.org/dc/terms/> .
lc_names:no97080492 dc:creator lc_names:n79049248 .dc:created "1945" .dc:extent "1 volume" .dc:language lc_languages:eng .dc:title "Brideshead revisited" .dc:type <http://purl.org/dc/dcmitype/Text> .
Brideshead Revisited @prefix lc_names: <http://id.loc.gov/authorities/names/> .@prefix lc_languages: <http://id.loc.gov/vocabulary/languages/> .@prefix dc: <http://purl.org/dc/terms/> .
lc_names:no97080492 dc:creator lc_names:n79049248 .
dc:created "1945" .dc:extent "1 volume" .dc:language lc_languages:eng .dc:title "Brideshead revisited" .dc:type <http://purl.org/dc/dcmitype/Text> .
LC Name Authority for Waugh (excerpt)
@prefix lc_names: <http://id.loc.gov/authorities/names/> .@prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> .@prefix mads: <http://www.loc.gov/mads/rdf/v1#> .@prefix viaf: <http://viaf.org/viaf/sourceID/> .
lc_names:n79049248 rdf:type mads:PersonalName . rdf:type mads:Authority . mads:authoritativeLabel "Waugh, Evelyn, 1903-1966"@en . mads:hasExactExternalAuthority viaf:68937142 .
Microdata, RDFa, Schema.org
OCLC Worldcat uses embedded Schema.org:
http://www.worldcat.org/oclc/221944758
Worldcat Schema.org data for a book@prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> .@prefix schema: <http://schema.org/> .@prefix worldcat: <http://www.worldcat.org/oclc/> .@prefix library: <http://purl.org/library/> .@prefix viaf: <http://viaf.org/viaf/> .@prefix lc_authorities: <http://id.loc.gov/authorities/names/> .@prefix mads: <http://www.loc.gov/mads/rdf/v1#> . worldcat:221944758 rdf:type schema:Book; library:oclcnum "221944758"; schema:name "Models for decision : a conference under the auspices of the United Kingdom Automation Council organised by the British Computer Society and the Operational Research Society"; library:placeOfPublication _:1; schema:publisher _:4 . schema:datePublished "[1965]"; schema:numberOfPages "149"; schema:contributor viaf:149407214; schema:contributor viaf:130073090; schema:contributor viaf:137135158; schema:contributor viaf:36887201;
_:1 rdf:type schema:Place; schema:name "London :" ._:4 rdf:type schema:Organization; schema:name "English Universities Press" .viaf:149407214 rdf:type schema:Organization; madsrdf:isIdentifiedByAuthority lc_authorities:n79056431; schema:name "British Computer Society." .viaf:130073090 rdf:type schema:Organization; madsrdf:isIdentifiedByAuthority lc_authorities:n85076053; schema:name "Operational Research Society." .viaf:137135158 rdf:type schema:Organization; madsrdf:isIdentifiedByAuthority lc_authorities:n79063901; schema:name "Institution of Electrical Engineers." .viaf:36887201 rdf:type schema:Person; schema:name "Berners-Lee, C. M." .
(http://www.aurochs.org/rdfv/rdfv.html : click Get Sample Data (OCLC))
Author Information in Worldcat RDF (Turtle)
@prefix worldcat: <http://www.worldcat.org/oclc/> .@prefix schema: <http://schema.org/> .@prefix viaf: <http://viaf.org/viaf/> .
worldcat:221944758 schema:contributor viaf:36887201 .
Author Information in Worldcat RDF
<http://www.worldcat.org/oclc/221944758> <http://schema.org/contributor> <http://viaf.org/viaf/36887201> .
Lots of Ways To Do It@prefix schema: <http://schema.org/> .@prefix dc: <http://purl.org/dc/terms/> .@prefix viaf: <http://viaf.org/viaf/> .@prefix rda_roles: <http://rdvocab.info/roles/> .@prefix cam: <http://data.lib.cam.ac.uk/id/entity/> .@prefix bnb_person: <http://bnb.data.bl.uk/id/person/> .
example:book0001 dc:creator cam:cambrdgedb_eeacef63d900c2acffc3daa400f3d4e4 .example:book0001 dc:creator bnb_person:WaughEvelyn1903-1966 .example:book0001 schema:creator viaf:68937142 .example:book0001 rda_roles:creator viaf:68937142 .example:book0001 dc:creator lc_names:n79049248 .example:book0001 dc:creator "Waugh, Evelyn, 1903-1966" .
[from CUL, BNB, OCLC Worldcat, RDA+VIAF, Dublic Core+LC Names, made-up]
Going from Wikipedia to a Catalogue Search
1. “Francis Bacon” (Wikipedia)2. http://viaf.org/viaf/31992319 (VIAF)3. “Bacon, Francis, 1561-1626” (LC Authorities)4. UCL Explore search
5. “Francis Bacon (artist)” (Wikipedia)6. http://viaf.org/viaf/14789843 (VIAF)7. “Bacon, Francis, 1909-1992” (LC Authorities)8. UCL Explore search
Linked Data: Is It Any Good?• Not-library specific
– Detailed library data becomes part of the web– Libraries can benefit from wider software, community, and expertise; less tied to specific vendors– Non-librarians can use our data
• Not catalogue-specific: e.g. if archives, repositories, and catalogues, and others can publish linked data and share identifiers (URIs) then it can be mixed and re-used in interesting ways
• Can be linked with other schemes. E.g. authorities such as VIAF with Wikipedia, ORCID, and ISNI• Backbone of other big initiatives:
– Schema.org used by major search engines (Google, Bing, Yahoo, Yandex)– UK government open data: data.gov.uk– Dbpedia– BBC websites, e.g. wildlife finder (takes data from Wikipedia) and World Cup sites.
• Based on very basic and flexible Entity Relationship Model (ERM), the same structure as e.g. FRBR
• Provenance of deconstructed data hard to determine• Can get complex very quickly• Linked data often synonymous with linked open data (a good thing for libraries)• No standard way of presenting bibliographic information as linked data, although…
BIBFRAME
BIBliographic
FRAMEwork Initiative
BIBFRAME Model
BIBFRAME Model: ResourceA BIBFRAME Resource can be anything: a Work, Instance, Authority, or Annotation
bf:authorizedAccessPointbf:descriptionbf:identifierbf:labelbf:subjectbf:relatedResource
BIBFRAME Model: WorkWork: A resource reflecting a conceptual essence of the cataloging resource. (A FRBR Work/Expression)
bf:creatorbf:notebf:languagebf:titlebf:subjectbf:relatedWorkbf:hasInstancebf:hasExpressionbf:expressionOf
BIBFRAME Model: InstanceInstance: A resource reflecting an individual, material embodiment of the Work. (A FRBR Manifestation)
bf:titlebf:contributorbf:placePubbf:providerbf:pubDatebf:extentbf:otherFeaturesbf:dimensionsbf:isbnbf:languagebf:notebf:instanceOf
BIBFRAME Model: Authority (Person)
Authority: A resource reflecting key authority concepts that have defined relationships reflected in the Work and Instance.
bf:resourceRolebf:isnibf:orcidbf:viaf
BIBFRAME Model: AnnotationAnnotation: A resource that decorates other BIBFRAME resources with additional information, e.g. holdings, cover art, reviews.
bf:annotatesbf:annotationAssertedBybf:annotationBody
BIBFRAME Examples
http://kcoyle.net/bibframe/
http://www.aurochs.org/rdfv/rdfv.html
BIBFRAME: Is It Any Good?
• Still very much in draft• Uses own scheme and namespace– ensures security– but against spirit and usual practice of linked data
• Reliant on successful conversion of MARC records (more so than move from AACR2 to RDA)
• Not limited to encoding AACR2 or RDA or FRBR or Dublin Core or…
• Basically under the control of the Library of Congress
More InformationMARC• MARC21 Standards http://www.loc.gov/marc/• MARC21 Bibliographic http://www.loc.gov/marc/bibliographic/ecbdhome.html• MARC21 Record Structure http://www.loc.gov/marc/specifications/specrecstruc.html• UKMARC Manual http://www.bl.uk/bibliographic/ukmarc.html• MARC Must Die / Roy Tennant. http://www.libraryjournal.com/article/CA250046.html
Linked Data• Library of Congress Linked Data Service. http://id.loc.gov/ Includes LC Name Authorities, LCSH, geographic and language codes, and
others.• Virtual International Authority File (VIAF). http://viaf.org/• The RDA (Resource Description and Access) Vocabularies at the Open Metadata Registry. http://rdvocab.info/• Schema.org for books. http://schema.org/Book• Dbpedia. http://dbpedia.org/About A linked data version of Wikipedia. • Linked Open BNB. http://bnb.data.bl.uk/search• data.lib.cam.ac.uk http://data.lib.cam.ac.uk• BBC Wildlife Finder, http://www.bbc.co.uk/nature/wildlife. Compare e.g. http://www.bbc.co.uk/nature/life/Desert_locust and http://
www.bbc.co.uk/nature/life/Desert_locust.rdf• Bookmarklet for searching catalogues from Wikipedia.
http://www.aurochs.org/aurlog/2013/03/25/bookmarklet-for-searching-catalogues-from-wikipedia/
Bibframe• LC Bibliographic Framework Transition Initiative. http://www.loc.gov/marc/transition/• BIBFRAME.org : New Bibliographic Framework. http://bibframe.org/• Bibframe examples / Karen Coyle. http://kcoyle.net/bibframe/ Converted from MARCXML. Mosly in JSON, but one in turtle.• NISO Bibliographic Roadmap Development Project. http://www.niso.org/topics/tl/BibliographicRoadmap/
Beyond MARC24510$aBeyond MARC
dc:title “Beyond MARC”bf:title “Beyond MARC”
17 April 2013Thomas Meehan
Head of Current [email protected]
Appendix: Look at your own BIBFRAME examples!
How to get some BIBFRAME RDF/XML1. Go to http://bibframe.org/tools/compare/2. Enter an LC system ID into the box (e.g. 10342843) and click on Search.3. Click on BIBFRAME RDF/XML.
How to convert RDF/XML to RDF/Turtle4. Select and copy some RDF/XML.5. Go to http://www.rdfabout.com/demo/validator/ and make sure Input Format is set to RDF/XML6. Paste the BIBFRAME data into the box, overwriting anything that’s already there.7. Click on Validate. You will get three versions of the RDF: as Notation 3 (of which Turtle is a subset); N-
Triples; and RDF/XML.
How to view the RDF/Turtle8. Select and copy some Turtle, Notation 3 or N-Triples. Notation 3 or Turtle are by far the easiest to read and
look much the same.9. Go to http://www.aurochs.org/rdfv/rdfv.html10. Paste the RDF data into the box.11. Click on Submit.12. Click on the data itself to highlight various bits.