national digital library and lld
TRANSCRIPT
National Digital Library of Korea
(Dibrary) and its Linked Data Strategy
Jinho Park
Senior Researcher, National Library of Korea
ISO/TC46 SC9 Korea Secretariat
Member of the International Relations
Committee, Korean Library Association
CJKDLI Korea Working Group Leader
December 12, 2012 | Kuala Lumpur, Malaysia | Broadband and Information Technology Summit for Libraries 2012
Innovation and Business Transformation for Information Service Excellence
Sam Oh
Professor, Sungkyunkwan University LIS
Affiliate Professor, University of Washington,
iSchool
ISO/IEC JTC1/SC34 Chair
ISO TC46/SC9 Chair
DCMI Oversight Committee
1. Dibrary Brand
1
Bra
nd
DIGITAL + LIBRARY
Co
nc
ep
t
• High digital
information technology + • Analog space where
nature and human
become one
“creating new digilog culture”
Brand & Concept
The brand name of the National Digital Library of Korea.
2. Dibrary’s Vision
2
A multicultural space harmonizing nature and cutting-edge digital facility.
Culture
Openness
High-tech
Nature
• Providing cultural
space to all users
• Integrating various
high digital
technologies
• Offering convenient access
for everyone
• Sitting close to nature-
friendly park
Dibrary
3. Establishment Background
3
Establishment of a new infrastructure, Dibrary.
External Issues
• Acceleration of
digitalizing data
processing infrastructure.
• Necessity of The NLK’s
new role in the digital
information era.
Internal Issues
“Integrate the latest
Information technology
into the library
infrastructure”
Dibrary
• Increasing needs for the
role and function as a
digital multicultural space.
• Providing solutions to the
limited stacks issue and
creating a new service
area.
“High quality digital
information available to all
users”
4. Vision, Principles, Goals
4
Establishment of a new infrastructure, Dibrary.
Integration
openness
Portal
Library Portal
Simple Search Interface
A Large Scale of
Information Resource
International Standards
Participations, Shared
Environment
Digital Archiving
Digital Technology
Principles Vision
Integration
Openness
Integrated Management of
Digital Collection
Creation of the Portal Site
Offering the Information Commons
Maintenance of the Digital Archives
Operational Innovation
on Various Digital System
Goals
5. Functions & Roles
5
Building a general digital information center considering synergy with The
National Library of Korea.
Considering synergy
with the National
Library of Korea
• Location
– a public square in front of
the National Library of Korea
• Area
– 38,014㎡
(3 stories above ground
5 stories underground)
• Construction
– 2002~2008`(7years)
– May 2009(grand opening)
• Total Cost
– 117.9 million dollars
• General Audio-Video Space, Global
Lounge, Digital Cluster, Multiplex, Media
Center, Digital Meeting Room, Space for
the Underprivileged Class, Exhibition
Hall, Digital Book Café…
• Book stacks, non-book materials stacks,
A thermo hygrostat control room,
Document transfer room
• General planning group office,
Information systems office, Digital
information use office
• Pathway connecting the main building,
Machinery room, Electricity room,
Parking lot
Information & Culture Facilities
Stacks
Management Facilities
• Incorporation with the
main Library building
• Differentiation with
digital knowledge and
information service
Main Facilities
6
Making an integrated IT
infrastructure to make the
base of service.
Integrated IT infrastructure Portal System and Cooperation Network
Management System Integrated Service System
Setting environment for
integrated search of digital
information
Building an integrated
service space of digital
information
Creating a management
system of digital collections
6. Four Main Projects
4 main projects for The Dibrary(NDLK)
7
7. Dibrary Service
Providing an integrated search service through on-line and off-line.
Digital
contents
Digital
facility
Dibrary Space Dibrary Portal
On-line service Off-line service
contents experience using cutting-
edge digital facility
– Entertainment , media creation,
digital contents viewing, active
exchange, information service,
service for the underprivileged class
• Dibrary visitor influx through
space distinction
• Providing knowledge and
information using on-line/mobile
– Sorted by Data Type, Target,
Region, and Topic
– Personalized service and
cooperation network
• Dibrary visitor influx through
Internet
8
8. Dibrary Portal
A global portal providing high-quality digital knowledge and information to
anyone, anytime, anywhere.
Based on huge high-quality digital resources.. ..Providing integrated portal service to users
Main
Portal
Sub
Portal
• Search Service
– Integrated search, directory
service
• Participation Service
– Q&A, recommendation
service
• Sharing Service
– Sharing special knowledge,
blog/community service
• Application Service
– API/RSS,
statistics/evaluation
• Policy information portal
• Regional portal
• Multicultural portal
• Portal for the disabled
• Book information
• Academic data
• Government
information
• Regional information
• Foreign information
• Author information
• Knowledge sharing
Contents
scope
9
9. Dibrary Portal – Main portal service
Providing integrated search service
Main portal service
menu
Search
Academic data
Research information
Foreign information
Directory
Q&A
Knowledge sharing
Recommendation
Participation
My library
10. Dibrary Portal – Sub portal service
4 sub-portal services characterized by topic and users.
Contents
• Providing an integrated search service on the policy
data of government departments and public institutions
• Codifying and securing regional information
• Providing an integrated search service on regional
academic research/knowledge and information
• Providing service for foreign workers, married immigrants, new
settlers, foreign students, and natives
• Providing an integrated search & community service based on 7
languages
– Vietnamese, Chinese, English, Japanese, Tagalog, Thai and
Korean
• Providing an integrated search & a reading service on
the available contents for the disabled
– Strengthen users’ information access and convenience
Policy information
Regional
Multicultural
Portal for
the disabled
11. Dibrary Portal – Cooperation network sharing digital resources
Developing cooperation network sharing and linking digital contents at
home and abroad.
Main Portal
• Private contents
• Key sources of
knowledge &
information
Organizations,
individual
Policy Information
• Policy information
sources of public
institutions
• International
organizations
Multicultural Information
• Overseas cultural
centers
• Multicultural
information centers
• Overseas public
institutions
Regional Information
• Regional libraries
• Public libraries
• Collecting centers of
regional knowledge
Information for the disabled
• Organizations &
institutions for the
disabled
• Braille libraries
12. Dibrary Space – Service
Providing new service considering users’ convenience and participation.
Extending users’ participation Ensure simple management
Strengthen user convenience
• Booking/guiding facility service
• Real time online Information helper
• Dibrary radio service
• Mobile terminal rental service
• Providing facility service
• Face-to-face service &
education program
• Digital text participation service
• UCC exhibition service
• Remote Management System
Dibrary Space Service
Consider the underprivileged class
Providing
open
information
service
13. Dibrary Space – Facility Overview
Introduction to the B3 ~ B1 Dibrary space.
Main Facilities
B1 • Digital Book
Café
• Pathway
connecting the
main building
B2 • Digital Cluster
• Digital Meeting
Room
• Media Center
• Multiplex
• Space for the
Underprivileged
Class
B3 • Lobby
• Exhibition Hall
• General Audio-
Video Space
디지털북카페 (지하 1층)
Dibrary is preparing for a change after 4 years opening
25
The new direction of Dibrary can be summarized as follows
26
Semantic Web(Linked Open Data)
• What Dibrary (NLK) can do to open its library data?
Converting national bibliographic data, subject and author
authority files into linked data and let anyone to use them
via the Web.
• What will be benefits of NLK LD?
Merging NLK Linked Data with other LLD so new associations
of knowledge can be made.
"The Semantic Web is an extension of
the current web in which information is
given well-defined meaning, better
enabling computers and people to work in
cooperation."
Tim Berners-Lee, James Hendler, Ora Lassila,
The Semantic Web, Scientific American, May 2001
[출처 : http://www.slideshare.net/sandhaus/all-about-rnews-evan-sandhaus]
structured
unstructured
Linked Data Principles
1. Use URIs as names for things.
2. Use HTTP URIs so that people can look up those names.
3. When someone looks up a URI, provide useful RDF
information.
4. Include RDF statements that link to other URIs so that they
can discover related things.
Tim Berners-Lee 2007
http://www.w3.org/DesignIssues/LinkedData.html
The RDF Data Model
Richard Cyganiak
dbpedia:Berlin
foaf:name
foaf:based_near
foaf:Person rdf:type
pd:cygri
Data items are identified with HTTP URIs
pd:cygri
Richard Cyganiak
dbpedia:Berlin
foaf:name
foaf:based_near
foaf:Person rdf:type
pd:cygri = http://richard.cyganiak.de/foaf.rdf#cygri
dbpedia:Berlin = http://dbpedia.org/resource/Berlin
Resolving URIs over the Web
dp:Cities_in_Germany
3.405.259 dp:population
skos:subject
Richard Cyganiak
dbpedia:Berlin
foaf:name
foaf:based_near
foaf:Person rdf:type
pd:cygri
Dereferencing URIs over the Web
dp:Cities_in_Germany
3.405.259 dp:population
skos:subject
Richard Cyganiak
dbpedia:Berlin
foaf:name
foaf:based_near
foaf:Person rdf:type
dbpedia:Hamburg
dbpedia:Muenchen
skos:subject
skos:subject
pd:cygri
The Disco – Hyperdata Browser
W3C Linking Open Data (LOD) Project
• Grassroots community effort to
– publish existing open license datasets as Linked Data on the Web
– interlink things between different data sources
LOD Datasets on the Web: May 2007
Over 500 million RDF triples
Around 120,000 RDF links between data sources
Example RDF Links
• RDF links from DBpedia to other data sources
<http://dbpedia.org/resource/Berlin> owl:sameAs <http://sws.geonames.org/29501
59> .
<http://dbpedia.org/resource/Tim_Berners-Lee> owl:sameAs <http://www4.wiwis
s.fu-berlin.de/dblp/resource/person/100007> .
LOD Datasets on the Web: September 2008
LOD Datasets on the Web: March 2009
LOD Datasets on the Web: July 2009
LOD Datasets on the Web: 2010
LOD Datasets on the Web: 2011
The Principles of Linked Data (Revisited)
• What kind of library data are suitable for these principles?
– Those that are frequently referenced and updated by librarians,
in relation to their works or within the information process
system
– Those that provide users with links as other references (links)
in relation to more accurate search results
– Those that are meaningful in themselves and are independently
capable of being referenced by other organizations/systems
– Those that have values capable of being recognized as unique
information via URI
44
The Principles of Linked Data (Revisited)
• Data owned by libraries:
– Bibliographic data
– Holdings records
– Authority records
• Authors, Titles, Subject Headings
• Those being endlessly referenced and updated by librarians, in relations to their works, or within the information process system
• Those providing links as other references (links) in relations to the more accurate search results to users
• Those that are meaningful themselves and independent ones capable of being referenced by other organizations/systems
• Those capable of being recognized with their values as unique information through URI
45
LLD is about the Links
Books
– http://worldcat.org/oclc/123456
Classification numbers
– http://dewey.info/class/641/about
People
– http://viaf.org/viaf/12345679
Subject headings
– http://tspilot.oclc.org/fast/fst01234567
LLD is about the Openness
• What sort of license is the data available in?
LLD is about the Data
<rdf:RDF>
<rdf:Description
rdf:about="http://viaf.org/viaf/12345679">
<rdf:type rdf:resource=
"http://xmlns.com/foaf/0.1/Person"/>
<rdf:type rdf:resource=
"http://RDVocab.info/uri/schema/FRBRentitiesRDA/Person"/>
<foaf:name>Mozziconacci, Jean-Francois</foaf:name>
</rdf:Description>
</rdf:RDF>
URIs for Dewey System
It should be simple:
http://dewey.info/class/641/
Browsers get redirected to:
http://dewey.info/class/641/about
RDF clients get redirected to:
http://dewey.info/class/641/about.rdf
But the DDC has a long complex history
What language did you want?
http://dewey.info/class/641/about.en
http://dewey.info/class/641/about.fr
What edition did you want?
http://dewey.info/class/641/e22/about
http://dewey.info/class/641/e23/about
In what language?
http://dewey.info/class/641/e22/about.en
In what format?
http://dewey.info/class/641/e22/about.en.html
http://dewey.info/class/641/e22/about.en.rdf
URIs for Virtual International Authority File: VIAF
It is blissfully simple compared to Dewey!
http://viaf.org/viaf/12345679
Generates a 303 redirect to:
http://viaf.org/viaf/12345679/
Where content negotiation will get you either:
http://viaf.org/viaf/12345679/viaf.html
http://viaf.org/viaf/12345679/viaf.xml
http://viaf.org/viaf/12345679/viaf.rss
http://viaf.org/viaf/12345679/viaf.rdf
http://viaf.org/viaf/12345679/marc21.xml
http://viaf.org/viaf/12345679/unimarc.xml
bibo:Document
bibo:Book
nlon:Score
nlon:Electronic Document
bibo:Map
bibo:AudioVisual Document
bibo:AudioDocument
nlon:Complex Document
nlon:OldBook
nlon:Concept
nlon:Location
nlon:Country
nlon:Library
nlon:Government
nlon:University
foaf:Organization
owl:Thing
string
AdministrativeSection
Countries
University
KDATA
string
string
string
dct:references dct:isReferencedBy
dct:references dct:isReferencedBy
LinkedData
COMET, BNB, PODE,
LIBRIS,LC owl:sameAs
geo:SpatialThing
bibo:Thesis
foaf:Person
nlon:Author nlon:Librarian
53
Resource
bibo:Document
a
Identifier
id.loc.gov
dct:language
nlk:Concept
skos:Concept
rdfs:subClassOf
KDC
DDC
Code
Title
Miscellaneous Literals
geo:SpatialThing
nlk:Location
rdfs:subClassOf
Resource as Location
a
nlk:isGeographicAreaOf
dct:alternative dct:title nlk:subtitle nlk:titleOfSeries nlk:titleOfMainSeries nlk:headingOfOriginalLanguage nlk:titleOfOriginalLanguage nlk:headingOfTranslation nlk:titleOfTranslation nlk:uniformTitleOfSeries
An Instance A Literal
A Class External Link
bibo:uri bibo:number dct:requires dct:created dct:issued dct:extent dct:description dct:tableOfContents dct:format dct:abstract
nlk:audienceNote nlk:supplementNote nlk:physicalFormAvailableNote nlk:reproductionNote nlk:reproductionPlace nlk:reproductionInstitution nlk:reproductionDate nlk:originalVersionNote nlk:locationNote nlk:holdingInstitution nlk:useAndReproductionNote nlk:languageNote nlk:linkingEntryComplexityNote nlk:awardsNote nlk:holdingItemNote nlk:publicationPlace nlk:department nlk:degreeYear nlk:bibliography nlk:restriction nlk:citationReferenceNote
nlk:authenticationCode nlk:classificationNumberOfLC nlk:itemNumberOfLC nlk:classificationNumberOfNLK nlk:itemNumberOfNLK nlk:volumeOfNLK nlk:kdc nlk:itermNumberOfKDC nlk:editionOfKDC nlk:ddc nlk:itemNumberOfDDC nlk:editionOfDDC nlk:otherNumber nlk:itemNumberOfOtherNumber nlk:sourceOfOtherNumber
dct:identifier nlk:nlkcn bibo:isbn bibo:issn nlk:cip nlk:strn
foaf:Agent
Resource as Agent
a
dct:creator dct:publisher nlk:awardedFrom
nlk:create nlk:publish
geo:location dct:isPartOf / dct:hasPart
dct:isReferencedBy / dct:references dct:relation
dct:hasVersion dct:hasFormat
dct:replaces / dct:isReplacedBy dct:identifier
bibo:ThesisDegree
bibo:degree
Resource as ThesisDegree
a
SubjectHeading
dct:subject / nlk:isSubjectOf
a
a
a
dct:subject / nlk:isSubjectOf
dct:subject / nlk:isSubjectOf
nlk:birthYear
nlk:deathYear
foaf:name
id.loc.gov dct:subject
COMET
LIBRIS
BNB
owl:sameAs
owl:sameAs
owl:sameAs
Subject
Author / Organization
nlk:numberMark
KDATA
owl:sameAs
foaf:Organization
rdfs:subClassOf nlk:Government
rdfs:subClassOf owl:sameAs
nlk:Library
rdfs:subClassOf owl:sameAs
nlk:University rdfs:subClassOf owl:sameAs
Legend
PODE
owl:sameAs
skos:closeMatch
@prefix dc: <http://purl.org/dc/elements/1.1/> . @prefix nlk: <http://data.nl.go.kr/resource/> . @prefix foaf: <http://xmlns.com/foaf/0.1/> . @prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> . @prefix dct: <http://purl.org/dc/terms/> . @prefix nlon: <http://data.nl.go.kr/ontology/> . @prefix owl: <http://www.w3.org/2002/07/owl#> . @prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> . @prefix skos: <http://www.w3.org/2004/02/skos/core#> . @prefix bibo: <http://purl.org/ontology/bibo/> . @pefex geo: <http://www.w3.org/2003/01/geo/wgs84_pos#> .
54
55
56
57
58
Smart Data and its Potentials
• Implications of Linked Data and Users
– Importance of making data smarter
• Libraries provide countless amounts of information and media
• Libraries need to be a trustworthy entity
• Libraries must be placed where permanent access to and preservation
of data are guaranteed
• Libraries should remain faithful to their most basic and natural role
– Providing users with diverse solutions to problems
• How do users think and solve problems?
• Is it possible to create a system configuration (interface configuration)
to assist users’ problem-solving process?
59
NLK LLD Strategy
• The first priority is opening NLK data
– Basically NLK data is a public data that should be open to anyone.
– The NLK must be open and easily accessible to others
• Try hard to use global standard formats when opening NLK
data
– Recommend to respect linked data principles when opening
• Contribute to information ecosystem.
– The Web is most general and accessible platform and ecosystem.
Contribute to the Web.
• Think more about contributing to the global database (Web)
rather than what NLK will gain from this endeavor.
– The NLK focus should be on providing the world with new
opportunities derived from the live NLK linked data, instead of
thinking about what NLK will gain by opening its data
60
Recent Issues
• Big Data
• Linked Data
• Open Data
61
New Possibilities are rooted in curating what we
already have.
62
References
• Image Sources – http://www.flickr.com/photos/wiertz/8011916658
– http://www.lafabbricadellarealta.com/2012/05/07/an-idea-worth-
sharing-radical-openness/
– http://www.bu.edu/ceit/
– http://www.w3.org/DesignIssues/LinkedData.html
63
64