linked data: emblematic applications on legacy data in libraries
TRANSCRIPT
Overview of Presentation
• Linked Data: Introduction– Evolution of Syntactic Web to Linked Web – Linked Data Working Principles
• Current Trends & Implementations• Linking Legacy Data– Annotating Text Data & Publishing Legacy Data as Linked Data
• Conclusion
04/18/23 2
Evolution of Syntactic Web to Linked Web
04/18/23 3
Linked Web
Semantic Web
Syntactic Web
Web of Documents,Human clickable hyperlinks, Web 1.0, Web 2.0,Content not well structuredCan’t ask expressive queriesCan’t process content within applications
Web of Data
Web 3.0Machine Processable & Understandable,Data sets Isolated, unconnected
Meaningful Links between related Data sets,Making Web like a single global database
Need for Linked Data
• To enhance re-usability, findability & visibility of our data• Publish structured data directly on the Web in addition to
classic HTML pages.• Different Approaches
– Microformats– Web APIs & Mashups
» APIs slice the Web into separate data silos– Linked Data
04/18/23 4
Christian Bizer: How to Publish Linked Data on the Web - Introduction (10/27/2008)
The Web of Linked Data
04/18/23 5
Use Semantic Web technologies topublish structured data on the Web,set links between data from one data source to data within other data sources
Linked Data Working Principles
• Use URIs as names for things.• Use URIs, so that people can look up those names.• When someone looks up a URI, provide useful information,
using the standards (RDF).• Include links to other URIs, so that they can discover more
things.Tim Berners-Lee 2007
http://www.w3.org/DesignIssues/LinkedData.html
04/18/23 6
URIs to identify people & relationships
04/18/23 7
http://drtc.isibang.ac.in/DRTC/students/sayon http://drtc.isibang.ac.in/DRTC/students/vikas
http://drtc.isibang.ac.in/DRTC/students/debashis
http://live.dbpedia.org/page/Bangalore
http://xmlns.com
/foaf/0.1/based_near
http://xmlns.com/foaf/0.1/knows
Facts about few others
Name Software Events ReasoningSPARQL
EndpointTriples
License
LCSHMarkLogic & 4Store
Yes Yes YesMore than
300 million
Open
Eurovoc Sesame No No Yes N/ANot
Open
Wordnet 3.0 3Store Yes Yes YesMore than 2 million
Open
04/18/23 9
OpenRefine(ex-Google Refine)
•Free, open source tool used for Cleaning messy data & Linking it to data sets like Freebase,
DBpedia, Europeana etc.•Survey result of 2014 reflects Librarians as largest user base.
04/18/23 Image Source : http://en.wikipedia.org/wiki/File:Google-refine-logo.svg 10
Linked Media Framework
• Server application that bundles central Semantic Web technologies such as
LMF Core :Linked Data Server that allows to expose data following the Linked Data Principles.
LMF Modules :Used to extend the functionality of the Linked Media Server.
• LMF Usage Scenarios :Publishing Legacy Data as Linked Data, Building Semantic Search over Data
04/18/23 Image Source: http://semanticweb.org/wiki/File:Weis_mit_bunter_schrift.png 11
RDF/XML using OpenRefine & csv2rdf
04/18/23 17
<rdf:Description rdf:about="http://localhost:8080/LMF/resource/author/1"><rdf:type rdf:resource="http://xmlns.com/foaf/0.1/person"/><foaf:name>Robert Silverberg</foaf:name><owl:sameAs
rdf:resource="http://dbpedia.org/resource/Robert_Silverberg"/></rdf:Description>
<rdf:Description rdf:about="http://localhost:8080/LMF/resource/book/1"><dc:title>Passengers (short story)</dc:title><owl:sameAs
rdf:resource="http://dbpedia.org/resource/Passengers_(short_story)"/></rdf:Description>
<rdf:Description rdf:about="http://localhost:8080/LMF/resource/author/1"> <dc:creator rdf:resource="http://localhost:8080/LMF/resource/book/1"/></rdf:Description>
<rdf:Description rdf:about="#entry00001">
<rdf:type rdf:resource="http://data-gov.tw.rpi.edu/2009/data-gov-twc.rdf#DataEntry"/>
<id1>1</id1> <authors>Robert Silverberg</authors>
<id2>1</id2> <book_title>Passengers (short story)</book_title>
<author_id>1</author_id>
<book_id>1</book_id>
</rdf:Description>
OpenRefine output csv2rdf output
Conclusion
• Linked Data converts the whole web into a single database• Designed for machines and humans.• Links between concept-concept and concept-and its description.• Encourages reuse, reduces repetition.• Enables to add more value to data over the network.
• People can more easily find library resources on the web.
• More creative applications based on library metadata
04/18/23 18