linked data and semantic web technologies for aid transparency aman shakya phd (national institute...
Post on 21-Dec-2015
220 views
TRANSCRIPT
Linked Data and Semantic Web Technologies for Aid Transparency
Aman ShakyaPhD (National Institute of Informatics, Japan)
Asst. Professor (Institute of Engineering, Nepal)
IT Consultant
http://nepalaid.yipl.com.np/
2
OutlineIntroduction
◦Semantic Web◦Linked Open Data
Aid Transparency Open Data◦IATI standard XML data◦Applications
IATI as Linked Data and Applications
Tools and Platforms
3
Introduction
WWW - World Wide Web page
?
URL: http://nepalaid.yipl.com.np
WWW –Web of Documents
5
Hyperlinks
Hyperlinks
Hyperlinks
URL-http://www.aidtransparency.net/
URL- http://nepalaid.yipl.com.np
6
Semantic WebSir Tim Berners-Lee (inventor of the
WWW)“.. an extension of the current web in which
information is given well-defined meaning, better enabling computers and people to work in cooperation..”Scientific American
(2001)
• Allows data to be shared and reused across application, enterprise, and community boundaries
• Collaborative effort led by W3C with participation from a large number of researchers and industrial partners.
7
The Semantic Web
1. Web of Structured Data
2. Machine understandable semanticsSemantics = Meaning
3. Data modeling and Knowledge representation
4. Interoperable data standards
7
Web of Data
8
Meaningful Data Nodes & Data Links
GGG (Giant Global Graph)Source: http://www.w3.org/2006/Talks/0123-sb-W3C-
ThingsWeb/
9
URI – Uniform Resource Identifier
URI: http://www.china.org/geography/rivers#Yangtze Yangtz
e
China
River
Beijing
6300 km
Located in
Has capital
Is a
Has length
URI: http://dbpedia.org/resource/China
URI: http://dbpedia.org/resource/Beijing
10
Triples (subject, predicate, object)
or (resource, property, value)
Graph data model
RDF - Resource Description Format
Subject ObjectPredicate
11
Syntax (computer format) for RDFRDF/XMLN-triples, turtle, etc.
Embedded formats◦RDFa, microformats, HTML5
JSON (lightweight structured format)
GRDDL
RDF/XML Example<?xml version="1.0"?><rdf:Description rdf:about="http://www.china.org/geography/rivers#Yangtze" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns="http://www.geodesy.org/river#"> <rdf:type rdf:resource="http://www.geodesy.org/river#River"/> <length>6300 kilometers</length> <startingLocation>western China's Qinghai-Tibet Plateau</startingLocation> <endingLocation>East China Sea</endingLocation></rdf:Description>
13
RDFa Example <HTML> file
Semantic Web (explained)Machine understandable Semantics by
◦Data Modeling & Knowledge Representation
Ontology“.. an explicit specification of a conceptualization”
◦Modeling of Concepts and Relationships that exist in the area of interest
14
Gruber (1993)
Example – Activities Ontology
http://intelleo.eu/ontologies/activities/spec/
16
Travel Ontology
17
Ontology LanguagesRDF Schema
◦A vocabulary description language
OWL - Web Ontology Language◦A richer ontology language◦Based on Description Logic
18
OntologiesDomain ontology
◦Open Biomedical Ontologies (OBO)◦CIDOC CRM - an ontology for cultural
heritage information … etc.
Lightweight ontology◦Vocabularies◦Taxonomy◦Thesaurus, etc.
19
SKOSSimple Knowledge Organization
System◦Used for Taxonomies, Thesaurus
Semantic Web Layers Cake
20
The Semantic Web Cake
21
Linked Data
(Web of Data on the Web)
Linked Data
Publishing and interlinking structured data on the Semantic Web
4 basic principles of Linked Data:1. Use URIs to identify things.
2. Use HTTP URIs so that we can locate and look up (dereference) these things.
3. Provide useful information when its URI is dereferenced.
4. Include links to related URIs to improve information discovery on the Web.
22
Linked Open Data Sources on the Web
2007
Open
2009
25
2010
Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/
26
Querying Semantic Web DataSPARQL RDF Query Language
SPARQL endpoint – query interface
Example SPARQL query
PREFIX foaf: <http://xmlns.com/foaf/0.1/> SELECT ?name ?email WHERE {
?person a foaf:Person. ?person foaf:name ?name. ?person foaf:mbox ?email.
}
Beautiful Applications over Raw Data
27Source: Berners-Lee, TED 2009
http://www.data.gov
28
29APPS SHOWCASE
Employment Market Explorer
31
Linked Open Data Rating
- Available on the web (whatever format), but with an open licence
- Available as machine-readable structured data
(e.g. excel instead of image scan of a table)
- Non-proprietary format (e.g. CSV instead of excel)
- All the above plus, Use open standards from W3C (RDF and SPARQL) to identify things
- All the above, plus: Link your data to other people’s data
(Berners-Lee, 2010)
★
★★
★★★
★★★★
★★★★★
32
Aid Transparency Open Data
33
Aid Transparency Open Data
The IATI Standard◦Activity Standard◦Organization Standard◦IATI Codelists
Available as IATI XML schemas
34
IATI Activity StandardBasic activity informationParticipating organizationsGeopolitical informationClassificationsFinancial detailsRelated documents, activitiesPerformance
35
IATI Organization Standard
36
IATI Codelists
37
IATI Data PublishingOrganisations publish IATI standard files
on their own website
Submit a link to the central IATI Registry
Use IATI Registry to find, use and stay updated on data from different publishers
Whenever an organisation updates/adds IATI files, they let the registry know
IATI Registry
39
IATI Data of Nepal
40
Making aid more transparent◦high quality, timely information on
Aid flow
Actively involved in supporting the IATI
Map of Aid Activities
43
IATI Aid Activity Tables
44
Data ExplorerCSV TransformationsQuery BuilderXPath EndpointXSLT TransformationsLists serviceXQuery service
45
Data Explorer – Table View
Faceted Search / Browsing
Data Explorer – Timeline View
Developed using …SIMILE project (MIT) http://simile.mit.edu/
51
IATI Data as Linked Data
52
IATI Open Data Rating
★★★
Machine-readable datain XML format
Towards RDF Linked Data
★★★★
53
IATI Data as Linked DataWhat is happening Already?
◦Code lists as Linked Data- API access to the code-lists- Return RDF linked data, SKOS concepts
◦Sketching Linked Data Vocabularies◦Support for linked data in the IATI
standard
ref: http://www.aidinfolabs.org/archives/384 (Tim Davies)
54
Tim Davies
http://beta.kasabi.com/dataset/iati
Blog: http://www.aidinfolabs.org/archives/582
XSLT- XML Stylesheet Transformation
Source: http://www.w3.org/Consortium/Offices/Presentations/XSLT_XPATH/
56
Further Possibilities
57
Further possibilitiesMix-and-match existing vocabulariesPerhaps
◦Payment Ontology◦Business Ontologies, Semantic XBRL◦Academic Research Project Funding
Ontology (ARPFO)◦DOAP (Description of a Project)◦…….
Generalized ‘Resource Flow’ ontology
Payments Ontology
http://data.gov.uk/resources/payments
59
XML to RDF Linked DataXML Semantics Reuse
Methodology (Roberto García)
◦XML Schema to OWL◦XML to RDF
ReDeFer◦Convert XML to RDF/OWL
60
GRDDLGleaning Resource Descriptions
from Dialects of Languages
Extract RDF data from XML formats via transformations identified by URIs and typically expressed in XSLT
61
Linking IATI to existing Linked Data
Government Data◦data.gov, data.gov.uk.. etc.
Semantic XBRL◦Financial Reports
Research/Project Data (funded by the donor)
Dbpedia (Wikipedia data)Geo Linked Data……… so on
62
Linked data from data.gov.ukhttp://linked4.org/lsd/explore.html
http://wheredoesmymoneygo.org/
Open Knowledge Foundation
68
http://data.gov.uk/apps
69
For the Developers
70
Semantic Web Platforms and ToolsSemantic Web Frameworks
◦ Triple Stores◦ Parsers and APIs◦ SPARQL query processor
Examples◦ Java – Jena, Sesame◦ Jena .NET◦ PHP – ARC, Redland, RAP◦ Ruby - ActiveRDF◦ Virtuoso, Redland, 3store, Talis platform, RDF Pro,
etc.Ontology editors
◦ Protégé, OntoStudio, TopBraid composer, etc.
71
Linked Data Platforms and Tools
Linked Data Publishing Platforms◦D2R server◦Triplify◦Talis platform◦Kasabi◦Pubby◦Paget … etc.
Linked Data/RDF Editors and Validators◦Hyena◦Vapour
72
Semantic Web Platforms and ToolsSemantic Web CMS and frameworks
◦Semantic Drupal◦Paggr◦ConStruct◦Structured Dynamics
PoolParty◦Thesaurus management and SKOS editor
Rhizomer◦publishing, querying, browsing, editing
and interacting with semantic data
73
Social Semantic Web ToolsSemantic Wikis
◦Semantic Media Wiki◦OntoWiki◦KIWI: Knowledge in a Wiki … etc.
Semantic blogging, tagging… etc.
74
Linked Data VisualizationTools
SIMILE project (MIT) http://simile.mit.edu/
◦Exhibit◦Timeline◦Longwell (faceted browser)
Many eyes◦Open platform for visualization
http://www-958.ibm.com/software/data/cognos/manyeyes/
76
ConclusionSemantic Web
◦A global Linked Open Data commonsIATI standard dataIATI as Linked Open Data
◦Ongoing work◦Further possibilities
Applications and Tools