modern trends in semantic web - cvut.cz · miroslav blaˇsko ([email protected]) modern...
TRANSCRIPT
Modern Trends in Semantic Web
Miroslav Blasko
January 15, 2018
Miroslav Blasko ([email protected]) Modern Trends in Semantic Web January 15, 2018 1 / 23
Outline
1 Overview
2 Semantic Data Pipelines
3 Data Validation
4 Integration into Programming Languages
5 Ontology-based form generation
Miroslav Blasko ([email protected]) Modern Trends in Semantic Web January 15, 2018 2 / 23
Overview
1 Overview
2 Semantic Data Pipelines
3 Data Validation
4 Integration into Programming Languages
5 Ontology-based form generation
Overview
Miroslav Blasko ([email protected]) Modern Trends in Semantic Web January 15, 2018 3 / 23
Overview
Current Trends for Semantic Technologies
Linked Datadata quality and data provenanceshallow semanticsdata validationBusiness IntelligenceIoT and RDF streamingKnowledge graphs
Miroslav Blasko ([email protected]) Modern Trends in Semantic Web January 15, 2018 3 / 23
Semantic Data Pipelines
1 Overview
2 Semantic Data Pipelines
3 Data Validation
4 Integration into Programming Languages
5 Ontology-based form generation
Semantic Data Pipelines
Miroslav Blasko ([email protected]) Modern Trends in Semantic Web January 15, 2018 4 / 23
Semantic Data Pipelines
Semantic Data Pipelines
semantic technologies are widely used for data processing activitiesthat includes heterogeneous data sourcesRDF within data pipelines is used for
annotation of data sourcesrepresentation/interpretation of data schemaproviding common format for representation of data
Miroslav Blasko ([email protected]) Modern Trends in Semantic Web January 15, 2018 4 / 23
Semantic Data Pipelines
LinkedPipes ETL
web-based lightweight ETL toolused primarily for processing of Open Data and publication of LinkedDatacomponents are written in Java + Javascript UImost of the components transforms to/from RDFconfiguration of transformation pipelines is in RDF
Miroslav Blasko ([email protected]) Modern Trends in Semantic Web January 15, 2018 5 / 23
Semantic Data Pipelines
LinkedPipes ETL Example Pipeline
ETL pipeline execution from [2]. Arrows represent flow of data between components, green/rededges of rectangles represent successful/unsuccessful execution of components.
Miroslav Blasko ([email protected]) Modern Trends in Semantic Web January 15, 2018 6 / 23
Semantic Data Pipelines
SPARQLMotion
an RDF-based scripting language with a graphical notation todescribe data processing pipelinespipelines editable by commercial version TopBraid Composer 1
provides extensions through RDF, Java, Javascriptonly SPARQL variable bindings and a RDF graph are passed betweennodes of pipelinewhole configuration is in RDFwell integrated with SPIN2, SHACL3
1https://www.topquadrant.com/tools/ide-topbraid-composer-maestro-edition/2SPARQL Inferencing Notation – RDF-based vocabulary to represent SPARQL rules
and constraints on RDF models3Shapes Constraint Language that is regarded as successor of SPIN
Miroslav Blasko ([email protected]) Modern Trends in Semantic Web January 15, 2018 7 / 23
Semantic Data Pipelines
SPARQLMotion Script Example
Visual representation of SPARQLMotion script from tutorial at http://sparqlmotion.org/.Rectangles represent modules while arrows represent order of execution/flow of data.
Upper/lower part of rectangles shows input/output SPARQL variables of modules.
Miroslav Blasko ([email protected]) Modern Trends in Semantic Web January 15, 2018 8 / 23
Semantic Data Pipelines
SPipes
SPipes language – an extension of SPARQLMotion scripting languageSPipes engine
command-line and REST interfaceonly partial support for SPARQLMotion language (e.g. no iterationover set of pipeline nodes)semantic logging of execution
SPipes componentsmost of the modules from SPARQLMotion core libraries are missingnew modules for generation of semantic forms, Linked Data processingand publishing, processing tabular data, NLP
SPipes editorweb based editor/debugger of SPipes scriptsfirst release within next month
Miroslav Blasko ([email protected]) Modern Trends in Semantic Web January 15, 2018 9 / 23
Semantic Data Pipelines
SPipes Editor
Visual representation of SPARQLMotion script in SPipes Editor Prototype
Miroslav Blasko ([email protected]) Modern Trends in Semantic Web January 15, 2018 10 / 23
Semantic Data Pipelines
RDFpro
Java command-line tool and library for RDF processingstream-oriented highly optimized RDF processorsprimarily for Linked Data integration tasks at large scaleJavascript and Groovy scripting supportsupport for RDF quads filtering and replacement, SPARQL-likeinference rules, TBox/VOID statistics extraction, owl:sameAssmushing, RDF deduplication, set/multiset operations ...
Miroslav Blasko ([email protected]) Modern Trends in Semantic Web January 15, 2018 11 / 23
Semantic Data Pipelines
RDFpro Pipeline Example
Sketch of a RDFpro pipeline [1] – integrating RDF data from Freebase, GeoNames andDBpedia in the four languaged EN, ES, IT and NL, performing smushing, inference,
deduplication and statistics extraction.
Configuration of 4.Inference – a deductive closure of data is computed and saved, using theextracted TBox and excluding RDFS rules rdfs4a, rdfs4b and rdfs8 to avoid inferringuninformative X rdf:type rdfs:Resource quads. Output of the closed TBox is filtered
(@transform) and placed in graph <graph:vocab>. For details seehttp://rdfpro.fbk.eu/example.html.
Miroslav Blasko ([email protected]) Modern Trends in Semantic Web January 15, 2018 12 / 23
Data Validation
1 Overview
2 Semantic Data Pipelines
3 Data Validation
4 Integration into Programming Languages
5 Ontology-based form generation
Data Validation
Miroslav Blasko ([email protected]) Modern Trends in Semantic Web January 15, 2018 13 / 23
Data Validation
Data validation languages
Shape basedShExSHACL
SPINOWL integrity constraints[4]
Miroslav Blasko ([email protected]) Modern Trends in Semantic Web January 15, 2018 13 / 23
Data Validation
Shapes Constraint Language (SHACL)
RDF vocabulary for validating RDF graphs against a set of conditions(called shapes)W3C Recommendation4 from July 20, 2017partial support for inference (rdf:type, rdf:Class,rdfs:subClassOf, owl:imports)optional support for entailment (sh:entailment)support for closed shapesmodular and reusable (owl:import, composition/inheritance ofshapes)support validation report, fixes for constraint violationsconstraints extensions (SHACL-SPARQL, Javascript ...)
4https://www.w3.org/TR/shacl/Miroslav Blasko ([email protected]) Modern Trends in Semantic Web January 15, 2018 14 / 23
Data Validation
SHACL resources
SHACL playground – http://shacl.org/playground/
SHACL by example – https://www.slideshare.net/jelabra/shacl-by-example
Reusable SHACL constrains, data model for test cases and fixes forconstraint violations – http://datashapes.org/
SHACL shapes for schema.org –http://datashapes.org/schema.ttl
Miroslav Blasko ([email protected]) Modern Trends in Semantic Web January 15, 2018 15 / 23
Integration into Programming Languages
1 Overview
2 Semantic Data Pipelines
3 Data Validation
4 Integration into Programming Languages
5 Ontology-based form generation
Integration into ProgrammingLanguages
Miroslav Blasko ([email protected]) Modern Trends in Semantic Web January 15, 2018 16 / 23
Integration into Programming Languages
Integration of Data Models into Programming Languages
There are three major ways to integrate usinggeneric types (e.g. Jena)mapping to the type system of a programming language (e.g. JOPA)using custom type system (e.g. λ − DL)
Miroslav Blasko ([email protected]) Modern Trends in Semantic Web January 15, 2018 16 / 23
Integration into Programming Languages
JOPA
JOPA5 is a persistence API and implementation for accessing OWLontologies with features:
Object-ontological mapping based on integrity constraints,Explicit access to inferred knowledge,Access to unmapped properties and individual’s typesTransactionsSeparate storage access layer
5https://kbss.felk.cvut.cz/web/kbss/jopaMiroslav Blasko ([email protected]) Modern Trends in Semantic Web January 15, 2018 17 / 23
Integration into Programming Languages
JOPA Entity Example
Object-ontological mapping from JOPA library [1]
Miroslav Blasko ([email protected]) Modern Trends in Semantic Web January 15, 2018 18 / 23
Integration into Programming Languages
λ − DL
λ − DL6 is a typed lambda calculus for semantic dataFeatures
DL concepts as typessubtype inferencestyping of queriesDL-safe queriesopen-world querying
Prototype implementation in F# using OWL reasoner HermiT
6https://west.uni-koblenz.de/de/lambda-dlMiroslav Blasko ([email protected]) Modern Trends in Semantic Web January 15, 2018 19 / 23
Integration into Programming Languages
Example of λ − DL program and data in abstract syntax
For detailed explanation of the example see [3].
Miroslav Blasko ([email protected]) Modern Trends in Semantic Web January 15, 2018 20 / 23
Integration into Programming Languages
Example of λ − DL program in F#
Miroslav Blasko ([email protected]) Modern Trends in Semantic Web January 15, 2018 21 / 23
Ontology-based form generation
1 Overview
2 Semantic Data Pipelines
3 Data Validation
4 Integration into Programming Languages
5 Ontology-based form generation
Ontology-based form generation
Miroslav Blasko ([email protected]) Modern Trends in Semantic Web January 15, 2018 22 / 23
Ontology-based form generation
SForms
SForms is a JavaScript library for ontology-based interactive web forms.Forms are defined by JSON-LD ontology using predefined RDF vocabulary.
Miroslav Blasko ([email protected]) Modern Trends in Semantic Web January 15, 2018 22 / 23
Ontology-based form generation
SForms UI
SForms – dynamically generated web formsMiroslav Blasko ([email protected]) Modern Trends in Semantic Web January 15, 2018 23 / 23
References
1 Overview
2 Semantic Data Pipelines
3 Data Validation
4 Integration into Programming Languages
5 Ontology-based form generation
References
Miroslav Blasko ([email protected]) Modern Trends in Semantic Web January 15, 2018 24 / 23
Ontology-based form generation
[1] Francesco Corcoglioniti et al. “RDF PRO: an extensible tool forbuilding stream-oriented RDF processing pipelines”. In: Proceedingsof the 2014 International Conference on Developers-Volume 1268.CEUR-WS. org. 2014, pp. 49–54.
[2] Jakub Klımek, Petr Skoda, and Martin Necask. “LinkedPipes ETL:Evolved linked data preparation”. In: International Semantic WebConference. Springer. 2016, pp. 95–100.
[3] Martin Leinberger, Ralf Lammel, and Steffen Staab. “LambdaDL:Syntax and Semantics (Preliminary Report)”. In: arXiv preprintarXiv:1610.07033 (2016).
[4] Evren Sirin. “Data validation with OWL integrity constraints”. In:Web Reasoning and Rule Systems (2010), pp. 18–22.
Miroslav Blasko ([email protected]) Modern Trends in Semantic Web January 15, 2018 23 / 23