semantic web for hack days
DESCRIPTION
Canberra Semantic Web Meetup, 2 August 2010 The talk objective is to encourage the Meetup members to participate and prepare the Sydney Amped Hack Day (October 16 in Sydney: http://ampedweb.org/ ).TRANSCRIPT
Semantic Mashups for Hack DaysLaurent Lefort, CSIRO ICT Centre, Information Engineering Laboratory
Canberra Semantic Meetup, 2 August 2010
Photo by JJ Halans
CSIRO. Semantic Mashups for Hack days
Bio
• Over 25 years ago: Computer science engineer
• Almost 10 years ago: CSIRO ICT Centre, Canberra
• Over 5 years ago: Semantic Web researcher
• Ontology design patterns
• 1 year ago: W3C Australia Office manager & guest programmer for the W3C Track at Web Directions South
• Armin Haller (Meetup co-organiser) is now in charge of the Australian Office
• About 1 year ago: GovHack host
• Now: W3C Semantic Sensor Network XGco-chair
• To be continued …
CSIRO. Semantic Mashups for Hack days
Taste le …
I don’t use (yet)
I use
I want to use more
Linked Data Api
(Gloze)
2222
1111
PURLz
Callimachus
CSIRO. Semantic Mashups for Hack days
My semantic web journey
0.01
0.1
1
10
100
1000
o11
o12
o65
o56
o23
o22
o30
o49
o24
o57
o27
Fact++RACERPellet(Direct)CEL
WHEN
WHEREIN
WHATObservationSampling Feature
Observed
property
Metadata
Procedure
Result
Time* Geometrical types
Units
Quantities
Taxa
Chemistry
Temporal types
Coord. Sys
Vertical Coord. Sys
Medium
Fraction
Processing & interpolation
Validation &
quality flag
Sensor
(Instrument)
Station
PlatformSite
Water
Feature
Result type
Sampled Feature
Institution and project
System and author
Security classif.
Transaction type
Gauge/weir
layout/profile
Missing data
Feature property
?
?
?
?
Survey type
Process
Action
Event
Multi-dependent concepts
Feature-dep. parameters
Feature-indep. parameters
Abstract concepts
Semi-abstract concepts
Semi-primitive concepts
Primitive concepts
O&M amd GFM stereotypes
Simple classes
Classes w/ ident. instances
Onto category to be defined
Time* : two O&M stereotypes (sampling time and result time)
Features types
?
?
Surfing www to learn about o. from Gio W.?
1995 2004 2006 2007 2008
OWL-ifyingthe “motherof all colorcharts”
EL+-ifyingaircraft data to test reasonersover large o.
OWL-ifyingO&M-based stds: uml, xsd, vocabs
Semantic Sensor ontology
OWL-ifyingwater vocabs(4-in-one o.)
2009 2009 2009 2009 2010
SKOS-ifyingUoM and quantities
Semantic Markup for XML-based stds
TriplifyingUoM and quantities
Linking Open Standards !
=xlink:role C
=xlink:href myC
CSIRO. Semantic Mashups for Hack days
Semantic Web and Web of Data
SKOS without dataOntologist
Will use OWLLink
Linked Open Data user
Also uses SPARQL
SKOS with data
I want both
I need loose coupling (follow my nose) I need tight coupling (reasoner-boosted links)
CSIRO. Semantic Mashups for Hack days
The Semantic Intensity scale
• RIF: to exchange rules among rule systems (which use a logic which is different to what OWL uses)
• ---- Ontologies stop here ----• OWL2: description logic stretched as far
as possible without breaking it: richer axioms, meta-modelling, additional capabilities to handle data
• OWL 1: triples arranged according to description logic principles
• SPARQL: triples arranged according to “web-ready schema”
• SKOS: triples structured as thesaurus, simple taxonomies
• RDF: triples
2222
1111
Low Intensity apps
High Intensity
CSIRO. Semantic Mashups for Hack days
Producing good quality ontologies canhelp to get good linked data pathways
• An ontology is always an approximation of the intended model
• To avoid mismatches between the intent and the result, it is better to have:
• domain experts with a broader knowledge of the domain
• experts with a narrower field of expertise
• But for experts to work efficiently together, you’ll need to solve the linguistic issues first
Figure by Krzysztof Janowicz , “invited expert”, SSN XG
CSIRO. Semantic Mashups for Hack days
Solving linguistic issues (W3C Semantic Sensor Network XG example)
• VIM_terms• Measurement (VIM 2.1): process of experimentally obtaining one or more
quantity values that can reasonably be attributed to a quantity• Measurement (MINET): Process of associating numbers, in an empirical
and objective way, to characteristics of objects and events of the real world in a way so as to describe them
• SWE_terms• Measurement (noun) (SensorML): an observation whose result is a
measure• Measurement (verb) (SensorML): An instance of a procedure to estimate
the value of a natural phenomenon, typically involving an instrument or sensor. This is implemented as a dynamic feature type, which has a property containing the result of the measurement. The measurement feature also has a location, time, and reference to the method used to determine the value. A measurement feature effectively binds a value to a location and to a method or instrument.
• Wikipedia_terms• Measurement (Wikipedia) In science, measurement is the process of
obtaining the magnitude of a quantity, such as length or mass, relative to a unit of measurement, such as a meter or a kilogram. The term can also be used to refer to the result obtained after performing the process.
• Tip: check if Wikipedia has a disambiguation list
CSIRO. Semantic Mashups for Hack days
OWL in Protege
Protégé is coupled to a description logic reasonerwhich can infer subsumption relationships
CSIRO. Semantic Mashups for Hack days
SKOS example in SESAME (with data)
RDF data can be queried by others using a REST-based service (what is know as a SPARQL endpoint)
CSIRO. Semantic Mashups for Hack days
OWL with data in Protégé
OWL skeleton populated with data (here SysML-QUDV)
Must have: a def. and a citation
CSIRO. Semantic Mashups for Hack days
Semantic Mashup options across the 3 main families of W3C standards: XML, Semantic Web and HTML
XML + HTML(e.g. KML)
“Lifted” XMLXML annotated with OWL, SKOS, RDF
HTML + RDFa
HTML + RDFa + content sourced from annotated XML, SPARQL endpoints or even other RDFa pages
HTML + content from SPARQLendpoints: OWL, SKOS, RDF
Semantic Web resources
CSIRO. Semantic Mashups for Hack days
Linking Open Standards
• Watch for the transition from XML-based standards to either RDF-only or hybrid XML-RDF standards
• Watch for the transition from HTML-based standards to either hybrid XML-RDF-HTML standards
• Standard Development Organisations should work together to agree on common approaches to:
• Develop and apply a core set of principles and methods to mix specifications snippets from XML, RDF and HTML
• Including extensible APIs like the SPARQL and RDFa
• Allow tight and loose couplings • between ontologies• between ontologies and linked open data resources• between linked open data resources
• Split ontologies into manageable modules: the issue here is that the definition of a module can vary a lot …
CSIRO. Semantic Mashups for Hack days
For the semantic hack fest, we have the Open Data, but it’s not linked (not avail. as SPARQL)!
CSIRO. Semantic Mashups for Hack days
LOD constellation: 13.1 billions triples and 142 millions RDF links (late 2009)
CSIRO. Semantic Mashups for Hack days
Adding Australian content to the LOD constellation Geo/Gov, People/communities, Science
LinkedMDB
Gov-Track
US Census Data
riese
Geo-names
lingvoj
World Fact-book
Euro-stat
flickr wrappr
Open Calais
RevyuSIOCSites
Doap-space Flickr
exporter
FOAF Profiles
Crunch base
Open-Guides
Pub Guide
W3CWordNet
Open Cyc
UMBEL
Yago
DBpedia
Freebase
Virtuoso Sponger
DBLPHannover
RDF Book Mashup
Project Guten-berg
DBLPBerlin
IEEE
CiteSeer
DBLP RKBExplorer
eprints
RKBECS South-
ampton
Linked GeoData
TO DO: Australian clouds
TO DO: Australian links
?
?
?
Also to consider …all the cultural clouds (e.g. Music)All the health & life sciences clouds
CSIRO. Semantic Mashups for Hack days
Class-to-Class link pathways in the UMBEL LOD Constellation (Oct 2008)
damltime -- Time Zone o. doap -- Description of a Project o. foaf -- Friend-of-a-Friend o. geo -- Geo wgs84 o. geonames -- GeoNames o. opencyc -- OpenCycknowledge base sioc -- Socially Interlinked Online Communities o. sioc_types -- SIOC extension umbel -- Upper Mapping and Binding Exchange Layer o. wordnet -- WordNet lexical o.
CSIRO. Semantic Mashups for Hack days
Why a semantic hack fest?To ease the transition from Web 2.0 to Web 3.0
• Mashups developed for govhack, apps4nsw, app-my-state VIChave used semantic web technos very rarely
• Report of the Gov 2.0 Taskforce• […] Information should be: free, easily discoverable, based on
open standards and therefore machine-readable, properly documented and therefore understandable, licensed to permit freereuse and transformation by others
• […] governments have a role to play in leading and encouraging the uptake of Web 3.0 technologies in support of greater innovation based on the reuse of public sector information and enhanced citizen/government interaction.
• e.g. RDFa usage in data.gov.au
• Having a semantic hack fest can help: • To demonstrate benefits on concrete examples: hack days are a
great way to get awesome results • To accelerate adoption at a larger scale by industry
CSIRO. Semantic Mashups for Hack days
The Amped Hack day
• Two free events organised by Web Directions South at the end of the Web Directions Conference
• Atlanta September 25th• Sydney October 16th
• Sydney event• Hosted by the PowerHouse Museum• Not yet fully booked: http://ampedweb.org/
• John Allsopp “likes the idea”• Semantic Geeks! I love it. I think this is a great addition.
CSIRO. Semantic Mashups for Hack days
Semantic Mashup for Hack days: What’s required
• Ontologies which could add value to mashups: e.g. my colour ontology for images,, …
• Australian content to play with: triplified (linkable) datasets for different domains: .gov (and .com, .org), geo + …, science, libraries/museums
• Lots of tools with hackable examples • More publicity:
• Semantic hackfest talk scheduled in the W3C Australia track at the Web Directions South Conference (just before Amped).
• Public announcement elsewhere: SW Meetup, THATCamp, others?
• A workspace or two: one to prepare and one for the hack fest participants
• Helpers (before the hack day) and mentors (during the event) for all of the above
CSIRO. Semantic Mashups for Hack days
Ideas for the hack day
• 1st idea. Mix:• Paul Hagon Searching by Colour Metadata Australia 2010
• My colour ontology (need some rework)• If we put the two approaches together, we should learn if:
• We can build a better colour ontology
• We can build better links between colours and tags or topics
• We can demonstrate that ontologies are really adding value
• 2nd idea. Mix:• Govhack’s mashups based on data about organisations
(companies, government agencies)
• The organisation ontology published by Epimorphics
CSIRO. Semantic Mashups for Hack days
Paul’s idea: correlate the flickr tags and the colours of images selected with image magick
Paul Hagon Searching by ColourMetadata Australia 2010
CSIRO. Semantic Mashups for Hack days
Bonus we can have a colour ontology which provides the true tags
Color QualeSpatial Region
NBS ISCCcentroids
Individual colors
MunsellHue
• HTML tables• 267 centroids
• 1000’s of individual colours http://tx4.us/
CSIRO. Semantic Mashups for Hack days
It is also citeable: the original references are provided
House and Garden magazine, in Supplementary Index of Color names (Dictionary of Color)
U.S. Department of Agriculture Soil Charts
Postage-Stamp Color Names, William H. Beck
National Research Council, Rock-Color Chart
Commercial Standard CS156-49, Colors for Polystyrene Plastics
Commercial Standard CS147-47, Colors for Molded Urea Plastics
Horticultural Colour Charts, R. F. Wilson
Federal Specification TT-C-595, Colors; (for) Ready-Mixed Paints
Colour Terms in Biology, H.A. Dade
American Association of Textile Chemists and Colorists and Society of Dyers and Colourists
Textile Color Card Association Standard Color Card of America and U.S. Army Color Card
Taylor, Knoche & Granville, Descriptive Color Names Dictionary
Ridgway, Color Standards and Color Nomenclature
Plochere Color System
Maerz and Paul, Dictionary of Color
Kelly and Judd (1976)Color Universal Language and Dictionary of Names. National Bureau of Standards special publication 440
CSIRO. Semantic Mashups for Hack days
Random example Amber Yellow
http://tx4.us/nbs/nbs-a.htm
CSIRO. Semantic Mashups for Hack days
Another example: organizations (.gov, .com and .org.)
• Some effort required to fix the missing links: ontology then data• Company codes: ANZSIC vs. UNSPCC
• Government codes: AGIFT thesaurus vs. UK’s Government Category List
• Epimorphics (UK) and W3C eGov IG working on this topic• Organization ontology developed to be LOD-ready
CSIRO. Semantic Mashups for Hack days
Linked Open Data (LOD) opportunities
• W3C• eGov IG or incubators
• local initiative (W3C Australia to help – status to be defined)• W3C track speakers (Web Directions South)
• Linked Open Data Around the Clock project (EU)• Contact: Michael Hausenblas (DERI) • Semantic Web Atlanta already involved (advisory committee)
• Amped hack day • Opportunity to engage with front end developers• Opportunity to liaise with other SW Meetups (Sydney in Australia,
Atlanta and maybe Austin in the US)
CSIRO. Semantic Mashups for Hack days
CSIRO ICT CentreLaurent Lefort @laurentlefortOntologist, Information Engineering Lab W3C Australia Office @W3CAustralia
Photo: Web DirectionsCC BY-NC 2.0