show me the semantics - semantic identitysemanticidentity.com/resources/prez/meta2011-slides.pdf ·...

37
Show Me The Semantics Renato Iannella, Semantic Identity <[email protected]> Canberra, Australia, 25-27 May 2011

Upload: hangoc

Post on 06-Feb-2018

227 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Show Me The Semantics - Semantic Identitysemanticidentity.com/resources/prez/meta2011-slides.pdf · Show Me The Semantics Renato Iannella, Semantic Identity

Show Me The SemanticsRenato Iannella, Semantic Identity

<[email protected]>

Canberra, Australia, 25-27 May 2011

Page 2: Show Me The Semantics - Semantic Identitysemanticidentity.com/resources/prez/meta2011-slides.pdf · Show Me The Semantics Renato Iannella, Semantic Identity

• Web 1.0 - First Web• Data

• Web 2.0 - Social Web• People & Sharing

• Web 3.0 - Semantic Web

• Understanding• Web 4.0 - Policy-Oriented Web

• Behaviour

Web Generations

We Started Here

We Are Here Now

We Are Supposedto Be Here

Can We Be Here?

Page 3: Show Me The Semantics - Semantic Identitysemanticidentity.com/resources/prez/meta2011-slides.pdf · Show Me The Semantics Renato Iannella, Semantic Identity

Elephant in the Room?• “Semantic Web”

• W3C Technologies - model the world• “Semantic Technology”

• Given a question, semantic technologies can directly search topics, concepts, associations that span a vast number of sources (wikipedia)

• “Semantics”• Text-mining/Analytics (data to

concepts/facts)• Be clear on the “Semantics”

Page 4: Show Me The Semantics - Semantic Identitysemanticidentity.com/resources/prez/meta2011-slides.pdf · Show Me The Semantics Renato Iannella, Semantic Identity

Back to the Future (1997)

Page 5: Show Me The Semantics - Semantic Identitysemanticidentity.com/resources/prez/meta2011-slides.pdf · Show Me The Semantics Renato Iannella, Semantic Identity

1997

!"#$%$#$&' ()#)*"

+,-.&/#0&1#%2"34)*5"&+63547"*0&896#*%)'69:4;%3#5<"%)<$)

+$0&=9&->"&16:"&4:&!"#$%$#$

Registries

• Need for a Global Metadata Registry– human-readable descriptions– machine-readable descriptions

• Metadata.Net– http://metadata.net

• Schema Extensibility– common semantics– interoperability

Conclusion

• Describing resources is becoming important (if not, critical)– For discovery– For management

• Metadata technologies are maturing at a significant rate– RDF will solve infrastructure– Semantics is community specific

Page 6: Show Me The Semantics - Semantic Identitysemanticidentity.com/resources/prez/meta2011-slides.pdf · Show Me The Semantics Renato Iannella, Semantic Identity

The Semantic Web

Page 7: Show Me The Semantics - Semantic Identitysemanticidentity.com/resources/prez/meta2011-slides.pdf · Show Me The Semantics Renato Iannella, Semantic Identity

SW Layer Cake

Linked Data

Vocabularies/Ontologies

Query

Inference

Metadata

Page 8: Show Me The Semantics - Semantic Identitysemanticidentity.com/resources/prez/meta2011-slides.pdf · Show Me The Semantics Renato Iannella, Semantic Identity

Semantic Web Technologies• The Semantic Web provides a common framework for

sharing information and consistent access• W3C Activities

• RDF/RDFa• OWL

• SPARQL• RIF• POWDER• GRDDL• SKOS

• RDB2RDF

Other Activities• Contacts: vCard RDF, FOAF• Vocabs: Dublin Core• Services: Sindice• Enterprise: GoodRelations

• Social: SOIC• Repositories: DBpedia• ...

Page 9: Show Me The Semantics - Semantic Identitysemanticidentity.com/resources/prez/meta2011-slides.pdf · Show Me The Semantics Renato Iannella, Semantic Identity

SW Layer Cake (new)

Page 10: Show Me The Semantics - Semantic Identitysemanticidentity.com/resources/prez/meta2011-slides.pdf · Show Me The Semantics Renato Iannella, Semantic Identity

Case Studies

Page 11: Show Me The Semantics - Semantic Identitysemanticidentity.com/resources/prez/meta2011-slides.pdf · Show Me The Semantics Renato Iannella, Semantic Identity

BBC World Cup Football 2010• High-performance dynamic semantic publishing framework

based on a rich ontological domain model• Ontology includes journalist-created content (eg stories,

blogs, profiles, images, video and statistics)and how they wererelated to the WorldCup entities

• RDF metadatarepresentation

• Triple store technologyand SPARQL approach

• IBM text analysistechnology

Page 12: Show Me The Semantics - Semantic Identitysemanticidentity.com/resources/prez/meta2011-slides.pdf · Show Me The Semantics Renato Iannella, Semantic Identity

Architecture

http://www.bbc.co.uk/blogs/bbcinternet/2010/07/bbc_world_cup_2010_dynamic_sem.html

Page 13: Show Me The Semantics - Semantic Identitysemanticidentity.com/resources/prez/meta2011-slides.pdf · Show Me The Semantics Renato Iannella, Semantic Identity

Sport Ontology

Page 14: Show Me The Semantics - Semantic Identitysemanticidentity.com/resources/prez/meta2011-slides.pdf · Show Me The Semantics Renato Iannella, Semantic Identity

IPTC rNews

Page 15: Show Me The Semantics - Semantic Identitysemanticidentity.com/resources/prez/meta2011-slides.pdf · Show Me The Semantics Renato Iannella, Semantic Identity

rNews Model

Page 16: Show Me The Semantics - Semantic Identitysemanticidentity.com/resources/prez/meta2011-slides.pdf · Show Me The Semantics Renato Iannella, Semantic Identity

Good Relations Model

Page 17: Show Me The Semantics - Semantic Identitysemanticidentity.com/resources/prez/meta2011-slides.pdf · Show Me The Semantics Renato Iannella, Semantic Identity

GR+RDFa+Yahoo

<span rel="gr:hasPriceSpecification"> <span property="gr:hasCurrencyValue" datatype="currency:USD" content="335000"></span></span>

Page 18: Show Me The Semantics - Semantic Identitysemanticidentity.com/resources/prez/meta2011-slides.pdf · Show Me The Semantics Renato Iannella, Semantic Identity

Linked Data

Page 19: Show Me The Semantics - Semantic Identitysemanticidentity.com/resources/prez/meta2011-slides.pdf · Show Me The Semantics Renato Iannella, Semantic Identity

• Linking up all the dataon the Web

• Exposes relationshipsbetween data that canlead to greaterknowledge creation &user experiences• “serendipitous

connections”• Reliable URLs return

RDF metadata• Use common

vocabularies

Linked Data

As of September 2010

MusicBrainz

(zitgist)

P20

YAGO

World Fact-book (FUB)

WordNet (W3C)

WordNet(VUA)

VIVO UFVIVO

Indiana

VIVO Cornell

VIAF

URIBurner

Sussex Reading

Lists

Plymouth Reading

Lists

UMBEL

UK Post-codes

legislation.gov.uk

Uberblic

UB Mann-heim

TWC LOGD

Twarql

transportdata.gov

.uk

totl.net

Tele-graphis

TCMGeneDIT

TaxonConcept

The Open Library (Talis)

t4gm

Surge Radio

STW

RAMEAU SH

statisticsdata.gov

.uk

St. Andrews Resource

Lists

ECS South-ampton EPrints

Semantic CrunchBase

semanticweb.org

SemanticXBRL

SWDog Food

rdfabout US SEC

Wiki

UN/LOCODE

Ulm

ECS (RKB

Explorer)

Roma

RISKS

RESEX

RAE2001

Pisa

OS

OAI

NSF

New-castle

LAAS

KISTIJISC

IRIT

IEEE

IBM

Eurécom

ERA

ePrints

dotAC

DEPLOY

DBLP (RKB

Explorer)

Course-ware

CORDIS

CiteSeer

Budapest

ACM

riese

Revyu

researchdata.gov

.uk

referencedata.gov

.uk

Recht-spraak.

nl

RDFohloh

Last.FM (rdfize)

RDF Book

Mashup

PSH

ProductDB

PBAC

Poké-pédia

Ord-nance Survey

Openly Local

The Open Library

OpenCyc

OpenCalais

OpenEI

New York

Times

NTU Resource

Lists

NDL subjects

MARC Codes List

Man-chesterReading

Lists

Lotico

The London Gazette

LOIUS

lobidResources

lobidOrgani-sations

LinkedMDB

LinkedLCCN

LinkedGeoData

LinkedCT

Linked Open

Numbers

lingvoj

LIBRIS

Lexvo

LCSH

DBLP (L3S)

Linked Sensor Data (Kno.e.sis)

Good-win

Family

Jamendo

iServe

NSZL Catalog

GovTrack

GESIS

GeoSpecies

GeoNames

GeoLinkedData(es)

GTAA

STITCHSIDER

Project Guten-berg (FUB)

MediCare

Euro-stat

(FUB)

DrugBank

Disea-some

DBLP (FU

Berlin)

DailyMed

Freebase

flickr wrappr

Fishes of Texas

FanHubz

Event-Media

EUTC Produc-

tions

Eurostat

EUNIS

ESD stan-dards

Popula-tion (En-AKTing)

NHS (EnAKTing)

Mortality (En-

AKTing)Energy

(En-AKTing)

CO2(En-

AKTing)

educationdata.gov

.uk

ECS South-ampton

Gem. Norm-datei

datadcs

MySpace(DBTune)

MusicBrainz

(DBTune)

Magna-tune

John Peel(DB

Tune)

classical(DB

Tune)

Audio-scrobbler (DBTune)

Last.fmArtists

(DBTune)

DBTropes

dbpedia lite

DBpedia

Pokedex

Airports

NASA (Data Incu-bator)

MusicBrainz(Data

Incubator)

Moseley Folk

Discogs(Data In-cubator)

Climbing

Linked Data for Intervals

Cornetto

Chronic-ling

America

Chem2Bio2RDF

biz.data.

gov.uk

UniSTS

UniRef

UniPath-way

UniParc

Taxo-nomy

UniProt

SGD

Reactome

PubMed

PubChem

PRO-SITE

ProDom

Pfam PDB

OMIM

OBO

MGI

KEGG Reaction

KEGG Pathway

KEGG Glycan

KEGG Enzyme

KEGG Drug

KEGG Cpd

InterPro

HomoloGene

HGNC

Gene Ontology

GeneID

GenBank

ChEBI

CAS

Affy-metrix

BibBaseBBC

Wildlife Finder

BBC Program

mesBBC

Music

rdfaboutUS Census

Page 20: Show Me The Semantics - Semantic Identitysemanticidentity.com/resources/prez/meta2011-slides.pdf · Show Me The Semantics Renato Iannella, Semantic Identity

Example: dbpedia

Page 21: Show Me The Semantics - Semantic Identitysemanticidentity.com/resources/prez/meta2011-slides.pdf · Show Me The Semantics Renato Iannella, Semantic Identity

Example: dbpedia

Page 22: Show Me The Semantics - Semantic Identitysemanticidentity.com/resources/prez/meta2011-slides.pdf · Show Me The Semantics Renato Iannella, Semantic Identity

New York Times

Page 23: Show Me The Semantics - Semantic Identitysemanticidentity.com/resources/prez/meta2011-slides.pdf · Show Me The Semantics Renato Iannella, Semantic Identity

data.gov.au

Page 24: Show Me The Semantics - Semantic Identitysemanticidentity.com/resources/prez/meta2011-slides.pdf · Show Me The Semantics Renato Iannella, Semantic Identity

• Mashup: Link ACT BBQs to Toilet Map?• Issues:

• Format (Excel V XML)• Consistency between National and ACT data• “Latitudes and longitudes are given in ACT Stromlo

Projection form”• All entities need persistent URLs

• BBQ Class: http://www.productontology.org/doc/Barbecue• BBQ Instance: http://act.gov.au/bbq/4532

• Resolve to (RDF) metadata

• (Need) Future d.g.a Technical Roadmap

data.gov.au

Page 25: Show Me The Semantics - Semantic Identitysemanticidentity.com/resources/prez/meta2011-slides.pdf · Show Me The Semantics Renato Iannella, Semantic Identity

Did I Mention Reuse?

Page 26: Show Me The Semantics - Semantic Identitysemanticidentity.com/resources/prez/meta2011-slides.pdf · Show Me The Semantics Renato Iannella, Semantic Identity

Open Graph Protocoltitletypedescriptionurlimagesite_namelatitudelongitudestreet-addresslocalityregionpostal-codecountry-nameemailphone_numberfax_numbervideoaudio

GML

Page 27: Show Me The Semantics - Semantic Identitysemanticidentity.com/resources/prez/meta2011-slides.pdf · Show Me The Semantics Renato Iannella, Semantic Identity

rNews

GML

Page 28: Show Me The Semantics - Semantic Identitysemanticidentity.com/resources/prez/meta2011-slides.pdf · Show Me The Semantics Renato Iannella, Semantic Identity

Mapping: SKOS

Page 29: Show Me The Semantics - Semantic Identitysemanticidentity.com/resources/prez/meta2011-slides.pdf · Show Me The Semantics Renato Iannella, Semantic Identity

SKOS• Simple Knowledge Organization System (SKOS) is a common

data model for sharing and linking knowledge systems• Fundamental element of SKOS vocabulary is the “concept”

• Labelling properties with preferred, and alternative terms• Broader and Narrower properties

• Transitive relationships• Related properties• Mapping properties to indicate how concepts compare• Ordering of concepts• Documentary properties for notes, definitions, examples

Page 30: Show Me The Semantics - Semantic Identitysemanticidentity.com/resources/prez/meta2011-slides.pdf · Show Me The Semantics Renato Iannella, Semantic Identity

SKOS Mapping Example

Page 31: Show Me The Semantics - Semantic Identitysemanticidentity.com/resources/prez/meta2011-slides.pdf · Show Me The Semantics Renato Iannella, Semantic Identity

Automatic Semantics?

Page 32: Show Me The Semantics - Semantic Identitysemanticidentity.com/resources/prez/meta2011-slides.pdf · Show Me The Semantics Renato Iannella, Semantic Identity

Auto RDF Creation• Open Calais - Automatically creates rich semantic metadata

from the content submitted • Used natural language processing & machine learning

methods• Categorizes and links document with

• entities (people, places,organizations, etc.)

• facts (person "x" works forcompany "y")

• events (person "z" wasappointed chairman of company"y" on date "x").

Page 33: Show Me The Semantics - Semantic Identitysemanticidentity.com/resources/prez/meta2011-slides.pdf · Show Me The Semantics Renato Iannella, Semantic Identity

Open Calais: Viewer Example “OAIC Principles Press Release”

Page 34: Show Me The Semantics - Semantic Identitysemanticidentity.com/resources/prez/meta2011-slides.pdf · Show Me The Semantics Renato Iannella, Semantic Identity

Summary

Page 35: Show Me The Semantics - Semantic Identitysemanticidentity.com/resources/prez/meta2011-slides.pdf · Show Me The Semantics Renato Iannella, Semantic Identity

Principles• Free and Open

• SBR Taxonomy?• Based on Open Standards

• Check Governance processes• Single (proprietary) Namespace?

• Easily discoverable• Must be on data.gov.au• Query interface

• Understandable• All properties need to be (human) documented

• All assumptions made explicit

Page 36: Show Me The Semantics - Semantic Identitysemanticidentity.com/resources/prez/meta2011-slides.pdf · Show Me The Semantics Renato Iannella, Semantic Identity

Principles• Machine-readable

• XML (minimum)….RDF better• Reuse Semantics - Don’t reinvent • Resolve the URL

• Freely reusable and transformable

• CC Licenses - “all or nothing” !• Example: Budget 2011 Papers

• Linked Data• Authority for the URL• Will the URL be sustained?

Page 37: Show Me The Semantics - Semantic Identitysemanticidentity.com/resources/prez/meta2011-slides.pdf · Show Me The Semantics Renato Iannella, Semantic Identity

Conclusion• “Global Metadata Registry” (1997)

• Think local • Better documentation of semantics• Give all entities a URL identifier + resolve metadata

• “Schema Extensibility” (1997)

• Reuse, Reuse, Reuse• Single namespace vocabularies should be rare

• “Metadata Technologies are Maturing at a Significant Rate” (1997)• Proliferation (yes)...Maturity (debatable)!

• Model, Reuse & Link