show me the semantics - semantic identitysemanticidentity.com/resources/prez/meta2011-slides.pdf ·...
Post on 06-Feb-2018
227 Views
Preview:
TRANSCRIPT
Show Me The SemanticsRenato Iannella, Semantic Identity
<ri@semanticidentity.com>
Canberra, Australia, 25-27 May 2011
• Web 1.0 - First Web• Data
• Web 2.0 - Social Web• People & Sharing
• Web 3.0 - Semantic Web
• Understanding• Web 4.0 - Policy-Oriented Web
• Behaviour
Web Generations
We Started Here
We Are Here Now
We Are Supposedto Be Here
Can We Be Here?
Elephant in the Room?• “Semantic Web”
• W3C Technologies - model the world• “Semantic Technology”
• Given a question, semantic technologies can directly search topics, concepts, associations that span a vast number of sources (wikipedia)
• “Semantics”• Text-mining/Analytics (data to
concepts/facts)• Be clear on the “Semantics”
Back to the Future (1997)
1997
!"#$%$#$&' ()#)*"
+,-.&/#0&1#%2"34)*5"&+63547"*0&896#*%)'69:4;%3#5<"%)<$)
+$0&=9&->"&16:"&4:&!"#$%$#$
Registries
• Need for a Global Metadata Registry– human-readable descriptions– machine-readable descriptions
• Metadata.Net– http://metadata.net
• Schema Extensibility– common semantics– interoperability
Conclusion
• Describing resources is becoming important (if not, critical)– For discovery– For management
• Metadata technologies are maturing at a significant rate– RDF will solve infrastructure– Semantics is community specific
The Semantic Web
SW Layer Cake
Linked Data
Vocabularies/Ontologies
Query
Inference
Metadata
Semantic Web Technologies• The Semantic Web provides a common framework for
sharing information and consistent access• W3C Activities
• RDF/RDFa• OWL
• SPARQL• RIF• POWDER• GRDDL• SKOS
• RDB2RDF
Other Activities• Contacts: vCard RDF, FOAF• Vocabs: Dublin Core• Services: Sindice• Enterprise: GoodRelations
• Social: SOIC• Repositories: DBpedia• ...
SW Layer Cake (new)
Case Studies
BBC World Cup Football 2010• High-performance dynamic semantic publishing framework
based on a rich ontological domain model• Ontology includes journalist-created content (eg stories,
blogs, profiles, images, video and statistics)and how they wererelated to the WorldCup entities
• RDF metadatarepresentation
• Triple store technologyand SPARQL approach
• IBM text analysistechnology
Architecture
http://www.bbc.co.uk/blogs/bbcinternet/2010/07/bbc_world_cup_2010_dynamic_sem.html
Sport Ontology
IPTC rNews
rNews Model
Good Relations Model
GR+RDFa+Yahoo
<span rel="gr:hasPriceSpecification"> <span property="gr:hasCurrencyValue" datatype="currency:USD" content="335000"></span></span>
Linked Data
• Linking up all the dataon the Web
• Exposes relationshipsbetween data that canlead to greaterknowledge creation &user experiences• “serendipitous
connections”• Reliable URLs return
RDF metadata• Use common
vocabularies
Linked Data
As of September 2010
MusicBrainz
(zitgist)
P20
YAGO
World Fact-book (FUB)
WordNet (W3C)
WordNet(VUA)
VIVO UFVIVO
Indiana
VIVO Cornell
VIAF
URIBurner
Sussex Reading
Lists
Plymouth Reading
Lists
UMBEL
UK Post-codes
legislation.gov.uk
Uberblic
UB Mann-heim
TWC LOGD
Twarql
transportdata.gov
.uk
totl.net
Tele-graphis
TCMGeneDIT
TaxonConcept
The Open Library (Talis)
t4gm
Surge Radio
STW
RAMEAU SH
statisticsdata.gov
.uk
St. Andrews Resource
Lists
ECS South-ampton EPrints
Semantic CrunchBase
semanticweb.org
SemanticXBRL
SWDog Food
rdfabout US SEC
Wiki
UN/LOCODE
Ulm
ECS (RKB
Explorer)
Roma
RISKS
RESEX
RAE2001
Pisa
OS
OAI
NSF
New-castle
LAAS
KISTIJISC
IRIT
IEEE
IBM
Eurécom
ERA
ePrints
dotAC
DEPLOY
DBLP (RKB
Explorer)
Course-ware
CORDIS
CiteSeer
Budapest
ACM
riese
Revyu
researchdata.gov
.uk
referencedata.gov
.uk
Recht-spraak.
nl
RDFohloh
Last.FM (rdfize)
RDF Book
Mashup
PSH
ProductDB
PBAC
Poké-pédia
Ord-nance Survey
Openly Local
The Open Library
OpenCyc
OpenCalais
OpenEI
New York
Times
NTU Resource
Lists
NDL subjects
MARC Codes List
Man-chesterReading
Lists
Lotico
The London Gazette
LOIUS
lobidResources
lobidOrgani-sations
LinkedMDB
LinkedLCCN
LinkedGeoData
LinkedCT
Linked Open
Numbers
lingvoj
LIBRIS
Lexvo
LCSH
DBLP (L3S)
Linked Sensor Data (Kno.e.sis)
Good-win
Family
Jamendo
iServe
NSZL Catalog
GovTrack
GESIS
GeoSpecies
GeoNames
GeoLinkedData(es)
GTAA
STITCHSIDER
Project Guten-berg (FUB)
MediCare
Euro-stat
(FUB)
DrugBank
Disea-some
DBLP (FU
Berlin)
DailyMed
Freebase
flickr wrappr
Fishes of Texas
FanHubz
Event-Media
EUTC Produc-
tions
Eurostat
EUNIS
ESD stan-dards
Popula-tion (En-AKTing)
NHS (EnAKTing)
Mortality (En-
AKTing)Energy
(En-AKTing)
CO2(En-
AKTing)
educationdata.gov
.uk
ECS South-ampton
Gem. Norm-datei
datadcs
MySpace(DBTune)
MusicBrainz
(DBTune)
Magna-tune
John Peel(DB
Tune)
classical(DB
Tune)
Audio-scrobbler (DBTune)
Last.fmArtists
(DBTune)
DBTropes
dbpedia lite
DBpedia
Pokedex
Airports
NASA (Data Incu-bator)
MusicBrainz(Data
Incubator)
Moseley Folk
Discogs(Data In-cubator)
Climbing
Linked Data for Intervals
Cornetto
Chronic-ling
America
Chem2Bio2RDF
biz.data.
gov.uk
UniSTS
UniRef
UniPath-way
UniParc
Taxo-nomy
UniProt
SGD
Reactome
PubMed
PubChem
PRO-SITE
ProDom
Pfam PDB
OMIM
OBO
MGI
KEGG Reaction
KEGG Pathway
KEGG Glycan
KEGG Enzyme
KEGG Drug
KEGG Cpd
InterPro
HomoloGene
HGNC
Gene Ontology
GeneID
GenBank
ChEBI
CAS
Affy-metrix
BibBaseBBC
Wildlife Finder
BBC Program
mesBBC
Music
rdfaboutUS Census
Example: dbpedia
Example: dbpedia
New York Times
data.gov.au
• Mashup: Link ACT BBQs to Toilet Map?• Issues:
• Format (Excel V XML)• Consistency between National and ACT data• “Latitudes and longitudes are given in ACT Stromlo
Projection form”• All entities need persistent URLs
• BBQ Class: http://www.productontology.org/doc/Barbecue• BBQ Instance: http://act.gov.au/bbq/4532
• Resolve to (RDF) metadata
• (Need) Future d.g.a Technical Roadmap
data.gov.au
Did I Mention Reuse?
Open Graph Protocoltitletypedescriptionurlimagesite_namelatitudelongitudestreet-addresslocalityregionpostal-codecountry-nameemailphone_numberfax_numbervideoaudio
GML
rNews
GML
Mapping: SKOS
SKOS• Simple Knowledge Organization System (SKOS) is a common
data model for sharing and linking knowledge systems• Fundamental element of SKOS vocabulary is the “concept”
• Labelling properties with preferred, and alternative terms• Broader and Narrower properties
• Transitive relationships• Related properties• Mapping properties to indicate how concepts compare• Ordering of concepts• Documentary properties for notes, definitions, examples
SKOS Mapping Example
Automatic Semantics?
Auto RDF Creation• Open Calais - Automatically creates rich semantic metadata
from the content submitted • Used natural language processing & machine learning
methods• Categorizes and links document with
• entities (people, places,organizations, etc.)
• facts (person "x" works forcompany "y")
• events (person "z" wasappointed chairman of company"y" on date "x").
Open Calais: Viewer Example “OAIC Principles Press Release”
Summary
Principles• Free and Open
• SBR Taxonomy?• Based on Open Standards
• Check Governance processes• Single (proprietary) Namespace?
• Easily discoverable• Must be on data.gov.au• Query interface
• Understandable• All properties need to be (human) documented
• All assumptions made explicit
Principles• Machine-readable
• XML (minimum)….RDF better• Reuse Semantics - Don’t reinvent • Resolve the URL
• Freely reusable and transformable
• CC Licenses - “all or nothing” !• Example: Budget 2011 Papers
• Linked Data• Authority for the URL• Will the URL be sustained?
Conclusion• “Global Metadata Registry” (1997)
• Think local • Better documentation of semantics• Give all entities a URL identifier + resolve metadata
• “Schema Extensibility” (1997)
• Reuse, Reuse, Reuse• Single namespace vocabularies should be rare
• “Metadata Technologies are Maturing at a Significant Rate” (1997)• Proliferation (yes)...Maturity (debatable)!
• Model, Reuse & Link
top related