towards a web of data?
DESCRIPTION
A presentation to Digital Sparks North West's Transmissions event on 21 July 2010 [http://transmission6.eventbrite.com/]. This presentation covers Linked Data and the Semantic Web, and uses the example of companies such as TripIt to demonstrate that a little semantics can go a long way. The presentation then explores more formal approaches, such as those underway within the UK Government, and asks whether or not this is feasible in a commercial context.TRANSCRIPT
cloudofdata.com
Paul Miller
The Cloud of Data
Towards a Web of Data?
cloudofdata.com
Today’s a “Web of Documents” ?
www.flickr.com/photos/calliope/306564541/
cloudofdata.com www.sciam.com/article.cfm?id=the-semantic-web
Tomorrow’s “Semantic Web,” 2001-style!
http://tr.im/timblhttp://tr.im/hendler
cloudofdata.com
Pipe Dream?
cloudofdata.com
Or are the pieces falling into place?
cloudofdata.com
cloudofdata.com
itinerary… enriched with maps, weather, etc
cloudofdata.com
itinerary… in calendar, on iPhone, etc
cloudofdata.com
simply making an existing task easier and richer
text processing, back-end knowledge, APIs
semantic knowledge of place, time, etc
valuable data on intentions, choices, preferences and more
adds value, without shouting about it.
cloudofdata.com
From DARPA’s CALO Project… to the iPhone
bit.ly/dawiYQ
cloudofdata.com
cloudofdata.com
voice recognition
lots and lots of data… ideally via APIs
keeps data fresh, increases opportunities for cash, makes it an SEP
leverage knowledge of time, place, and history
‘a cab from here to work, tomorrow morning’
semantic smarts
‘cheap sushi’ and ‘great sushi’ require analysis of different data, and may result in different recommendations.
cloudofdata.com
finding meaning and associations in text
cloudofdata.com
relatively expensive, computationally
attempting to understand language and infer meaning
promising technology, but...
cloudofdata.com
the first big VC engagement with ‘Semantic Web’
bit.ly/bncbEA & bit.ly/9iuTKV
cloudofdata.com
community, meet canonical
cloudofdata.com
adding structure to information
cloudofdata.com
exposing via APIs. Glue for the Web.
cloudofdata.com
canonical source for community-enriched data
commercial interest
free, open, unencumbered… and available for download or linking.
cloudofdata.com $100 million$200 million
cloudofdata.com bit.ly/am57RN [& bit.ly/bqEWsk]
cloudofdata.com www.flickr.com/photos/wyoming_1/169388525/
Data tends to remain siloised.
cloudofdata.com
“Stop hugging your data...”
Sir Tim Berners-Lee, 2009
www.flickr.com/photos/_-amy-_/3167333250/
bit.ly/ymmpj
wider movement towards ‘Open’ Data
cloudofdata.com www.sciam.com/article.cfm?id=the-semantic-web
previous examples not like this...
http://tr.im/timblhttp://tr.im/hendler
cloudofdata.com Image © World Wide Web Consortium
URI - 1994XML - 1998RDF - 1999/ 2004OWL - 2004SPARQL - 2008Applications - 2007/8
W3C-driven effort. More used - and useful - than PR might imply
cloudofdata.com
“J.R.R. Tolkien wrote The Hobbit”
The HobbitJ.R.R. Tolkien wrote
cloudofdata.com
predicate
http://dbpedia.org/page/J._R._R._Tolkien
http://dbpedia.org/page/The_Hobbit
cloudofdata.com
eg Wikipedia data boxes to DBpedia
cloudofdata.com
eg UK Gov
www.flickr.com/photos/lorentey/1438477358/
bit.ly/ztOed
cloudofdata.com www.flickr.com/photos/scobleizer/2510349462/
eg NY Times
cloudofdata.com www.flickr.com/photos/virtualsugar/316200555/
Data OPEN for use and re-use… But that’s not all...
cloudofdata.com www.flickr.com/photos/foxypar4/2124673642/
Data LINKED to other places outside firewalleg BBC trusts and relies upon MusicBrainz
bit.ly/9tBJGH
cloudofdata.com www.flickr.com/photos/tanaka/3212373419/
“the Web done right”
Sir Tim Berners-Lee, 2008
harks back to TimBL’s original vision for a Read/Write Web
cloudofdata.com
Use URIs to name things
Use HTTP URIs so that they can be followed
When someone follows a URI, provide useful information
Include links to other URIs, so that more can be discovered.
www.w3.org/DesignIssues/LinkedData.html
As of July 2009
LinkedCTReactome
Taxonomy
KEGG
PubMed
GeneID
Pfam
UniProt
OMIM
PDB
SymbolChEBI
Daily Med
Disea-some
CAS
HGNC
InterPro
Drug Bank
UniParc
UniRef
ProDom
PROSITE
Gene Ontology
HomoloGene
PubChem
MGI
UniSTS
GEOSpecies
Jamendo
BBCProgrammes
Music-brainz
Magna-tune
BBCLater +TOTP
SurgeRadio
MySpaceWrapper
Audio-Scrobbler
LinkedMDB
BBCJohnPeel
BBCPlaycount
Data
Gov-Track
US Census Data
riese
Geo-names
lingvoj
World Fact-book
Euro-stat
flickrwrappr
Open Calais
RevyuSIOCSites
Doap-space
Flickrexporter
FOAFprofiles
CrunchBase
Sem-Web-
Central
Open-Guides
Wiki-company
QDOS
Pub Guide
RDF ohloh
W3CWordNet
OpenCyc
UMBEL
Yago
DBpedia
Freebase
Virtuoso Sponger
DBLPHannover
IRIT Toulouse
SWConference
Corpus
RDF Book Mashup
Project Guten-berg
DBLPBerlin
LAAS- CNRS
Buda-pestBME
IEEE
IBM
Resex
Pisa
New-castle
RAE 2001
CiteSeer
ACM
DBLP RKB
Explorer
eprints
LIBRIS
SemanticWeb.org
Eurécom
RKBECS
South-ampton
CORDIS
ReSIST ProjectWiki
NationalScience
Foundation
ECS South-ampton
LinkedGeoData
BBC Music
cloudofdata.com richard.cyganiak.de/2007/10/lod/lod-datasets_2009-07-14.pdf
cloudofdata.com www.flickr.com/photos/joevare/3743653601/
Open Data need not be Linked Data(eg US Data.gov)
Linked Data need not be Open Data(licensing hell)
cloudofdata.com www.flickr.com/photos/ldodds/4043803502/
how much mis-licensed?‘Copyright’ MAY NOT APPLY!
cloudofdata.com www.flickr.com/photos/starrett/3123216825/
Data as Commodity
cut costs, mass produce, standardise - create opportunities for growth
What does this mean for Enterprise?
cloudofdata.com www.datacenterknowledge.com/wp-content/uploads/2009/09/aerial-1000.jpg
Microsoft Data Centre, Dublin With data in the Cloud, barriers get lower.
It’s easier to TRY...
cloudofdata.com
Boundaries blur, walls (like this one) fall.Data and processes move back and forth.Fears eventually diminish.
www.flickr.com/photos/andrei_dimofte/1590561883/
cloudofdata.com www.flickr.com/photos/dsifry/2104378305/
“from Data Centre to data-centric”JP Rangaswami, 2009
data is a costwe treat all data as Core and confidentialmost actually Context and a commodityThink Different!
cloudofdata.com www.flickr.com/photos/king-edward/2152782252/
Closing Thoughts
cloudofdata.com www.flickr.com/photos/king-edward/2152782252/
semantics on the web bring value
sometimes solving quite narrow problems
The Semantic Web is part of that
but only part
Open Data creates opportunities, and challenges business models
Linked Data takes us beyond specific applications
This isn’t just for the public sector.
cloudofdata.com
Dr Paul Miller
The Cloud of Data
skype: cloudofdata
phone: +44 7769 740083
Except where otherwise noted, this work is licensed under the Creative Commons Attribution Licence. To view a copy of this licence, visit creativecommons.org/licenses/by/2.0/uk/ or send a letter to
Creative Commons, 171 Second St, San Francisco, CA 94105, United States of America
Thank you
cloud of data
Download this presentationslideshare.net/cloudofdata
Made on a
Mac