© 2010 openlink software, all rights reserved. exploiting linked data by kingsley idehenkingsley...
TRANSCRIPT
© 2010 OpenLink Software, All rights reserved.
Exploiting Linked Data
By Kingsley Idehen
Founder & CEO, OpenLink Software
Presentation Goals
Demystify Linked Data
Demonstrate the unobtrusive nature
of Linked Data
Explore how Linked Data can
Work for You!
© 2010 OpenLink Software, All rights reserved.
Situation Analysis
User Generated Content is Growing
Exponentially
Enterprise & Individual Connectivity is Growing
The Line Between The Individual & The
Enterprise is Blurring (Social Dimension is now
Palpable)
Access, Integration, Manipulation, Dissemination
of Data, Information, and Knowledge still
underlies our Agility!
© 2010 OpenLink Software, All rights reserved.
Data, Information, and Knowledge
What is Data ?
How we express observation
What is Information?
How we use Context to perceive observation
What is Knowledge?
How we Comprehend what we perceive
© 2010 OpenLink Software, All rights reserved.
Late and loose Schema / Dictionary binding. Powerful mechanism for Data Access, Integration,
Manipulation, and Dissemination via Hyperlinks that’s platform agnostic and exploitable via: Native desktop productivity tools (spreadsheets, report writers,
business intelligence analytics, master data management, word processors, email etc.)
Native collaboration tools Native development tools and frameworks Web based collaboration tools with varying degrees of sociality
(macro and micro blogs, wikis, shared bookmarks, feeds, discussion forums etc.)
Web based development tools and frameworks .
Why is Linked Data Important?
© 2008 OpenLink Software, All rights reserved.
What is Linked Data?
Hyperlink based Structured Data Representation where: Resources consist of fine-grained Data Objects Every Data Object is uniquely identified using a
functional (resolvable) Hyperlink based Name Every Data Object is accessible via a functional
(resolvable) Hyperlink based Address Every Object has Identity distinct from its Access
Address Data Object representation takes the form of a
directed graph pictorial where Attribute=Value pairs coalesce around a Data Object Name.
© 2010 OpenLink Software, All rights reserved.
Contd: What is Linked Data?
Hyperlink based Structured Data Representation where: Object representation is expressible using a variety of
syntaxes that include: RDF family of syntaxes,
Microdata, OData, ASN.1 etc.
Data Object serialization formats for across the wire
transmission are varied and negotiable (if working with
HTTP)
© 2010 OpenLink Software, All rights reserved.
Is Linked Data New?
No! Similar to “data access by reference”
patterns already established by ODBC modulo: Relational “Tables” Model Specificity
Coarse Grained Data Access Granularity
(stops at Table/View level i.e. cannot reference
individual data objects directly)
© 2010 OpenLink Software, All rights reserved.
Contd: Is Linked Data New?
X.500 based Data Source Naming style
Data Representation & Data Access Protocol
Conflation
Operating Systems Specificity (for the most part re.
Driver Manager libraries).
© 2010 OpenLink Software, All rights reserved.
Data Source Name (DSN) Types
ODBC DSN=“CRM”;HOST=crm.example.org;SVT=Oracle;DATABASE=C
RM;TABLE=CUSTOMER
JDBC URL jdbc:openlink://crm.example.org/SVT=Oracle/DATABASE=CRM/
TABLE=CUSTOMER
LSID based Data Object Name for “Elephant-eared kangaroo rat” from Universal Biological Indexer & Organizer urn:lsid:ubio.org:namebank:762537
Data Object Access Address (DSN with one level of indirection) http://kingsley.idehen.net/dataspace/person
Data Object Name/Handle (DSN with > one level of indirection) http://kingsley.idehen.net/dataspace/person#this
© 2010 OpenLink Software, All rights reserved.
Linked Data & ODBC Similarities
© 2010 OpenLink Software, All rights reserved.
The Linked Data Journey
Computer Network Infrastructure Distributed Computing Distributed Databases Compound Documents Distributed Data Objects
© 2010 OpenLink Software, All rights reserved.
Network: Routers link Computers
© 2010 OpenLink Software, All rights reserved.
Internet Layer: Cloud hides Routers
© 2010 OpenLink Software, All rights reserved.
Internet: Computers Link to Documents
© 2010 OpenLink Software, All rights reserved.
Web Layer: Links to Compound Docs
© 2010 OpenLink Software, All rights reserved.
Web Layer: Page Links Hide Compound Docs
© 2010 OpenLink Software, All rights reserved.
Data Layer: Links To Object Descriptors
© 2010 OpenLink Software, All rights reserved.
Data Layer: Links to Object Descriptors Hide Pages
© 2010 OpenLink Software, All rights reserved.
Linked Data Web Materialization Generated "on the fly" via RDF middleware from:
Existing Web Pages (POSH, Microformats, eRDF, RDFa, GRDDL)
Web 2.0 Data Spaces (Social Networks, Blogs, Wikis, Bookmarks, Online
Discussions / Conversations etc)
Web Services (SOAP and REST)
Enterprise Data Sources SOA oriented Web Services
XML based Data Warehouses and Views
Enterprise Databases (ODBC, JDBC, OLE-DB, ADO.NET, XMLA, Native
CLIs)
Community driven extraction efforts DBpedia , Bio2RDF , and many other Linking Open Data projects
© 2010 OpenLink Software, All rights reserved.
Linked Data Generation from Existing Dynamic Web Content
© 2010 OpenLink Software, All rights reserved.
Linked Data Generation from Existing Web Services
© 2010 OpenLink Software, All rights reserved.
Linked Data Generation from Enterprise Data Sources
© 2010 OpenLink Software, All rights reserved.
Linked Data Generation from Existing Web Content
© 2010 OpenLink Software, All rights reserved.
Linked Data Generation via Linked Middleware
© 2010 OpenLink Software, All rights reserved.
Build or Buy a Linked Data Server capable of: Minting hyperlink based Identifiers that ensure Data Object
Names and Addresses are distinct
Publishing Data Objects that support a wide variety of across-
the-wire data serialization formats
Fine-grained access control lists for controlling access to Data
Objects
Create, Update, Read, and Delete operations
Generating Data Objects from a plethora of existing non Linked
Data oriented data sources
Horizontal Scale in line with Data Object growth.
How Do I Deploy Linked Data?
© 2010 OpenLink Software, All rights reserved.
Linked Data Usage Examples
Enterprise 3.0 Customer, Orders, Employees, Products, Countries, Suppliers,
Competitors etc. are all represented via Data Objects.
Social Media Blog Posts, Tweets, Tags, Comments, Threaded Discussions,
Feeds, Wikis, Bookmarks, Likes, DisLikes, +1’s etc. are all
represented via Data Objects.
Policy Based Security & Access Controls
Agent Info Cards (security tokens) for Software, People, and
Organizations are represented via Data Objects.
© 2010 OpenLink Software, All rights reserved.
Linked Data Exploitation: Enterprise
Conceptual interaction with Heterogeneous Data Sources that transcends DBMS engine models
DBMS engine vendors APIs
SOA Web Services
Host Operating Systems
Simply “Peel-Back” the containment of Applications by integrating Linked
Data Source Names into ERP Systems
CRM Systems
Enterprise Collaboration Systems
HR Systems
Marketing Systems
Web Content Management & Corporate Communications Systems
© 2010 OpenLink Software, All rights reserved.
Linked Data Exploitation: Social-Networking
Construction & Deconstruction of Social-Networks will
become a loose function of data links exposed via: Our Posts (macro and micro blogs)
Our Tags
Our Interests
Our Realms of Discourse (comments associated with Blogs,
Wikis, and other Data Spaces on the Web)
It’s Community, Stupid! Community-building prowess will express Vitality
Vitality will be an expression of “Link Density”
Co-opetition will thrive.
© 2010 OpenLink Software, All rights reserved.
Linked Data Exploitation: Security & Privacy
Verifiable Identifiers via WebID protocol and Agent
Profiles where:
PKI is decoupled from centralized CA network
Info Cards bear WebID (an Agent Identifier) watermark
which makes federated proof of identity viable.
Access Control Lists
Applied to individuals
Groups / Circles.
© 2010 OpenLink Software, All rights reserved.
Additional Information
OpenLink Software: OpenLink Software OpenLink Virtuoso OpenLink Data Explorer Universal Data Access Drivers http://www.openlinksw.com/blog/~kidehen/ (weblog)
Linking Open Data Community Projects http://dbpedia.org/ -- DBpedia http://lod.openlinksw.com -- LOD Cloud Cache
© 2010 OpenLink Software, All rights reserved.