skos as the focal point of linked data strategies
DESCRIPTION
This talk was given at SEMANTiCS 2014 in Leipzig. It gives an overview how to develop an enterprise linked data strategy around controlled vocabularies based on SKOS. It discusses how knowledge graphs based on SKOS can extended step by step due to the needs of the organization.TRANSCRIPT
SKOS as the focal point of linked data strategies
Andreas Blumauer, MSc IT
About Semantic Web Company (SWC)
SWC was founded 2001 in Vienna, Austria
Over 20 experts in linked data
technologies
Product: PoolParty Semantic Platform
Serving customers from three continents
EU- & US-based partner network
Our network: Customers & Partners
Customers● Credit Suisse● Daimler● Roche● Wolters Kluwer● Tieto● Red Bull Media House● World Bank Group● The Pokémon Company● Healthdirect Australia● Ministry of Finance (A)● Council of the E.U.● TC Media● American Physical Society● Education Services Australia● Wood Mackenzie● Pearson● Geological Survey (A)● Norwegian Direct. of
Immigration● REEEP● GBPN - Global Buildings
Performance Network● ...
Finance / Automotive / Publisher / Health Care / Public Administration / Energy / EducationPartners
● Cognizant● EPAM Systems● iQuest● DTI AG● Tenforce● OpenLink Software● Ontotext● Gravity Zero● Altotech● Wolters Kluwer● Term Management● Taxonomy Strategies● Search explained● WAND● Linked Data Factory● Taxonic● semweb● Digirati● KMSolutions
FAQs: How enterprises dive deeper into linked data technologies & methodologies
1. What is linked data?2. How do we benefit from it?
What are the use cases for linked data? 3. Will it replace our existing systems,
e.g. enterprise search?4. Can we reuse existing metadata, glossaries,
schemes & vocabularies?5. How can we integrate linked data into our DMS,
CMS, …6. Where should we start?
1. What is Linked Data? An enterprise perspective
Graphs everywhere...
Microsoft
Things are the key elements of graphs which open the doors to ‘smart data’
It’s all about things,not strings!
Add relations between things and link them with other knowledge graphs
http://www.mycom.com/taxonomy/6234672
http://www.mycom.com/taxonomy/9734585
http://www.mycom.com/taxonomy/4543567
is winner of
http://musicbrainz.org/artist/1036b808-f58c-4a3e-b461-a2c4492ecf1bhttps://twitter.com/nickiminajhttp://open.spotify.com/artist/0hCNtLu0JehylgoiP8L4Gh http://www.imdb.com/name/nm3747326/http://viaf.org/viaf/154110584http://www.youtube.com/user/NickiMinajAtVEVOhttp://dbpedia.org/resource/Nicki_Minajhttp://www.freebase.com/m/047sxrj
related
Harvest facts, references, images, videos, ….
persons
events
Link your knowledge graphs with your documents and data streams!
http://www.mycom.com/taxonomy/62346723
prefLabel
Miley Cyrus
image
http://www.mycom.com/images/90546089
http://www.mycom.com/taxonomy/97345854
prefLabel
Nicki Minaj
altLabel Onika Tanya Maraj
http://www.mycom.com/taxonomy/4543567
prefLabel
altLabel
MTV VideoMusic Award
VMA
2. How do we benefit from it?What are the use cases for linked data?
Benefit arguments
Cost effectiveness The systemic viewOperating efficiency
Basic argument
IT-Management /Software Architect
Information & Knowledge
Management
Business Process
Management
Efficient and agile
data model
Better reuse of existing information resources helps to
save costs
Better understanding of relations between things increases communication
skills
Unified views on business objects
lead to better decisions
Higher information quality
Efficient handling of metadata
Increased transparency on inconsistencies and
contradictions
Information flows adapt to the
needs of the user
Improved informati
on retrieval
Automatic structuring of
unstructured data help to save costs
Consistent use of controlled vocabularies
triggers additional network effects
BI-like, complex queries become
possible
Use cases for linked data - 3 archetypical scenarios
Unified Views Contextual awareness BI-like, complex queries
http://reegle.info/countries http://www.eip-water.eu/ http://www.gbpn.org/
3. Will it replace our existing systems?
An architectural question
Things are everywhere!A four-layered information architecture
enterprise knowledge model
domain specificknowledge model
annotation &categorization
legacy data& documents
It won’t replace existing systems, but it will change the way apps are developed
Make conceptual model & the semantics of your data explicitly available
PoolParty Semantic Integrator:Unified Views on various data sources
based on
4. Can we reuse existing metadata? Glossaries, schemes & vocabularies
Revisiting the good old ‘ontology continuum’
Glossaries &Folksonomy
Taxonomy
Thesaurus
Ontology
SemanticExpressivity
How about linkability?
Is this really a continuum?
What do we need for text mining?
From CSV and Excel to Taxonomies
From Folksonomies to Taxonomies
Free terms (candidate terms) are extracted from document collections
… and asserted into thecontrolled vocabulary.
From Taxonomies to Ontologies: Start with SKOS...
… and apply ontologies
Linkability: Modularize knowledge graphs like you do with software!
SKOS is a ‘semantic interface’ to retrieve and link distributed content
EurovocWKD German labor law thesaurus
STW Thesaurus
DBpedia
5. How can we integrate linked data
into our systems?
PoolParty PowerTagging: integrate semantics in enterprise content systems
See how it works:PoolParty components & workflows
works on
basis for
● reference taxonomies
● linked data sources● text reference
corpora
enrich
basis for
Taxonomist/Ontologist
Developer
● Confluence, WordPress SharePoint, Drupal
● search engine● database
is user ofContentManager
enrich annotate
basis foranalyzes
uses API
6. Where should we start? Start SKOS, grow big!
SKOS is at the intersection of three disciplines and their paradigms
SKOS
librarians & taxonomists
data engineers &artificial intelligenceschemas &
ontologies taxonomies &classification
systems
text mining &data analytics
computational linguists & information managers
SKOS makes taxonomies/thesauri accessible, linkable & reusable
http://www.w3.org/2004/02/skos/
SKOS as a basis to visualize and browse semantic knowledge graphs
SKOS: You are not alone...
● Eurovoc (EU)● ESCO (EU)● Jurivoc (SUI)● ScoT (AUS)● Agrovoc (UN)● MeSH (US)● Getty Vocabularies (US)● GEMET (EEA)● GeoThesaurus (AT)● STW Economy (DE)● Polythematic SH (CZ)● Canadian Subject Headings (Can)● LCSH (US)● Worldbank Taxonomy (WBG)● Labor Law Germany Thesaurus (DE)● Reegle Thesaurus (REEEP)● Austrian Tax Law Thesaurus (AT)
● UNESCO Thesaurus (UN)● New York Times SH (US)● RAMEAU subject headings (FR)● TheSoz (DE)● The General Finnish Thesaurus (FIN)● NAL Thesaurus (US)● Social Semantic Web Thesaurus (AT)● Courts thesaurus (DE)● SITC-V4 (UN)● Google Product Taxonomy (US)● NAICS 2012 (US)● Common Procurement Vocabulary (ES)● UKAT UK Archival Thesaurus (UK)● NASA taxonomy (US)● IVOA astronomy vocabularies (UK)● IPTC News Codes (UK)● WAND taxonomies (US)
Provide integrated & interlinked views on all kind of information
The SKOS/Linked Data based approach for information integration
Transforming documents into SKOS based graphs
Annotating & categorising documents
SKOS based graph of concepts
Tree of categories & terms
Standards based ontologies applied to
SKOS based concept graphs
Schemas, classes, properties,
restrictions & rules
Let your documents become part of something bigger & make them smart!
Transforming documents into SKOS based graphs
Annotating & categorising documents
http://vocabulary.semantic-web.at/semweb/367
Tim Berners-Lee TimBL
skos:altLabelskos:prefLabel
Show me biographies of all computer scientists working for an organization located near Boston.
Start SKOS, grow big.
So Long, and Thanks for All the Links
Contact points & further information
Andreas Blumauer, MSc [email protected]://at.linkedin.com/in/andreasblumauer/https://plus.google.com/115842492297705285184/
Semantic Web Company GmbHMariahilfer Strasse 70/8, A-1070 Vienna+43-1-4021235http://www.semantic-web.athttp://www.poolparty-software.comhttp://slideshare.net/semwebcompanyhttp://youtube.com/semwebcompany