using the semantic web - iribenno/talks/mbbsemantics... · 7/31/2012 · semantic web “a web of...
TRANSCRIPT
![Page 1: Using the Semantic Web - IRIbenno/Talks/MBBSemantics... · 7/31/2012 · Semantic Web “a web of data that can be processed directly and indirectly by machines” Web 3.0 really](https://reader035.vdocuments.net/reader035/viewer/2022071012/5fca69074da80652a70d862d/html5/thumbnails/1.jpg)
Using the Semantic Web
M. Benno BlumenthalInternational Research Institute for Climate
and SocietyColumbia University
31 July 2012CU Metadata Group
![Page 2: Using the Semantic Web - IRIbenno/Talks/MBBSemantics... · 7/31/2012 · Semantic Web “a web of data that can be processed directly and indirectly by machines” Web 3.0 really](https://reader035.vdocuments.net/reader035/viewer/2022071012/5fca69074da80652a70d862d/html5/thumbnails/2.jpg)
Semantic Web
● “a web of data that can be processed directly and indirectly by machines”
● Web 3.0
really it is about Explicit Semantics● Universal Resource Identifier (URI)● Resource Description Framework (RDF)● Web Ontology Language (OWL)
![Page 4: Using the Semantic Web - IRIbenno/Talks/MBBSemantics... · 7/31/2012 · Semantic Web “a web of data that can be processed directly and indirectly by machines” Web 3.0 really](https://reader035.vdocuments.net/reader035/viewer/2022071012/5fca69074da80652a70d862d/html5/thumbnails/4.jpg)
Why use RDF?
Make implicit semantics explicit
Web-based system for interoperating semantics
RDF/OWL is an emerging technology, so tools are being built that help solve the semantic problems in handling data
Make implicit semantics explicit
Web-based system for interoperating semantics
RDF/OWL is an emerging technology, so tools are being built that help solve the semantic problems in handling data
![Page 5: Using the Semantic Web - IRIbenno/Talks/MBBSemantics... · 7/31/2012 · Semantic Web “a web of data that can be processed directly and indirectly by machines” Web 3.0 really](https://reader035.vdocuments.net/reader035/viewer/2022071012/5fca69074da80652a70d862d/html5/thumbnails/5.jpg)
Standard Metadata
Users
Datasets
Tools
Standard Metadata Schema/Data Services
![Page 6: Using the Semantic Web - IRIbenno/Talks/MBBSemantics... · 7/31/2012 · Semantic Web “a web of data that can be processed directly and indirectly by machines” Web 3.0 really](https://reader035.vdocuments.net/reader035/viewer/2022071012/5fca69074da80652a70d862d/html5/thumbnails/6.jpg)
Many Data Communities
● Semantic walls● Exchange walls
Tools
Users
Datasets
Standard Metadata Schema
Tools
Users
Datasets
Standard Metadata Schema
Tools
Users
Datasets
Standard Metadata Schema
Tools
Users
Datasets
Standard Metadata Schema
Tools
Users
Datasets
Standard Metadata Schema
![Page 7: Using the Semantic Web - IRIbenno/Talks/MBBSemantics... · 7/31/2012 · Semantic Web “a web of data that can be processed directly and indirectly by machines” Web 3.0 really](https://reader035.vdocuments.net/reader035/viewer/2022071012/5fca69074da80652a70d862d/html5/thumbnails/7.jpg)
Super Schema
Tools
Users
Datasets
Standard Metadata Schema
Tools
Users
Datasets
Standard Metadata Schema
Tools
Users
Datasets
Standard Metadata Schema
Tools
Users
Datasets
Standard Metadata Schema
Tools
Users
Datasets
Standard Metadata Schema
Standard metadata schema
![Page 8: Using the Semantic Web - IRIbenno/Talks/MBBSemantics... · 7/31/2012 · Semantic Web “a web of data that can be processed directly and indirectly by machines” Web 3.0 really](https://reader035.vdocuments.net/reader035/viewer/2022071012/5fca69074da80652a70d862d/html5/thumbnails/8.jpg)
Super Schema: direct
Tools
Users
Datasets
Standard Metadata Schema
Tools
Users
Datasets
Standard Metadata Schema
Tools
Users
Datasets
Standard Metadata Schema
Tools
Users
Datasets
Standard Metadata Schema
Tools
Users
Datasets
Standard Metadata Schema
Standard metadata schema/data service
![Page 9: Using the Semantic Web - IRIbenno/Talks/MBBSemantics... · 7/31/2012 · Semantic Web “a web of data that can be processed directly and indirectly by machines” Web 3.0 really](https://reader035.vdocuments.net/reader035/viewer/2022071012/5fca69074da80652a70d862d/html5/thumbnails/9.jpg)
Flaws
● A lot of work● Super Schema/Service is the Lowest-
Common-Denominator● Science keeps evolving, so that standards
either fall behind or constantly change
![Page 10: Using the Semantic Web - IRIbenno/Talks/MBBSemantics... · 7/31/2012 · Semantic Web “a web of data that can be processed directly and indirectly by machines” Web 3.0 really](https://reader035.vdocuments.net/reader035/viewer/2022071012/5fca69074da80652a70d862d/html5/thumbnails/10.jpg)
RDF Standard Data Model Exchange
Tools
Users
Datasets
Standard Metadata Schema
Tools
Users
Datasets
Standard Metadata Schema
Tools
Users
Datasets
Standard Metadata Schema
Tools
Users
Datasets
Standard Metadata Schema
Tools
Users
Datasets
Standard Metadata Schema
Standard metadata schema
RDF
RDF
RDF
RDF
RDF
RDF
![Page 11: Using the Semantic Web - IRIbenno/Talks/MBBSemantics... · 7/31/2012 · Semantic Web “a web of data that can be processed directly and indirectly by machines” Web 3.0 really](https://reader035.vdocuments.net/reader035/viewer/2022071012/5fca69074da80652a70d862d/html5/thumbnails/11.jpg)
Standard metadata schema
Tools
Users
Datasets
Standard Metadata Schema
RDF
RDFRDF
Tools
Users
Datasets
Standard Metadata Schema
RDF
RDFRDF
Tools
Users
Datasets
Standard Metadata Schem
RDF
RDFRDF
RDF Data Model Exchange
RDF
Tools
Users
Datasets
Standard Metadata Schema
RDF
RDFRDF
Tools
Users
Datasets
Standard Metadata Schema
RDF
RDFRDF
![Page 12: Using the Semantic Web - IRIbenno/Talks/MBBSemantics... · 7/31/2012 · Semantic Web “a web of data that can be processed directly and indirectly by machines” Web 3.0 really](https://reader035.vdocuments.net/reader035/viewer/2022071012/5fca69074da80652a70d862d/html5/thumbnails/12.jpg)
Why is this better?
● Maps the original dataset metadata into a standard format that can be transported and manipulated
● Still the same impedance mismatch when mapped to the least-common-denominator standard metadata, but
● When a better standard comes along, the original complete-but-nonstandard metadata is already there to be remapped, and “late semantic binding” means everyone can use the new semantic mapping
● Can use enhanced mappings between models that have common concepts beyond the least-common-denominator
● EASIER – tools to enhance the mapping process, mappings build on other mappings
![Page 13: Using the Semantic Web - IRIbenno/Talks/MBBSemantics... · 7/31/2012 · Semantic Web “a web of data that can be processed directly and indirectly by machines” Web 3.0 really](https://reader035.vdocuments.net/reader035/viewer/2022071012/5fca69074da80652a70d862d/html5/thumbnails/13.jpg)
RDF Architecture
RDF
RDFRDF
RDF
RDFRDF
RDF
RDFRDF
RDF
RDF
RDFRDF
RDF
RDFRDF
Virtual (derived) RDF
queries queries queries
![Page 14: Using the Semantic Web - IRIbenno/Talks/MBBSemantics... · 7/31/2012 · Semantic Web “a web of data that can be processed directly and indirectly by machines” Web 3.0 really](https://reader035.vdocuments.net/reader035/viewer/2022071012/5fca69074da80652a70d862d/html5/thumbnails/14.jpg)
Example: Search Interface
Search Interface
Users
Datasets
Search Ontology
Dataset Ontology
Additional Semantics
![Page 15: Using the Semantic Web - IRIbenno/Talks/MBBSemantics... · 7/31/2012 · Semantic Web “a web of data that can be processed directly and indirectly by machines” Web 3.0 really](https://reader035.vdocuments.net/reader035/viewer/2022071012/5fca69074da80652a70d862d/html5/thumbnails/15.jpg)
Sample Tool: Faceted Searchhttp://iridl.ldeo.columbia.edu/ontologies/query2.pl?...
![Page 16: Using the Semantic Web - IRIbenno/Talks/MBBSemantics... · 7/31/2012 · Semantic Web “a web of data that can be processed directly and indirectly by machines” Web 3.0 really](https://reader035.vdocuments.net/reader035/viewer/2022071012/5fca69074da80652a70d862d/html5/thumbnails/16.jpg)
Distinctive Features of the search
● Search terms are interrelated● terms that describe the set of returns are
displayed (spanning and not)● Returned items also have structure (sub-items
and superseded items are not shown)
![Page 17: Using the Semantic Web - IRIbenno/Talks/MBBSemantics... · 7/31/2012 · Semantic Web “a web of data that can be processed directly and indirectly by machines” Web 3.0 really](https://reader035.vdocuments.net/reader035/viewer/2022071012/5fca69074da80652a70d862d/html5/thumbnails/17.jpg)
Architectural Features of the search
● Multiple search structures possible● Multiple languages possible● Search structure is kept in the database, not in
the code
http://iridl.ldeo.columbia.edu/ontologies/query2.pl
![Page 18: Using the Semantic Web - IRIbenno/Talks/MBBSemantics... · 7/31/2012 · Semantic Web “a web of data that can be processed directly and indirectly by machines” Web 3.0 really](https://reader035.vdocuments.net/reader035/viewer/2022071012/5fca69074da80652a70d862d/html5/thumbnails/18.jpg)
Triplets of • Subject• Property (or Predicate)• Object
URI’s identify things, i.e. most of the aboveNamespaces are used as a convenient
shorthand for the URI’s
RDF: framework for writing connections
![Page 19: Using the Semantic Web - IRIbenno/Talks/MBBSemantics... · 7/31/2012 · Semantic Web “a web of data that can be processed directly and indirectly by machines” Web 3.0 really](https://reader035.vdocuments.net/reader035/viewer/2022071012/5fca69074da80652a70d862d/html5/thumbnails/19.jpg)
Datatype Properties
{WOA} dc:title “NOAA NODC WOA01”
{WOA} dc:description “NOAA NODC WOA01: World Ocean Atlas 2001, an atlas of objectively analyzed fields of major ocean parameters at monthly, seasonal, and annual time scales. Resolution: 1x1; Longitude: global; Latitude: global; Depth: [0 m,5500 m]; Time: [Jan,Dec]; monthly”
![Page 20: Using the Semantic Web - IRIbenno/Talks/MBBSemantics... · 7/31/2012 · Semantic Web “a web of data that can be processed directly and indirectly by machines” Web 3.0 really](https://reader035.vdocuments.net/reader035/viewer/2022071012/5fca69074da80652a70d862d/html5/thumbnails/20.jpg)
Object Properties
{WOA} iridl:isContainerOf {Grid-1x1},
{Grid-1x1} iridl:isContainerOf {Monthly}
![Page 21: Using the Semantic Web - IRIbenno/Talks/MBBSemantics... · 7/31/2012 · Semantic Web “a web of data that can be processed directly and indirectly by machines” Web 3.0 really](https://reader035.vdocuments.net/reader035/viewer/2022071012/5fca69074da80652a70d862d/html5/thumbnails/21.jpg)
WOA01 diagram
![Page 22: Using the Semantic Web - IRIbenno/Talks/MBBSemantics... · 7/31/2012 · Semantic Web “a web of data that can be processed directly and indirectly by machines” Web 3.0 really](https://reader035.vdocuments.net/reader035/viewer/2022071012/5fca69074da80652a70d862d/html5/thumbnails/22.jpg)
Standard Properties
{WOA} dcterm:hasPart {Grid-1x1},{Grid-1x1} dcterm:hasPart {MONTHLY}
Alternatively
{WOA} iridl:isContainerOf {Grid-1x1},{iridl:isContainerOf} rdfs:subPropertyOf
{dcterm:hasPart}
![Page 23: Using the Semantic Web - IRIbenno/Talks/MBBSemantics... · 7/31/2012 · Semantic Web “a web of data that can be processed directly and indirectly by machines” Web 3.0 really](https://reader035.vdocuments.net/reader035/viewer/2022071012/5fca69074da80652a70d862d/html5/thumbnails/23.jpg)
{SST} rdf:type {cfatt:non_coordinate_variable}, {SST} cfobj:standard_name {cf:sea_surface_temperature}, {SST} netcdf:hasDimension {longitude}
Data Structures in RDF
Object properties provide a framework for explicitly writing down relationships between data objects/components, e.g. vague meaning of nesting is made explicit
Properties also can be related, since they are objects too
![Page 24: Using the Semantic Web - IRIbenno/Talks/MBBSemantics... · 7/31/2012 · Semantic Web “a web of data that can be processed directly and indirectly by machines” Web 3.0 really](https://reader035.vdocuments.net/reader035/viewer/2022071012/5fca69074da80652a70d862d/html5/thumbnails/24.jpg)
Search Interface Term
• http://iri.columbia.edu/~benno/sampleterm.pdf
![Page 25: Using the Semantic Web - IRIbenno/Talks/MBBSemantics... · 7/31/2012 · Semantic Web “a web of data that can be processed directly and indirectly by machines” Web 3.0 really](https://reader035.vdocuments.net/reader035/viewer/2022071012/5fca69074da80652a70d862d/html5/thumbnails/25.jpg)
Virtual Triples
Use Conventions to connect concepts to established sets of concepts
Generate additional “virtual” triples from the original set and semantics
RDFS – some property/class semanticsOWL – additional property/class semantics: more
sophisticated (ontological) relationshipsSWRL – rules for constructing virtual triples
![Page 26: Using the Semantic Web - IRIbenno/Talks/MBBSemantics... · 7/31/2012 · Semantic Web “a web of data that can be processed directly and indirectly by machines” Web 3.0 really](https://reader035.vdocuments.net/reader035/viewer/2022071012/5fca69074da80652a70d862d/html5/thumbnails/26.jpg)
Multiple Ways of Expressing Concepts in RDF
Note that there are many world views in how to express concepts: concepts as classes vs concepts as individuals vs concept as predicate values
![Page 27: Using the Semantic Web - IRIbenno/Talks/MBBSemantics... · 7/31/2012 · Semantic Web “a web of data that can be processed directly and indirectly by machines” Web 3.0 really](https://reader035.vdocuments.net/reader035/viewer/2022071012/5fca69074da80652a70d862d/html5/thumbnails/27.jpg)
Nuanced tagging
Concepts as objects can be interrelated: specific terms imply broader terms
Object ends up being tagging with terms ranging from general to specific.
Search can then be nuancedtagging can proceed in absence of perfect
information
![Page 28: Using the Semantic Web - IRIbenno/Talks/MBBSemantics... · 7/31/2012 · Semantic Web “a web of data that can be processed directly and indirectly by machines” Web 3.0 really](https://reader035.vdocuments.net/reader035/viewer/2022071012/5fca69074da80652a70d862d/html5/thumbnails/28.jpg)
Faceted Search Explicated
![Page 29: Using the Semantic Web - IRIbenno/Talks/MBBSemantics... · 7/31/2012 · Semantic Web “a web of data that can be processed directly and indirectly by machines” Web 3.0 really](https://reader035.vdocuments.net/reader035/viewer/2022071012/5fca69074da80652a70d862d/html5/thumbnails/29.jpg)
Search Interface
● Items (datasets/maps)
● Terms● Facets● Taxa
![Page 30: Using the Semantic Web - IRIbenno/Talks/MBBSemantics... · 7/31/2012 · Semantic Web “a web of data that can be processed directly and indirectly by machines” Web 3.0 really](https://reader035.vdocuments.net/reader035/viewer/2022071012/5fca69074da80652a70d862d/html5/thumbnails/30.jpg)
Search Interface Semantic API
{item} dc:title dc:description rss:link iridl:icon dcterm:isPartOf {item2} dcterm:isReplacedBy {item2}
{item} trm:isDescribedBy {term}
{term} a {facet} of {taxa} of {trm:Term},{facet} a {trm:Facet}, {taxa} a {trm:Taxa},{term} trm:directlyImplies {term2}
![Page 31: Using the Semantic Web - IRIbenno/Talks/MBBSemantics... · 7/31/2012 · Semantic Web “a web of data that can be processed directly and indirectly by machines” Web 3.0 really](https://reader035.vdocuments.net/reader035/viewer/2022071012/5fca69074da80652a70d862d/html5/thumbnails/31.jpg)
RDF Architecture
RDF
RDFRDF
RDF
RDFRDF
RDF
RDFRDF
RDF
RDF
RDFRDF
RDF
RDFRDF
Virtual (derived) RDF
queries queries queries
![Page 32: Using the Semantic Web - IRIbenno/Talks/MBBSemantics... · 7/31/2012 · Semantic Web “a web of data that can be processed directly and indirectly by machines” Web 3.0 really](https://reader035.vdocuments.net/reader035/viewer/2022071012/5fca69074da80652a70d862d/html5/thumbnails/32.jpg)
Data Servers
Ontologies
MMI
JPL
StandardsOrganizations
Start Point
RDF/XML-Schema CrawlerXSLT/GRDDL ingest
XML Schema to OWL translationOwl SemanticsSWRL Rules
SeRQL CONSTRUCT
Search Queries
LocationCanonicalizer
TimeCanonicalizer
Sesame
Search Interface
bibliography
IRI RDF Architecture
![Page 33: Using the Semantic Web - IRIbenno/Talks/MBBSemantics... · 7/31/2012 · Semantic Web “a web of data that can be processed directly and indirectly by machines” Web 3.0 really](https://reader035.vdocuments.net/reader035/viewer/2022071012/5fca69074da80652a70d862d/html5/thumbnails/33.jpg)
Models, Crosswalks, and Objects
Structure of the RDF information that we are using to represent data objects in multiple frameworks (see full figure)
![Page 34: Using the Semantic Web - IRIbenno/Talks/MBBSemantics... · 7/31/2012 · Semantic Web “a web of data that can be processed directly and indirectly by machines” Web 3.0 really](https://reader035.vdocuments.net/reader035/viewer/2022071012/5fca69074da80652a70d862d/html5/thumbnails/34.jpg)
Semantic Crosswalk for metadata translation
![Page 37: Using the Semantic Web - IRIbenno/Talks/MBBSemantics... · 7/31/2012 · Semantic Web “a web of data that can be processed directly and indirectly by machines” Web 3.0 really](https://reader035.vdocuments.net/reader035/viewer/2022071012/5fca69074da80652a70d862d/html5/thumbnails/37.jpg)
Semantic Web
● Universal ids (URIs)● Multiple partial representations adding to be a
more complete picture
John Godfrey Saxe (1816-1887)