1 extended metadata registry (xmdr) ecoterm rome, italy may 17, 2006 bruce bargmeyer, lawrence...

21
1 eXtended Metadata Registry (XMDR) Ecoterm Rome, Italy May 17, 2006 Bruce Bargmeyer, Lawrence Berkley National Laboratory University of California Tel: +1 510-495-2905 [email protected]

Upload: charla-williams

Post on 14-Jan-2016

218 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: 1 eXtended Metadata Registry (XMDR) Ecoterm Rome, Italy May 17, 2006 Bruce Bargmeyer, Lawrence Berkley National Laboratory University of California Tel:

1

eXtended Metadata Registry (XMDR)

EcotermRome, ItalyMay 17, 2006

Bruce Bargmeyer, Lawrence Berkley National LaboratoryUniversity of CaliforniaTel: +1 [email protected]

Page 2: 1 eXtended Metadata Registry (XMDR) Ecoterm Rome, Italy May 17, 2006 Bruce Bargmeyer, Lawrence Berkley National Laboratory University of California Tel:

2

XMDR Project Draws Together

UsersUsers

ISO/IEC 11179MetadataRegistries

Terminology

CONCEPT

Referent

Refers To Symbolizes

Stands For

“Rose”,“ClipArt”

Metadata Registry

Terminology Thesaurus Taxonomy

DataStandards

Ontology

StructuredMetadata

ISO/IEC JTC 1/SC 32ISO TC 37 & …

Page 3: 1 eXtended Metadata Registry (XMDR) Ecoterm Rome, Italy May 17, 2006 Bruce Bargmeyer, Lawrence Berkley National Laboratory University of California Tel:

3

XMDR Direction

ISO/IEC JTC 1/SC 32

UsUs

ersers

ISO/IEC 11179MetadataRegistries Terminology

CONCEPT

Referent

Refers To Symbolizes

Stands For

“Rose”,“ClipArt”

Metadata Registry

Terminology Thesaurus Taxonomy

DataStandards

Ontology

StructuredMetadata

ISO TC 37 & …

DataAdmin.

Page 4: 1 eXtended Metadata Registry (XMDR) Ecoterm Rome, Italy May 17, 2006 Bruce Bargmeyer, Lawrence Berkley National Laboratory University of California Tel:

4

XMDR Direction

ISO/IEC JTC 1/SC 32

UsUs

ersers

ISO/IEC 11179MetadataRegistries Terminology

CONCEPT

Referent

Refers To Symbolizes

Stands For

“Rose”,“ClipArt”

Metadata Registry

Terminology Thesaurus Taxonomy

DataStandards

Ontology

StructuredMetadata

ISO TC 37 & …

SemanticComputing

Page 5: 1 eXtended Metadata Registry (XMDR) Ecoterm Rome, Italy May 17, 2006 Bruce Bargmeyer, Lawrence Berkley National Laboratory University of California Tel:

5

Metadata Registry Extensions

Register (and manage) any semantics that are useful for managing data. E.g., this may include registering not only permissible

values (concepts), definitions, but may extend to registration of the full concept systems in which the permissible values are found.

E.g., may want to register keywords, thesauri, taxonomies, ontologies, axiomatized ontologies….

Support traditional data management and data administration

Lay Foundation for semantic computing: Semantics Service Oriented Architecture, Semantic Grids, Semantics based workflows, Semantic Web ….

Page 6: 1 eXtended Metadata Registry (XMDR) Ecoterm Rome, Italy May 17, 2006 Bruce Bargmeyer, Lawrence Berkley National Laboratory University of California Tel:

6

XMDR Project Results

Design for next generation metadata registries, expressed as a standard—ISO/IEC 11179 family of standards

XMDR Prototype, open source software Semantic content in prototype Demonstrations for healthcare and the

environment Ecoinformatics test bed

Demonstration using water data and concept systems

Page 7: 1 eXtended Metadata Registry (XMDR) Ecoterm Rome, Italy May 17, 2006 Bruce Bargmeyer, Lawrence Berkley National Laboratory University of California Tel:

7

XMDR: Register Any Concept System orKnowledge Organization System (KOS)

KeywordsGlossariesGazetteersThesauriTaxonomiesOntologiesAxiomatized Ontologies

Page 8: 1 eXtended Metadata Registry (XMDR) Ecoterm Rome, Italy May 17, 2006 Bruce Bargmeyer, Lawrence Berkley National Laboratory University of California Tel:

8

XMDR Content List (partial)

NBII Biocomplexity Thesaurus

NCI Thesaurus National Cancer Institute Thesaurus

NCI Data Elements (National Cancer Institute Data Standards Registry

UMLS (non-proprietary portions)

GEMET (General Multilingual Environmental Thesaurus)

(New project to get Chinese terms for the GEMET concepts)

EDR Data Elements (Environmental Data Registry)

USGS Geographic Names Information System (GNIS) HL7 Terminology, Data Elements

Mouse Anatomy

GO (Gene Ontology)

EPA Web Registry Controlled Vocabulary

BioPAX Ontology

NASA SWEET Ontologies

AGROVOC …

Page 9: 1 eXtended Metadata Registry (XMDR) Ecoterm Rome, Italy May 17, 2006 Bruce Bargmeyer, Lawrence Berkley National Laboratory University of California Tel:

9

XMDR: Register Ontologies

Concept Concept

ConceptConcept Geographic Area

Geographic Sub-Area

Country

Country Identifier

Country Name Country Code

Short Name ISO 31662-Character

Code

ISO 31663- Character

Code

Long Name

DistributorCountry Name

Mailing AddressCountry Name ISO 3166

3-Numeric CodeFIPS Code

Page 10: 1 eXtended Metadata Registry (XMDR) Ecoterm Rome, Italy May 17, 2006 Bruce Bargmeyer, Lawrence Berkley National Laboratory University of California Tel:

10

XMDR: Register Graphs

Directed Graph

Directed Acyclic Graph

Graph

Undirected Graph

Bipartite Graph

Partial Order Graph

Faceted Classification

Clique

Partial Order Tree

Tree

Lattice

Ordered Tree

Note: not all bipartite graphsare undirected.

Graph Taxonomy:

Page 11: 1 eXtended Metadata Registry (XMDR) Ecoterm Rome, Italy May 17, 2006 Bruce Bargmeyer, Lawrence Berkley National Laboratory University of California Tel:

11GooseDuck

Waterfowl

Represent concepts and relationships as nodes and edges in formal graph structurese.g., “is-a” hierarchies.

Duck Goose

Waterfowl

is-a is-a

is-ais-a

CanvasbackBufflehead

Include Concept System Semantics in Metadata Registries

Page 12: 1 eXtended Metadata Registry (XMDR) Ecoterm Rome, Italy May 17, 2006 Bruce Bargmeyer, Lawrence Berkley National Laboratory University of California Tel:

12

Inference

Polio Smallpox

Infectious Disease

Disease

is-a

is-a is-a

is-a

is-a

Diabetes Heart disease

Chronic Disease

is-a

Signifies inferred is-a relationship

Page 13: 1 eXtended Metadata Registry (XMDR) Ecoterm Rome, Italy May 17, 2006 Bruce Bargmeyer, Lawrence Berkley National Laboratory University of California Tel:

13

ISO/IEC 11179 Metadata Registries+ XMDR

Register and manage semantics that are or can be harmonized and vetted by some Community of Interest (COI)

Provide Semantic ServicesE.g., the semantics can be referenced by RDF

statements (subjects, predicates, objects)The semantics can be used for Semantic Web and

Semantic Computing A “vocabulary” that is grounded for some COI

Page 14: 1 eXtended Metadata Registry (XMDR) Ecoterm Rome, Italy May 17, 2006 Bruce Bargmeyer, Lawrence Berkley National Laboratory University of California Tel:

14

Ontology EditorProtege11179 OWL Ontology

XMDR Prototype: Modular Architecture-- Initial Implemented Modules

MetadataValidator

AuthenticationService

MappingEngine

RegistryExternalInterface

Generalization Composition (tight ownership) Aggregation (loose ownership)

Jena, Xerces

Java

RetrievalIndex

FullTextIndex

Lucene

LogicBasedIndex

Jena, OWI KSRacer

RegistryStore

WritableRegistryStore

Subversion

Page 15: 1 eXtended Metadata Registry (XMDR) Ecoterm Rome, Italy May 17, 2006 Bruce Bargmeyer, Lawrence Berkley National Laboratory University of California Tel:

15

XMDR Project Collaboration

Collaborative, interagency effort EPA, USGS, NCI, Mayo Clinic, DOD, LBNL

…& othersDraws on and contributes to

interagency/International Cooperation on Ecoinformatics

Interacts with many organizations around the world through ISO/IEC standards committees

Page 16: 1 eXtended Metadata Registry (XMDR) Ecoterm Rome, Italy May 17, 2006 Bruce Bargmeyer, Lawrence Berkley National Laboratory University of California Tel:

16

Where does this fit with Ecoterm?

Ecoterm organizations as sources of contentEcoterm organizations as

collaborators/testersEcoterm organizations as potential users of

Open Source software Potential collaboration on R&D projects

e.g., under European Commission Framework Program 7

WWW.XMDR.ORG

Page 17: 1 eXtended Metadata Registry (XMDR) Ecoterm Rome, Italy May 17, 2006 Bruce Bargmeyer, Lawrence Berkley National Laboratory University of California Tel:

17

Concept System Store

UsersUsers

Concept systems:KeywordsControlled VocabulariesThesauriTaxonomiesOntologiesAxiomatized Ontologies

(Essentially graphs: node-relation-node + axioms)

} Metadata Registry

Concept System Thesaurus Themes

DataStandards

Ontology GEMET

StructuredMetadata

Page 18: 1 eXtended Metadata Registry (XMDR) Ecoterm Rome, Italy May 17, 2006 Bruce Bargmeyer, Lawrence Berkley National Laboratory University of California Tel:

18

Management of Concept Systems

Metadata Registry

Concept System Thesaurus Themes

DataStandards

Ontology GEMET

StructuredMetadata

UsersUsers

Concept system:RegistrationHarmonization StandardizationAcceptance (vetting)Mapping (correspondences)

}

Page 19: 1 eXtended Metadata Registry (XMDR) Ecoterm Rome, Italy May 17, 2006 Bruce Bargmeyer, Lawrence Berkley National Laboratory University of California Tel:

19

Life Cycle Management

Metadata Registry

Concept System Thesaurus Themes

DataStandards

Ontology GEMET

StructuredMetadata

UsersUsers

Life cycle management:Data andConcept systems(ontologies)

}

Page 20: 1 eXtended Metadata Registry (XMDR) Ecoterm Rome, Italy May 17, 2006 Bruce Bargmeyer, Lawrence Berkley National Laboratory University of California Tel:

20

Grounding Semantics

Metadata Registry

Concept System Thesaurus Themes

DataStandards

Ontology GEMET

StructuredMetadata

UsersUsers

MetadataRegistries Semantic Web

RDF TriplesSubjectVerbObject

Ontologies

Page 21: 1 eXtended Metadata Registry (XMDR) Ecoterm Rome, Italy May 17, 2006 Bruce Bargmeyer, Lawrence Berkley National Laboratory University of California Tel:

22

See

www.xmdr.org