developing packages and integrating ontologies for...

1
Developing packages and integrating ontologies for Volcanoes, Plate Tectonics and Atmospheric Science Data Integration (IN53B-1204) (sesdi.hao.ucar.edu) Krishna Sinha 1 , Robert Raskin 2 , Deborah McGuinness 3,4,5 ([email protected] ), Peter Fox 6 ([email protected]), ( 1 Virginia Polytechnic University) ( 2 JPL/NASA) ( 3 McGuinness Associates) ( 4 Rensselaer Polytechnic Inst.) ( 5 Stanford University, Knowledge Systems, AI Lab), ( 6 HAO/NCAR) . Funded by NASA/ESTO/ACCESS. Abstract In support of a NASA-funded scientific application (SESDI; Semantically Enabled Science Data Integration Project; that needs to share volcano and climate data to investigate relationships between volcanism and global climate, we have generated a volcano and plate tectonic ontologies and leveraged and augmented the existing SWEET (Semantic Web for Earth and Environmental Terminology) ontology. Our goal is to create a package for integrating the relevant ontologies (meant to be shared and reused by a broad community of users) to provide access to the key volcanology, plate tectonic and atmospheric related databases. We present how we have put ontologies to work in this science application setting, and the methodologies employed to create the ontologies, map them to the underlying data and implement them for use by scientists. SESDI is an NASA/ESTO/ACCESS-funded project involving the High Altitude Observatory at the National Center for Atmospheric Research (NCAR), McGuinness Associates Consulting, NASA/JPL and Virginia Polytechnic University. Data Registration Framework Level 1: Data Registration at Discovery Level, e.g. Volcano location and activity Level 2: Data Registration at Inventory Level, e.g. list of datasets, times, products Level 3: Data Registration at Item Detail Level, e.g. access to individual quantities Ontology based Data Integration Using scientific workflows Earth Sciences Virtual Database A Data Warehouse where Schema heterogeneity problem is Solved; schema based integration Data Discovery Data Integration Determine the statistical signatures of both volcanic and solar forcings on the height of the tropopause Integrated Concept Map Overall Packaging of Ontologies Detailed Packaging of Ontologies From GEON ontologies From SWEET ontologies From VSTO, SESDI, and SWEET ontologies Leads to re-factorization of SWEET: Electromagnetic Radiation in a separate ontology Processes decomposed into mechanical, electrical, radiative, chemical, thermal, etc. EarthRealm decomposed into horizontal and vertical realms Ontologies are also involved with the data registration procedures at different levels of granularity: SESDI Ontology Package GEON (for planetary) and SWEET (for atmosphere) Ontology Packages VSTO Data Product Ontology Package Leveraged Semantic Data Framework from VSTO Use of VSTO Instrument Ontology Leads to further population of VSTO instruments: Add the sets of instruments specific to volcano and atmospheric measuring/ monitoring Specific modularization requirements are still evolving, e.g. all instruments in one ontology packages but no instances, or separate instruments and instances into distinct modules. • World Volcano database (WOVODat) USGS and community • NOVAC - Network for Observation of Volcanic and Atmospheric Change a network for measurements of the emissions of gases -- in particular SO2 and BrO -- and aerosols by volcanoes network encompasses observatories of 15 volcanoes from five continents, including some of the most active and strongest degassing volcanoes in the world • SCIAMACHY imaging spectrometer whose primary mission objective is to perform global measurements of trace gases in the troposphere and in the stratosphere (also ideally suited for the detection of clouds and aerosols). Added bonus - also measures solar irradiance Long Valley Caldera - Underlying metadata and data sources Initial development of distinct ontology efforts - SWEET, GEON, and VSTO used groupings of concepts in files for developing the ontologies SWEET used realm, physical phenomenon, units, etc. GEON used rocks, chemical species, etc. VSTO used core and community imports, e.g. instruments were in two files, one for solar physics, one for aeronomy When used for discipline oriented applications, like search, access and use, these forms were adequate When the application - data integration - required broad concept and relation integration, drawing on a variety of component concepts, and thus, the creation of a package - SWEET, GEON and VSTO needed to reform and modularize their ontologies in a more compatible and re-useable form Future: further modularization and refinement of an overall framework for ontology package population based on components Discussion and Present Lessons Learned: Future Directions Aeronomy instrument Aerosol analysis instruments ElementalConcentration element : El ement c oncentration : V al ueWit hUni t errorOfConcentration : ValueWithUnit Gas Rock = Petrology Fluid Need to add ... Solid NaturalMineral name : String size : ValueWithUnit has_cleavage : Boolean is_fractured : Boolean is_oriented : Boolean is_twinned : Boolean is_altered : Boolean is_primary : Boolean is_secondary : Boolean is_accessory : Boolean is_heavy_mineral : Boolean is_reworked : Boolean is_zoned : Boolean color : String density : ValueWithUnit age : String crystalline : Boolean amorphous : Boolean striations : Boolean hardness : Double indexOfFraction : Double fusibility : ValueWithUnit thermalConductivity : ValueWithUnit expansion : ValueWithUnit specificHeat : ValueWithUnit electricalConductivity : ValueWithUnit taste : String odor : String luster : String (from Mine... Rock color_i ndex : String is_monomineralic : Boolean is_polymetamorphic : Boolean is_polydefor med : Boolean (f ro m R.. 0..n 1 0..n 1 For registration of Physical Property data: The user needs to first select a Mineral or Rock Sample then select the attribute from the physical property list under StateOfMatter Liquid Water pH eH Additional properties to be added ChemicalCompound St ateOfMatt er density : ValueWithUnit resistivity : ValueWithUnit anisotropy : ValueWithUnit seismic_velocity : ValueWithUnit compressive_strength : ValueWithUnit magnetic_susceptibility : ValueWithUnit thermal_conductivity : ValueWithUnit other_physical_property : ValueWithUnit Temperature : ValueWithUnit Pressure : ValueWithUnit thermalCoefficientofExpansion : ValueWithUnit Micronex Classific... NormativeMineral name : String symbol : String fomula : String gramEquiv i lentWeight : Double Need a Kepler workflow to work in the background: Normative Vs. Modal PlanetaryMaterial Element name : String atomicWeight : ValueWithUnit symbol : String atomicNumber : Integer color : String atomicRadius : ValueWithUnit meltingPoint : ValueWithUnit boilingPoint : ValueWithUnit density : ValueWithUnit specificHeat : ValueWithUnit ionizationPotential : ValueWithUnit electronAffinity : ValueWithUnit heatOfFusion : ValueWithUnit heatOfVaporization : ValueWithUnit bondingRadius : ValueWithUnit electronNegativity : Double (from Elem... 1..n 1 1..n 1 AnalyticalOxide 1..n 1 1..n 1 RockSample ssID : String fieldID : String age : String dateOfCollection : String image analyticMethod : String (from R... IsotopeAbundance isotope : Is otope abundance : Val ueWit hUni t errorOfAbundance : ValueWithUni t 1..n 1 1..n 1 IsotopeRatio value : Double is_radiogenic : Boolean is_common : Boolean is_stable : Boolean error : ValueWithUnit (from Elem... 1..n 1 1..n 1 1..n 1 1..n 1 Analyt ic alOxi deConcentrat ion analyticalOxide : AnalyticalOxide concentration : ValueWithUnit errorOfConcentration : ValueWithUnit 1..n 1 1..n 1 1..n 1 1..n 1 NormativeMineralAbundance normativeMineral : NormativeMineral abundance : ValueWithUnit 1..n 1 1..n 1 MineralSample minID : String image (from Mine... 1..n 1 1..n 1 1.. n 1 1.. n 1 1..n 1 1..n 1 1..n 1 1..n 1 1..n 1 1..n 1 Minerals Instance of Normative Minerals : Basic Analyti...

Upload: dangquynh

Post on 03-May-2018

217 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Developing packages and integrating ontologies for ...sesdi.hao.ucar.edu/docs/IN53B-1204_AGUFM07_SESDI_package.pdfData Discovery Data Integration Determine the statistical signatures

Developing packages and integrating ontologies for Volcanoes, Plate Tectonics and Atmospheric Science Data Integration (IN53B-1204) (sesdi.hao.ucar.edu)Krishna Sinha1, Robert Raskin2, Deborah McGuinness3,4,5 ([email protected]), Peter Fox6 ([email protected]), (1Virginia Polytechnic University) (2JPL/NASA) (3McGuinness Associates) (4Rensselaer Polytechnic Inst.) (5Stanford University, Knowledge Systems, AI Lab), (6HAO/NCAR) . Funded by NASA/ESTO/ACCESS.

Abstract

In support of a NASA-funded scientific application (SESDI; Semantically Enabled Science Data Integration Project; that needs to share volcano and climate data to investigate relationships between volcanism and global climate, we have generated a volcano and plate tectonic ontologies and leveraged and augmented the existing SWEET (Semantic Web for Earth and Environmental Terminology) ontology. Our goal is to create a package for integrating the relevant ontologies (meant to be shared and reused by a broad community of users) to provide access to the key volcanology, plate tectonic and atmospheric related databases. We present how we have put ontologies to work in this science application setting, and the methodologies employed to create the ontologies, map them to the underlying data and implement them for use by scientists. SESDI is an NASA/ESTO/ACCESS-funded project involving the High Altitude Observatory at the National Center for Atmospheric Research (NCAR), McGuinness Associates Consulting, NASA/JPL and Virginia Polytechnic University.

Data Registration Framework

Level 1:

Data Registration at Discovery Level,

e.g. Volcanolocation and activity

Level 2:

Data Registration at Inventory Level, e.g. list of datasets,

times, products

Level 3:

Data Registration at Item Detail

Level, e.g. access toindividual quantities

Ontology basedData IntegrationUsing scientific

workflows

Earth Sciences Virtual DatabaseA Data Warehouse where

Schema heterogeneity problem is Solved; schema based integration

Data Discovery Data Integration

Determine the statistical signatures of both volcanic and solar forcings on the height of the tropopause

Integrated Concept Map

Overall Packaging of Ontologies Detailed Packaging of Ontologies

From GEON ontologiesFrom SWEET ontologies

From VSTO, SESDI, and SWEET ontologies

Leads to re-factorization of SWEET:• Electromagnetic Radiation in a separate ontology• Processes decomposed into mechanical, electrical, radiative, chemical, thermal, etc.• EarthRealm decomposed into horizontal and vertical realms

Ontologies are also involved with the data registration procedures at different levels of granularity:

SESDI Ontology Package

GEON (for planetary) and SWEET (for atmosphere) Ontology Packages

VSTO Data Product Ontology Package

Leveraged Semantic Data Framework from VSTO

Use of VSTO Instrument Ontology

Leads to further population of VSTO instruments:• Add the sets of instruments specific to volcano and

atmospheric measuring/ monitoring• Specific modularization requirements are still

evolving, e.g. all instruments in one ontology packages but no instances, or separate instruments and instances into distinct modules.

• World Volcano database (WOVODat)– USGS and community

• NOVAC - Network for Observation of Volcanic and Atmospheric Change– a network for measurements of the emissions of gases -- in particular SO2 and

BrO -- and aerosols by volcanoes– network encompasses observatories of 15 volcanoes from five continents,

including some of the most active and strongest degassing volcanoes in the world

• SCIAMACHY– imaging spectrometer whose primary mission objective is to perform global

measurements of trace gases in the troposphere and in the stratosphere (also ideally suited for the detection of clouds and aerosols).

– Added bonus - also measures solar irradiance• Long Valley Caldera -

Underlying metadata and data sources

Initial development of distinct ontology efforts - SWEET, GEON, and VSTO used groupings of concepts in files for developing the ontologiesSWEET used realm, physical phenomenon, units, etc.GEON used rocks, chemical species, etc.VSTO used core and community imports, e.g. instruments were in two files, one for solar physics, one for aeronomy

When used for discipline oriented applications, like search, access and use, these forms were adequate

When the application - data integration - required broad concept and relation integration, drawing on a variety of component concepts, and thus, the creation of a package - SWEET, GEON and VSTO needed to reform and modularize their ontologies in a more compatible and re-useable form

Future: further modularization and refinement of an overall framework for ontology package population based on components

Discussion and Present Lessons Learned: Future Directions

Aeronomy instrument

Aerosol analysis instruments

ElementalConcentrationelement : El ementconcentration : Val ueWithUni terrorOfConcentrat ion : ValueWithUnit

Gas

Rock = Petrology

Fluid

Need to add ...

Solid

NaturalMineral

name : Stringsize : ValueWithUnithas_cleavage : Booleanis_fractured : Booleanis_oriented : Booleanis_twinned : Booleanis_altered : Booleanis_primary : Booleanis_secondary : Booleanis_accessory : Booleanis_heavy_mineral : Booleanis_reworked : Booleanis_zoned : Booleancolor : Stringdensity : ValueWithUnitage : Stringcrystalline : Booleanamorphous : Booleanstriations : Booleanhardness : DoubleindexOfFraction : Doublefusibility : ValueWithUnitthermalConductivity : ValueWithUnitexpansion : ValueWithUnitspecificHeat : ValueWithUnitelectricalConductivity : ValueWithUnittaste : Stringodor : Stringluster : String

(from Mine...

Rock

color_i ndex : Stringis_monomineralic : Booleanis_polymetamorphic : Booleanis_polydefor med : Boolean

(f ro m R.. .

0..n

1

0..n

1

For registration of Physical Property data: The user needs to first select a Mineral or Rock Sample then select the attribute from the physical property list under StateOfMatter

Liquid

WaterpHeH

Additional properties to be added

ChemicalCompound

StateOfMatterdensity : ValueWithUnitresistivity : ValueWithUnitanisotropy : ValueWithUnitseismic_velocity : ValueWithUnitcompressive_strength : ValueWithUnitmagnetic_susceptibility : ValueWithUnitthermal_conductivity : ValueWithUnitother_physical_property : ValueWithUnitTemperature : ValueWithUnitPressure : ValueWithUnitthermalCoefficientofExpansion : ValueWithUnit

Micronex Classific...

NormativeMineralname : Stringsymbol : Stringfomula : StringgramEquivi lentWeight : Double

Need a Kepler workflow to work in the background: Normative Vs. Modal

PlanetaryMaterial

Element

name : StringatomicWeight : ValueWithUnitsymbol : StringatomicNumber : Integercolor : StringatomicRadius : ValueWithUnitmeltingPoint : ValueWithUnitboilingPoint : ValueWithUnitdensity : ValueWithUnitspecificHeat : ValueWithUnitionizationPotential : ValueWithUnitelectronAffinity : ValueWithUnitheatOfFusion : ValueWithUnitheatOfVaporization : ValueWithUnitbondingRadius : ValueWithUnitelectronNegativity : Double

(from Elem...

1..n

1

1..n

1

AnalyticalOxide

1..n

1

1..n

1

RockSample

ssID : StringfieldID : Stringage : StringdateOfCollection : StringimageanalyticMethod : String

(from R...

IsotopeAbundanceisotope : Isotopeabundance : Val ueWithUni terrorOfAbundance : ValueWithUni t

1..n

1

1..n

1

IsotopeRatio

value : Doubleis_radiogenic : Booleanis_common : Booleanis_stable : Booleanerror : ValueWithUnit

(from Elem...1..n

1

1..n

1

1..n

1

1..n

1

Analyt icalOxi deConcentrat ionanalyticalOxide : AnalyticalOxideconcentration : ValueWithUniterrorOfConcentration : ValueWithUnit

1..n

1

1..n

1

1..n1

1..n1

NormativeMineralAbundancenormativeMineral : NormativeMineralabundance : ValueWithUnit

1..n

1

1..n

1

MineralSample

minID : Stringimage

( from Mine...

1..n

1

1..n

1

1..n

1

1..n

1

1..n

1

1..n

1

1..n

1

1..n

1 1..n

1

1..n

1

Minerals

Instance of Normative Minerals : Basic Analyti...