documentation foundation spectrum relational tables xml/relational database (with some fields) xml...

19
Documentation Foundation Spectrum Relational Tables XML/Relational Database (with some fields) XML Blobs (with some fields) File Systems XML Blobs in Database Component Repository, Objects / Xforms Micro Interfaces, SemanticWiki Workflows Record Cache Record Repository

Upload: anthony-shields

Post on 13-Jan-2016

226 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Documentation Foundation Spectrum Relational Tables XML/Relational Database (with some fields) XML Blobs (with some fields) File Systems XML Blobs in Database

Documentation Foundation Spectrum

Relational TablesRelational Tables

XML/Relational Database (with some fields)XML/Relational Database (with some fields)

XML Blobs (with some fields)XML Blobs (with some fields) File SystemsFile Systems

XML Blobs in DatabaseXML Blobs in Database

Component Repository,Objects / XformsMicro Interfaces, SemanticWikiWorkflows

Record CacheRecord Repository

Page 2: Documentation Foundation Spectrum Relational Tables XML/Relational Database (with some fields) XML Blobs (with some fields) File Systems XML Blobs in Database

Relational TablesRelational Tables

XML/Relational Database (with some fields)XML/Relational Database (with some fields)

XML Blobs (with some fields)XML Blobs (with some fields) File SystemsFile Systems

XML Blobs in DatabaseXML Blobs in Database

Published RecordsPublished Records

XSLT to Multiple StandardsXSLT to Multiple Standards

Harvesting to GCMD, GOSHarvesting to GCMD, GOS

Z39.50, FirstGov, GoogleZ39.50, FirstGov, Google

WWWWWW

CLASSCLASS

NMMRNMMR

GeoNetworkGeoNetwork

Extents / NOSAExtents / NOSA

SPEEDSSPEEDS

NGDC TrackingNGDC Tracking

Rich InventoryRich InventorySNAAPSNAAP

eXisteXist

Documentation@ NGDC

Component Repository,Objects / XformsMicro Interfaces, SemanticWikiWorkflows

Record CacheRecord Repository

GeoToolkitGeoToolkit

Page 3: Documentation Foundation Spectrum Relational Tables XML/Relational Database (with some fields) XML Blobs (with some fields) File Systems XML Blobs in Database

RecordRepository

RecordRepository Documents

(PDF, HTML, …)

Documents(PDF, HTML,

…)

Rich Inventory

Rich Inventory

Relational Databases

RecordCache

RecordCache

Files

Documentation Repository

Page 4: Documentation Foundation Spectrum Relational Tables XML/Relational Database (with some fields) XML Blobs (with some fields) File Systems XML Blobs in Database

RecordRepository

RecordRepository

Documents(PDF, HTML,

XML, …)

Documents(PDF, HTML,

XML, …)

Rich Inventory

Rich Inventory

Relational Databases

RecordCache

RecordCache Export

XML, HTML,Reports, Graphics

SQL, Web Queries HTML, XML

SemanticWiki,GeoNetwork,CRUD Tools,

FTP

XML Web Services

Users

Clients

WWW

Catalogs and Granules

Catalogs and Granules

Communication

Page 5: Documentation Foundation Spectrum Relational Tables XML/Relational Database (with some fields) XML Blobs (with some fields) File Systems XML Blobs in Database

DocumentationRepositoryISO 19115,

19115-2, 19119 and extensions

DocumentationRepositoryISO 19115,

19115-2, 19119 and extensions

THREDDSTHREDDS

netCDFnetCDF

DIF, FGDCDIF, FGDC

SensorMLSensorML

WCS, WMS, WFS, SOS

WCS, WMS, WFS, SOS

ISOISO

KMLKML

Multiple Documentation Dialects

Page 6: Documentation Foundation Spectrum Relational Tables XML/Relational Database (with some fields) XML Blobs (with some fields) File Systems XML Blobs in Database

netCDFDatasets

THREDDS Catalogs

netCDFDatasets

THREDDS Catalogs

netCDF Datasets

THREDDS Catalogs

netCDF Datasets

THREDDS Catalogs

DocumentationRepository

DocumentationRepository

Maximizing Benefit of Existing Documentation – Connection/Translation

MetadataDIF, FGDC, ISO

MetadataDIF, FGDC, ISOMetadata

DIF, FGDC, ISOMetadata

DIF, FGDC, ISO

Page 7: Documentation Foundation Spectrum Relational Tables XML/Relational Database (with some fields) XML Blobs (with some fields) File Systems XML Blobs in Database

MD_ContentInformation

MD_Metadata

+ fileIdentifier [0..1] : CharacterString+ language [0..1] : CharacterString+ characterSet [0..1] : MD_CharacterSetCode = "utf8"+ parentIdentifier [0..1] : CharacterString+ hierarchyLevel [0..*] : MD_ScopeCode = "dataset"+ hierarchyLevelName [0..*] : CharacterString+ contact [1..*] : CI_ResponsibleParty+ dateStamp : Date+ metadataStandardName [0..1] : CharacterString+ metadataStandardVersion [0..1] : CharacterString+ dataSetURI [0..1] : CharacterString+ locale [0..*] : PT_Locale

MD_ReferenceSystem

MD_DataQuality

MD_SpatialRepresentation

MD_Identification

MD_MaintenanceInformation

MD_Distribution

MD_ApplicationSchemaInformationMD_PortrayalCatalogueReference

MD_Constraints

MD_MetadataExtensionInformation

MD_ContentInformation

Page 8: Documentation Foundation Spectrum Relational Tables XML/Relational Database (with some fields) XML Blobs (with some fields) File Systems XML Blobs in Database

MD_RangeDimension

+ sequenceIdentifier [0..1] : MemberName + descriptor [0..1] : CharacterString MD_ImageDescription

+ illuminationElevationAngel [0..1] : Real + illuminationAzimuthAngel [0..1] : Real + imagingCondition [0..1] : MD_ImagingConditionCode + imageQualityCode [0..1] : MD_Identifier + cloudCoverPercentage [0..1] : Real + processingLevelCode [0..1] : MD_Identifer + compressionGenerationQuantity [0..1] : Integer + triangulationIndicator [0..1] : Boolean + radiometricCalibrationDataAvailability [0..1] : Boolean + cameraCalibrationInformationAvailability [0..1] : Boolean + filmDistortionInformationAvailability [0..1] : Boolean + lensDistortionInformationAvailability [0..1] : Boolean

MD_FeatureCatalogDescription

+ complianceCode [0..1] : Boolean + language [0..*] : CharacterString + includeWithDataset : Boolean + featureTypes [0..*] : GenericName + featureCatalogueCitation [1..*] : CI_Citation

MD_ContentInformation

MD_Metadata+contentInfo 0..*

<<Abstract>>MD_ContentInformation

MD_Band

+ maxValue [0..1] : Real+ minValue [0..1] : Real + units [0..1] : UomLength+ peakResponse [0..1] : Real + bitsPerValue [0..1] : Integer + toneGradation [0..1] : Integer + scaleFactor [0..1] : Real + offset [0..1] : Real

<<CodeList>>MD_CoverageContentTypeCode

+ image + thematicClassification + physicalMeasurement

<<CodeList>>MD_ImagingConditionCode

+ blurredImage + cloud + degradingObliquity + fog + heavySmokeOrDust + night + rain + seminDarkness + shadow + snow + terrainMasking

MD_CoverageDescription

+ attributeDescription : RecordType + contentType : MD_CoverageContentTypeCode

+dimension 0..*

Page 9: Documentation Foundation Spectrum Relational Tables XML/Relational Database (with some fields) XML Blobs (with some fields) File Systems XML Blobs in Database

MD_RangeDimension

+ sequenceIdentifier [0..1] : MemberName + descriptor [0..1] : CharacterString

MI_ContentInformation 19115-2

+contentInfo 0..*

MD_Band

+ maxValue [0..1] : Real+ minValue [0..1] : Real + units [0..1] : UomLength+ peakResponse [0..1] : Real + bitsPerValue [0..1] : Integer + toneGradation [0..1] : Integer + scaleFactor [0..1] : Real + offset [0..1] : Real

<<CodeList>>MD_CoverageContentTypeCode

+ image + thematicClassification + physicalMeasurement

+dimension 0..*

MI_CoverageDescription

MI_RangeElementDescription

+ name : CharacterString+ definition : CharacterString+ rangeElement[1..*] : Record

+rangeElementDescription 0..*

MD_CoverageDescription

+ attributeDescription : RecordType + contentType : MD_CoverageContentTypeCode

MD_Metadata <<Abstract>>MD_ContentInformation

Page 10: Documentation Foundation Spectrum Relational Tables XML/Relational Database (with some fields) XML Blobs (with some fields) File Systems XML Blobs in Database

MD_Band

+ sequenceIdentifier [0..1] : MemberName + description [0..1] : CharacterString+ maxValue [0..1] : Real+ minValue [0..1] : Real + units [0..1] : UomLength+ peakResponse [0..1] : Real + bitsPerValue [0..1] : Integer + toneGradation [0..1] : Integer + scaleFactor [0..1] : Real + offset [0..1] : Real

MI_CoverageDescriptions

MI_Metadata

+contentInfo 0..*

<<CodeList>>MD_CoverageContentTypeCode

+ image + thematicClassification + physicalMeasurement

MI_CoverageDescription

+ attributeDescription : RecordType + contentType : MD_CoverageContentTypeCode

+dimension 0..*

MI_RangeElementDescription

+ name : CharacterString+ definition : CharacterString+ rangeElement[1..*] : Record

+rangeElementDescription 0..*

Layers, Member = Unique Identifier

Flags

Attribute Description / Type

New Types:referenceInformationqualityInformationauxilliaryDatamodelResult

<attributeDescription xlink:href=http://motherlode.ucar.edu:8081/thredds/cdmremote/fmrc/NCEP/GFS/CONUS_191km/files/GFS_CONUS_191km_20091111_0000.grib1?req=ncml/>

Page 11: Documentation Foundation Spectrum Relational Tables XML/Relational Database (with some fields) XML Blobs (with some fields) File Systems XML Blobs in Database

Simple multi-layered dataset (SST50)

Physical Measurements:Analysis Temperature (sea_surface_temperature)Gradient X+, X-, Y+, Y-, AverageIce Field

Quality Information:Age Of Most Recent ObservationNumber of ObservationsClass 1 CoverageReliability

Auxilliary Data:Physiographic Desc. (land_binary_mask)Spatial Covariance X+, X-, Y+, Y-

Quality Reports:Lineage / ProcessingProcessing Parameters 1Processing Parameters 2Mission History

contentInfo 2contentInfo3

contentInfo 1

dataQuality 1dataQuality 2dataQuality 3dataQuality 4

dataQuality 5dataQuality 6dataQuality 7dataQuality 8

Page 12: Documentation Foundation Spectrum Relational Tables XML/Relational Database (with some fields) XML Blobs (with some fields) File Systems XML Blobs in Database

netCDFnetCDF

Data Set Description (DSD) / Archive

Data Set Description (DSD) / Archive

Collection MetadataCollection Metadata

Temporal and spatial extentTemporal and spatial extent

--- ISO Record ---DS_Series composedOf DS_DataSet has MI_MetadataseriesMetadata MI_Metadata

--- ISO Record ---DS_Series composedOf DS_DataSet has MI_MetadataseriesMetadata MI_Metadata

netCDF (NCML)netCDF (NCML) 3D Coverages3D Coverages

Combining Collection and Granule Information

Collection InformationGranule Information

Page 13: Documentation Foundation Spectrum Relational Tables XML/Relational Database (with some fields) XML Blobs (with some fields) File Systems XML Blobs in Database

Metadata Structures – THREDDS and ISO

<catalog><dataset>

<metadata></metadata><dataset urlPath=“accessPath”>

<metadata></metadata></dataset>

</dataset></catalog>

Collection datasetCollection metadata

Direct datasetDirect metadata

<DS_Series><subset>

<DS_Series><DS_DataSet>

<MI_Metadata></MI_Metadata></DS_DataSet ><seriesMetadata>

<MI_Metadata></MI_Metadata></seriesMetadata>

</DS_Series></subset>

</DS_Series>

Collection metadata

Direct metadata

Page 14: Documentation Foundation Spectrum Relational Tables XML/Relational Database (with some fields) XML Blobs (with some fields) File Systems XML Blobs in Database

DQ_Lineage (19115-2)

MD_Metadata

DQ_DataQuality

+ scope : DQ_Scope+ lineage 0..1

LI_Lineage

+ statement [0..1] : CharacterString

LE_Source

+ description [0..1] : CharacterString+ scaleDenominator [0..1] : MD_RepresentativeFraction+ sourceReferenceSystem [0..1] : MD_ReferenceSystem+ sourceCitation [0..1] : CI_Citation+ sourceExtent [0..*] : EX_Extent+ processedLevel[0..1] : MD_Identifier+ resolution[0..1] : LE_NominalResolution

LE_ProcessStep

+ description : CharacterString+ rationale [0..1] : CharacterString+ dateTime [0..1] : DateTime+ processor [0..*] : CI_ResponsibleParty

+ source 0..*+ processStep 0..*

+ output, source0..*

If(count(source) + count(processStep) =0) and(DQ_DataQuality.cope.level = 'dataset' or 'series')then statement is mandatory

LE_Processing

+ identifier : MD_Identifier+ softwareReference[0..*] : CI_Citation+ procedureDescription[0..1] : CharacterString+ documentation[0..*] : CI_Citation+ runTimeParameters[0..1] : CharacterString

LE_Algorithm

+ citation: CI_Citation+ description : CharacterString

+ processingInformation0..*

+ algorithm 0..*

LE_ProcessStepReport

+ name : CharacterString+ description[0..1] : CharacterString+ fileType[0..1] : CharacterString

+ report 0..*

+ sourceStep0..*

<attribute type="string" name="history" value=" Process Step: Orbital processing: Input level 1 data, calibration, cloud screening, sst and optical thicknesses computations. Gridded products of aerosol observations are produced weekly from the afternoon satellite data."/>

Page 15: Documentation Foundation Spectrum Relational Tables XML/Relational Database (with some fields) XML Blobs (with some fields) File Systems XML Blobs in Database

MD_Band

+ sequenceIdentifier [0..1] : MemberName + description [0..1] : CharacterString+ maxValue [0..1] : Real+ minValue [0..1] : Real + units [0..1] : UomLength+ peakResponse [0..1] : Real + bitsPerValue [0..1] : Integer + toneGradation [0..1] : Integer + scaleFactor [0..1] : Real + offset [0..1] : Real

MI_CoverageDescription (Grid Coverages)MI_Metadata

+contentInfo 0..*

<<CodeList>>MD_CoverageContentTypeCode

+ image + thematicClassification + physicalMeasurement

MI_CoverageDescription

+ attributeDescription : RecordType + contentType : MD_CoverageContentTypeCode

+dimension 0..*

MI_RangeElementDescription

+ name : CharacterString+ definition : CharacterString+ rangeElement[1..*] : Record

+rangeElementDescription 0..*

<variable type="float" shape="lat lon" name="Optical Thickness"> <attribute type="string" name="long_name" value="Optical Thickness"/> <attribute type="float" name="valid_min" value="0"/> <attribute type="float" name="valid_max" value="2440"/> <attribute type="string" name="units" value="Optical Thickness Unit X 1000"/></variable>

*Need multiple names with sources (MD_Identifiers)

Page 16: Documentation Foundation Spectrum Relational Tables XML/Relational Database (with some fields) XML Blobs (with some fields) File Systems XML Blobs in Database

CI_CitationInformation

<<DataType>>CI_Series

+ name [0..1] : CharacterString + issueIdentification [0..1] : CharacterString + page [0..1] : CharacterString

<<DataType>>CI_Citation

+ title : CharacterString + alternateTitle [0..*] : CharacterString + date [1..*] : CI_Date + edition [0..1] : CharacterString + editionDate [0..1] : Date + identifier [0..*] : MD_Identifier + citedResponsibleParty [0..*] : CI_ResponsibleParty + presentationForm [0..*] : CI_PresentationFormCode + series [0..1] : CI_Series + otherCitationDetails [0..1] : CharacterString + collectiveTitle [0..1] : CharacterString + ISBN [0..1] : CharacterString + ISSN [0..1] : CharacterString

<<CodeList>>CI_DateTypeCode

+ creation+ publication+ revision

<<DataType>>CI_Date

+ date : Date + dateType : CI_DateTypeCode

<<CodeList>>CI_PresentationFormCode

+ documentDigital + modelHardcopy + documentHardcopy + profileDigital + imageDigital + profileHardcopy + imageHardcopy + tableDigital + mapDigial + tableHardcopy + mapHardcopy + videoDigital + modelDigital + videoHardcopy

<<DataType>>CI_ResponsibleParty

+ individualName [0..1] : CharacterString+ organisationName [0..1] : CharacterString+ positionName [0..1] : CharacterString+ role : CI_RoleCode

+ citedResponsibleParty 0..*

<<DataType>>CI_Contact

+ hoursOfService [0..1] : CharacterString+ contactInstructions [0..1] : CharacterString+ onlineResource [0..1]: CI_OnlineResource

+ contactInfo 0..1 <attribute type="string" name="title“value="Aerosol Optical Thickness (100 KM)"/><attribute type="string" name="institution" value="DOC/NOAA/NESDIS/OSDPD >Office of Satellite Data Processing and Distribution, NESDIS, NOAA, U.S. Department of Commerce"/>

Page 17: Documentation Foundation Spectrum Relational Tables XML/Relational Database (with some fields) XML Blobs (with some fields) File Systems XML Blobs in Database

ncISO Architecture

THREDDS Catalogs

THREDDS Catalogs

file1.ncfile1.nc file2.ncfile2.nc file3.ncfile3.nc file4.ncfile4.nc file1.ncfile1.nc file.ncfile.ncfile.ncfile.nc file.ncfile.ncfile2.ncfile2.nc file.ncfile.nc

THREDDS Catalogs

THREDDS Catalogs

THREDDS Catalogs

THREDDS Catalogs

Extract Dataset URLs (many times files) Extract Dataset URLs (many times files)

Extract NcMLExtract NcML

Extract DataExtract Data

Documentation RepositoryDocumentation Repository

RecordRepository

RecordRepository

Documents(PDF, HTML,

…)

Documents(PDF, HTML,

…)Rich Inventory

Rich Inventory

Relational Databases

RecordCache

RecordCache

Files

CalculatorCalculator

http://motherlode.ucar.edu:8081/thredds/cdmremote/fmrc/NCEP/GFS/CONUS_191km/files/GFS_CONUS_191km_20091111_0000.grib1?req=ncml

Page 18: Documentation Foundation Spectrum Relational Tables XML/Relational Database (with some fields) XML Blobs (with some fields) File Systems XML Blobs in Database

19115 Revisions

Reviewed and developed tentative response to 150+ review comments~120 accepted or accepted in principle

Backward compatibility must be maintained (what does that mean?)

Developed several focus groups:Hierarchy level and aggregated datasets - John Hockaday, AU.Use of class identifiers, referencing classes - Jean Brodeur, CAIncorporation of expanded Service metadata - Nicolas Lesage, DGIWG

Schedule:•CD 2010-06•DIS 2010-12•FDIS 2011-12•IS 2012-06

Page 19: Documentation Foundation Spectrum Relational Tables XML/Relational Database (with some fields) XML Blobs (with some fields) File Systems XML Blobs in Database

Scientific QuestionRequirementUse Case

Scientific QuestionRequirementUse Case

Metadata ContentIndependent of standardMetadata ContentIndependent of standard

Standard Implementation / GuidanceISO 191*OGC CapabilitiesFGDC W/ extensionsNASA DIFDublin Core

Standard Implementation / GuidanceISO 191*OGC CapabilitiesFGDC W/ extensionsNASA DIFDublin Core

PresentationAnswersXSLT, …

PresentationAnswersXSLT, …

Metadata ManagementInterface (xForms, wiki)Create / Read /Update / Delete

Metadata ManagementInterface (xForms, wiki)Create / Read /Update / Delete

Data Providers / Scientists Metadata Standards Experts Technologists

Do you need different documentation for different parts of your data? Do you need different documentation for different temporal and spatial subsets? Do you have datasets with multiple sources? Do you need to reference On-Line Resources? Do you need to describe a series of related granules? Do you need to describe many kinds of aggregations? Does data quality vary within the dataset? Do you need to track processing for multiple data sources? Do you need to track compliance with standards? Do you need to use spatial features to describe quality, like grids of quality flags? Do you need to explain why you did things to the data? Do you have datasets in multiple locations? Do you need to describe instruments used to make observations? Do you need to unambiguously identify things using your own namespace? Do you want to manage metadata using a relational or XML database? Do you want to serve metadata using a REST web service? Do you need to identify people in different roles? Do you need to keep track of user problems? Do you need to explain why you did things to the data? Do you need to track requirements and plans? Do you need to share data with international partners? Do you need to describe data formats and structures? Do you need to track data transformations and processing? What might the entity and attributes look like for station level metadata?