leinfelder earth grid jam2008

19
EarthGrid Sharing data across networks Ben Leinfelder National Center for Ecological Analysis and Synthesis, University of California Santa Barbara JAM 2008 October 22th, 2008

Upload: leinfelder

Post on 18-May-2015

510 views

Category:

Education


0 download

DESCRIPTION

The EarthGrid network gives researchers from around the world seamless and persistent access to shared environmental data that are valuable for synthesis and analysis. Researchers use a consistent system to locate and download datasets housed in disparate, loosely-coupled repositories that have committed to providing a standard communication interface. Evolution of the existing EarthGrid network is underway to increase the breadth and depth of available data.

TRANSCRIPT

Page 1: Leinfelder Earth Grid Jam2008

EarthGridSharing data across networks

Ben Leinfelder

National Center for Ecological Analysis and Synthesis, University of California Santa Barbara

JAM 2008October 22th, 2008

Page 2: Leinfelder Earth Grid Jam2008

“Provide access to disparate data on different networks.”

Page 3: Leinfelder Earth Grid Jam2008

Knowledge Network for Biocomplexity

Page 4: Leinfelder Earth Grid Jam2008

• Distributed data system

• Archive data and metadata

Metacat Data Repository

attr1 | attr2 .... | .... .... | .... .... | .... .... | .... .... | ....

attr1 | attr2 .... | .... .... | .... .... | .... .... | .... .... | ....

attr1 | attr2 .... | .... .... | .... .... | .... .... | .... .... | ....

<eml>

<dataset>

..........

</dataset>

</eml>

<eml>

<dataset>

..........

</dataset>

</eml>

<eml>

<dataset>

..........

</dataset>

</eml>

Page 5: Leinfelder Earth Grid Jam2008

Global Metacat Deployments

Page 6: Leinfelder Earth Grid Jam2008

SouthAfrican

DataNetwork

Mozambique

Mapungubwe

MarakeleKrugerSAEON

Grahamstown

Cape TownSan ParksWilderness

Cape Town U

Addo

Karoo

Tsitsikama Phalabora

Savannah ClusterMarine Cluster

KNB 1KNB II

PISCOAND

... (26)

GCE LTER

NCEAS

ESA

OBFSKnowledge Network for

Biocomplexity (KNB)

Page 7: Leinfelder Earth Grid Jam2008

KNB Global Data Distribution

Page 8: Leinfelder Earth Grid Jam2008

Diverse Data Systems

• KNB Repository–Experimental data, survey data, spatial

raster and vector data–Ecological Metadata Language (EML)

• KU DiGIR–Museum specimen collection and

taxonomic information–Darwin Core

http://www.specifysoftware.org/Informatics/informaticsdigir

Page 9: Leinfelder Earth Grid Jam2008

attr1 | attr2 .... | .... .... | .... .... | .... .... | .... .... | ....

attr1 | attr2 .... | .... .... | .... .... | .... .... | .... .... | ....

attr1 | attr2 .... | .... .... | .... .... | .... .... | .... .... | ....

<eml>

<dataset>

..........

</dataset>

</eml>

<eml>

<dataset>

..........

</dataset>

</eml>

<eml>

<dataset>

..........

</dataset>

</eml>

Synthetic Data Analysis

Identify species

Morpho

Store specimen data

Publish specimen data

Document data Publish datasets

Store observation data

!

Page 10: Leinfelder Earth Grid Jam2008

• Light-weight interface to underlying systems

• Hide complexity

• Low threshold forimplementation

EarthGrid Data Providers

a

Ea

rth

Gri

d

b

c

Page 11: Leinfelder Earth Grid Jam2008

• Standard communication protocol• Common methods across systems

• Allows simplified data access by clients• Exposes data to more software

EarthGrid Data Consumers

EarthGrid

x

y

z

Page 12: Leinfelder Earth Grid Jam2008

The Usual Suspects

✓Search

✓Authenticate

✓Read

✓Write

EarthGridProvider

Page 13: Leinfelder Earth Grid Jam2008

<attribute id="att.5"> <attributeName>avesr91</attributeName> <attributeLabel>Average Species Richness for 1991</attributeLabel> <attributeDefinition>The average species richness for the field in 1991 </attributeDefinition> <storageType>float</storageType> <measurementScale> <ratio> <unit><standardUnit>dimensionless</standardUnit></unit> <precision>0.1</precision> <numericDomain id="nd.5"> <numberType>real</numberType> <bounds> <minimum exclusive="true">0</minimum> </bounds> </numericDomain> </ratio> </measurementScale> </attribute>

KNB Software Suite

Dat

a A

naly

sis

Dat

a St

orag

e

Dat

a M

anag

emen

t

Morpho

<eml/>M

etad

ata

You are here

Page 14: Leinfelder Earth Grid Jam2008

EarthGrid Search in Kepler

Page 15: Leinfelder Earth Grid Jam2008

Species Distribution Modeling

Page 16: Leinfelder Earth Grid Jam2008

Distribution Predictions

Current 2020 2050

Page 17: Leinfelder Earth Grid Jam2008

DataNetONE (Observation Network for Earth)

• ‘New institution’ for data preservation

Page 18: Leinfelder Earth Grid Jam2008

DataNetONE (Observation Network for Earth)

• Scalable. Flexible. Sustainable.

Current ????30+ years horizon

Page 19: Leinfelder Earth Grid Jam2008

Acknowledgements

• This material is based upon work supported by:

– The National Science Foundation under Grant Numbers 9980154 (KDI), 0618501 (FIRST) and 0225676 (SEEK).

– The National Center for Ecological Analysis and Synthesis, a Center funded by NSF (Grant Number 0072909), the University of California, and the UC Santa Barbara campus.

– The Andrew W. Mellon Foundation.

• Resources

– http://www.nceas.ucsb.edu/ecoinfo

– http://seek.ecoinformatics.org

– http://knb.ecoinformatics.org

– http://lno.lternet.edu/projects/pasta

– http://sbc.lternet.edu