building a data sharing community

23
Building a Data Sharing Community

Upload: ash

Post on 24-Feb-2016

24 views

Category:

Documents


0 download

DESCRIPTION

Building a Data Sharing Community. In collaboration with. The Vertebrate Networks. Facilitate open access to specimen data on the web Enhance the value of specimen collections Conserve curatorial resources Use a design easily adapted by other disciplines with similar needs. Primary Goals. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Building a Data Sharing Community

Building a Data Sharing Community

Page 2: Building a Data Sharing Community

The Vertebrate Networks

In collaboration with

Page 3: Building a Data Sharing Community

Primary Goals

Facilitate open access to specimen data on the web

Enhance the value of specimen collections

Conserve curatorial resources

Use a design easily adapted by other disciplines with similar needs

Page 4: Building a Data Sharing Community

Critical Challenge #1

Performance

Page 5: Building a Data Sharing Community

Critical Challenge #1

Page 6: Building a Data Sharing Community

Critical Challenge #1

Page 7: Building a Data Sharing Community

Critical Challenge #2

Performance

Aggregation

Page 8: Building a Data Sharing Community

Critical Challenge #3

Performance

Aggregation

Costs and Sustainability

Page 9: Building a Data Sharing Community

Critical Challenge #3

~ $200k annually reduced to ~ $20k annually

Page 10: Building a Data Sharing Community

All you need is a Darwin Core Archive

Create your DwC-A or we'll do it for you

Publish it yourself or we'll host it for you

No servers, no extra IT expertise needed

Easy

Critical Challenge #3

Page 11: Building a Data Sharing Community

Critical Challenge #4

Performance

Aggregation

Costs and Sustainability

Technological Integration

Page 12: Building a Data Sharing Community

Big Data157+ institutions + 377+ collections

= ~100M records and growing

Technical Challenge:Downloading, aggregating, caching, and

serving these data from the cloud

Technical Solution:"Gulo": aggregates archives in the cloud

Critical Challenge #4

Page 13: Building a Data Sharing Community
Page 14: Building a Data Sharing Community

Visualization: VertNet & CartoDB

Page 15: Building a Data Sharing Community

Opening Doors to Innovation

Page 16: Building a Data Sharing Community

32 institutions (79 collections) are up19 institutions (44 collections) in process106 institutions (228 collections) waiting

In CartoDB to date (44 archives):3,367,773 records processed1,606,374 mappable records

228,270 distinct, mappable coordinates162,077 distinct scientific names

Progress So Far...

Page 17: Building a Data Sharing Community

2012-2013:• Finish transitioning current networks into

VertNet• 2012-2013: Develop User Interface for data

searching• 2012-2013: Integrate with other partners

and projects2013-2014:

• Develop tools for visualization, discovery, and improvement (annotations, thesaurus, phylogenetic browser)

• Sustainability Workshop

Moving Forward

Page 18: Building a Data Sharing Community

Dave Bloom - VertNet [email protected]

Laura Russell - VertNet [email protected]

Carla Cicero - VertNet [email protected]

Page 19: Building a Data Sharing Community
Page 20: Building a Data Sharing Community

All Aves

Page 21: Building a Data Sharing Community

Field Museum of Natural History

Page 22: Building a Data Sharing Community

Hyla regilla

Page 23: Building a Data Sharing Community

Hyla regilla