processing of scientific data: from field capture to web delivery

9
Processing of scientific data From field capture to web delivery Hector Quintero Casanova Postgraduate in e-Science

Upload: hector-quintero-casanova

Post on 09-Jul-2015

481 views

Category:

Technology


2 download

DESCRIPTION

Short presentation on the lifecycle of scientific data and how it relates to the Glastir Monitoring and Evaluation Programme. The GMEP is effectively a "real-time" healthcheck system for the new Welsh agri-environment scheme Glastir.

TRANSCRIPT

Page 1: Processing of scientific data: from field capture to web delivery

Processing of scientific dataFrom field capture to web delivery

Hector Quintero CasanovaPostgraduate in e-Science

Page 2: Processing of scientific data: from field capture to web delivery

● GMEP ticks all the boxes:

✔ Highly multidisciplinary: social, landscape, water, birds

plants...

✔ Large volumes of data: covers the whole of Wales.

✔ Cross-organisational collaboration: 13 institutions.

Why e-Science? Data-intensive

Page 3: Processing of scientific data: from field capture to web delivery

Why e-Science? Metadata

● NERC's data policy says it all

– “It is essential that metadata are submitted”

● Metadata = context information about data

– Provenance = who, when, where, how

● Exposes data relationships → traceability

– Workflow = how. Essential if using models

● Enables reproducing outcome → repeatability

● Exactly what information depends on the stage.

Page 4: Processing of scientific data: from field capture to web delivery

● Raw data from the field– Metadata: method, calibration, place, units...

Data collection

Page 5: Processing of scientific data: from field capture to web delivery

● Information products: e.g. data from models– Metadata: name, conditions, where it applies

Data analysis

Page 6: Processing of scientific data: from field capture to web delivery

Data analysis

● Workflow metadata avoids costly reruns

– Identify model output needed → reuse

● But not enough for cross-organisation collab.

– 13 institutions in Glastir.

– Differences in storage structure, metadata defs...

● Need extra layer(s) for seamless access

– Web already offers tools needed.

Page 7: Processing of scientific data: from field capture to web delivery

Publication: linked data

● HTTP for generic retrieval of resources

● URIs for unique identification of those resources

– E.g. http://www.ceh.ac.uk

● Both can be used to build web services

– Amount to remote functions.

– Eg: seamless recording of workflows across institutions.

● Semantics for automated reasoning

– Acts as standardised metadata aimed at machines.

Page 8: Processing of scientific data: from field capture to web delivery

… We've come full circle!

¿?

Page 9: Processing of scientific data: from field capture to web delivery

Hector Quintero Casanova Postgraduate in e-Science

Thank youwww.hqcasanova.com