data infrastructures for science

Post on 14-Jan-2016

41 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

DESCRIPTION

Mário Campolargo European Commission - DG INFSO F Acting Director. Data infrastructures for Science. DRIVER Summit Towards a Confederation of Digital Repositories G ö ttingen, 16 th January 2008. - PowerPoint PPT Presentation

TRANSCRIPT

••• 1

Data infrastructures for Science

DRIVER SummitTowards a Confederation of Digital Repositories

Göttingen, 16th January 2008

Mário CampolargoEuropean Commission - DG INFSO F

Acting Director"The views expressed in this presentation are those of the author and do not necessarily reflect the views of the European

Commission"

••• 2

a new vision for Science

Global challenges with high societal impact

Data deluge… wet-labs versus virtual-labs

Improved scientific process… role of simulation

Cross-disciplinarity

Virtual Research Communities

networkinggridsinstrumentationcomputingdata curation…

Tech

nolo

gy

pu

sh

value added of distributed

collaborative research (virtual

communities)

Ap

plic

atio

n p

ull

••• 3

building Science through collaboration

Research Communities common goals, complementary and shared

information, tools and knowledge, awareness of research protocols, effective means of collaboration, interest in being part of the communityVirtual research

from empirical, experimental, theoretical and computational science… to intensive use of data… abstraction… models… simulation… e-Science

Virtual communities no geographical, time or institutional boundaries

Globalisation Global challenges, global dimension, win-win

situation

••• 4

VirtualCommunity

Meetings, etc.

Workspace

Virtual Labs

Scientific Data

Grid

Network

VirtualCommunity

Meetings, etc.

Workspace

Virtual Labs

Scientific Data

Grid

Network

global virtual research community

VirtualCommunity

Meetings, etc.

Workspace

Virtual Labs

Scientific Data

Grid

Network

Scientific Data

Grid

NetworkEcon

om

ies

of

Scale

Eff

icie

ncy

Gain

s

••• 5

ICT for Science: e-Infrastructure

Linking at the speed of the lightLinking at the speed of the light

Sharing computers, instruments and applicationsSharing computers, instruments and applications

Sharing and federating scientific dataSharing and federating scientific data

. . . . . .

.

Astrophysics

community

WeatherForecast

community

Biomedics

community

Connecting the finest mindsSharing and federating the best scientific

resourcesBuilding global virtual communities

••• 6

network

e-Infrastructure - implementation

middleware

data

virtual labs

user communities

••• 7

Framework Programme 7 (2007-13)

Capacities4097 M€

JRC1751 M€

Ideas7510 M€

Euratom4062 M€

People4750 M€

Cooperation32413 M€

Dev. of policiesINCO

Sciencein Society

Research Infrastructures 42% - 1715 M€

SMEsResearch Potential

Regions of Knowledge

e-Infrastructures(ICT for Science)

572 M€

••• 8

e-Infrastructure - implementation

network

middleware

data

virtual labs

user communities

Su

pp

ort

acti

on

s

Dep

loym

en

t of

e-I

nfr

ast

ructu

re f

or

scie

nti

fic c

om

mu

nit

ies

Scie

nti

fic

Dig

ital

Rep

osi

t.

e-S

cie

nce

gri

d

Infr

ast

.

Scie

nti

fic

Data

In

frast

.

AN

T

new infrastructures - PRACE

35M€

••• 9

importance of data for Science

extract knowledg

e

addvalue

publishdisseminat

e

store validatemanage

capturecollectcreate

From raw data to publications

From research to education

From one to multiple disciplines

From past and present to future

From one to multiple organisations

Several "continuum"

••• 10

implementation strategy

Policy actions Studies

e-Infrastructure projects

Scientific Digital Rep. Scientific Data Infr. User communities Support actions

ICTe-Contents

eSciDR(Towards a Europ.

e-Infrastructure for e-Science Digital

Repositories)

creating an European

Scientific Data Infrastructure Council Decision

CommunicationScientific Inf. /ERA

ESFRI WG e-IRG

••• 11

Improved access to, and sharing of, research data:

Promotes new research practices;

Makes possible the testing of new or alternative hypotheses and methods of analysis;

Enables new scientific insights by the exploration of topics not envisioned by the initial investigators;

Permits the creation of new data sets by the combination of data from multiple sources.

Facilitates the education of new researchers.

sharing scientific data

••• 12

Improved access to, and sharing of, research data:

Supports good management of public investment.

Potentially creates strong innovation value chains.

Enhances the value of global co-operations.

Impacts outside science.

sharing scientific data

Policy requirements identified by e-IRG and ESFRI

(availability, preservation/curation, quality, rights of use, interoperability)

Conclusions of eSciDR study coming soon

••• 13

qualities of repositories

repositoriesinfrastructure

sustained

protected

discoverable

selected contents

managed

trusted

concern for quality

organisational context

source: eSciDR study (adapted)

••• 14

Council of European Union, 22/23 Nov 2007: Considering

Access to and dissemination of publications and data crucial for the European Research Area and innovation

Effective long lasting preservation is fundamental

Invites the Member States / CommissionEnhance coordination between MS and large research organisation and funding bodies on access, preservation and dissemination policies and practices

Experiment OA to data and publication from EU projects

Encourages research into digital preservation

Wide deployment of scientific data infrastructures with cross border, cross institution and cross discipline value added for OA and preservation

council conclusions

••• 15

working with scientific communities

EGEEDEISA

DRIVER

GÉANT

neuGRIDEUFORIAD4SCIENCEETSF

IMPACTMETAFOREuroVO-AIDAGENESI-DR

netw

ork

mid

dle

ware

data

generic e-Infrastructure… user communities involvement

FEDERICAEVALSOEDGES

••• 16

FP7: data repositories

IMPACT

NMDB

DRIVER II

METAFOR

EuroVO-AIDA

GENESI-DR

bio-informatics

space physics

federated digital repositories

climatology

astronomy

geosciences

••• 17

data repositories projects - highlights

IMPACT

unifies data from 10 major databases related to protein families.

NMDB

establishes a digital repository for cosmic-ray data, and develops a real-time database from many neutron monitoring stations.

METAFOR

defines a Common Information Model (CIM) for climate data handling heterogeneous metadata stand.

••• 18

data repositories projects - highlights

EuroVO-AIDA

unifies digital data collections of astronomy, integrating European data centres into a global Virtual Observatory.

GENESI-DR

provides access to earth science digital repository (data from space, airborne, in-situ sensors).

DRIVER II

federates scientific repositories based on open standards, supporting complex information objects, cross-discipline.

••• 19

FP7: user communities/support actions*

EUFORIA

neuGRID

D4Science

ETSF

 e-NMR

DORII

 PESI

PARSE.insight

magnetic fusion

medical e-Support

environment - e-Infrastructure

spectroscopy

 data analysis - biology

environmental science, seismology

biodiversity

permanent access records of science

* Proposals under negotiation at this stage

••• 20

data deluge?

The animals boarding Noah’s Ark two by two

painting by the American Edwards Hicks (1780-1849)

••• 21

Modern Science makes increasing use of ICTs to change the way it is operating.

Amount and importance of scientific data produced is growing exponentially.

e-Infrastructure must support the management of data throughout the whole life cycle (including curation and long term preservation).

Data “is” an infrastructure for Science; scientific data infrastructure is therefore a key perspective in building a European e-Infrastructure.

Europe launched the basis for a consistent action (studies, policies, projects).

conclusions

••• 22

further information

www.cordis.europa.eu/fp7/ict/e-infrastructure/

top related