the aginfra germplasm working group

33
The Germplasm Working Group Dr. Vassilis Protonotarios Agricultural Biotechnologist, PhD Agro-Know Technologies, Greece e-Conference on Germplasm Data Interoperability Session 1: “The vision of Linked Germplasm Data”

Upload: vassilis-protonotarios

Post on 27-Jan-2015

112 views

Category:

Education


0 download

DESCRIPTION

Presentation about the agINFRA Germplasm Working Group (http://wiki.aginfra.eu/index.php/Germplasm_Working_Group). Presented during Session 1 of the 1st International e-Conference on Germplasm Data Interoperability (https://sites.google.com/site/germplasminteroperability/)

TRANSCRIPT

Page 1: The agINFRA Germplasm Working Group

The Germplasm Working Group

Dr. Vassilis ProtonotariosAgricultural Biotechnologist, PhDAgro-Know Technologies, Greece

e-Conference on Germplasm Data InteroperabilitySession 1: “The vision of Linked Germplasm Data”

Page 2: The agINFRA Germplasm Working Group

Structure of the presentation

1. Background– About the agINFRA project– Issues related to data sharing

2. The Germplasm Working Group– Objectives– Wiki– Link with RDA

3. The next steps

Page 3: The agINFRA Germplasm Working Group

Background

Page 4: The agINFRA Germplasm Working Group

The agINFRA project

• A project funded under the FP7 program of EC• Consortium with expertise on– Technology / infrastructures– Data / data management

Combined to facilitate agricultural data sharingMore info at:

www.aginfra.eu

Page 5: The agINFRA Germplasm Working Group

The agINFRA project

• Aims to enhance the interoperability between the agricultural data sources

– Data sharing by• Metadata aggregation & linking data• Design and deploy the linked ag-data framework

– Methodology for linking data– Provide the infrastructure needed• Both cloud- and grid-based services• Tools, APIs etc.

Page 6: The agINFRA Germplasm Working Group

agINFRA major data types

agINFRA

Bibliographic

Agri Statistics & Economics

Educational

Germplasm

Soil data

Profiles

Raw data

Other?

Page 7: The agINFRA Germplasm Working Group

agINFRA major data sourcesData Type Data provider(s)

Bibliographic FAO AGRISCASDD (CAAS)

Educational Organic.EdunetGreen Learning NetworkLAFLOR

Germplasm Chinese Crop Germplasm Information System (CAAS)Italian National Germplasm Database (CRA)

Soil Data Italian National Center for Soil Mapping

Statistical FAOSTATCountrySTAT

Researchers’ profiles, organizations & events

AGRIVIVO

Page 8: The agINFRA Germplasm Working Group

Focusing on germplasm

Local Databases

National DatabasesAggregators

GENESYSEURISCO

GBIF

Italian

Italian University

Italian research center

Chinese Chinese research center

Data flow

Page 9: The agINFRA Germplasm Working Group

Focusing on germplasm

Local Databases

National DatabasesAggregators

GENESYSEURISCO

Italian

Italian University

Italian research center

Chinese Chinese research center

Page 10: The agINFRA Germplasm Working Group

The issue ?

• Heterogeneity!– Data types– Data formats– Data management workflows– Standards used– Metadata exposure options– ….

• Lack of connectivity with other data sources

Page 11: The agINFRA Germplasm Working Group

The Germplasm Working Group

Page 12: The agINFRA Germplasm Working Group

The Germplasm Working Group

• Created in the context of the agINFRA project• Initially included agINFRA stakeholders– now expanded to host all stakeholders

• The group is NOT a group of experts on germplasm data!

Page 13: The agINFRA Germplasm Working Group

The scope of the Germplasm WG

• Aims to enable/enhance interoperability between germplasm databases – By developing the services for

• exchanging their data and • delivering their data to other partners

• Focusing on three actions:1. IDENTIFY2. ORGANIZE3. PROPOSE

Page 14: The agINFRA Germplasm Working Group

Germplasm WG objectives

• IDENTIFY: collect all information related to germplasm data

• People/groups• Namespaces (metadata, KOS)• Standards• Workflows• Events

• ORGANIZE: engage all stakeholders & available resources, analyze existing standards , facilitate collaboration

• PROPOSE: linked data framework to connect data sources • facilitate data sharing between germplasm data sources

Page 15: The agINFRA Germplasm Working Group

Germplasm related information

data management

workflows

metadata schemas

Working groups in

germplasm

Events (for connecting stakeholders)

KOS (ontologies,

thesauri, vocabularies

etc.)

Data exposure capabilities

Page 16: The agINFRA Germplasm Working Group

Germplasm related information

data management

workflows

metadata schemas

Working groups in

germplasm

Events (for connecting stakeholders)

KOS (ontologies,

thesauri, vocabularies

etc.)

Data exposure capabilities

Page 17: The agINFRA Germplasm Working Group

Proposed methodology

1. Analyze metadata schemas & KOSs used to describe germplasm resources

2. Define attributes & vocabularies that can be used to expose germplasm resources in linked data format.

3. Provide a set of recommendations for the exposure of germplasm resources as linked data

4. Embed the recommendations in the data infrastructure of agINFRA – to allow the exposure of germplasm resources as LOD.

Page 18: The agINFRA Germplasm Working Group

The Germplasm WG wiki

• Central point of reference

• Freely accessible (no login required)

http://wiki.aginfra.eu/index.php/Germplasm_Working_Group

Page 19: The agINFRA Germplasm Working Group

Information available so far

• Vision• Activities• Outcomes• Participants• Next steps• Useful resources– Data sources– Standards– Services– Stakeholders

• Events

Page 20: The agINFRA Germplasm Working Group
Page 21: The agINFRA Germplasm Working Group

Key outcomes of the group

• Dossier on Germplasm Information:– Major programs– Major information systems and services– agINFRA germplasm data sources (CGRIS & CRA)– Core standards for germplasm information – Plant nomenclature, taxonomies and ontologies– Plant genomic resources– Related references and links

• Freely available from the Germplasm Group wiki

Page 22: The agINFRA Germplasm Working Group
Page 23: The agINFRA Germplasm Working Group

Existing participants

Page 24: The agINFRA Germplasm Working Group

Our wish list (tentative list)

Reusing experiences from …and working closely with

Page 25: The agINFRA Germplasm Working Group

Connection with RDA

• RDA: Research Data Alliance (https://rd-alliance.org)

• Aims to “accelerate and facilitate research data sharing and exchange”

• Structure:– Interest Groups: Cover wider topics– Working Groups: Working on focused topics

Page 26: The agINFRA Germplasm Working Group
Page 27: The agINFRA Germplasm Working Group

Connection with RDA

• Representation of agINFRA Germplasm WG in– 1st RDA Plenary Meeting (March 2013,

Gothenburg, Sweden)– 2nd RDA Plenary Meeting (September 2013,

Washington D.C., USA)

• Suggestion for a Germplasm WG in RDA

Page 28: The agINFRA Germplasm Working Group

Link between WG and RDA Groups

Page 29: The agINFRA Germplasm Working Group

Link between WG and RDA GroupsRDA IG/WG

• Collection of large-scale data

• Collection of requirements

•Development of Best Practices

• Interaction with other IGs/WGs (e.g. metadata, LD)

• Application in more cases

• Wider exposure of outcomes

•Development of Best Practices

agINFRA WG• Interactions with data

providers

• Two (2) case studies

• Analysis of existing standards

• Collection of requirements

• Definition of data management workflows

• Development & adaptation of tools and services

•Development of Best Practices

Page 30: The agINFRA Germplasm Working Group

The next steps

Page 31: The agINFRA Germplasm Working Group

Towards the linking of germplasm data sources

1. Definition and application of the linked data for the agINFRA germplasm data sources

2. Recording and documentation of the process3. Identification of issues4. Suggestion for solutions to these issues5. Fine-tuning of workflow6. Development of Best Practices

Page 32: The agINFRA Germplasm Working Group

…and more next steps

• Update the existing analysis with new data• Collect new user requirements• (re)define the mappings between metadata

schemas and KOSs• Fine-tune the linked data approach

Page 33: The agINFRA Germplasm Working Group

Source: http://verastic.com/social/why-do-people-not-say-thank-you.html

Contact me: [email protected]