nova phd training course on pre-breeding, nordic university network (2012)

36
Focused Identification of Germplasm Strategy (FIGS) NOVA PhD training course Pre-breeding for sustainable plant production Dag Endresen, 25 January 2012, Alnarp, Sweden Oat (Avena sativa L.) at Alnarp Aug 2010, by Dag Endresen, CC-By.

Upload: dag-endresen

Post on 18-May-2015

1.248 views

Category:

Education


0 download

DESCRIPTION

Pre-breeding for sustainable plant production. Nova PhD course, January 2012 at Röstånga in Southern Sweden. Nova is a Nordic University Network. Pre-breeding provides an important element in broadening the genetic diversity and introducing new and useful traits and properties to the food crops. New traits introduced in pre-breeding activities are not least important to meet the new challenges agriculture will face from the on-going climate change. The needed genetic diversity is often available outside of the genepool of cultivars and elite breeding lines. And sources of novel genetic diversity such as the primitive crops and even the wild relatives of the cultivated plants are expected to get increased focus when facing new challenges in agriculture. The GBIF data portal provides information on in situ occurrences for many of the wild relatives to the cultivated plants that are not (yet) collected and accessioned by the ex situ seed genebank collections. The GBIF data portal will therefore provide a very valuable bridge between these data sources for genebank accessions and occurrence data sources outside of the genebank community. Occurrences from the GBIF data portal will assist in the identification of locations where potentially useful populations of crop wild relatives can be found. Ecological niche modeling provides a widely used approach for predicting species distributions and can be used for this purpose. Recent work on predictive modeling to identify a link between useful crop traits and eco-geographic data associated with the source locations for germplasm may have particular value for pre-breeding efforts. The Focused Identification of Germplasm Strategy (FIGS) provides and approach for efficient identification of germplasm material with new and useful genetic diversity for a target trait property. Such predictive modeling approaches are of particular interest when performing pre-breeding because of the high costs related to working with this material. Cultivated plants are domesticated for properties and traits such as non-shattering seed behavior and more uniform harvest time that makes conducting agricultural experiments easier and less costly. Non-domesticated germplasm material and also the older cultivars and landraces have many agro-botanical traits that was moderated in modern cultivars to better suit agricultural practices and efficiency. Pre-breeding is largely about removing such undesired traits from the non-cultivated and less intensively domesticated material while maintaining potentially useful traits. Nova PhD course home page: http://www2.nova-university.org/chome/cpage.php?cnr=03-110404-412 https://sites.google.com/site/novaplantimprovementnetwork/home/phd-course-in-sweden-january-2012

TRANSCRIPT

Page 1: NOVA PhD training course on pre-breeding, Nordic University Network (2012)

Focused Identification of Germplasm Strategy (FIGS)

NOVA PhD training coursePre-breeding for sustainable plant production

Dag Endresen, 25 January 2012, Alnarp, Sweden

Oat (Avena sativa L.) at Alnarp Aug 2010, by Dag Endresen, CC-By.

Page 2: NOVA PhD training course on pre-breeding, Nordic University Network (2012)

TOPICS:

2

• Focused Identification of Germplasm Strategy (FIGS)– Predictive link between climate

data and trait data– Heuristic approach– Trait mining with FIGS

• Some FIGS case studies• The FIGS approach for pre-

breeding

Bread wheat at Alnarp, June 2010 by Dag Endresen , CC-By

Page 3: NOVA PhD training course on pre-breeding, Nordic University Network (2012)

Domestication and cultivated plants:Utilizing genetic potential from the wild

corn, maize

wild tomato

tomato

teosinte3

cultivation

Page 4: NOVA PhD training course on pre-breeding, Nordic University Network (2012)

A needle in a hay stack

• Scientists and plant breeders want a few hundred germplasm accessions to evaluate for a particular trait.

• How does the scientist select a small subset likely to have the useful trait?

4

Photo from the USDA Photo archive

Page 5: NOVA PhD training course on pre-breeding, Nordic University Network (2012)

Origin of Concept:Boron toxicity for wheat and barley in Australia, late 1980s

What is Focused Identification of Germplasm Strategy

Slide made byM.C. Mackay, 1995

South AustraliaMediterranean Sea

Page 6: NOVA PhD training course on pre-breeding, Nordic University Network (2012)

Objectives of FIGS

6Bread wheat at Nöbbelöv in Lund by Dag Endresen (CC-By).

• Identify new and useful genetic diversity for crop improvement.

• Using eco-geographic data for prediction of crop traits a priori BEFORE the field trials.

• Subset with a higher density of genetic diversity for a target trait property.

Page 7: NOVA PhD training course on pre-breeding, Nordic University Network (2012)

er

y

a

n

is

F I G SOCUSED DENTIFICATION OF ERM PLASM TRATEGY

Data la

yers sie

ve acce

ssions

ba

sed on

latitud

e &

lon

gitud

e

Illustration by Mackay (1995)

Heuristic approach

Origin of FIGS: Michael Mackay (1986, 1990, 1995)

7

• Based on heuristic experience and expert knowledge.

• Finding upper and lower boundary limits for individual environmental parameters.

Page 8: NOVA PhD training course on pre-breeding, Nordic University Network (2012)

Trait Mining

• Based on multivariate and multi-way data analysis.

• Eco-geographic data analysis using climate and other environmental data.

• Focused Identification of Germplasm Strategy (FIGS).

8Potato (Solanum tuberosum L.) at Polli in Latvia, May 2004.

Page 9: NOVA PhD training course on pre-breeding, Nordic University Network (2012)

FIGS – Focused Identification of Germplasm Strategy

Trait mining using the FIGS approach is a new method to predict crop traits of primitive cultivated material from climate variables by using multivariate statistical methods.

9

Page 10: NOVA PhD training course on pre-breeding, Nordic University Network (2012)

10

Trait mining: To build a predictive computer model- explaining the crop trait score - using environmental data.

Page 11: NOVA PhD training course on pre-breeding, Nordic University Network (2012)

Climate effect during the cultivation process

Wild relatives are shaped by the environment

Primitive cultivated crops are shaped by local climate and humans

Traditional cultivated crops (landraces) are shaped by climate and humans

Modern cultivated crops are mostly shaped by humans (plant breeders)

Perhaps future crops are shaped in the molecular laboratory…?

11

Page 12: NOVA PhD training course on pre-breeding, Nordic University Network (2012)

Predictive link between eco-geography and traits

It is possible that the human mediated selection of landraces contributes to the link between ecogeography and traits.

During traditional cultivation the farmer actively selects for and introduces germplasm for improved suitability of the landrace to the local conditions.

12

Page 13: NOVA PhD training course on pre-breeding, Nordic University Network (2012)

Limitations of FIGS• Landraces and wild relatives

– The link between climate data and the trait data is required for trait mining with FIGS. Modern cultivars are not expected to show this predictive link (complex pedigree).

• Georeferenced accessions– Trait mining with FIGS is based on multivariate

models using climate data from the source location of the germplasm. To extract climate data the accessions need to be accurately georeferenced.

13

Wheat in the Hulah valley (Israel), 2007 by Aviad Bublil

Page 14: NOVA PhD training course on pre-breeding, Nordic University Network (2012)

Eco-geographic climate data layers

Climate layers from the ICARDA eco-climatic database (De Pauw, 2003)

14

Page 15: NOVA PhD training course on pre-breeding, Nordic University Network (2012)

Climate dataLayers used for these early FIGS studies:

• Precipitation (rainfall)• Maximum temperatures • Minimum temperatures

Some of the other layers available:

• Potential evapotranspiration (water-loss)• Agro-climatic Zone (UNESCO classification)• Soil classification (FAO Soil map)• Aridity (dryness)

(mean values for month and year)Eddy De Pauw (ICARDA, 2008)

15

Page 16: NOVA PhD training course on pre-breeding, Nordic University Network (2012)

Climate data – WorldClimThe climate data can be extracted from the WorldClim dataset.http://www.worldclim.org/ (Hijmans et al., 2005)

Data from weather stations worldwide are combined to a continuous surface layer.

Climate data for each landrace is extracted from this surface layer.

Precipitation: 20 590 stations

Temperature: 7 280 stations

16

Page 17: NOVA PhD training course on pre-breeding, Nordic University Network (2012)

FIGS studies:

17

• Heuristic approach:– Sunn pest– Powdery mildew, Pm3

• Multi-way approach– Morphological traits for Nordic

Barley landraces• Multivariate approach

– Net blotch on barley landraces– Stem rust on wheat landraces– Ug99 stem rust on wheat

• Wild relatives– PGR Secure (EU 7th framework)

Salix Accessions at Alnarp, 2011 by Dag Endresen, CC-By

Page 18: NOVA PhD training course on pre-breeding, Nordic University Network (2012)

Powdery mildew on wheat(Heuristic FIGS approach)

18

• A FIGS set for Powdery mildew resistance was derived based on the environmental conditions for PM hotspots.• Starting with 16,089 wheat landraces (6159 sites).• FIGS subset of 1320 wheat accessions (420 sites).• 211 accessions were scored as resistant in the field

trials.• Allele mining was made using Virus Induced Gene

Silencing (VIGS).• Only 7 resistance alleles (Pm3a to Pm3g) were

previously known at the Pm3 locus.• This study found 7 new resistance alleles (Pm3h to

Pm3n).

Bhullar, N.K., K. Street, M. Mackay, N. Yahiaoui, and B. Keller (2009). Unlocking wheat genetic resources for the molecular identification of previously undescribed functional alleles at the Pm3 resistance locus. PNAS 106(23):9519-9524. DOI: 10.1073/pnas.0904152106.

Powdery mildew on wheat. Bhullar et al (2009) PNAS 106: 9519-9524, Fig 2.

Page 19: NOVA PhD training course on pre-breeding, Nordic University Network (2012)

Sunn Pest on wheat (ICARDA)(Heuristic FIGS approach)

19

Based on a slide by Ken Street, ICARDA

• No previous sources of Sunn pest resistance had been found in hexaploid wheat.

• 2 000 accessions were screened at ICARDA without result (during 2000 to 2006).

• A FIGS set of 534 accessions was developed and screened (during 2007 and 2008). • Starting with 16 000 wheat landraces from VIR, ICARDA

and AWCC.• Excluding origin CHN, PAK, IND - were Sunn pest was

only recently reported (6 328 accessions).• One accession per collecting site (2 830 acc).• Excluding dry environments below 280 mm/year.• Excluding sites of low winter temperature below 10

degrees Celsius (1 502 accessions).• Reduced to 534 accessions, using PCA clustering.

• 10 resistant accessions were found!

Bouhssini, M., K. Street, A. Joubi, Z. Ibrahim, and F. Rihawi (2009). Sources of wheat resistance to Sunn pest, Eurygaster integriceps Puton, in Syria. Genetic Resources and Crop Evolution 56:1065-1069. DOI: 10.1007/s10722-009-9427-1

Page 20: NOVA PhD training course on pre-breeding, Nordic University Network (2012)

Morphological traits in Nordic Barley landracesField observations by Agnese Kolodinska Brantestam (2002-2003)

Multi-way N-PLS data analysis, Dag Endresen (2009-2010)

Priekuli (LVA) Bjørke (NOR) Landskrona (SWE) 20

Google Maps © 2010Tele Atlas

Page 21: NOVA PhD training course on pre-breeding, Nordic University Network (2012)

Multi-way N-PLS resultsNordic barley landraces

ExperimentSite Year

Heading days

Ripening days

Length of plant

Harvest index

Volumetric weight

Thousand grain weight

LVA 20021 n.s. n.s. n.s. n.s. *** n.s.

LVA 2003 *** n.s. ** ** *** n.s.

NOR 2002 - * ** *** ** n.s.

NOR 2003 ** *** *** * * n.s.

SWE 2002 ** *** n.s. ** * n.s.

SWE 20032 n.s. ** n.s. n.s. ** n.s.

*** Significant at the 0.001 level (p-value)

** Significant at the 0.01 level

* Significant at the 0.05 level

n.s. Not significant (at the above levels)

1 LVA 2002 Germination on spikes (very wet June)

2 SWE 2003 Incomplete grain filling (very dry June)

Endresen, D.T.F. (2010). Predictive association between trait data and ecogeographic data for Nordic barley landraces. Crop Science 50: 2418-2430. DOI: 10.2135/cropsci2010.03.0174 21

Page 22: NOVA PhD training course on pre-breeding, Nordic University Network (2012)

Net blotch on barley landraces

Green dots indicate collecting sites for resistant wheat landraces and red dots collecting sites for susceptible landraces.

USDA GRIN, trait data online: http://www.ars-grin.gov/cgi-bin/npgs/html/desc.pl?1041

Field experiments made in Minnesota, North Dakota and Georgia in the USA

22

Page 23: NOVA PhD training course on pre-breeding, Nordic University Network (2012)

Multivariate SIMCA resultsfor Net blotch on barley

Dataset (unit) PPV LR+ Estimated gain

Net blotch (accession) 0.54 (0.48-0.60) 1.75 (1.42-2.17) 1.35 (1.19-1.50)

Random 0.40 (0.35-0.45) 0.99 (0.84-1.17) 0.99 (0.87-1.12)(40 % resistant samples)

Endresen, D.T.F., K. Street, M. Mackay, A. Bari, E. De Pauw (2011). Predictive association between biotic stress traits and ecogeographic data for wheat and barley landraces. Crop Science 51: 2036-2055. DOI: 10.2135/cropsci2010.12.0717

PPV = Positive Predictive Value; LR+ = Positive Diagnostic Likelihood Ratio

23

Page 24: NOVA PhD training course on pre-breeding, Nordic University Network (2012)

Stem rust on wheat landraces

USDA GRIN, trait data online: http://www.ars-grin.gov/cgi-bin/npgs/html/desc.pl?65049

Green dots indicate collecting sites for resistant wheat landraces and red dots collecting sites for susceptible landraces.

Field experiments made in Minnesota by Don McVey

24

Page 25: NOVA PhD training course on pre-breeding, Nordic University Network (2012)

Multivariate SIMCA Resultsfor Stem rust on wheat

Dataset (unit) PPV LR+ Estimated gain

Stem rust (accession) 0.54 (0.50-0.59) 3.07 (2.66-3.54) 1.95 (1.79-2.09)

Random 0.29 (0.26-0.33) 1.04 (0.90-1.20) 1.03 (0.91-1.16)(28 % resistant samples)

Stem rust (site) 0.50 (0.40-0.60) 4.00 (2.85-5.66) 2.51 (2.02-2.98)

Random 0.19 (0.13-0.26) 0.94 (0.63-1.39) 0.95 (0.66-1.33)(20 % resistant samples)

Endresen, D.T.F., K. Street, M. Mackay, A. Bari, E. De Pauw (2011). Predictive association between biotic stress traits and ecogeographic data for wheat and barley landraces. Crop Science 51: 2036-2055. DOI: 10.2135/cropsci2010.12.0717

PPV = Positive Predictive Value; LR+ = Positive Diagnostic Likelihood Ratio

25

Page 26: NOVA PhD training course on pre-breeding, Nordic University Network (2012)

Multivariate AnalysisStem rust on wheat

Classifier method AUC Cohen’s Kappa

Principal Component Regression (PCR) 0.69 (0.68-0.70) 0.40 (0.37-0.42)

Partial Least Squares (PLS) 0.69 (0.68-0.70) 0.41 (0.39-0.43)

Random Forest (RF) 0.70 (0.69-0.71) 0.42 (0.40-0.44)

Support Vector Machines (SVM) 0.71 (0.70-0.72) 0.44 (0.42-0.45)

Artificial Neural Networks (ANN) 0.71 (0.70-0.72) 0.44 (0.42-0.46)

Bari, A., K. Street, , M. Mackay, D.T.F. Endresen, E. De Pauw, and A. Amri (2011). Focused Identification of Germplasm Strategy (FIGS) detects wheat stem rust resistance linked to environment variables. Genetic Resources and Crop Evolution [online first]. doi:10.1007/s10722-011-9775-5; Published online 3 Dec 2011.

Abdallah Bari (ICARDA)

AUC = Area Under the ROC Curve (ROC, Receiver Operating Curve)

26

Page 27: NOVA PhD training course on pre-breeding, Nordic University Network (2012)

Multivariate SIMCA resultsStem rust (Ug99) on wheat

Ug99 set with 4563 wheat landraces screened for Ug99 in Yemen 2007, 10.2 % resistant accessions. The true trait scores for 20% of the accessions (825 samples) were revealed. We used trait mining with SIMCA to select 500 accessions more likely to be resistant from 3728 accession with true scores hidden (to the person making the analysis). The FIGS set was observed to hold 25.8 % resistant samples and thus 2.3 times higher than expected by chance.

27

Page 28: NOVA PhD training course on pre-breeding, Nordic University Network (2012)

Multivariate Analysis Resultsfor Stem rust (Ug99) on wheat

Classifier method PPV LR+ Estimated gainkNN (pre-study) 0.29 (0.13-0.53) 5.61 (2.21-14.28) 4.14 (1.86 - 7.57)

SIMCA 0.28 (0.14-0.48) 5.26 (2.51-11.01) 4.00 (2.00 - 6.86)Ensemble classifier 0.33 (0.12-0.65) 8.09 (2.23-29.42) 6.47 (2.05-11.06)

Random 0.06 (0.01-0.27) 0.95 (0.13 - 6.73) 0.97 (0.16 - 4.35)(pre-study, 550 + 275 accessions)

Ensemble 0.26 (0.22-0.30) 2.78 (2.34-3.31) 2.32 (2.00-2.68)

Random 0.11 (0.09-0.15) 1.02 (0.77-1.36) 0.95 (0.77-1.32)

(blind study, 825 + 3738 accessions)

Endresen, D.T.F., K. Street, M. Mackay, A. Bari, E. De Pauw, K. Nazari, and A. Yahyaoui (2012). Sources of Resistance to Stem Rust (Ug99) in Bread Wheat and Durum Wheat Identified Using Focused Identification of Germplasm Strategy (FIGS). Crop Science [online first]. doi: 10.2135/cropsci2011.08.0427; Published online 8 Dec 2011.

PPV = Positive Predictive Value; LR+ = Positive Diagnostic Likelihood Ratio

28

Page 29: NOVA PhD training course on pre-breeding, Nordic University Network (2012)

29

Collecting the seeds for some of the most important populations for the wild relatives of the cultivated plants, to maintain them at the ex situ genebanks, would provide improved access to these valuable plant genetic resources.

... but how to identify the most important CWR populations?

Genebank Accession at IPK Gatersleben by Dag Endresen, 2010, CC-By.

Page 30: NOVA PhD training course on pre-breeding, Nordic University Network (2012)

GAP analysis for management of genebank collections• Advice the planning of new

collecting/gathering expeditions

– Identification of relevant areas were the crop species is predicted to be present (using GBIF data and niche models).

– Focus on areas least well represented in the genebank collection (maximize diversity).

– Focus on areas with a higher likelihood for a desired target trait (FIGS). 30

South of Tunisia, by Dag Endresen, CC-By

For more information on Gap Analysis see: http://gisweb.ciat.cgiar.org/GapAnalysis/

Page 31: NOVA PhD training course on pre-breeding, Nordic University Network (2012)

Wormwood (Artemisia absinthium L.)NordGen study: June 2010

Species distribution model(7 364 records)

Using the Maxent desktop software.

31

Wormwood (Artemisia absinthium L.)

Page 32: NOVA PhD training course on pre-breeding, Nordic University Network (2012)

The compatibility of data standards between PGR and biodiversity collections made it possible to integrate the worldwide germplasm collections into the biodiversity community (TDWG, GBIF).

Potential of the GBIF technology

Genebank accessions: http://data.gbif.org/datasets/network/2

32

Using GBIF/TDWG technology (and contributing to its development), the PGR community can more easily establish specific PGR networks without duplicating GBIF's work.

Page 33: NOVA PhD training course on pre-breeding, Nordic University Network (2012)

Global crop collections

Moving towards… global integration of information

Threatened species

Migratory species

Spatial data

Global crop system

Crop standards

Legislation and regulations etc. Crop collections in

Europe

Genebank datasets

33

Page 34: NOVA PhD training course on pre-breeding, Nordic University Network (2012)

GBIF Young Researcher Award

34

Suwon, Thursday, 14 October 2010:• Amy McDougall, Environmental Sciences School at the

University of East Anglia, United Kingdom, “Bridging the Gap in a Climate Changed World: Is Conservation Planning by Taxa a Realistic Aim?”

• Andrés Lira-Noriega, Department of Ecology and Evolutionary Biology and the Biodiversity Institute at the University of Kansas, “A Comparison of Correlative and Mechanistic Ecological Niche Modeling Approaches for Understanding Biodiversity Patterns.“

Buenos Aires, Argentina on 4-6 October 2011:• César Antonio Ríos-Muñoz (ornithologist from Mexico) will

investigate the evolutionary processes leading to the current distribution of species in Mesoamerican lowland tropical forests.

• Conor Ryan (marine biologist from Ireland) will study otoliths, calcium carbonate deposits found in the inner ear of fish, whose shape is highly specific to each species.

César Antonio Ríos-Muñoz (left) and Conor Ryan (right)

Amy McDougall (left) and Andrés Lira-Noriega (right)

Page 35: NOVA PhD training course on pre-breeding, Nordic University Network (2012)

GBIF Young Researcher Award

35

The 3rd Young Researcher Award (Lillehammer, September 2012):

GBIF invites proposals from graduate students for the 2012 Young Researchers Award. This prize intends to foster innovative research and discovery in biodiversity informatics.

Two awards of €4,000 will be available to graduate students in a master’s or doctoral programme at a university in a GBIF Voting Participant or Associate Participant country.

The deadline to receive nominations from the Heads of Delegation is 15 March 2012.

http://www.gbif.org/communications/news-and-events/young-researchers-award/

Perhaps one of the students from the Nordic Plant Improvement Network will become the next GBIF Young Researcher Awardee…?

Page 36: NOVA PhD training course on pre-breeding, Nordic University Network (2012)

Thanks for listening!

NOVA University NetworkPhD training coursePre-breeding for sustainable plant production22 to 29 January 2012

Dag Endresen (GBIF)[email protected]

36