association mapping for improvement of agronomic traits in rice

36
Doctoral Doctoral seminar on seminar on Presented By Zuge Sopan Shivaji Ph.D. Scholar Dept. of Genetics and Plant Breeding, CoA, Raipur.

Upload: sopan-zuge

Post on 16-Apr-2017

39 views

Category:

Science


1 download

TRANSCRIPT

Page 1: Association mapping  for improvement of agronomic traits in rice

Doctoral seminar Doctoral seminar onon

Presented ByZuge Sopan ShivajiPh.D. ScholarDept. of Genetics and Plant Breeding, CoA, Raipur.

Page 2: Association mapping  for improvement of agronomic traits in rice
Page 3: Association mapping  for improvement of agronomic traits in rice

Introduction

• Polygenic inheritance of agronomic traits- controlled by multiple genes whose

expression is affected by many factors. Hence phenotypic selection becomes

tedious job.

• Family mapping (Limitations-Biparental population, Low resolution, Analysis

of only 2 alleles, time consuming).

• Population or Association mapping (I) increased mapping resolution, (ii)

reduced research time, and (iii) greater allele number (Yu and Buckler, 2006).

• Association mapping identifies quantitative trait loci (QTLs) by examining the

marker-trait associations that can be attributed to the strength of linkage

disequilibrium between markers and phenotype across a set of diverse

germplasm.

Page 4: Association mapping  for improvement of agronomic traits in rice

• Association mapping, also known as "linkage disequilibrium

mapping", is a method of mapping quantitative trait loci (QTLs)

that takes advantage of linkage disequilibrium to link phenotypes

to genotypes.

Offers greater precision in QTL location than family-based

linkage analysis.

Does not require family or pedigree information , can be applied to

a range of experimental and non-experimental populations.

Association mapping (AM)

Page 5: Association mapping  for improvement of agronomic traits in rice

How it works?

• Association studies are based on the assumption that a marker locus

is ‘sufficiently close’ to a trait locus so that some marker allele

would be ‘travelling’ along with the trait allele through many

generations during recombination.

Page 6: Association mapping  for improvement of agronomic traits in rice

Direct and Indirect Allelic Association

D

*

Measure disease relevance (*) directly, ignoring correlated markers nearby

Direct Association

M1 M2 Mn

Assess trait effects on D via correlated markers (Mi) rather than susceptibility/etiologic variants.

D

Indirect Association & LD

•Allele of interest is itself involved in phenotype

• Allele itself is not involved, but a nearby correlated marker changes phenotype

Page 7: Association mapping  for improvement of agronomic traits in rice

Linkage mapping

In 1913, the first individual to construct a (very small) genetic map was Alfred Sturtevant.

Genes/ markers in order, indicating the relative genetic distances between them, and assigning them to their chromosome.

Distance = Recombination frequency=No. of recombinants /Total progeny X 100

Suppose the recombination between loci A and B is 6%, that between loci B and C is 20%, and that between A and C 24%, then we can order the loci along the chromosome as…

(Hartal et al., 2010)

Page 8: Association mapping  for improvement of agronomic traits in rice

Mapping resolution

(Braulio et al., 2012)

Page 9: Association mapping  for improvement of agronomic traits in rice

Marker-trait associations in experimental and natural populations

Experimental populations (e.g. F2, RIL) 2-parental alleles; small genetic variation; few meiotic cycles; low resolution

Natural populations many alleles; large genetic variation; many meiotic cycles; high resolution

Page 10: Association mapping  for improvement of agronomic traits in rice

Approaches to mapping genes

(Yu and Buckler, 2006)

Page 11: Association mapping  for improvement of agronomic traits in rice

Advantages of AM over linkage mapping

Linkage Mapping Association Mapping

Structured Population

(e.g. Biparental population)

Un-structured population

(e.g. Germplasm lines)

Low resolution (few to several

centimorgans away from gene/QTL)

High resolution (Much closer than

those by linkage mapping)

Only few alleles can be detected Many alleles can be detected

Moderate marker density High/moderate marker density

Feasible in annual and biennial

species, not feasible in perennial

species

Feasible in annual, biennial and

perennial species

Narrow range Wide range

Time consuming Less time required(Yu et al., 2006)

Page 12: Association mapping  for improvement of agronomic traits in rice

Types of association mapping

1. Genome wide association mapping: Search whole genome for causal

genetic variation. A large number of markers are tested for association

with various complex traits and it doesn’t require any prior information

on the candidate genes.

2. Candidate gene association mapping: Dissect out the genetic control

of complex traits, based on the available results from genetic,

biochemical, or physiology studies in model and non-model plant

species (Mackay, 2001). Requires identification of SNPs between lines

within specific genes.

Page 13: Association mapping  for improvement of agronomic traits in rice

(Zu et al., 2009)

Page 14: Association mapping  for improvement of agronomic traits in rice

Steps in association mapping

Page 15: Association mapping  for improvement of agronomic traits in rice

Mapping population and Population structure

• Randomly or non-randomly mated germplasm

• Randomly mated populations represent a rather narrow group of

germplasm, likely to lower resolution and harbor only a narrow range of

alleles

• Non randomly mated germplasm is used, population structure needs to

be controlled in the statistical analysis.

• Cluster analysis is done to know the variation in population and most

diverse individuals are selected from each cluster to represent the

individuals of that cluster.

(Yu et al., 2006)

Page 16: Association mapping  for improvement of agronomic traits in rice

Phenotyping

Phenotyping

• Success of AM depends on accuracy and throughput of genotyping

• Replications across multiple years in randomized plots and multiple

locations and environments.

• Field Design:- incomplete block design (Lattice), RBD (Eskridge,

2003).

Should be done on the basis of

• Diversity:- on the basis of phenotype and genotype

• Population structure:- Systematic difference in allele frequencies

btw. sub-populations…

Page 17: Association mapping  for improvement of agronomic traits in rice

Genotyping

• Mostly multiallelic, reproducible, PCR-based markers are used.

• Microsatellites or simple sequence repeats (SSRs), and SNPs are

more revealing than their dominant counterparts and, therefore, are

more powerful.

• Due to higher genome density, lower mutation rate and wide

distribution throughout the genome SNPs are rapidly becoming the

marker of choice for complex trait

Page 18: Association mapping  for improvement of agronomic traits in rice

Linkage Disequilibrium

Linkage disequilibrium means that we don’t need to genotype the

exact causal variant, but only a variant that is correlated with it.

Page 19: Association mapping  for improvement of agronomic traits in rice

Linkage Disequilibrium Map & Allelic Association

Primary Aim of LD maps: To identify the relationship between marker and QTL or trait of interest.

Marker 1 2 3 n

LD

D

Page 20: Association mapping  for improvement of agronomic traits in rice

Linkage disequilibrium (LD)

• LD refers to non random association of allels at different loci.

• LD follows the fact that closely located genes are transmitted as a block, which only

rarely breaks up in meiosis.

• Closely located genes often express linkage disequilibrium to each other: An example:

Consider two independently segregating genes A and B with two alleles (A, a and B, b

respectively)

• At equilibrium, the frequency of the AB should equal to the product of the allele

frequencies of A and B,

• PAB Pab =PAbPaB (1:1 ratio = no LD)

• Any deviation from these values implies LD.

A a Total

B AB aB Bb Ab ab bTotal A a

Page 21: Association mapping  for improvement of agronomic traits in rice
Page 22: Association mapping  for improvement of agronomic traits in rice

Cont.…..

(Zhu et al.,2009)

Page 23: Association mapping  for improvement of agronomic traits in rice

LD Decay with time for four different recombination fractions (ϴ)

(Powell et al., 2006)

Page 24: Association mapping  for improvement of agronomic traits in rice

Factors affecting LD

LD increases due to population structure, relatedness (kinship), small

founder population size or genetic drift, selection (natural, artificial).

While factors like outcrossing, high recombination rate, high

mutation rate, gene conversion, etc., lead to a decrease/disruption in

LD. Thus, LD declines with 1) increase in genetic distance and

2) increase in number of generations.

(Huttley et al., 2005)

Page 25: Association mapping  for improvement of agronomic traits in rice

Evaluation of linkage disequilibrium and associating

genotype- phenotype

• TASSEL (http://www.maizegenetics.net) is used to measure the

extent of LD as squared allele frequency correlation estimates (R2,

Weir, 1996) and measure the significance of R2.

• Besides TASSEL there are many other softwares like DnaSP,

Arlequin etc. used to calculate D‘ and R2.

Page 26: Association mapping  for improvement of agronomic traits in rice

Softwares used in AMSr. Software Focus Description

1. TASSEL Association analysis Free, LD statistics, sequence analysis, association mapping

2. Haploview 4.2

Haplotype analysis and LD LD and haplotype block analysis, haplotype population frequency estimation, single SNP and haplotype association tests.

3. SVS 7 Stratification,LD and AM

Estimate stratification, LD, haplotypes blocks and multiple AM approaches for up to 1.8 million SNPs and 10,000 sample

4. GenStat Stratification, LD and AM SSR markers, GLM and MLM-PCA methods

5. JMP genomics

Stratification, LD and structured AM

SNPs, CG and GWAS, analysis of common and rare Variants

6. GenAMap Stratification, LD and structured AM

SNPs, tree of functional branches, multiple visualization tools

7 PLINK Stratification, LD and structured AM

SNPs, multiple AM approaches, IBD and IBS Analyses

8. STRUCTURE

Populationstructure

Compute a MCMC Bayesian analysis to estimate the proportion of the genome of an individual originating from the different inferred Populations

9. SPAGeDi Relative kinship genetic relationship analysis(Braulio et al., 2012)

Page 27: Association mapping  for improvement of agronomic traits in rice

Advantages of AM

1. Saves time, effort, and cost needed for the development of specific

mapping populations.

2. The QTL-linked markers identified by AM can be directly used for MAS

3. AM has high resolution

4. AM would assess the entire range of diversity in the trait of interest

5. Associated markers identified during AM can be used for either selection

of parents for hybridization or for selection of desirable segregants

Page 28: Association mapping  for improvement of agronomic traits in rice

Disadvantages of AM

1. The results from AM are affected by several factors like selection

history, population structure, kinship, etc., may lead to false positive

association

2. Large number (hundreds of thousands or even millions) of markers

would be required to adequately cover the entire genome.

3. High quality phenotypic data required (Multiple environment with multi

location)

4. The rate of recombination is not uniform throughout the genome.

Page 29: Association mapping  for improvement of agronomic traits in rice

Need of Association mapping in Rice

• Rice (Oryza sativa) is a staple food that feeds 3 billion people.

• Largest variability among germplasm and genomic database as

compared to any other species.

• All the agronomic traits in rice (Grain yield, Days to maturity,

Height, etc.) have quantitative inheritance.

• Challenge and opportunity is to utilize this information to

understand and predict how genotypic variation gives rise to the

abundance of phenotypic variation and its utilization in MAS.

Page 30: Association mapping  for improvement of agronomic traits in rice

Association mapping studies in RiceAssociation mapping studies in Rice

Population Sample Size

Markers used

Trait Reference

Germplasm 523 5291 SNPs 12 agronomic traits (Qing et al., 2015)

Diverse accessions 203 154 SSRs Trait of Harvest Index (Li et al., 2012)

Diverse rice accessions

383 44,000 SNPs Aluminum Tolerance (Famoso et al., 2011)

Diverse accessions 413 44K SNPs Agronomic traits (Zhao et al., 2011)

Diverse accessions 210 86 SSRs yield and grain quality

(Borba et al., 2010)

Landraces 517 3,625,200 SNPs

14 agronomic traits (Xuehui et al., 2010)

Mini core collection

90 108 SSR stigma and spikelet characteristics

(Yan et al., 2009)

Diverse accessions 103 123 SSRs Yield and its components

(Agrama et al., 2007)

Page 31: Association mapping  for improvement of agronomic traits in rice

517 landraces were phenotyped and genotyped by sequencing using

Illumina Genome Analyzer II

Aligned sequence reads to the rice reference genome for SNP

identification

Discrepancies with rice reference genome were called as candidate

SNPs.

Case Study

Page 32: Association mapping  for improvement of agronomic traits in rice

A total of 3,625,200 SNPs were identified, resulting in an average of 9.32

SNPs per kb, with 87.9% of the SNPs located within 0.2 kb of the nearest SNP

A total of 167,514 SNPs were found in the coding regions of 25,409 genes.

3,625 large-effect SNPs (representing mutations predicted to cause large

effects) were identified.

Principal-component analysis

seperated rice germaplasm in two

groups i.e. indica and japonica.

Further both indica and japonica had three subgroups.

Page 33: Association mapping  for improvement of agronomic traits in rice

Because of strong population differentiation between the two

subspecies of cultivated rice GWAS was conducted only for 373

indica lines using mixed linear model (MLM)

80 associations for the 14 agronomic traits were identified.

Heading date strongly correlated with both population structure and

geographic distribution.

Page 34: Association mapping  for improvement of agronomic traits in rice
Page 35: Association mapping  for improvement of agronomic traits in rice

• Ultimate aim of plant breeding is prediction of phenotype from

genotype

• Major agricultural economic traits are of complex nature

• It is desperate to dissect these complex traits and assign them function

• Advanced genomic tools like association mapping will be a valuable

option can be effectively and efficiently utilized to accelerate crop

improvement

• Association mapping is long term commitment, so have all the things

and then go for it

Conclusion

Page 36: Association mapping  for improvement of agronomic traits in rice