basics of aflp and usat - yale university of aflp and usat.pdf · amplified fragment length...

39
Basics of AFLP and microsatellite analysis

Upload: others

Post on 21-Oct-2019

6 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Basics of AFLP and usat - Yale University of AFLP and usat.pdf · Amplified Fragment Length Polymorphism Pros: Large number of markers with relatively little lab effort No prior information

Basics of AFLP and

microsatellite analysis

Page 2: Basics of AFLP and usat - Yale University of AFLP and usat.pdf · Amplified Fragment Length Polymorphism Pros: Large number of markers with relatively little lab effort No prior information

Amplified Fragment Length Polymorphism

Pros:

Large number of markers with relatively little lab effort

No prior information about genome needed

Genome wide overage

Small amount of DNA needed

Cons:

Markers are dominant (i.e. heterozygotes are scores as homozygotes)

Can be tedious to score

Size homoplasy

Reproducibility?

Page 3: Basics of AFLP and usat - Yale University of AFLP and usat.pdf · Amplified Fragment Length Polymorphism Pros: Large number of markers with relatively little lab effort No prior information

STEP 1: Restriction-Ligation

Page 4: Basics of AFLP and usat - Yale University of AFLP and usat.pdf · Amplified Fragment Length Polymorphism Pros: Large number of markers with relatively little lab effort No prior information

EcoRI PRE-SELECTIVE PRIMER

MseI PRE-SELECTIVE PRIMER

GTAGACTGCGTACC AATT CA

CA AT GAGTCCTGAGTA

STEP 2: Pre-selective PCR

Page 5: Basics of AFLP and usat - Yale University of AFLP and usat.pdf · Amplified Fragment Length Polymorphism Pros: Large number of markers with relatively little lab effort No prior information

SELECTIVE PRIMERSELECTIVE PRIMER

GTAGACTGCGTACC AATT CACT

GACA AT GAGTCCTGAGTA

GTAGACTGCGTACC AATT CA

CA AT GAGTCCTGAGTA

EcoRI SELECTIVE PRIMER (labeled)

MseI SELECTIVE PRIMER

STEP 3: Selective PCR

FAM

Page 6: Basics of AFLP and usat - Yale University of AFLP and usat.pdf · Amplified Fragment Length Polymorphism Pros: Large number of markers with relatively little lab effort No prior information

MseI

MseI

MseI

MseI MseI

MseI

MseI

MseI

MseI

MseI

MseI

MseI

MseI

MseI

MseI

MseI

MseI

MseI

MseI

MseI

MseI

MseI

MseI

MseI

MseI

MseI

MseI

MseI

MseI

MseI

MseI

MseI

MseI

MseI

EcoRI

EcoRI

EcoRI

MseI

EcoRI

MseI

EcoRI

MseI

EcoRI

MseI

EcoRI: 6bp cutter --> one cut every 4096 bp

MseI: 4bp cutter --> one cut every 256 bp

Selective PCR product contains many unlabeledfragments that will not be visible on ABI

Page 7: Basics of AFLP and usat - Yale University of AFLP and usat.pdf · Amplified Fragment Length Polymorphism Pros: Large number of markers with relatively little lab effort No prior information

Number of bands in AFLP profileis determined by

1 Genome size: larger genome ---> more bands

2 Number of selective nucleotides in selective primers

3 Dilution of PCR product Low (noise) peaks get magnified

Why optimize number of bands?

1 Size homoplasy !!!!!

2 Difficult to score

Page 8: Basics of AFLP and usat - Yale University of AFLP and usat.pdf · Amplified Fragment Length Polymorphism Pros: Large number of markers with relatively little lab effort No prior information

EcoR1-AGT MseI-CGTEcoR1-AGC MseI-CGA

MseI-CGCMseI-CGGetc.

MseI-CGTG

MseI-CG

Choosing selective primer combinations

An additionalnucleotide reducesnumber of peaks 4-fold

One less nucleotideincreases number of peaks 4-fold

Use few of these(expensive),

but allows use of multiple colors(multiplex run on ABI)

Use many of these to get enough markers (cheap)

And use these to optimize number of bands

Page 9: Basics of AFLP and usat - Yale University of AFLP and usat.pdf · Amplified Fragment Length Polymorphism Pros: Large number of markers with relatively little lab effort No prior information

Reproducibility

High reproducibility has generally been reported

However, DNA quality is crucial component (use same DNA extraction protocol for all samples!)

Assess quality of data byrepeating several samples from scratch

i.e. starting with DNA extraction

Page 10: Basics of AFLP and usat - Yale University of AFLP and usat.pdf · Amplified Fragment Length Polymorphism Pros: Large number of markers with relatively little lab effort No prior information

Note: Genome size is correlated with noise level

Around 20% of primer combinations provide profiles that are suitable for high throughput genotyping.

1 Well separated peaks

2 Right number of peaks

2 Little noise

3 Peaks are distributed across size range

4 High level of Polymorphism

Ideal AFLP profile

Page 11: Basics of AFLP and usat - Yale University of AFLP and usat.pdf · Amplified Fragment Length Polymorphism Pros: Large number of markers with relatively little lab effort No prior information

A very fine example

Page 12: Basics of AFLP and usat - Yale University of AFLP and usat.pdf · Amplified Fragment Length Polymorphism Pros: Large number of markers with relatively little lab effort No prior information

Too many peaks

Page 13: Basics of AFLP and usat - Yale University of AFLP and usat.pdf · Amplified Fragment Length Polymorphism Pros: Large number of markers with relatively little lab effort No prior information

Optimizing AFLP reactions

1 DNA quality

2 DNA quality A successful AFLP analyses depends crucially on this

3 DNA quality

4 Increase restriction time to 2 hours

5 Increase ligation time to 16 hours

6 Use fresh T4 ligase

7 Increase amount of DNA (rest-lig) added to pre-selective PCR (15 ul DNA’ in 50ul reaction)

8 Reduce amount of DNA in Selective PCR

9 Increase amount of cycles in Selective PCR

10 Increase amount of TAQ in Selective PCR

11 Several people have reported better results with TaqI vs MseI

(but this requires different adaptors)

Page 14: Basics of AFLP and usat - Yale University of AFLP and usat.pdf · Amplified Fragment Length Polymorphism Pros: Large number of markers with relatively little lab effort No prior information

Scoring AFLP profiles

Normalize samples: Arbitrary cut-off peak height has to beused and this needs to be relative since different samples have different intensity.

Set high cut-off for inclusion as marker (that is, at least one individual has to have this cut-off peak height), then reduce peak height for scoring the presence/absence for remainder of individuals.

In Genemapper do not use auto-bin option. Make your own bins

Analyze all samples for the same primer set in the same project. This allowsyou to assess the reliability of the marker by scrolling across samples. Also prevents you from including non-polymorphic markers. Also, normalization performed on all samples at the same time.

Do not include peaks that do not show clear presence or absence in most cases.

Score blindly to avoid bias.

Check for overflow from different dye

Page 15: Basics of AFLP and usat - Yale University of AFLP and usat.pdf · Amplified Fragment Length Polymorphism Pros: Large number of markers with relatively little lab effort No prior information

Normalization

Page 16: Basics of AFLP and usat - Yale University of AFLP and usat.pdf · Amplified Fragment Length Polymorphism Pros: Large number of markers with relatively little lab effort No prior information

Genemapper

Freeware for scoring AFLPfrom ABI runs:

Genographer v 1.6

GenoProfiler 2.0

Page 17: Basics of AFLP and usat - Yale University of AFLP and usat.pdf · Amplified Fragment Length Polymorphism Pros: Large number of markers with relatively little lab effort No prior information

A few population genetic programs for AFLPanalyses

RAPDFst: Fst (Lynch and Milligam, 1994)

MVSP, NTSYS: Jaccard coeficient, Nei and Li (1979)

Arlequin, TFPGA: Amova

Genalex: Φst, analog of Fst, Amova

Structure, BAPS: inference of population structure.

Hickory: Bayesian estimation of F statistics for dominant markers

Page 18: Basics of AFLP and usat - Yale University of AFLP and usat.pdf · Amplified Fragment Length Polymorphism Pros: Large number of markers with relatively little lab effort No prior information

A few population genetic programs for AFLPanalyses

RAPDFst: Fst (Lynch and Milligam, 1994)

MVSP, NTSYS: Jaccard coeficient, Nei and Li (1979)

Arlequin, TFPGA: Amova

Genalex: Φst, analog of Fst, Amova

Structure, BAPS: inference of population structure.

Hickory: Bayesian estimation of F statistics for dominant markers

Assumes H-W equilibrium

Page 19: Basics of AFLP and usat - Yale University of AFLP and usat.pdf · Amplified Fragment Length Polymorphism Pros: Large number of markers with relatively little lab effort No prior information

A few population genetic programs for AFLPanalyses

RAPDFst: Fst (Lynch and Milligam, 1994)

MVSP, NTSYS: Jaccard coeficient, Nei and Li (1979)

Arlequin, TFPGA: Amova

Genalex: Φst, analog of Fst, Amova

Structure, BAPS: inference of population structure.

Hickory: Bayesian estimation of F statistics for dominant markers

Treats multilocus data as single haplotype

Assumes H-W equilibrium

Page 20: Basics of AFLP and usat - Yale University of AFLP and usat.pdf · Amplified Fragment Length Polymorphism Pros: Large number of markers with relatively little lab effort No prior information

A few population genetic programs for AFLPanalyses

RAPDFst: Fst (Lynch and Milligam, 1994)

MVSP, NTSYS: Jaccard coeficient, Nei and Li (1979)

Arlequin, TFPGA: Amova

Genalex: Φst, analog of Fst, Amova

Structure, BAPS: inference of population structure.

Hickory: Bayesian estimation of F statistics for dominant markers

Assumes H-W equilibrium

Treats multilocus data as single haplotype

No assumption of H-W equilibrium

Low information content

Page 21: Basics of AFLP and usat - Yale University of AFLP and usat.pdf · Amplified Fragment Length Polymorphism Pros: Large number of markers with relatively little lab effort No prior information

Microsatellites

* Di- or tri-nuleotide repeats

* Ubiquitous

* High mutation rate (102-106)

High level of variability

Page 22: Basics of AFLP and usat - Yale University of AFLP and usat.pdf · Amplified Fragment Length Polymorphism Pros: Large number of markers with relatively little lab effort No prior information

Mutational mechanismSlippage during replication

(also happens during PCR)

ACCGAGTCGATCGTGTGTGTGTGTGTGTGTACGCTATGGCTCAGCTAGCACACACACACAC

ACCGAGTCGATCGTGTGTG TGTGTGTGTGTACGCTATGGCTCAGCTAGCACACAC ACACACACACATGCGAT

CA

Slippage increases with number of repeats

Reduces or decreases number of repeats

Page 23: Basics of AFLP and usat - Yale University of AFLP and usat.pdf · Amplified Fragment Length Polymorphism Pros: Large number of markers with relatively little lab effort No prior information

Obtaining Microsatellites

• Screening sequenced genomes

• Screening enriched genomic library

Glenn and Schable (2005) Methods in Enzymology 395: 202-222.

This paper is particularly useful. It comes from a Lab that has isolated microsatellites from 125+ species

Page 24: Basics of AFLP and usat - Yale University of AFLP and usat.pdf · Amplified Fragment Length Polymorphism Pros: Large number of markers with relatively little lab effort No prior information

SELECTING LOCI

Too few repeats Low variability

Too many repeats Difficult to score, Homoplasy

Choosing loci:• 8 - 20 repeats• uninterrupted repeats

Screening of loci:

•Number of alleles Cloning pool of PCR amplicons, followed bylabeled PCR

•Heterozygosity, allelic richness

M13 labeled primers

Page 25: Basics of AFLP and usat - Yale University of AFLP and usat.pdf · Amplified Fragment Length Polymorphism Pros: Large number of markers with relatively little lab effort No prior information

M13 tailed primer

Forward primer

Reverse primer

M13-tail

Forward primer Reverse primer

M13 primer

Forward primer

FAM

(Low concentration)

Boutin-Ganache et al (2001) Biotechniques 31, 26-28

Page 26: Basics of AFLP and usat - Yale University of AFLP and usat.pdf · Amplified Fragment Length Polymorphism Pros: Large number of markers with relatively little lab effort No prior information

Some scoring issues

Great looking heterozygote

Page 27: Basics of AFLP and usat - Yale University of AFLP and usat.pdf · Amplified Fragment Length Polymorphism Pros: Large number of markers with relatively little lab effort No prior information

Some scoring issues

Extra peak because of partial A overhang addition of Taq

Stutter bands of the two high peaks due to slippage

Page 28: Basics of AFLP and usat - Yale University of AFLP and usat.pdf · Amplified Fragment Length Polymorphism Pros: Large number of markers with relatively little lab effort No prior information

Some scoring issues

Heterozygote

Page 29: Basics of AFLP and usat - Yale University of AFLP and usat.pdf · Amplified Fragment Length Polymorphism Pros: Large number of markers with relatively little lab effort No prior information

Some scoring issues

A single large allele with many repeatsLots of slippage

Page 30: Basics of AFLP and usat - Yale University of AFLP and usat.pdf · Amplified Fragment Length Polymorphism Pros: Large number of markers with relatively little lab effort No prior information

35 repeats

Some scoring issues

Increase in slippage with increase in repeat number

Page 31: Basics of AFLP and usat - Yale University of AFLP and usat.pdf · Amplified Fragment Length Polymorphism Pros: Large number of markers with relatively little lab effort No prior information

Some scoring issues

How many alleles?

Page 32: Basics of AFLP and usat - Yale University of AFLP and usat.pdf · Amplified Fragment Length Polymorphism Pros: Large number of markers with relatively little lab effort No prior information

Some scoring issues

Find a heterozygote that clearlyshows the shape of a single allele

Page 33: Basics of AFLP and usat - Yale University of AFLP and usat.pdf · Amplified Fragment Length Polymorphism Pros: Large number of markers with relatively little lab effort No prior information

Some scoring issues

The alleles

Page 34: Basics of AFLP and usat - Yale University of AFLP and usat.pdf · Amplified Fragment Length Polymorphism Pros: Large number of markers with relatively little lab effort No prior information

Some scoring issues

Electrophoresis artifacts

(Fernando et al (2001) Mol. Ecol. Notes 1, 325-328)

The figures shows the difference in peak shape of the samePCR products loaded at different concentration

Page 35: Basics of AFLP and usat - Yale University of AFLP and usat.pdf · Amplified Fragment Length Polymorphism Pros: Large number of markers with relatively little lab effort No prior information

Some scoring issues

Electrophoresis artifacts

(Fernando et al (2001) Mol. Ecol. Notes 1, 325-328)

Do not overload your gel !

Also keep in mind that in different PCR’s the left peak or the right peak may be dominant

Page 36: Basics of AFLP and usat - Yale University of AFLP and usat.pdf · Amplified Fragment Length Polymorphism Pros: Large number of markers with relatively little lab effort No prior information

Optimizing PCR

Avoid Null Alleles (or try to)• Minimize annealing temp lowest temp that produces

clean bands• MgCl2 concentration increase reduces specificity• Different species design new primers (if possible)

(In my limited experience with cross species amplification null alleles can be big problem)

Reduce stutter:• Reduce number of cycles• Reduce amount of MgCl2• Touchdown PCR• 2/2/8 PCR (2 sec denat, 2 sec anneal, 8 sec extens.)• BSA, DMSO

Addition of A• Increase final extension time• Add Pigtail (GTTTCTT) on 5’end of reverse primer to

facilitate addition of A overhang

Seems to be most successfull

Page 37: Basics of AFLP and usat - Yale University of AFLP and usat.pdf · Amplified Fragment Length Polymorphism Pros: Large number of markers with relatively little lab effort No prior information

Analysis Issues

Null alleles Are loci in HW equilibrium?

Linkage disequilibrium?

Possible solutions:

Remove loci from analysis (if enough loci are available)

Check if HW disequilibrium influences results bytemporarily removing affected loci.

Adjust allele and genotype frequencies (Microchecker)

Microsats biggest problem Population subdivision causes both. Null alleles only cause HW disequilibrium.

Page 38: Basics of AFLP and usat - Yale University of AFLP and usat.pdf · Amplified Fragment Length Polymorphism Pros: Large number of markers with relatively little lab effort No prior information

Some population genetics software

Microsatellite toolkit: Excel plug-in for creating Arlequin, FSTAT and Genepop files.

Microchecker: Estimate null allele frequency. Adjust allele frequencies.

Arlequin: HW equilibrium, Linkage Disequilibrium, Fst, exact test of differentiation, Amova, Mantel test

FSTAT: Allelic richness, Fst per locus (to check contribution of each locus to observed pattern of differentiation)

Structure, BAPS: Population structuring, population assignment.

Migrate: Estimates of effective population size and migration rates

Bottleneck: Check for very recent population bottlenecks

Page 39: Basics of AFLP and usat - Yale University of AFLP and usat.pdf · Amplified Fragment Length Polymorphism Pros: Large number of markers with relatively little lab effort No prior information

Thanks to

Chaz, Kristen, Deborah and Bob Marra