lecture 19: mutation, selection, and neutral theory november 2, 2015

Lecture 19 : Mutation, Selection, and Neutral Theory

November 2, 2015

Last Time

Mutation introduction

Mutation-reversion equilibrium

Mutation and drift

Today

Mutation and selection

Introduction to neutral theory

Exam

Mutation-Selection Balance

Equilibrium occurs when creation of mutant allele is balanced by selection against that allele

For a recessive mutation:

pqmu

0 smu qq

At equilibrium:

2

2

1 sq

psqp

sqeq

sqeq

2

2

2

1 sq

psqqs

assuming: 1-sq21

sqeq

What is the equilibrium allele frequency of a recessive lethal with no mutation in a large (but finite) population?

What happens with increased forward mutation rate from wild-type allele?

How about reduced selection?

Balance Between Mutation and Selection

Recessive lethal allele with s=0.2 and μ=10-5

Muller’s Ratchet

Deleterious mutations accumulate in haploid or asexual lineages

Driving force for evolution of recombination and sex

Question:

Do most mutations cause reduced fitness?

Why or why not?

Relative Abundance of Mutation Types

Most mutations are neutral or ‘Nearly Neutral’

A smaller fraction are lethal or slightly deleterious (reducing fitness)

A small minority are advantageous

Types of Mutations (Polymorphisms)

First and second position SNP often changes amino acid

UCA, UCU, UCG, and UCC all code for Serine

Third position SNP often synonymous

Majority of positions are nonsynonymous

Not all amino acid changes affect fitness: allozymes

Synonymous versus Nonsynonymous SNP

Nuclear Genome Size Size of nuclear genomes varies

tremendously among organisms

Weak association with organismal complexity, especially within kingdoms

Arabidopsis thaliana 120 MbpPoplar 460 MbpRice 450 Mbp Maize 2,500 Mbp Barley 5,000 MbpHexaploid wheat 16,000 MbpFritillaria (lilly family) >87,000 Mbp

Noncoding DNA accounts for majority of genome in many eukaryotesG

enic

Fra

ction

(%)

Genome Size (x109 bp)

What is the probability of a mutation hitting a coding region in humans? Assumptions?

Lynch (2007) Origins of Genome

Architecture

Composition of the Human Genome

Classical-Balance Fisher focused on the dynamics of allelic forms of genes,

importance of selection in determining variation: argued that selection would quickly homogenize populations (Classical view)

Wright focused more on processes of genetic drift and gene flow, argued that diversity was likely to be quite high (Balance view)

Problem: no way to accurately assess level of genetic variation in populations! Morphological traits hide variation, or exaggerate it.

Molecular Markers Emergence of enzyme electrophoresis in mid 1960’s

revolutionized population genetics

Revealed unexpectedly high levels of genetic variation in natural populations

Classical school was wrong: purifying selection does not predominate

Initially tried to explain with Balancing Selection

Deleterious homozygotes create too much fitness burden

22

211 qspsi

mi for m loci

The rise of Neutral Theory Abundant genetic variation exists, but perhaps not driven by

balancing or diversifying selection: selectionists find a new foe: Neutralists!

Neutral Theory (1968): most genetic mutations are neutral with respect to each other

Deleterious mutations quickly eliminated

Advantageous mutations extremely rare

Most observed variation is selectively neutral

Drift predominates when s<1/(2N)

Infinite Alleles Model (Crow and Kimura Model)

Each mutation creates a completely new allele

Alleles are lost by drift and gained by mutation: a balance occurs

Is this realistic?

Average human protein contains about 300 amino acids (900 nucleotides)

Number of possible mutant forms of a gene:

542900 1014.74 xn

If all mutations are equally probable, what is the chance of getting same mutation twice?

Infinite Alleles Model (IAM: Crow and Kimura Model)

Homozygosity will be a function of mutation and probability of fixation of new mutants

21 )1()

2

11(

2

1

t

eet f

NNf

Probability of sampling same allele twice

Probability of sampling two alleles identical by

descent due to inbreeding in ancestors

Probability neither allele mutates

Expected Heterozygosity with Mutation-Drift Equilibrium under IAM

At equilibrium ft = ft-1=feq

Previous equation reduces to:

214

21

e

eq Nf

Ignoring μ2

14

4

e

ee N

NH

Remembering that H=1-f:4Neμ is called the

population mutation rate

21 )1()

2

11(

2

1

t

eet f

NNf

14

1

eeq Nf

Ignoring 2μ

4Neμ often symbolized by Θ

Equilibrium Heterozygosity under IAM

Frequencies of individual alleles are constantly changing

Balance between loss and gain is maintained

4Neμ>>1: mutation predominates, new mutants persist, H is high

4Neμ<<1: drift dominates: new mutants quickly eliminated, H is low

Effects of Population Size on Expected Heterozgyosity Under Infinite Alleles Model (μ=10-5)

Rapid approach to equilibrium in small populations

Higher heterozygosity with less drift

Stepwise Mutation Model Do all loci conform to Infinite Alleles Model?

Are mutations from one state to another equally probable?

Consider microsatellite loci: small insertions/deletions more likely than large ones?

14

4

e

ee N

NH

IAM:

)18(

11

ee

NH

SMM:

Which should have higher produce He,the Infinite Alleles Model, or the Stepwise Mutation Model, given equal Ne and μ?

14

4

e

ee N

NH

IAM:

)18(

11

ee

NH

SMM:

Plug numbers into the equations to see how they behave. e.g, for Neμ = 1, He = 0.66 for SMM and 0.8 for IAM

Expected Heterozygosity Under Neutrality Direct assessment of neutral

theory based on expected heterozygosity if neutrality predominates (based on a given mutation model)

Allozymes show lower heterozygosity than expected under strict neutrality

Why?

Avise 2004

Observed

1

eH

Neutral Expectations and Microsatellite Evolution

Comparison of Neμ (Θ) for 216 microsatellites on human X chromosome versus 5048 autosomal loci

Only 3 X chromosomes for every 4 autosomes in the population

Ne of X expected to be 25% less than Ne of autosomes:

θX/θA=0.75

AutosomesX

X chromosome

Correct model for microsatellite evolution is a combination of IAM

and StepwiseWhy is Θ higher for autosomes?

Observed ratio of ΘX/ΘA was 0.8 for Infinite Alleles Model and 0.71 for Stepwise model

Sequence Evolution

DNA or protein sequences in different taxa trace back to a common ancestral sequence

Divergence of neutral loci is a function of the combination of mutation and fixation by genetic drift

Sequence differences are an index of time since divergence

Molecular Clock If neutrality prevails, nucleotide divergence between two sequences should be

a function entirely of mutation rate

1

t

Expected Time Until Fixation of a New Mutation:

Since μ is number of substitutions per unit time

Time since divergence should therefore be the reciprocal of the estimated mutation rate

Probability of creation of new

alleles

Probability of fixation of new

alleles

Variation in Molecular Clock If neutrality prevails, nucleotide divergence between two sequences should

be a function entirely of mutation rate

So why are rates of substitution so different for different classes of genes?

Exam 2 Results: 86.8% Avg, 9.8% Std Dev.

lecture 19: mutation, selection, and neutral theory november 2, 2015

Documents

genetic mutations

recessive mutation

reduced selection

importance of selection

diversifying selection

otherdeleterious mutations

level of genetic variation

neutral theorynovember