introduction to molecular dating methods. principles ultrametricity: all descendants of any node are...

32
Introduction to molecular dating methods

Upload: riley-aslin

Post on 30-Mar-2015

218 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Introduction to molecular dating methods. Principles Ultrametricity: All descendants of any node are equidistant from that node For extant species, branches,

Introduction to molecular dating methods

Page 2: Introduction to molecular dating methods. Principles Ultrametricity: All descendants of any node are equidistant from that node For extant species, branches,

Principles

• Ultrametricity: All descendants of any node are equidistant from that node

• For extant species, branches, in units of time, are ultrametric

F

BA

10090 80 70 60 50 40 30 20 10

CD

E

Page 3: Introduction to molecular dating methods. Principles Ultrametricity: All descendants of any node are equidistant from that node For extant species, branches,

Evolutionary branch length

• Expected number of substitutions/site = rate of change x branch duration

• Rate = 0.001 sub/site/Ma

• “True” length = 0.02

• Actual length ≈ 0.02

20 Ma

Page 4: Introduction to molecular dating methods. Principles Ultrametricity: All descendants of any node are equidistant from that node For extant species, branches,

What is a “molecular clock”?

a) All internodes have equal duration

b) All branches have equal rate of substitution

c) All tips are the same number of time units from the root

d) The expected number of substitutions per site is the same for all branches

e) The observed number of substitutions is the same for all descendants of a given node

Page 5: Introduction to molecular dating methods. Principles Ultrametricity: All descendants of any node are equidistant from that node For extant species, branches,

The molecular clock idea

• First proposed by Zuckerkandl and Pauling (1965) based on haemoglobin data

• If there is the same rate for all branches there will be a linear relationship between sequence distance and time since divergence

O

B

A

Page 6: Introduction to molecular dating methods. Principles Ultrametricity: All descendants of any node are equidistant from that node For extant species, branches,

If you know one divergence date then you can calculate others

Per

cent

seq

uenc

e di

verg

ence

Time since divergence

x

y

Page 7: Introduction to molecular dating methods. Principles Ultrametricity: All descendants of any node are equidistant from that node For extant species, branches,

If you know one divergence date then you can calculate others

Time since divergence

x

z

Per

cent

seq

uenc

e di

verg

ence

Page 8: Introduction to molecular dating methods. Principles Ultrametricity: All descendants of any node are equidistant from that node For extant species, branches,

Issue 1: There will be error around the estimates

Per

cent

seq

uenc

e di

verg

ence

Time since divergence

x

z

Uncertainty in dating

Stochastic rate variation

Inferred age

Range

Page 9: Introduction to molecular dating methods. Principles Ultrametricity: All descendants of any node are equidistant from that node For extant species, branches,

Issue 2: You need to correct for multiple hits

Per

cent

seq

uenc

e di

verg

ence

x

z

Assumed re

lationship

Actual relationship

Inferred ageActual age

Page 10: Introduction to molecular dating methods. Principles Ultrametricity: All descendants of any node are equidistant from that node For extant species, branches,

Issue 3: Is evolution clock-like?

Page 11: Introduction to molecular dating methods. Principles Ultrametricity: All descendants of any node are equidistant from that node For extant species, branches,

Issue 3: Is evolution clock-like?

Local clock: clade-specific rates

Page 12: Introduction to molecular dating methods. Principles Ultrametricity: All descendants of any node are equidistant from that node For extant species, branches,

Issue 3: Is evolution clock-like?

No clock: rates vary greatly

Page 13: Introduction to molecular dating methods. Principles Ultrametricity: All descendants of any node are equidistant from that node For extant species, branches,

Why should we expect a clock?

• Under neutral evolution: but that is too fast for most (all?) data sets

• If there is reasonable constancy of population size, mutation rate, and patterns of selection

• We can hope that rates of evolution change slowly and/or rarely

Page 14: Introduction to molecular dating methods. Principles Ultrametricity: All descendants of any node are equidistant from that node For extant species, branches,

The likelihood approach

• Consider two models of evolution– The usual model– The same model but

• A root is specified

• The summed branch lengths from any node to all descendants of that node are the same

• Do a likelihood ratio test

Which is the simpler model?

Page 15: Introduction to molecular dating methods. Principles Ultrametricity: All descendants of any node are equidistant from that node For extant species, branches,

How many degrees of freedom?

• Depends on the number of taxa (n)

• Branch length parameters in the non-clock model = 2n - 3

• Branch length parameters in the clock model = n - 1

• Difference = (2n - 3) - (n - 1) = n - 2

Page 16: Introduction to molecular dating methods. Principles Ultrametricity: All descendants of any node are equidistant from that node For extant species, branches,

If a clock model is not rejected• Calculate rates and then extrapolate from known

to unknown pairwise distances

DOA = 0.4 ; DAB = 0.1TOA = 90 ; TAB = (0.1/0.4) x 90 = 22.5 Ma

O A B

0.05 0.05

0.195

0.2

90

22.5

Page 17: Introduction to molecular dating methods. Principles Ultrametricity: All descendants of any node are equidistant from that node For extant species, branches,

Should obtain confidence intervals around date estimates• Look at the curvature of the likelihood

surface (can be done with PAML)

• Use bootstrapping (parametric or non-parametric)– Generate multiple pseudoreplicate data sets– For each data set calculate relative nodal ages– Discard the upper and lower 2.5%

Page 18: Introduction to molecular dating methods. Principles Ultrametricity: All descendants of any node are equidistant from that node For extant species, branches,

Calibrating the tree

• How does one attach a date to an internal node? How old is the fossil? Where does a fossil fit on the tree?

Page 19: Introduction to molecular dating methods. Principles Ultrametricity: All descendants of any node are equidistant from that node For extant species, branches,

Calibrating the tree

• How does one attach a date to an internal node? How old is the fossil? Where does a fossil fit on the tree?

F (90 Ma)

Page 20: Introduction to molecular dating methods. Principles Ultrametricity: All descendants of any node are equidistant from that node For extant species, branches,

What does that tell us?

O

B

AF (90 Ma)

This node is at least 90 Ma

Page 21: Introduction to molecular dating methods. Principles Ultrametricity: All descendants of any node are equidistant from that node For extant species, branches,

What else?

O

B

AF

This node is at least 90 Ma

This node is more than 90 Ma

Page 22: Introduction to molecular dating methods. Principles Ultrametricity: All descendants of any node are equidistant from that node For extant species, branches,

The lineage leading to F could have been missed

O

B

AF

This node is at least 90 Ma

Page 23: Introduction to molecular dating methods. Principles Ultrametricity: All descendants of any node are equidistant from that node For extant species, branches,

General issues

• Fossils generally provide only minimal ages• The age is attached to the node below the

lowest place on the tree that the fossil could attach

• Maximal or absolute ages can only be asserted when there are lots of fossil data

• Geological events can sometimes be used to obtain minimal ages

Page 24: Introduction to molecular dating methods. Principles Ultrametricity: All descendants of any node are equidistant from that node For extant species, branches,

What if a clock is rejected?

• Until recently three (bad) choices– Give-up on molecular dating– Go ahead and use molecular dating anyway– Delete extra-fast or extra-slow taxa

• Now we have other options– Assume local clocks– Relaxed clock methods

Page 25: Introduction to molecular dating methods. Principles Ultrametricity: All descendants of any node are equidistant from that node For extant species, branches,

Local clocks

Can use likelihood ratio tests to compare to strict clock and non-clock models

How many parameters?

Page 26: Introduction to molecular dating methods. Principles Ultrametricity: All descendants of any node are equidistant from that node For extant species, branches,

Non-Parametric Rate-Smoothing(NPRS: Sanderson 1998)

a

d1

d2

The rate of branch a = ra= La/Ta

(L = branch length; T = time duration)

^

Node k

Page 27: Introduction to molecular dating methods. Principles Ultrametricity: All descendants of any node are equidistant from that node For extant species, branches,

Non-Parametric Rate-Smoothing(NPRS: Sanderson 1998)

a

d1

d2

Measure of rate roughness = Rk = (ra - rd1)2 + (ra - rd2) 2^ ^ ^ ^

Node k

Page 28: Introduction to molecular dating methods. Principles Ultrametricity: All descendants of any node are equidistant from that node For extant species, branches,

Non-Parametric Rate-Smoothing(NPRS: Sanderson 1998)

a

d1

d2

Adjust times so as to minimize overall roughness:

kRk=1

k= n−1

Page 29: Introduction to molecular dating methods. Principles Ultrametricity: All descendants of any node are equidistant from that node For extant species, branches,

NPRS

• Uses branch lengths only (ignores raw data)

• Quick and easy to do

• Assumes rate change is smooth

Page 30: Introduction to molecular dating methods. Principles Ultrametricity: All descendants of any node are equidistant from that node For extant species, branches,

Penalized Likelihood(Sanderson 2001)

• Semi-parametric likelihood approach

• Uses raw data but penalizes the likelihood score by the roughness score, , weighted by a smoothness parameter ()

• Selects optimal value of using cross-validation (pick the value that minimizes the errors made in predicting branch lengths)

kRk=1

k= n−1

Page 31: Introduction to molecular dating methods. Principles Ultrametricity: All descendants of any node are equidistant from that node For extant species, branches,
Page 32: Introduction to molecular dating methods. Principles Ultrametricity: All descendants of any node are equidistant from that node For extant species, branches,

Penalized Likelihood

• Uses more data than NPRS - more accurate

• More difficult to implement