phylo t2 2012 - dalhousie universityawarnach.mathstat.dal.ca/~joeb/biol3046/pdfs/slides/... ·...

18
12-11-06 1 Phylogenetics 2: Phylogenetic and genealogical homology Phylogenies distinguish homology from similarity Analogy Homology Polarity Ancestral character Derived character

Upload: others

Post on 04-Feb-2021

8 views

Category:

Documents


0 download

TRANSCRIPT

  • 12-11-06

    1

    Phylogenetics 2:

    Phylogenetic and genealogical homology

    Phylogenies distinguish homology from similarity

    •  Analogy •  Homology

    •  Polarity •  Ancestral character •  Derived character

  • 12-11-06

    2

    human cow rabbit rat opossum GTG CTG TCT CCT GCC GAC AAG ACC AAC GTC AAG GCC GCC TGG GGC AAG GTT GGC GCG CAC ... ... ... G.C ... ... ... T.. ..T ... ... ... ... ... ... ... ... ... .GC A.. ... ... ... ..C ..T ... ... ... ... A.. ... A.T ... ... .AA ... A.C ... AGC ... ... ..C ... G.A .AT ... ..A ... ... A.. ... AA. TG. ... ..G ... A.. ..T .GC ..T ... ..C ..G GA. ..T ... ... ..T C.. ..G ..A ... AT. ... ..T ... ..G ..A .GC ... GCT GGC GAG TAT GGT GCG GAG GCC CTG GAG AGG ATG TTC CTG TCC TTC CCC ACC ACC AAG ... ..A .CT ... ..C ..A ... ..T ... ... ... ... ... ... AG. ... ... ... ... ... .G. ... ... ... ..C ..C ... ... G.. ... ... ... ... T.. GG. ... ... ... ... ... .G. ..T ..A ... ..C .A. ... ... ..A C.. ... ... ... GCT G.. ... ... ... ... ... ..C ..T .CC ..C .CA ..T ..A ..T ..T .CC ..A .CC ... ..C ... ... ... ..T ... ..A ACC TAC TTC CCG CAC TTC GAC CTG AGC CAC GGC TCT GCC CAG GTT AAG GGC CAC GGC AAG ... ... ... ..C ... ... ... ... ... ... ... ..G ... ... ..C ... ... ... ... G.. ... ... ... ..C ... ... ... T.C .C. ... ... ... .AG ... A.C ..A .C. ... ... ... ... ... ... T.T ... A.T ..T G.A ... .C. ... ... ... ... ..C ... .CT ... ... ... ..T ... ... ..C ... ... ... ... TC. .C. ... ..C ... ... A.C C.. ..T ..T ..T ...

    Alignment of mammalian beta-globin gene sequences

    Homology: similarity among two or more individuals or lineages in a feature/character, or character-state, that is the result of inheritance from a common ancestor

    Homologous character verses Homologous character state

    ACG TAC TAA

    ACG TAT TAA

    ACG TAT TAA

    C

    T

    ACG TAC TAA

    ACG TAT TAA

    ACG TAT TAA

    C

    T

    Molecular evolution: positional homology

    DNA alignment

  • 12-11-06

    3

    Phylogenies distinguish homology from similarity

    ACG TAC TAA

    ACG TAT TAA

    ACG TAT TAA

    C

    T

    Phylogenetic perspective on homologous characters and homologous character states

    Ancestral character states

    ACG TAT TAA

    ACG TAC TAA

    ACG TAC TAA

    C

    C

    Ancestral character states

    C ⇒ T

    SYNAPOMORPHY: a shared derived character state in two or more lineages. These must be homologous in state. This represents a true phylogenetic similarity

    APOMORPHY: a derived character state unique to a single lineages

    Implicit in the above alignment is the assumption of positional homology for the “red” position above. At this position C and T are non-homologous character states. Note that the pair of T’s in the first example, and the pair of C’s in the second example represent homologous character states.

    Phylogenies distinguish homology from similarity

    It is possible for character-states to be identical but non-homologous (Homoplasy)

    1.  Convergence

    2.  Reversal

    3.  Parallelism

    Homoplsy arises in molecular datasets when multiple substitutions occur at a single site.

    - “multiple substitutions”

    - “multiple hits”

    - “superimposed substitutions”

  • 12-11-06

    4

    Phylogenies distinguish homology from similarity: convergence

    ACG GAT TAA

    ACG GAG TAA

    ACG GAA TAA

    T

    G G ⇒ A

    T

    ACG GAA TAA T ⇒ C ⇒ A

    Convergent (non-phylogenetic) similarity of nucleotide character states; i.e., homoplasy

    Phylogenies distinguish homology from similarity: reversal

    ACG GAA TAA

    ACG GAT TAA

    ACG GAT TAA

    A

    T

    A

    ACG GAA TAA A ⇒ C ⇒ A

    Reversal leads to non-homologous similarities in character state.

  • 12-11-06

    5

    Phylogenies distinguish homology from similarity: parallelism

    ACG GAA TAA

    ACG GAT TAA

    ACG GAA TAA

    A

    A

    A

    ACG GAT TAA A ⇒ T

    Parallel evolution leads to non-homologous similarities in character states.

    A ⇒ T

    Evolutionary dissociation

    “A change in linkage or effects that particular traits from different levels of biological organization have on each other over evolutionary time”

    ⎯ Meyer and Mindell (2001)

  • 12-11-06

    6

    Evolutionary dissociation

    Adapted from: Mindell and Meyer (2001) Trends Ecol Evol. (16) 434-440.

    Dissociation events

    GENE A is homologous as a molecular character in all five lineages GENE A’s functional role in development also is homologous in all five lineages The “character state” of GENE A is NOT homologous in lineage 5 as compared with lineages 1 and 2 even though they have the same role! This is another example of homologous characters with non-homologous character states.

    The phylogenetic concept of homology allows distinction between homologous characters and non-homologous characters states in the case of evolutionary dissociation.

    Role 2: morphologically similar structures with different molecular evolutionary basis

    Evolutionary dissociation

    Homologous genes control the basis for non-homologous structures. Apparently, Genetic co-option has occurred. Gene duplication provides one mechanism; non-homologous character states within homologous positions of a gene provides another.

    Butterfly embryo Sea urchin larva

  • 12-11-06

    7

    Alternatives to phylogenetic concept of homology

    Serial Homology refers to repeated structural units of morphology of an organism.

    •  It seems reasonable that such units are derived from homologous genes or developmental processes; whether this assumption turns out to be generally applicable to morphological features exhibiting serial homology has not yet been demonstrated.

    Functional homology is generally used when the functional attributes of organisms are similar due to shared ancestry.

    •  However, due to a long history of using this term to denote simple similarity, one should not assume this usage unless unequivocally stated.

    Biological homology is defined by Wagner (1989) as morphological structures that share “developmental constraints, caused by locally acting self-regulating mechanisms or organ differentiation”.

    •  Although phylogeny can be used to help diagnose such homology, the definition does not explicitly include the condition of shared ancestry.

    Alternatives to phylogenetic concept of homology

    percent homology

    For DNA (or amino acid) sequences use percent similarity

  • 12-11-06

    8

    Trees within trees

    A B

    C D

    E F

    I

    Conventional representation

    A BB

    C D

    E F

    I

    Conventional representation

    A

    C D

    E F

    I

    C D

    E F

    I

    B

    Path 1 Path 2

    A

    C D

    E F

    I

    AA

    CC DD

    EE FF

    II

    C D

    E F

    I

    B

    C D

    E F

    I

    BB

    Path 1 Path 2

    Trees within trees: reticulation

  • 12-11-06

    9

    Trees within trees

    Spe

    cies

    1

    Spe

    cies

    2

    Spe

    cies

    3

    Spe

    cies

    4

    Trees within trees

    Spe

    cies

    1

    Spe

    cies

    2

    Spe

    cies

    3

    Spe

    cies

    4

  • 12-11-06

    10

    Trees within trees

    Spe

    cies

    1

    Spe

    cies

    2

    Spe

    cies

    3

    Spe

    cies

    4

    Trees within trees

    Spe

    cies

    1

    Spe

    cies

    2

    Spe

    cies

    3

    Spe

    cies

    4

  • 12-11-06

    11

    Trees within trees

    Spe

    cies

    1

    Spe

    cies

    2

    Spe

    cies

    3

    Spe

    cies

    4

    Population genetics: coalescent theory to study populations

    Trees within trees

    time

    Drift

    Selection

    Population history

  • 12-11-06

    12

    Trees within trees

    Polymorphism and substitution (highly simplified) along a branch of a phylogeny

    Time

    Residence time: the time that a particular neutral polymorphism is present in a population.

    Mean residence time is determined by effective population size (Ne)

    Population substitution 1 Population substitution 2 Population substitution 3

    Coalescent

    GAC

    GAT

    GAT

    A

    T

    A

    GAG A ⇒ C ⇒ A ⇒ G

    A ⇒ C C ⇒ A A ⇒ G

    Forms of homology (for genes)

    1. ORTHOLOGY. Orthologous genes are derived from the divergence of an organismal lineage; i.e., a speciation event. Thus if we look at orthologs on a phylogeny we see that their most recent common ancestor represents the coalescence of two organismal lineages.

    2. PARALOGY. Paralogous genes are derived from the divergence event within a genomes; i.e., a gene

    duplication event. In this case if we look at paralogs on a phylogeny we see that they coalesce at a gene duplication event.

    3. PRO-ORTHOLOGY. A gene is pro-orthologous to another gene if they coalesce at a speciation event that

    predates a gene duplication event. Thus a single-copy gene in organism A is pro-orthologous to a gene that is present in multiple copies in organism B due to gene duplication events that followed the divergence of organisms A and B.

    4. SEMI-ORTHOLGY. This is a term that simply takes the reverse perspective of pro-orthology. Any one of the

    multi-copy genes in the genome of organism B is said to be semi-orthologous to a single copy gene in the genome of organism A, if the most recent common ancestors of those genes coalesce at a point in time that predates the gene duplication event.

    5. PARTIAL HOMOLOGY. This refers to the situation that arises when the evolutionary histories of different

    segments within the same gene coalesce at different ancestors. This can arise from evolutionary processes such as homologous recombination or exon shuffling.

    6. GAMETOLOGY. Gametologs coalesce at an event that isolated those genes on opposite sex

    chromosomes; i.e., they coalesce at the point when they became isolated from the process of recombination.

    7. XENOLOGY Genes that coalesce at either a speciation or duplication event, but whose evolutionary

    histories do not fit with that of the organismal lineages which carry such genes due to one or more lateral gene transfer events.

    8. SINOLOGY. Homologous genes found within the same organism’s genome have different

    evolutionary histories due to the fusion of formerly evolutionarily independent genomes, such as in endosymbiosis.

  • 12-11-06

    13

    The mammalian Ldh-A and Ldh-C gene family is used as an example to illustrate the various forms of homology (ORTHOLOGY, PAROLOGY, PRO-ORTHOLOGY, and SEMI-ORTHOLOGY) that are important when dealing with gene families.

    Pro-orthologs pre-date the involved duplication event

    Examples of different types of homology in gene families: Homo and Rattus (rat) Ldh-C are orthologous. Homo Ldh-C and Homo Ldh-A are paralogous. Homo Ldh-C and Rattus Ldh-A also are paralogous. Gallus (chicken) Ldh-A is pro-orthologous to both Homo Ldh-C and Homo Ldh-A. Homo Ldh-C is semi-orthologous to the Gallus Ldh-A. All mammalian Ldh-A genes are semi-orthologous to the non-mammalian Ldh-A. Note that the gene duplication that gave rise to the Ldh-C gene is specific to an ancestor of all present-day mammals. Mammalian Ldh-C and Ldh-A genes are paralogous. All Ldh-C and Ldh-A are homologous.

    Mus

    Cricetinae

    Homo

    Gallus

    Sceloporus

    Rattus

    Sus

    Homo

    Sus

    Rabbit

    Mus

    Rattus

    Ldh-A

    Ldh-C

    Gene duplication event

    Organism history can different from gene history

    Selected examples (in notes):

    •  Birth-death evolution in gene families

    •  Trans-species evolution •  Recombination

    •  Lateral gene transfer (LGT)

    Other sources:

    •  Statistical error

    •  Human error

  • 12-11-06

    14

    1. Ancestral polymorphism

    1. Ancestral polymorphism: residence times

    •  rate to fixation [under drift] slows with increasing in Ne •  ultimate fate is fixation or loss

    •  Larger Ne yield larger residence time of a polymorphism in a population

    If we run this simulation long enough it will go to fixation or loss; it just takes much longer

  • 12-11-06

    15

    1. Ancestral polymorphism

    What happens depends on Ne!

    1. Ancestral polymorphism

    t 2

    Tim

    e

    t 1

    Likelihood of non-phylogenetic lineage sorting:

    1.  Residence time (Ne)

    2.  Time between successive splitting events (t)

    Other terms:

    1.  Lineage sorting

    2.  Incomplete lineage sorting

  • 12-11-06

    16

    2. Birth-death evolution in gene families

    Tandem array of 2 genes in the ancestral species

    (−)

    (−)

    (+)

    (+)

    (+)

    (−) (−)

    1 2

    A1 A2 B1 B2 C1 C2

    Gene “Birth”: (+)Gene “Death”: (-)

    time

    Tandem array of 2 genes in the ancestral species

    (−)

    (−)

    (+)

    (+)

    (+)

    (−) (−)

    1 21 2

    A1 A2A1 A2 B1 B2B1 B2 C1 C2C1 C2

    Gene “Birth”: (+)Gene “Death”: (-)

    time

    A B C

    A1 B1 C1

    Species tree

    Gene (1) tree

    A B C

    A1 B1 C1

    Species tree

    Gene (1) treeSpecies A Species B

    Species C

    Gene 1 Gene 2

    3. Trans-species evolution

  • 12-11-06

    17

    3. Recombination

    A B C D

    a b c d

    A B C D

    a B C d

    Gene conversion between the same gene

    A B

    a b

    a B

    A b

    Parental contribution Resultant offspring

    Gene conversion between different genes

    Gene A Gene A Gene A

    Gene B

    Among alleles within an individual: think about patterns of population coalescence

    Among alleles within an individual: this is a nonreciprocal exchange

    Among genes (paralogs) within an individual: this is also a nonreciprocal exchange. Remember that different genes can have different phylogenetic histories

    4. Lateral gene transfer (LGT)

    time

    a b c

    A B C

    a b c

    Species tree

    Gene tree

  • 12-11-06

    18

    Organism history can different from gene history

    Selected examples (in notes):

    •  Birth-death evolution in gene families

    •  Trans-species evolution •  Recombination

    •  Lateral gene transfer (LGT)

    Other sources:

    •  Statistical error

    •  Human error

    Human error: What happens if you mistake paralogous genes for orthologous genes?

    Mus

    Cricetinae

    Homo

    Gallus Sceloporus

    Rattus

    Sus Homo

    Sus

    Rabbit Mus Rattus

    Ldh - A

    Ldh - C Gene duplication

    event Ldh–A gene to infer the relationships of mammals:

    Data set:

    Mus (Opps! Ldh-C)

    Sus (Ldh-A)

    Homo (Ldh-A)

    Rabbit (Ldh-A)

    Rattus (Ldh-A)

    Gallus (Ldh-A)

    Sceloporus (Ldh-A)