functional prediction in proteins (purifying and positive selection)

50
1 Functional prediction Functional prediction in proteins in proteins (purifying and positive (purifying and positive selection) selection)

Upload: sutton

Post on 24-Jan-2016

36 views

Category:

Documents


0 download

DESCRIPTION

Functional prediction in proteins (purifying and positive selection). 1. Introduction: evolution & sequence analysis. Darwin – the theory of natural selection. Adaptive evolution : Favorable traits will become more frequent in the population. Adaptive evolution. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Functional prediction  in proteins  (purifying and positive selection)

11

Functional prediction Functional prediction in proteins in proteins

(purifying and positive (purifying and positive selection)selection)

Page 2: Functional prediction  in proteins  (purifying and positive selection)

22

1. Introduction: evolution 1. Introduction: evolution & sequence analysis& sequence analysis

Page 3: Functional prediction  in proteins  (purifying and positive selection)

33

Darwin – the theory of natural Darwin – the theory of natural selectionselection

Adaptive evolutionAdaptive evolution::

Favorable traits will become more Favorable traits will become more frequent in the populationfrequent in the population

Page 4: Functional prediction  in proteins  (purifying and positive selection)

44

Adaptive evolutionAdaptive evolution

When natural selection favors a single allele When natural selection favors a single allele

and therefore allele frequency continuously and therefore allele frequency continuously

shifts in one directionshifts in one direction

Page 5: Functional prediction  in proteins  (purifying and positive selection)

55

Kimura – the theory of neutral Kimura – the theory of neutral evolutionevolution

Neutral evolutionNeutral evolution::

Most molecular changes have no effect Most molecular changes have no effect on the phenotype (neutral)on the phenotype (neutral)

Selection operates to Selection operates to preservepreserve a trait a trait (no change)(no change)

Page 6: Functional prediction  in proteins  (purifying and positive selection)

66

Purifying SelectionPurifying Selection

Stabilizes a trait in a population:Stabilizes a trait in a population:Small babies Small babies more illness more illness

Large babies Large babies more difficult birth… more difficult birth…

Baby weight is stabilized round 3-4 KgBaby weight is stabilized round 3-4 Kg

Page 7: Functional prediction  in proteins  (purifying and positive selection)

77

Purifying selectionPurifying selection (conservation) -(conservation) - the the molecular levelmolecular level

Histone 3Histone 3

Page 8: Functional prediction  in proteins  (purifying and positive selection)

88

Synonymous vs. non-synonymous substitutions

Purifying selection: excess of synonymous substitutions relative to non-synonymous substitutions

Synonymous substitution: GUUGUC

Non-synonymous substitution: GUUGCU

Page 9: Functional prediction  in proteins  (purifying and positive selection)

99

Synonymous vs. non-synonymous substitutions

Histone 3Histone 3

Non-synNon-syn.. SynSyn..

Page 10: Functional prediction  in proteins  (purifying and positive selection)

1010

Conservation as a means of Conservation as a means of predicting functionpredicting function

Infer the rate of evolution at each siteInfer the rate of evolution at each site

Page 11: Functional prediction  in proteins  (purifying and positive selection)

1111

Conservation as a means of Conservation as a means of predicting functionpredicting function

Low rate of evolution Low rate of evolution constraints on the site to constraints on the site to prevent disruption of function/structure: prevent disruption of function/structure: active sites, protein-protein interactions, protein active sites, protein-protein interactions, protein core etc.core etc. 11223344556677

HumanHumanDDMMAAAAHHAAMM

ChimpChimpDDEEAAAAGGGGCC

CowCowDDQQAAAAWWAAPP

FishFishDDLLAAAACCAALL

S. S. cerevisiaecerevisiae

DDDDGGAAFFAAAA

S. pombeS. pombeDDDDGGAALLGGEE

Page 12: Functional prediction  in proteins  (purifying and positive selection)

1212

Which site is more conserved?Which site is more conserved?

11223344556677

HumanHumanDDMMAAAAHHAAMM

ChimpChimpDDEEAAAAGGGGCC

CowCowDDQQAAAAWWAAPP

FishFishDDLLAAAACCAALL

S. S. cerevisiaecerevisiae

DDDDGGAAFFAAAA

S. pombeS. pombeDDDDGGAALLGGEE

Page 13: Functional prediction  in proteins  (purifying and positive selection)

1313

Use phylogenetic informationUse phylogenetic information 11223344556677

HumanHumanDDMMAAAAHHAAMM

ChimpChimpDDEEAAAAGGGGCC

CowCowDDQQAAAAWWAAPP

FishFishDDLLAAAACCAALL

S. S. cerevisiaecerevisiae

DDDDGGAAFFAAAA

S. pombeS. pombeDDDDGGAALLGGEEA

G

A

A

A

G

A

A

A

A

G

G

Page 14: Functional prediction  in proteins  (purifying and positive selection)

1414

ConSurf/ConSeq web servers:ConSurf/ConSeq web servers: Prediction of conserved residues by Prediction of conserved residues by

estimating evolutionary rates at each siteestimating evolutionary rates at each site

Page 15: Functional prediction  in proteins  (purifying and positive selection)

1515

Working processWorking processInput a protein Input a protein with a known 3D structurewith a known 3D structure

((PDB ID or file provided by the userPDB ID or file provided by the user))

Find homologous protein sequences )psi-blast(

Perform multiple sequence alignment )removing doubles(

Construct an evolutionary tree

Project the results on the 3D structure

Calculate the conservation score for each site

Page 16: Functional prediction  in proteins  (purifying and positive selection)

1616

ConSurf example: ConSurf example: potassium channel potassium channel

An integral membrane protein with sequence An integral membrane protein with sequence similarity to all known K+ channels, particularly similarity to all known K+ channels, particularly in the pore region. in the pore region.

PDB ID: 1bl8 chain A PDB ID: 1bl8 chain A

Page 17: Functional prediction  in proteins  (purifying and positive selection)

1717

ConSurf resultsConSurf results

Page 18: Functional prediction  in proteins  (purifying and positive selection)

1818

http://http://conseq.bioinfo.tau.ac.ilconseq.bioinfo.tau.ac.il//

ConSeq performs the same analysis as ConSeq performs the same analysis as ConSurf but presents the results on the ConSurf but presents the results on the sequence.sequence.

Predicts buried/exposed relation Predicts buried/exposed relation exposed & conserved exposed & conserved functionally important functionally important

sitessites buried & conserved buried & conserved structurally important sites structurally important sites

Page 19: Functional prediction  in proteins  (purifying and positive selection)

1919

2. Positive selection & drug 2. Positive selection & drug resistanceresistance

Page 20: Functional prediction  in proteins  (purifying and positive selection)

2020

Darwin – the theory of natural Darwin – the theory of natural selectionselection

Adaptive evolutionAdaptive evolution::

Favorable traits will become more Favorable traits will become more frequent in the populationfrequent in the population

Page 21: Functional prediction  in proteins  (purifying and positive selection)

2121

Adaptive evolution Adaptive evolution at the molecular levelat the molecular level

Page 22: Functional prediction  in proteins  (purifying and positive selection)

2222

Adaptive evolution Adaptive evolution at the molecular levelat the molecular level

Look for Look for changes changes

which confer which confer an advantagean advantage

Page 23: Functional prediction  in proteins  (purifying and positive selection)

2323

Naïve detectionNaïve detection

Observe a multiple sequence alignment:Observe a multiple sequence alignment:variable regions = adaptive evolution??variable regions = adaptive evolution??

Page 24: Functional prediction  in proteins  (purifying and positive selection)

2424

Naïve detectionNaïve detection The problem – how do we know which The problem – how do we know which

sites are not under any sites are not under any selectionselection pressure pressure (“non-important” sites) and which are (“non-important” sites) and which are underunder adaptive evolution adaptive evolution??

Page 25: Functional prediction  in proteins  (purifying and positive selection)

2525

Solution – we look at the DNASolution – we look at the DNA

synonymoussynonymous

non-non-synonymoussynonymous

Page 26: Functional prediction  in proteins  (purifying and positive selection)

2626

Solution – we look at the DNASolution – we look at the DNA

Purifying selectionSyn > Non-syn

Adaptive evolution = Positive selectionNon-syn > Syn

NeutralselectionSyn = Non-syn

Page 27: Functional prediction  in proteins  (purifying and positive selection)

2727

Also known as… Ka/Ks Also known as… Ka/Ks (or dn/ds, or (or dn/ds, or ωω) ) ratioratio

Purifying selection: Ka < Ks (Ka/Ks <1)Purifying selection: Ka < Ks (Ka/Ks <1) Neutral selection: Ka = Ks (Ka/Ks = 1)Neutral selection: Ka = Ks (Ka/Ks = 1) Positive selection: Ka > Ks (Ka/Ks >1)Positive selection: Ka > Ks (Ka/Ks >1)

Non-synonymous

substitution rate

Synonymous substitution

rate

Page 28: Functional prediction  in proteins  (purifying and positive selection)

2828

Examples for positive selectionExamples for positive selection

Proteins involved in the Proteins involved in the immune systemimmune system Proteins involved in Proteins involved in host-pathogen host-pathogen

interactioninteraction (‘arms-race’) (‘arms-race’) Proteins following Proteins following gene duplicationgene duplication Proteins involved in Proteins involved in reproductionreproduction systems systems

Page 29: Functional prediction  in proteins  (purifying and positive selection)

2929

Accumulation of substitutions (syn. or non-syn.) Accumulation of substitutions (syn. or non-syn.)

depends on the evolutionary time that elapsed depends on the evolutionary time that elapsed

since the divergence of the analyzed species. since the divergence of the analyzed species.

When distant species are analyzed saturation of syn.When distant species are analyzed saturation of syn.

substitutions is often encounteredsubstitutions is often encountered

Synonymous vs. non-synonymous substitutions

Page 30: Functional prediction  in proteins  (purifying and positive selection)

3030

Selecton – a server for the detection Selecton – a server for the detection of purifying and positive selectionof purifying and positive selection

http://selecton.bioinfo.tau.ac.il

Stern et al., Nucleic Acids Res 35, W506 (2007).

Page 31: Functional prediction  in proteins  (purifying and positive selection)

3131

Detecting drug resistance using Detecting drug resistance using SelectonSelecton

Page 32: Functional prediction  in proteins  (purifying and positive selection)

3232

HIV: molecular evolution paradigmHIV: molecular evolution paradigm

Rapidly evolving Rapidly evolving virus:virus:

1.1.High mutation High mutation rate (low rate (low fidelity of fidelity of reverse reverse transcriptase)transcriptase)

2.2.High High replication replication raterate

Page 33: Functional prediction  in proteins  (purifying and positive selection)

3333

Drug resistanceDrug resistance

No No drugdrug

DrugDrug

Adaptive evolution Adaptive evolution (positive selection)(positive selection)

Page 34: Functional prediction  in proteins  (purifying and positive selection)

3434

HIV ProteaseHIV Protease

Protease is an Protease is an essential essential enzymeenzyme for viral for viral

replicationreplication

Drugs against Drugs against Protease are Protease are

always part of always part of the “cocktail”the “cocktail”

Page 35: Functional prediction  in proteins  (purifying and positive selection)

3535

Ritonavir InhibitorRitonavir Inhibitor

Ritonavir (RTV) is a specific protease Ritonavir (RTV) is a specific protease inhibitor (drug)inhibitor (drug)

CC3737HH4848NN66OO55SS22

Page 36: Functional prediction  in proteins  (purifying and positive selection)

3636

Used Selecton to analyse HIV-1 protease Used Selecton to analyse HIV-1 protease gene sequences from patients that were gene sequences from patients that were treated with RTV treated with RTV onlyonly

Page 37: Functional prediction  in proteins  (purifying and positive selection)

3737

Page 38: Functional prediction  in proteins  (purifying and positive selection)

3838

Example: HIV ProteaseExample: HIV Protease

Primary mutationsPrimary mutations Secondary Secondary

mutationsmutations

novel novel predictions predictions (experimental (experimental validation)validation)

Page 39: Functional prediction  in proteins  (purifying and positive selection)

3939

Rate shifts and Rate shifts and HIV sub-typesHIV sub-types

Page 40: Functional prediction  in proteins  (purifying and positive selection)

4040

Rate shiftsRate shifts

V Chimp

V Rhesus

A Squirrel

K Rat

M Mouse

V Human

Page 41: Functional prediction  in proteins  (purifying and positive selection)

4141

Rate shiftsRate shifts

V

VA

K

M

V

Low evolutionary rateLow evolutionary rate

High evolutionary rateHigh evolutionary rate

Page 42: Functional prediction  in proteins  (purifying and positive selection)

4242

Rate shiftsRate shifts

Specificity determinants:Specificity determinants: Different phylogenetic groupsDifferent phylogenetic groups

V Chimp

V Rhesus

A Squirrel

A Rat

A Mouse

V HumanGain of Gain of function?function?

Page 43: Functional prediction  in proteins  (purifying and positive selection)

4343

Rate shiftsRate shifts

Specificity determinants:Specificity determinants: Following gene duplicationFollowing gene duplication

V S. paradoxus

V S. mikatae

A S. cervisiae

A S. paradoxus

A S. mikatae

V S. cervisiae

Tropomyosin 1Tropomyosin 1

Tropomyosin 2Tropomyosin 2

Page 44: Functional prediction  in proteins  (purifying and positive selection)

4444

Rate shifts in HIV subtypesRate shifts in HIV subtypes

Page 45: Functional prediction  in proteins  (purifying and positive selection)

4545

HIV subtypesHIV subtypes

Page 46: Functional prediction  in proteins  (purifying and positive selection)

4646

Which sites are responsible for the Which sites are responsible for the differences between the subtypesdifferences between the subtypes??

Detection of rate-shifts in all 9 subtypesDetection of rate-shifts in all 9 subtypes

Page 47: Functional prediction  in proteins  (purifying and positive selection)

4747

Significant rate shift in all HIV genesSignificant rate shift in all HIV genes

# rate-shift sites# rate-shift sitesproportionproportion

EnvEnv84840.10.1

GagGag20200.040.04

NefNef21210.170.17

PolPol33330.030.03

RevRev29290.250.25

TatTat13130.150.15

VifVif13130.070.07

VprVpr440.050.05

VpuVpu29290.350.35

Page 48: Functional prediction  in proteins  (purifying and positive selection)

4848

Gag Position12Gag Position12

Wild-type (Wild-type (EE)) Site which contributes to Site which contributes to

Protease Inhibitor (Amprenavir) Protease Inhibitor (Amprenavir) drug resistance (drug resistance (KK))

E

E

E

K

Q

R

K

K

Page 49: Functional prediction  in proteins  (purifying and positive selection)

4949

C

C

A

G

F

D

J

KK

EE

QQ

NN

RR

Page 50: Functional prediction  in proteins  (purifying and positive selection)

5050

SummarySummary

Sequence analysis can provide valuable Sequence analysis can provide valuable information about protein functioninformation about protein function

The basic signal: conservation:The basic signal: conservation:

http://http://consurf.tau.ac.ilconsurf.tau.ac.il Positive “Darwinian” selection: Positive “Darwinian” selection:

http://selecton.bioinfo.tau.ac.ilhttp://selecton.bioinfo.tau.ac.il Rate-shifts (specificity determinants)Rate-shifts (specificity determinants)