homology modeling - sta.uwi.edu · •homology modeling is an extrapolation of protein structure...

43
1/27/2010 5:59 PM Homology modeling Dinesh Gupta ICGEB, New Delhi

Upload: others

Post on 26-Jan-2020

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Homology modeling - sta.uwi.edu · •Homology modeling is an extrapolation of protein structure for a target sequence using the known 3D structure of similar sequence as a template

1/27/2010 5:59 PM

Homology modeling

Dinesh Gupta

ICGEB, New Delhi

Page 2: Homology modeling - sta.uwi.edu · •Homology modeling is an extrapolation of protein structure for a target sequence using the known 3D structure of similar sequence as a template

1/27/2010 5:59 PM

Protein structure prediction

• Methods:

– Homology (comparative) modelling

– Threading

– Ab-initio

Page 3: Homology modeling - sta.uwi.edu · •Homology modeling is an extrapolation of protein structure for a target sequence using the known 3D structure of similar sequence as a template

1/27/2010 5:59 PM

Protein Homology modeling

• Homology modeling is an extrapolation of protein structure for a target sequence using the known 3D structure of similar sequence as a template.

• Basis: proteins with similar sequences are likely to assume same folding

• Certain proteins with as low as 25% similarity have been observed to assume same 3D structure

Page 4: Homology modeling - sta.uwi.edu · •Homology modeling is an extrapolation of protein structure for a target sequence using the known 3D structure of similar sequence as a template

1/27/2010 5:59 PM

The accuracy of modeling is proportional

to the similarity in primary sequences

Page 5: Homology modeling - sta.uwi.edu · •Homology modeling is an extrapolation of protein structure for a target sequence using the known 3D structure of similar sequence as a template

1/27/2010 5:59 PM

Steps…

• Given:

– A query sequence Q

– A database of known protein structures

• Find protein P such that P has high

sequence similarity to Q

• Return P’s structure as an approximation

to Q’s structure

• Energy minimization

Page 6: Homology modeling - sta.uwi.edu · •Homology modeling is an extrapolation of protein structure for a target sequence using the known 3D structure of similar sequence as a template

1/27/2010 5:59 PM

Sofware for homology molecular

modelling

• Freeware: available for all OS

– Downloadable

• Modeller (Sali, 1998)

• DeepView (SwissPDB viewer)

• WHATIF (Krieger et al. 2003)

– Web based:

• SWISS MODEL server (www.expasy.org/swissmod/SWISS-

MODEL.html)

• CPH model server

(http://www.cbs.dtu.dk/services/CPHmodels)

• SDSC1 server (http://cl.sdsc.edu/hm.html)

Page 7: Homology modeling - sta.uwi.edu · •Homology modeling is an extrapolation of protein structure for a target sequence using the known 3D structure of similar sequence as a template

1/27/2010 5:59 PM

Page 8: Homology modeling - sta.uwi.edu · •Homology modeling is an extrapolation of protein structure for a target sequence using the known 3D structure of similar sequence as a template

1/27/2010 5:59 PM

Page 9: Homology modeling - sta.uwi.edu · •Homology modeling is an extrapolation of protein structure for a target sequence using the known 3D structure of similar sequence as a template

1/27/2010 5:59 PM

Page 10: Homology modeling - sta.uwi.edu · •Homology modeling is an extrapolation of protein structure for a target sequence using the known 3D structure of similar sequence as a template

1/27/2010 5:59 PM

Protein structure prediction

• Methods:

– Homology (comparative) modelling

– Threading

– Ab-initio

Page 11: Homology modeling - sta.uwi.edu · •Homology modeling is an extrapolation of protein structure for a target sequence using the known 3D structure of similar sequence as a template

1/27/2010 5:59 PM

Threading

• Structure prediction that picks up where homology modelling leaves off.

• Recognize folds in proteins having no similarity to known proteins structures

• Very approximate models

• Check by forcing a sequence of structure into known folds checking the packing of aa residues, including sides chains, in each fold.

Page 12: Homology modeling - sta.uwi.edu · •Homology modeling is an extrapolation of protein structure for a target sequence using the known 3D structure of similar sequence as a template

1/27/2010 5:59 PM

2 kinds of threading

• Three dimensional threading

– Distance Based Method (DBM)

• Two dimensional threading

– Prediction Based Methods (PBM)

Page 13: Homology modeling - sta.uwi.edu · •Homology modeling is an extrapolation of protein structure for a target sequence using the known 3D structure of similar sequence as a template

1/27/2010 5:59 PM

Page 15: Homology modeling - sta.uwi.edu · •Homology modeling is an extrapolation of protein structure for a target sequence using the known 3D structure of similar sequence as a template

1/27/2010 5:59 PM

Page 16: Homology modeling - sta.uwi.edu · •Homology modeling is an extrapolation of protein structure for a target sequence using the known 3D structure of similar sequence as a template

1/27/2010 5:59 PM

Page 17: Homology modeling - sta.uwi.edu · •Homology modeling is an extrapolation of protein structure for a target sequence using the known 3D structure of similar sequence as a template

1/27/2010 5:59 PM

Page 18: Homology modeling - sta.uwi.edu · •Homology modeling is an extrapolation of protein structure for a target sequence using the known 3D structure of similar sequence as a template

1/27/2010 5:59 PM

Page 19: Homology modeling - sta.uwi.edu · •Homology modeling is an extrapolation of protein structure for a target sequence using the known 3D structure of similar sequence as a template

1/27/2010 5:59 PM

Protein structure prediction

• Methods:

– Homology (comparative) modelling

– Threading

– Ab-initio

Page 20: Homology modeling - sta.uwi.edu · •Homology modeling is an extrapolation of protein structure for a target sequence using the known 3D structure of similar sequence as a template

1/27/2010 5:59 PM

Ab initio structure prediction

• Still experimental

• ROSETTA (David Baker)

Page 21: Homology modeling - sta.uwi.edu · •Homology modeling is an extrapolation of protein structure for a target sequence using the known 3D structure of similar sequence as a template

1/27/2010 5:59 PM

Energy minimization (Molecular

Mechanics, MM)

• Energy minimization is an important part of both empirical and predicted structures

• MM could be used to calculate large scale conformational changes over long periods of time, but currently computationally infeasible.

Page 22: Homology modeling - sta.uwi.edu · •Homology modeling is an extrapolation of protein structure for a target sequence using the known 3D structure of similar sequence as a template

1/27/2010 5:59 PM

How does MM work?

• Three aspects:– Functions that describe the forces acting on the

atoms

– Numerical integration methods, to calculate the motion of the atoms due to the forces acting on them

– Long time propagation of the equations of motion

• Computational demands are intense– Accuracy (small errors propagate!)

– Stability

– Lots of techniques for approximation (e.g. rigid bodies) and handling artifacts (resonance).

Page 23: Homology modeling - sta.uwi.edu · •Homology modeling is an extrapolation of protein structure for a target sequence using the known 3D structure of similar sequence as a template

1/27/2010 5:59 PM

The Force Fields

• How do atoms stretch, vibrate, rotate, etc.?

• Must represent the constraints on atomic motion

(e.g. van der Waals, electrostatic, bonds, etc.)

• Must also represent solvation effects etc.

• Quantum solutions exist, but are too complex to

calculate for such large systems

• Empirical (approximate) energy functions must

be used. No single best function exists.

Page 24: Homology modeling - sta.uwi.edu · •Homology modeling is an extrapolation of protein structure for a target sequence using the known 3D structure of similar sequence as a template

1/27/2010 5:59 PM

Real energetics

• Steric (conformational) energy. Additive

combination of

– Bonded: stretching, bending, stretching and bending

– Non-bonded: Van der Waals, electrostatic and

“torsional”

• Minimum energy conformation minimizes these

energies

• Rosetta energy function is an empirical attempt

to capture most of this energy function without

having to calculate it fully.

Page 25: Homology modeling - sta.uwi.edu · •Homology modeling is an extrapolation of protein structure for a target sequence using the known 3D structure of similar sequence as a template

1/27/2010 5:59 PM

Bond length

• Spring-like term for energy based on

distance Estr

= ½ks,ij

(rij

-ro)2

where ks,ij is the stretching force constant

for the bond between i and j, rij is the

length, and ro is the equilibrium bond

length

Page 26: Homology modeling - sta.uwi.edu · •Homology modeling is an extrapolation of protein structure for a target sequence using the known 3D structure of similar sequence as a template

1/27/2010 5:59 PM

Bond bend

• Same basic idea for bending

Ebend

= ½kb,ij

(ij–

o)2

where where kb,ij is the bending force constant,

ij is the instantaneous

bond angle, and o is

the equilibrium

bond angle

Page 27: Homology modeling - sta.uwi.edu · •Homology modeling is an extrapolation of protein structure for a target sequence using the known 3D structure of similar sequence as a template

1/27/2010 5:59 PM

Stretch-bend

• When a bond is bent, the two associated bond

lengths increase, with interaction term:

Estr-bend

=½ksb,ijk

(rij-r

o)(

ik-

o)

where ksb,ijk is the stretch-bend force constant for

the bond between

atoms i and j with the

bend between atoms

i, j, and k.

Page 28: Homology modeling - sta.uwi.edu · •Homology modeling is an extrapolation of protein structure for a target sequence using the known 3D structure of similar sequence as a template

1/27/2010 5:59 PM

• A non-bonded interaction capturing the preferred distance between atoms

where A and B are constants depending on the atoms. For two hydrogen atoms, A=70.4kCÅ6 and B=6286kCÅ12

Van der Waals

Page 29: Homology modeling - sta.uwi.edu · •Homology modeling is an extrapolation of protein structure for a target sequence using the known 3D structure of similar sequence as a template

1/27/2010 5:59 PM

Electrostatics

• If bonds in the molecule are polar, some atoms will have partial electrostatic charges, which attract if opposite and repel otherwise.

where Qi and Qj are the partial atomic charges for i and j

separated by distance rij , is the dielectric constant of the solute, and k is a unitsconstant (k=2086 kcal/mol)

Page 30: Homology modeling - sta.uwi.edu · •Homology modeling is an extrapolation of protein structure for a target sequence using the known 3D structure of similar sequence as a template

1/27/2010 5:59 PM

Torsional energy

• Torsion is the energy needed to rotate about

bonds. Only relevant to single bonds, since

others are too stiff to rotate at all

Etor = ½ktor,1 (1 - cos ) + ½ktor,2 (1 - 2cos )

+ ½ktor,3 (1 - 3cos )

where is the dihedral angle

around the bond, and ktor,1, ktor,2

and ktor,3 are constants for one-,

two- and three-fold barriers.

energy of 3-fold torsional

barrier in ethane

Page 31: Homology modeling - sta.uwi.edu · •Homology modeling is an extrapolation of protein structure for a target sequence using the known 3D structure of similar sequence as a template

1/27/2010 5:59 PM

Energy minimization

• Given some energy function and initial conditions, we want to find the minimum energy conformation.

• Optimization problem, various methods:

– Steepest descent

– Conjugate gradient descent

– Newton-Raphson

• Various programs: Charmm, Amber are two most widely used (and packaged)

Page 32: Homology modeling - sta.uwi.edu · •Homology modeling is an extrapolation of protein structure for a target sequence using the known 3D structure of similar sequence as a template

1/27/2010 5:59 PM

Time steps

Need time steps of roughly 1/10 the period of the smallest time

scale of interest, or about a femtosecond (10-15s). A million

computational steps per nanosecond of simulation...

Page 33: Homology modeling - sta.uwi.edu · •Homology modeling is an extrapolation of protein structure for a target sequence using the known 3D structure of similar sequence as a template

1/27/2010 5:59 PM

Issues in Molecular Mechanics

• Solvation models: water & salt are very important to molecular behaviour. Must model as many water atoms as protein atoms.

• Initial conditions: velocity & position

• Equilibration: simulated heating and cooling

• Chaos: sensitivity to initial conditions, and statistical characterization of states

• Computational issues (e.g. parallelization)

Page 34: Homology modeling - sta.uwi.edu · •Homology modeling is an extrapolation of protein structure for a target sequence using the known 3D structure of similar sequence as a template

1/27/2010 5:59 PM

Molecular Dynamics

• Molecules, especially proteins, are not static.

– Dynamics can be important to function

• Trajectories, not just minimum energy state.

– MM ignores kinetic energy, does only potential energy

– MD takes same force model, but calculates F=ma and

calculates velocities of all atoms (as well as positions)

Page 35: Homology modeling - sta.uwi.edu · •Homology modeling is an extrapolation of protein structure for a target sequence using the known 3D structure of similar sequence as a template

1/27/2010 5:59 PM

Docking

• Computation to assess binding affinity

• Looks for conformational and electrostatic "fit"

between proteins and other molecules e.g.

inhibitors

• Optimization again: what position and

orientation of the two molecules minimizes

energy?

• Large computations, since there are many

possible positions to check, and the energy for

each position may involve many atoms

Page 36: Homology modeling - sta.uwi.edu · •Homology modeling is an extrapolation of protein structure for a target sequence using the known 3D structure of similar sequence as a template

1/27/2010 5:59 PM

Virtual Screening

• Docking small ligands to proteins is a way to find

potential drugs. Industrially important

• A small region of interest (pharmacophore) can

be identified, reducing computation

• Empirical scoring functions are not universal

• Various search methods:

– Rigid provides score for whole ligand (accurate)

– Flexible breaks ligands into pieces and docks them

individually

Page 37: Homology modeling - sta.uwi.edu · •Homology modeling is an extrapolation of protein structure for a target sequence using the known 3D structure of similar sequence as a template

1/27/2010 5:59 PM

Docking example

Benzamidine binding to beta-Trypsin 3ptb,

Page 38: Homology modeling - sta.uwi.edu · •Homology modeling is an extrapolation of protein structure for a target sequence using the known 3D structure of similar sequence as a template

1/27/2010 5:59 PM

Macromolecular docking• Docking of proteins to proteins or to DNA

• Important to understanding

macromolecular recognition, genetic

regulation, etc.

• Conceptually similar to small molecule

docking, but practically much more difficult

– Score function can't realistically compute

energies

– Use either shape complementarities alone or

some kind of mean field approximation

Page 39: Homology modeling - sta.uwi.edu · •Homology modeling is an extrapolation of protein structure for a target sequence using the known 3D structure of similar sequence as a template

1/27/2010 5:59 PM

Docking Resources

• AutoDock http://www.scripps.edu/pub/olson-

web/doc/autodock/

• FlexX http://www.biosolveit.de/FlexX/ and

commercially at http://www.tripos.com

• Dock

http://www.cmpharm.ucsf.edu/kuntz/dock.html

• 3D-Dock http://www.bmm.icnet.uk/docking/

which uses an unusual “Fourier correlation”

method and is aimed at protein-protein

interactions

Page 40: Homology modeling - sta.uwi.edu · •Homology modeling is an extrapolation of protein structure for a target sequence using the known 3D structure of similar sequence as a template

1/27/2010 5:59 PM

Lab Exercise-1

Install:

• MDL chime

• RasMol

• SwissPDBviewer

• Cn3D

Explore few protein/DNA structures

Page 41: Homology modeling - sta.uwi.edu · •Homology modeling is an extrapolation of protein structure for a target sequence using the known 3D structure of similar sequence as a template

1/27/2010 5:59 PM

Lab exercise-2

• Download sequence file for S. cerevisiae endoplasmic reticulum mannosidase

• Generate a homology model using SWISS-model server http://www.expasy.ch/swissmod/

• Download the template structure from www.rcsb.org

• Compare the model and template structures

• Repeat the exercise for other protein sequences of your choice

Page 42: Homology modeling - sta.uwi.edu · •Homology modeling is an extrapolation of protein structure for a target sequence using the known 3D structure of similar sequence as a template

1/27/2010 5:59 PM

Page 43: Homology modeling - sta.uwi.edu · •Homology modeling is an extrapolation of protein structure for a target sequence using the known 3D structure of similar sequence as a template

1/27/2010 5:59 PM