rna structure prediction rna structure basics the rna ‘rules’ programs and predictions bio520...

30
RNA Structure Prediction RNA Structure Basics The RNA ‘Rules’ Programs and Predictions O520 Bioinformatics Jim Lund Assigned reading: Ch. 6 from Bioinformatics: A Practical Guide to the Analysis of Genes and Proteins, 3rd Ed. by Baxevanis and Ouellette.

Upload: margaretmargaret-scott

Post on 18-Jan-2016

225 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: RNA Structure Prediction RNA Structure Basics The RNA ‘Rules’ Programs and Predictions BIO520 BioinformaticsJim Lund Assigned reading: Ch. 6 from Bioinformatics:

RNA Structure Prediction

RNA Structure Basics

The RNA ‘Rules’

Programs and Predictions

BIO520 Bioinformatics Jim Lund

Assigned reading: Ch. 6 from Bioinformatics: A Practical Guide to the Analysis of Genes and Proteins, 3rd Ed. by Baxevanis and Ouellette.

Page 2: RNA Structure Prediction RNA Structure Basics The RNA ‘Rules’ Programs and Predictions BIO520 BioinformaticsJim Lund Assigned reading: Ch. 6 from Bioinformatics:

RNA classes

• mRNA - messenger RNA.• tRNA - transfer RNA, small (~80 bases) sequences

which bring amino acids to the ribosome.• rRNA - ribosomal RNA, RNA + proteins =

ribosome.• viral RNA (ssRNA, dsRNA virii)• miRNA: translational/transcriptional gene silencing.• snoRNA, snRNA: splicing, RNA bp modification• Transfer-Messenger RNAs (tmRNA), Small

cytosolic RNAs (scRNA), Guide RNAs (gRNA)• and more…

Page 3: RNA Structure Prediction RNA Structure Basics The RNA ‘Rules’ Programs and Predictions BIO520 BioinformaticsJim Lund Assigned reading: Ch. 6 from Bioinformatics:

RNA structures

• 1°– Sequence (and modifications)

• 2°– Base pairing

• 3°– Overall Structure, non Watson-Crick

pairs– Experimental structures: tRNA,

ribosome

Page 4: RNA Structure Prediction RNA Structure Basics The RNA ‘Rules’ Programs and Predictions BIO520 BioinformaticsJim Lund Assigned reading: Ch. 6 from Bioinformatics:

RNA Tertiary Structure, tRNA

Anticodon Loop

3’(aminoacyl) endCCA

Page 5: RNA Structure Prediction RNA Structure Basics The RNA ‘Rules’ Programs and Predictions BIO520 BioinformaticsJim Lund Assigned reading: Ch. 6 from Bioinformatics:

Yeast Phenylalanine tRNA, 1.93A

Page 6: RNA Structure Prediction RNA Structure Basics The RNA ‘Rules’ Programs and Predictions BIO520 BioinformaticsJim Lund Assigned reading: Ch. 6 from Bioinformatics:

Yeast Phenylalanine tRNA, 1.93A

Page 7: RNA Structure Prediction RNA Structure Basics The RNA ‘Rules’ Programs and Predictions BIO520 BioinformaticsJim Lund Assigned reading: Ch. 6 from Bioinformatics:

rRNA small subunit, X. laevis

Page 8: RNA Structure Prediction RNA Structure Basics The RNA ‘Rules’ Programs and Predictions BIO520 BioinformaticsJim Lund Assigned reading: Ch. 6 from Bioinformatics:

2° RNA structures

• Watson-Crick pairing -> helices

• Loop regions– Hairpin loops

– Internal loops

– Bulge loops

– Multibranch loops

Page 9: RNA Structure Prediction RNA Structure Basics The RNA ‘Rules’ Programs and Predictions BIO520 BioinformaticsJim Lund Assigned reading: Ch. 6 from Bioinformatics:

RNA Modifications

Covalent Modifications-especially tRNAtRNA– rUrT, rrT, r, rD, rD, rS4U

– rC 3-CH3-C, 5-CH3-C

– rA I, 6-CH3-A, 6-isopentenyl-A

– rG 7-CH3-G, Q, Y

Nucleosides Nucleotides 1999 Jun-Jul;18(6-7):1579-81

Page 10: RNA Structure Prediction RNA Structure Basics The RNA ‘Rules’ Programs and Predictions BIO520 BioinformaticsJim Lund Assigned reading: Ch. 6 from Bioinformatics:

RNA Base pairing

• G-C triple hydrogen bond• A -U double hydrogen bond• G-U single hydrogen bond

Page 11: RNA Structure Prediction RNA Structure Basics The RNA ‘Rules’ Programs and Predictions BIO520 BioinformaticsJim Lund Assigned reading: Ch. 6 from Bioinformatics:

RNA structure energetics

• The number of GC versus AU and GU base pairs.– Higher energy bonds form more stable structures.

• Number of base pairs in a stem region.– Longer stems result in more bonds.

• Number of base pairs in a hairpin loop region.– Formation of loops with more than 10 or less than 5

bases requires more energy.

• Number of unpaired bases (interior loops or bulges).– Unpaired bases decrease the stability of the structure.

Page 12: RNA Structure Prediction RNA Structure Basics The RNA ‘Rules’ Programs and Predictions BIO520 BioinformaticsJim Lund Assigned reading: Ch. 6 from Bioinformatics:

2° Structure

5’ 3’ G--C G--C C--GA | U--A G--CA AA A A A

Page 13: RNA Structure Prediction RNA Structure Basics The RNA ‘Rules’ Programs and Predictions BIO520 BioinformaticsJim Lund Assigned reading: Ch. 6 from Bioinformatics:

“The Rules”

• Base Pairs -- Good– G:C better than A:T -- And local sequence

matters!

• Bulges, Loops -- Bad

• Many small interactions---Stable Structure

• Only predict “Canonical Interactions”

Page 14: RNA Structure Prediction RNA Structure Basics The RNA ‘Rules’ Programs and Predictions BIO520 BioinformaticsJim Lund Assigned reading: Ch. 6 from Bioinformatics:

Base Pairs/Stacks

A UA U

A=UA=U

Basepair

G = -1.2 kcal/mole

A UU A

A=UU=A

Basepair

G = -1.6 kcal/mole

Page 15: RNA Structure Prediction RNA Structure Basics The RNA ‘Rules’ Programs and Predictions BIO520 BioinformaticsJim Lund Assigned reading: Ch. 6 from Bioinformatics:

Base Pairing/Stacking

AAUU

-1.2 CGGC

-3.0

AU or UAUA AU

-1.6 GCCG

-4.3

AG, AC, CA, GAUC, UG, GU, CU

-2.1 GUUG

-0.3

CCGG

-4.8 XG, GXYU, UY

0

Bloomfield, Crothers, Tinoco, Physical Chemistry of Nucleic Acids

Page 16: RNA Structure Prediction RNA Structure Basics The RNA ‘Rules’ Programs and Predictions BIO520 BioinformaticsJim Lund Assigned reading: Ch. 6 from Bioinformatics:

Hairpin Loops(GC closure)

N=3 +8

N=4,5 +5

N=6,7 +4

N=8,9 +5

N>=10 6+0.9(ln[N/10])

•Tertiary Interactions!

Page 17: RNA Structure Prediction RNA Structure Basics The RNA ‘Rules’ Programs and Predictions BIO520 BioinformaticsJim Lund Assigned reading: Ch. 6 from Bioinformatics:

Internal Loops

G-X-CC-X-G

0

N=2-6 +2

N=7 +3

N>=8 3+0.9(ln[N/7])

5’ 3’ G--C G--C C--G A GG A A C T--A G--C T--A G--C

Page 18: RNA Structure Prediction RNA Structure Basics The RNA ‘Rules’ Programs and Predictions BIO520 BioinformaticsJim Lund Assigned reading: Ch. 6 from Bioinformatics:

Single-Strand Bulges

5’ 3’ G--C G--C C--G A |G | A | T--A G--C T--A G--C

N=1 +3

N=2-3 +4

N=4-7 +5

N>=8 6+0.9ln(N/8)

Page 19: RNA Structure Prediction RNA Structure Basics The RNA ‘Rules’ Programs and Predictions BIO520 BioinformaticsJim Lund Assigned reading: Ch. 6 from Bioinformatics:

Prediction Programs

• Mfold (M. Zuker)– 2° structure

• RNAstructure/OligoWalk– 2° structure, oligo/RNA target interactions

• alifold– 2° structure constrained by muliple

alignment.

• Pfold– 2° structure guided by rules derived from

known tRNA/rRNA structures

Page 20: RNA Structure Prediction RNA Structure Basics The RNA ‘Rules’ Programs and Predictions BIO520 BioinformaticsJim Lund Assigned reading: Ch. 6 from Bioinformatics:

Prediction Programs

• Mfold (GCG)– M. Zuker

• Mfold input to Plotfold– Non-graphic output -G option– Graphics outputs

• SQUIGGLES• mountains• circles• domes• energy plots

Page 21: RNA Structure Prediction RNA Structure Basics The RNA ‘Rules’ Programs and Predictions BIO520 BioinformaticsJim Lund Assigned reading: Ch. 6 from Bioinformatics:

Squiggles

1

2040

60

CCA-3’OH

Page 22: RNA Structure Prediction RNA Structure Basics The RNA ‘Rules’ Programs and Predictions BIO520 BioinformaticsJim Lund Assigned reading: Ch. 6 from Bioinformatics:

DOMES, MOUNTAIN, CIRCLES

Page 23: RNA Structure Prediction RNA Structure Basics The RNA ‘Rules’ Programs and Predictions BIO520 BioinformaticsJim Lund Assigned reading: Ch. 6 from Bioinformatics:

MFOLDStructure Family

• Optimal & Suboptimal structures– Can ask for multiple structures

• Energy increment and “window size” increment.

• View individually.

• How variable are the structures?– Energy Plots

Page 24: RNA Structure Prediction RNA Structure Basics The RNA ‘Rules’ Programs and Predictions BIO520 BioinformaticsJim Lund Assigned reading: Ch. 6 from Bioinformatics:

ENERGY PLOT

Page 25: RNA Structure Prediction RNA Structure Basics The RNA ‘Rules’ Programs and Predictions BIO520 BioinformaticsJim Lund Assigned reading: Ch. 6 from Bioinformatics:

P-Num Plot

Page 26: RNA Structure Prediction RNA Structure Basics The RNA ‘Rules’ Programs and Predictions BIO520 BioinformaticsJim Lund Assigned reading: Ch. 6 from Bioinformatics:

Prediction Quality

Page 27: RNA Structure Prediction RNA Structure Basics The RNA ‘Rules’ Programs and Predictions BIO520 BioinformaticsJim Lund Assigned reading: Ch. 6 from Bioinformatics:

Forces in RNA folds

• Complementary molecular surfaces

• Bridging cations

• Pseudoknotting

• “kinetic traps” in folding– NOT always 2 first!

Annu Rev Biophys Biomol Struct 1999;28:57-73Proc Natl Acad Sci U S A 1998 Sep 29;95(20):11555-60

Page 28: RNA Structure Prediction RNA Structure Basics The RNA ‘Rules’ Programs and Predictions BIO520 BioinformaticsJim Lund Assigned reading: Ch. 6 from Bioinformatics:

RNA Structure Probing

• Physical methods– X-ray diffraction, NMR

• Enzymatic methods– S1, Rnases (find ss and ds regions).

• Chemical modification– DMS…

• Mutagenesis– G:C=>C:C=>C:G

Page 29: RNA Structure Prediction RNA Structure Basics The RNA ‘Rules’ Programs and Predictions BIO520 BioinformaticsJim Lund Assigned reading: Ch. 6 from Bioinformatics:

Ribozymes• Naturally occurring

– RNAaseP

– Group I introns

– Group II introns

– snRNA in the splicosome

• Artifical– Engineered/evolved in the lab from natural

ribozymes to have new substrate RNA.

– Cleave mRNA, drug-like action

• miRNA/siRNA– Translational/transcriptional gene silencing

Page 30: RNA Structure Prediction RNA Structure Basics The RNA ‘Rules’ Programs and Predictions BIO520 BioinformaticsJim Lund Assigned reading: Ch. 6 from Bioinformatics:

Published by AAAS

T. A. Lincoln et al., Science 323, 1229 -1232 (2009)

Cross-replicating RNA enzymes