Download - cloning

Ghosh LabUniversity of Arizona

Department of Chemistry

Outline

•Cloning overview

•pDRAW32

• Design

•Gene

•Insert

•Primers

•Further considerations (optimization of the process)

•Transformation

Cloning OverviewFour main steps in cloning:

•Insert synthesis

•Restriction enzyme digest

•Ligation

•Transformation

+Functionalconstruct

Plasmid(vector)

Insert(your gene)

Design OverviewSteps to follow in designing your cloning experiment:

•Design your gene

•Design your insert

•Pick your enzymes

•Check your design

•Recheck your design

Functionalconstruct

All of the important information in one place!

pDRAW32Plasmid maps: pDRAW32

pETDuet GFPuv6104 bp

XbaI - 30 - T'CTAG_ANcoI - 69 - C'CATG_G

BamHI - 106 - G'GATC_CEcoRI - 112 - G'AATT_C

NcoI - 281 - C'CATG_GMscI - 286 - TGG'CCA

NdeI - 345 - CA'TA_TGBsrGI - 391 - T'GTAC_AMluI - 437 - A'CGCG_TBaeI - 456 - ACnnnnGTAyCnnnnnnn_nnnnn'BaeI - 489 - GrTACnnnnGTnnnnnnnnnn_nnnnn'PspXI - 536 - vC'TCGA_GbAvaI - 536 - C'yCGr_GXhoI - 536 - C'TCGA_GBstZ17I - 565 - GTA'TACBamHI - 635 - G'GATC_CMfeI - 675 - C'AATT_GSalI - 718 - G'TCGA_CHincII - 720 - GTy'rACBstBI - 737 - TT'CG_AAEcoICRI - 820 - GAG'CTCSacI - 822 - G_AGCT'CNotI - 834 - GC'GGCC_GCAflII - 847 - C'TTAA_GBsrGI - 874 - T'GTAC_ANdeI - 982 - CA'TA_TGBglII - 989 - A'GATC_TMfeI - 995 - C'AATT_GEcoRV - 1003 - GAT'ATCFseI - 1012 - GG_CCGG'CCBaeI - 1016 - GrTACnnnnGTnnnnnnnnnn_nnnnn'AsiSI - 1021 - GCG_AT'CGCPvuI - 1021 - CG_AT'CGZraI - 1028 - GAC'GTCAatII - 1030 - G_ACGT'CAcc65I - 1032 - G'GTAC_CKpnI - 1036 - G_GTAC'CPspXI - 1038 - vC'TCGA_GbAvaI - 1038 - C'yCGr_GXhoI - 1038 - C'TCGA_GBaeI - 1049 - ACnnnnGTAyCnnnnnnn_nnnnn'PflMI - 1085 - CCAn_nnn'nTGGPacI - 1113 - TTA_AT'TAAAvrII - 1117 - C'CTAG_GBlpI - 1135 - GC'TnA_GCBsaAI - 1460 - yAC'GTrDraIII - 1463 - CAC_nnn'GTGAloI - 1499 - GAACnnnnnnTCCnnnnnnn_nnnnn'BsaXI - 1499 - ACnnnnnCTCCnnnnnnn_nnn'BsaXI - 1529 - GGAGnnnnnGTnnnnnnnnn_nnn'AloI - 1531 - GGAnnnnnnGTTCnnnnnnn_nnnnn'PsiI - 1588 - TTA'TAASspI - 1668 - AAT'ATT

AhdI - 1873 - GACnn_n'nnGTCBsaI - 1934 - GGTCTCn'nnnn_

BglI - 1993 - GCCn_nnn'nGGCFspI - 2095 - TGC'GCA

PvuI - 2243 - CG_AT'CGScaI - 2353 - AGT'ACT

XmnI - 2472 - GAAnn'nnTTCBssSI - 2537 - C'ACGA_G

AlwNI - 3069 - CAG_nnn'CTGBssSI - 3305 - C'ACGA_G

PciI - 3478 - A'CATG_T

SapI - 3595 - GCTCTTCn'nnn_

BstZ17I - 3711 - GTA'TACBsaAI - 3730 - yAC'GTr

Tth111I - 3736 - GACn'n_nGTCPfoI - 3835 - T'CCnGG_A

BsmBI - 3837 - CGTCTCn'nnnn_

XmnI - 3924 - GAAnn'nnTTC

Afe I - 4228 - AGC'GCT

AlfI - 4292 - GCAnnnnnnTGCnnnnnnnnnn_nn'

Bpu10I - 4373 - CC'TnA_GC

PpuMI - 4473 - rG'GwC_Cy

AlfI - 4513 - GCAnnnnnnTGCnnnnnnnnnn_nn'

BsaXI - 4674 - ACnnnnnCTCCnnnnnnn_nnn'BsaXI - 4704 - GGAGnnnnnGTnnnnnnnnn_nnn'

BsmBI - 4727 - CGTCTCn'nnnn_

HincII - 4840 - GTy'rACHpaI - 4840 - GTT'AAC

BssHII - 4931 - G'CGCG_C

PspOMI - 5135 - G'GGCC_CApaI - 5139 - G_GGCC'CBstEII - 5160 - G'GTnAC_C

BclI - 5328 - T'GATC_A - dam methylated!MluI - 5342 - A'CGCG_T

BstAPI - 5666 - GCAn_nnn'nTGCPflMI - 5767 - CCAn_nnn'nTGG

PfoI - 5774 - T'CCnGG_AEcoNI - 5810 - CCTnn'n_nnAGG

SphI - 5875 - G_CATG'CAfeI - 5941 - AGC'GCTSgrAI - 6023 - Cr'CCGG_yG

ClaI - 6067 - AT'CG_AT - dam methylated!

GFPuv

AMP

lacI

pDRAW32

You can look at the sequence in detail

•Open reading frames

•Translation

•Restriction sites

•Complementary strand

Design of the Gene

Example, the gene we want:G C D R A S P Y C G

We got this from phage display:ggctgcgacagggcgagcccgtactgcggtG C D R A S P Y C G

Phage sequence

Final sequence for the gene of interest:ggctgcgacagggcgagcccgtactgcggttaaG C D R A S P Y C G *

Add a stop codon

If you are cloning out of a known plasmid, just use the sequence that you have

Design of the Gene

•If you are designing the gene from scratch, keep in mind codon usage

•Not all codons are created equal

•Un-optimized codons could lead to lower expression levels

•The codon usage reflects levels of tRNA available in E. Coli

•Pay attention to the stop codons too (XL1-Blues read through TAG {amber stop codon} 20% of the time)

http://www.bioinformatics.org/sms2/rev_trans.html

http://www.entelechon.com/index.php?id=tools/backtranslation&lang=eng

or preferably…

What if we don’t have the DNA sequence?Design from scratch! (don’t forget about codon usage)E. Coli Codon Usage

UUU F 0.59 UCU S 0.17 UAU Y 0.6 UGU C 0.47UUC F 0.41 UCC S 0.15 UAC Y 0.4 UGC C 0.53UUA L 0.15 UCA S 0.15 UAA * 0.6 UGA * 0.31UUG L 0.13 UCG S 0.13 UAG * 0.09 UGG W 1

CUU L 0.12 CCU P 0.19 CAU H 0.58 CGU R 0.35CUC L 0.1 CCC P 0.13 CAC H 0.42 CGC R 0.34CUA L 0.04 CCA P 0.21 CAA Q 0.34 CGA R 0.07CUG L 0.46 CCG P 0.47 CAG Q 0.66 CGG R 0.12

AUU I 0.49 ACU T 0.19 AAU N 0.51 AGU S 0.16AUC I 0.38 ACC T 0.38 AAC N 0.49 AGC S 0.23AUA I 0.13 ACA T 0.19 AAA K 0.73 AGA R 0.08AUG M 1 ACG T 0.24 AAG K 0.27 AGG R 0.05

GUU V 0.29 GCU A 0.19 GAU D 0.63 GGU G 0.34GUC V 0.2 GCC A 0.26 GAC D 0.37 GGC G 0.36GUA V 0.17 GCA A 0.24 GAA E 0.67 GGA G 0.14GUG V 0.34 GCG A 0.31 GAG E 0.33 GGG G 0.16

•Endonucleases (or restriction enzymes) are enzymes which cut DNA at specific internal recognition sequences

•Compare to exonucleases, which cut from one end

•You must choose restriction sites that are available in the plasmid you are cloning into

•They must not appear in your gene (silent mutation can remove unwanted sites in your designed gene)

Choice of Restriction Sites/Enzymes

Once you have your gene, you need to design a way to get it into your plasmid

•Restriction sites must exist only once in your plasmid

•They must be in the correct position relative to the purification tag

•Restrictions sites usually add extra residues to your gene product; make sure they are compatible with your peptide/protein

•Some restriction sites are sub-optimal for cloning

•Blunt end sites

•dam and dcm methylation-affected enzymes

Really Important Factors to Remember When Choosing Restriction Enzymes

AGCCAG GATCCGGGCTGCAAGCGGTTAAG AATTCGTCGACGTCGACG AATTCTTAACCGCTTCCAGCCCG GATCCTGGCT

GATCCGGGCTGCAAGCGGTTAAG AATTCTTAACCGCTTCCAGCCCG

GATCCTGGCTAGCCAG AATTCGTCGACGTCGACG+

“sticky ends”

AGCCAGAT ATCGGGCTGCAAGCGGTTAACAG CTGGTCGACGTCGACCAG CTGTTAACCGCTTCCAGCCCGAT ATCTGGCT

ATCGGGCTGCAAGCGGTTAACAGCTGTTAACCGCTTCCAGCCCGAT

AGCCAGATATCTGGCT + CTGGTCGACGTCGACCAG

•“Sticky ends”: 5’ or 3’ over-hangs that allow the DNA to anneal even though it is not covalently bound

•Help with the next step: ligation

Most common restriction enzymes

Blunt-end restriction enzymes No sticky ends

Blunt vs Sticky Ends

Digestion

Digestion

dam Methylation

NO

N

N

ON

NHO

P

P

O

O O

O

O

O

MeNO

N

N

ON

NH2

O

P

P

O

O O

O

O

O

Dam methylase

•Dam methylase puts a methyl group on the nitrogen of 6th position of adenosine at the site: GATC

•All of the E. Coli that we use generate DNA with dam methylation

•Some enzymes only cut dam methylated DNA: eg DpnI

•Some enzymes do not cut dam methylated DNA: eg XbaI

http://www.neb.com/nebecomm/tech_reference/restriction_enzymes/dam_dcm_methylases_of_ecoli.asp

dcm Methylation

http://www.neb.com/nebecomm/tech_reference/restriction_enzymes/dam_dcm_methylases_of_ecoli.asp

Dcm methylase

O

P

O

O O

O NN

NH2

O

O

P

O

O O Me

O

P

O

O O

O NN

NH2

O

O

P

O

O O

•Dcm methylase puts a methyl group on the carbon of 5th position of cytidine at the site: CCAGG and CCTGG

•The enzyme we use most that can be affected by dcm methylation is SfiI

•XL1-Blues and BL21s are both Dcm+

•Once you have your restriction enzymes chosen, it is time to design the final complete gene

•The multiple cloning site (or whatever plasmid you are cloning into) should already have the 5’ portion of the gene intact (i.e. RBS, spacer, Met)

• Sequences must be in frame

NcoI BtgI51 CTTTAATAAG GAGATATACC ATGGGCAGCA GCCATCACCA TCATCACCAC M G S S H H H H H H

SacI AscI SbfI SalI NotI BamHI EcoRI EcoICRI BssHII PstI AccI HindIII101AGCCAGGATC CGAATTCGAG CTCGGCGCGC CTGCAGGTCG ACAAGCTTGC S Q D P N S S S A R L Q V D K L A

Design of the Insert

Design of the Insert71 ATGGGCAGCAGCCATCACCATCATCACCAC M G S S H H H H H H SacI AscI SbfI SalI BamHI EcoRI EcoICRI PstI AccI HindIII101AGCCAGGATCCGAATTCGAGCTCGGCGCGCCTGCAGGTCGACAAGCTTGC S Q D P N S S S A R L Q V D K L A

The gene we want:ggctgcgacagggcgagcccgtactgcggttaa G C D R A S P Y C G *

BamHI PstI AGCCAGGATCCGAATTCGAGCTCGGCGCGCCTGCAGGTCGACAAGCTTGC S Q D P N S S S A R L Q V D K L A G C D R A S P Y C G * ggctgcgacagggcgagcccgtactgcggttaa

AGCCAGGATCCGggctgcgacagggcgagcccgtactgcggttaaCTGCAGGTCGACAA

Be aware of the amber stop codon: TAG

Multiple cloning site


Always check and re-check your sequence!

ATGGGCAGCA GCCATCACCA TCATCACCACAGCCAGGATCCGggctgcgacagggcgagcccgtactgcggttaaCTGCAGGTCGACAA

atgggcagcagccatcaccatcatcaccacagccaggatccgggctgcgacagggcgagc M G S S H H H H H H S Q D P G C D R A S ccgtactgcggttaactgcaggtcgacaa P Y C G - L Q V D

Everything looks good: in frame the whole way!

Translate the whole gene

The wrong way to do it:AGCCAGGATCC ggctgcgacagggcgagcccgtactgcggttaaCTGCAGGTCGACAAGCTT

atgggcagcagccatcaccatcatcaccacagccaggatccggctgcgacagggcgagccM G S S H H H H H H S Q D P A A T G R A cgtactgcggttaactgcaggtcgacaagcttR T A V N C R S T S

Frame shifted = garbage!


The gene is just inserted after the restriction site, which is out of frame with the plasmid-encoded start-codon/His-tag

**Some plasmids, for whatever reason, have restriction sites out of frame with the translated

gene**

Finishing Touches

atgggcagcagccatcaccatcatcaccacagccaggatccgggctgcgacagggcgagc M G S S H H H H H H S Q D P G C D R A S ccgtactgcggttaactgcaggtcgacaa P Y C G - L Q V D

•Restriction enzymes need 5’ and 3’ base pairs to cut properly

•NEB has a reference guide for specific enzymes (see link below)

•A good rule of thumb is 6 base pairs after the recognition site

•Inserting a GC “clamp” at the end and beginning of the sequence is also a good idea

http://www.neb.com/nebecomm/tech_reference/restriction_enzymes/cleavage_linearized_vector.asp

gccagccaggatccgggctgcgacagggcgagcccgtactgcggttaactgcaggtcgacgc S Q D P G C D R A S P Y C G - L Q V D

Final gene, polished and ready to go:

http://www.neb.com/nebecomm/tech_reference/restriction_enzymes/cleavage_linearized_vector.asp

Once the insert is designed correctly, the next step is designing primers to order from IDT, based on insert synthesis strategy

Design of the Primers

Three main strategies towards insert synthesis:

•PCR amplification

•Klenow extension of overlapping primers

•Complimentary full-length primers

+Insert

Vector

The most common method of insert synthesis

•Necessitates a pre-existing construct

•Extra restriction sites and/or amino acid residues can be added on each side of the gene

•Internal mutations are more difficult

PCR Amplification of Insert from an Existing Gene

Insert

PCR amplification from overlapping primers

•No pre-existing construct is needed

•PCR products messy, possibly making subsequent rxns difficult

•Good for inserts >150 bp

PCR Synthesis of Insert

F1: 10xF2: 1x

R1: 1xR2: 10x

5’3’

5’ 3’

5’3’

5’ 3’

Full-length insert should still be the major productInsert

Klenow Extension of Overlapping Primers

•Two primers that are complimentary in their 3’ region are designed (overlap 15bp)

•Extended to full length by the Klenow fragment of DNA Polymerase I

•Useful if insert is 50 to 150 bp

Insert

5’3’5’ 3’

5’3’

5’ 3’

KlenowKlenow fragment: retains 3’ to 5’ polymerase activity, but does not have exonuclease activity

•The simplest approach

•Order two primers that compliment each other

•Mix the two primers, heat, and aneal slowly (to ensure proper base-pairing)

•Feasible if the total insert size is < 60 bp

Complimentary Full-Length Primers

Insert5’3’

5’ 3’ Anneal

Designing Primers to Order

Once the insert synthesis technique is decided, primer design is fairly straight-forward

Forward primers:

•Assess necessary overlap and copy the sequence from your designed gene, along with extra 5’ sequence

Reverse primers:

•First, design exactly as if it were a forward primer: Copy necessary overlap and extra 3’ sequence from your designed gene

•Once all this is in place, use pDRAW32 sequence manipulator to calculate the reverse compliment

•Order the pDRAW32 calculated sequence directly

Cloning Out an Existing GeneIn the example mentioned previously, we would normally use full length overlapping primers, but let’s look at the more common case of having a preexisting gene:


tgcggcccagccggccatgggctgcgacagggcgagcccgtactgcggtggaggcggtgctgcagcgc A A Q P A M G C D R A S P Y C G G G G A A A

Preexisting gene:

Goal gene:

gccagccaggatccgggctgcgacagg ccgtactgcggttaactgcaggtcgacgc

Forward Primer: Design of Reverse Primer:

+

Overlap

Extra sequence from gene design


Ordering Primers

Forward primer to order:gccagccaggatccgggctgcgacagg

Reverse primer to order:GCGTCGACCTGCAGTTAACCGCAGTACGG

http://www.idtdna.com/Home/Home.aspxNow we can order the primers:

Design of Reverse Primer: ccgtactgcggttaactgcaggtcgacgc

&

Vectors and Bacteria Strains

Vector Promoter E Coli strains we use

pQE-30 T5 promoter XL1-Blue: mostly good for DNA isolation/phage displayM15(pREP4): tighter regulation of the lac suppressor

pMAL Ptac promoter

pCANTAB-5E Plac promoter

pET-DuetpRSF-Duet

T7 lac promoter(An E. Coli strain with phage T7 RNA polymerase is necessary)

BL-21: Protease deficient, stable to toxic proteins, and contains the T7 RNA polymerase gene

An important thing to think about before you start cloning: What vectors/E Coli should I use?

lac sitePromoter RBS ATG- your gene

lac repressor lac

sitePromoter RBS ATG- your gene

RNA polymerase

X

IPTG (or lactose, etc)

IPTG

lac sitePromoter RBS ATG- your gene

Transcription

mRNA

lac Expression Regulation

•Anti-biotic resistance (working concentration)

•Ampicillin (100g/mL)

•Kanamycin (35g/mL)

•Tetracycline HCl (10g/mL)

•Chloramphenicol (170g/mL in ethanol)

Purification Tags and Selection (Anti-biotic Resistance)

•Purification Tag

•His-tag (nickel agarose resin)

•Maltose Binding Protein (amylose resin)

•Glutathione S-Transferase (glutathione resin)

Digestion of Insert and Vector

•Digest with the same restriction endonucleases

•Optional (recommended) step:

•Treat the plasmid DNA with Antarctic phosphatase

•Decreases the background by stopping self-ligation of singly cut plasmid and background re-ligation

Ligation of the Insert into the Vector

+

•Ligation covalently attaches the vector and the insert via a phosphodiester bond (5’phosphate and 3’ hydroxyl of the next base)

Antarctic Phosphatase and Ligation

http://www.neb.com/nebecomm/products/productM0202.asp

O

P

O

O O

R1O

OH

O

P

O

O O

R2OO

P

O

O O+

O

P

O

O O

R1O

O

O

P

O

O O

R2OO

PO O

•Antarctic Phosphatase cleaves this phosphate, disallowing self-ligation

•The insert still has the 5’ phosphate though

Transformation

•The functional construct is now ready to be transformed into new E. Coli and grown up

•The new DNA isolated from the E. Coli must then be sequenced to make sure that everything worked

•Once the sequence is confirmed, we are ready to go!

Download - cloning

Top Related