(c) 2002, snu biointelligence lab, a computer scientist’s guide to molecular biology...

51
(C) 2002, SNU Biointell igence Lab, http://bi.s nu.ac.kr/ 1 A Computer Scientist’s A Computer Scientist’s Guide to Molecular Guide to Molecular Biology Biology Biointelligence Lab. Interdisciplinary Program in Cognitive Science, Seoul National University 2002. 3. 20. L. Kari et al., Soft Computing 5 (2001) 95-101

Upload: walter-holland

Post on 14-Jan-2016

223 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

(C) 2002, SNU Biointelligence Lab, http://bi.snu.ac.kr/

1

A Computer Scientist’s Guide to A Computer Scientist’s Guide to Molecular BiologyMolecular Biology

Biointelligence Lab.

Interdisciplinary Program in Cognitive Science,

Seoul National University

2002. 3. 20.

L. Kari et al., Soft Computing 5 (2001) 95-101

Page 2: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

(C) 2002, SNU Biointelligence Lab, http://bi.snu.ac.kr/

2

OutlineOutline

Introduction The DNA molecule The natural information contents of DNA

Transcription Editing Translation

Adleman’s in vitro experiment DNA as a computational tool

Page 3: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

3

Biocomputing vs. BioinformaticsBiocomputing vs. Bioinformatics

MolecularMolecularBiologyBiology

ComputerComputerScienceScience

Bioinformatics

Molecular computingBiomolecular computingDNA computing

Page 4: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

(C) 2002, SNU Biointelligence Lab, http://bi.snu.ac.kr/

4

The DNA MoleculeThe DNA Molecule

Page 5: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

(C) 2002, SNU Biointelligence Lab, http://bi.snu.ac.kr/

5

DNA (DNA (ddeoxyriboeoxyribonnucleic ucleic aacid)cid) Schrödinger (1944): What is Life?

Our genes must essentialy be an aperiodic crystal Watson & Crick (1953): Nature 25: 737-738

Molecular Structure of Nucleic Acids: A structure for deoxyribose nucleic acid Nobel Prize, 1962.

Page 6: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

6

DNA Structure (1) nucleotideDNA Structure (1) nucleotide

Page 7: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

7

DNA Structure (2) ester bondDNA Structure (2) ester bond

Long and unbranched polymers formed by ester bonds between the 5’ phosphate (5’-P) and the 3’ hydroxyl (3’-OH) group of the sugar of the next.

Page 8: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

8

DNA Structure (3) base pairingDNA Structure (3) base pairing

Complimentary base pairing accounts for the Chargaff’s rule (A=T, G=C).

Page 9: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

9

DNA Structure (4) – double helixDNA Structure (4) – double helix

Watson-Crick complement

Page 10: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

10

DNA Structure (5) – conformationDNA Structure (5) – conformation

The B-form is the common natural form, prevailing under physiological conditions of low ionic strength and high degree of hydration.

The Z-form (Zigzag chain) is observed in DNA G-C rich local region.

The A-form is sometimes found in some parts of natural DNA in presence of high concentration of cations or at a lower degree of hydration (<65%).

Page 11: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

(C) 2002, SNU Biointelligence Lab, http://bi.snu.ac.kr/

11

The Natural Information Content of The Natural Information Content of DNADNA

Page 12: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

12

Central DogmaCentral Dogma

Page 13: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

13

Replicaton (1)Replicaton (1)

Each time a cell divides into two daughter cells, all the DNA molecule must be duplicated.

Duplication of an old DNA molecule into two new DNA molecules is called Replication.

Page 14: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

14

Replicaton (2)Replicaton (2)

During replication, the DNA helix is unraveled and its two strands are separated. An area known as the replication bubble forms and progresses along the molecule in both direction. Then each DNA strand serves as a template for the synthesis of a new complementary strand.

Each daughter DNA molecule is an exact copy of its parent molecule, consisting of one old and one new DNA strand. Thus the replication is semi-conservative

Page 15: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

15

TranscriptionTranscription

Scientists snap first 3-D pictures of the "heart" of the transcription machine.

Page 16: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

16

EditingEditing

Capping

PolyAdenylation

RNA splicing

Page 17: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

17

5’ Capping (1)5’ Capping (1)

There are three cap structures that have been identified:

Cap 0, Cap 1 and Cap 2. The structures all have

the methylated guanine but differ in the extent of methylation of the ribose moiety of the first and second position of the RNA chain.

Page 18: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

18

5’ Capping (2)5’ Capping (2) 3 functions of 5’ cap

protection of the RNA from degradation

translatability transport from the

nucleus to the cytoplasm

Page 19: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

19

Polyadenylation (1)Polyadenylation (1)

Polyadenylation mechanism involves following steps:1. The cutting of the RNA chain at a particular site2. The addition of the poly [A] to the 3' end of the pre-mRNA3. The degradation of the remainder of the RNA transcript

Page 20: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

20

Polyadenylation (2)Polyadenylation (2)

Two functions for

poly [A] on mRNA;

1. Protection

2. Translatability

Page 21: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

21

RNA Splicing (1)RNA Splicing (1)

Page 22: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

22

RNA Splicing (2)RNA Splicing (2)

Page 23: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

23

RNA Splicing (3)RNA Splicing (3)

Page 24: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

24

Translation (1)Translation (1)

Page 25: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

25

Translation (2)Translation (2)

tRNA structure

Page 26: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

26

Translation (3)Translation (3)

Page 27: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

27

Translation (4)Translation (4)

InitiationInitiation

Page 28: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

28

Translation (5)Translation (5) ElongationElongation

Page 29: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

29

Translation (6)Translation (6) TerminationTermination

Page 30: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

(C) 2002, SNU Biointelligence Lab, http://bi.snu.ac.kr/

30

The First DNA Computing The First DNA Computing MethodMethod

L. M. Adleman, Molecular Computation of Solutions to Combinatorial Problems,

Science, 266:1021-1024, 1994

Page 31: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

(C) 2002, SNU Biointelligence Lab, http://bi.snu.ac.kr/

31

First special DNA computerFirst special DNA computer

Special problem: given N points, find a path visiting each and every point only once, and starting and ending at a given locations. (Hamiltonian path problem)

Solved with a DNA computer by Leonard Adleman in 1994 for N=7

Basic approach: code each point as an 8 unit DNA string, code each possible path, allow DNA bonding, suppress DNA with incorrect start/end points.

Page 32: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

(C) 2002, SNU Biointelligence Lab, http://bi.snu.ac.kr/

32

Hamiltonian Path ProblemHamiltonian Path Problem

The Hamiltoian path problem: as the number of cities grows, even supercomputers have difficulty finding the path.

1

0

3

2 5

6

4

Page 33: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

(C) 2002, SNU Biointelligence Lab, http://bi.snu.ac.kr/

33

Adleman’s Molecular Computer: Adleman’s Molecular Computer: A Brute Force MethodA Brute Force Method

Each city (vertex) is represented by a

different sequence of nucleotides (6

here, but Adleman used 20).

A DNA linker (edge) joining two

city (vertex) strands.

Page 34: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

(C) 2002, SNU Biointelligence Lab, http://bi.snu.ac.kr/

34

Encoding (Basic Concept)Encoding (Basic Concept)

1

32

AGCT TAGGP1A P1B

TAGG CATGP2A P2B

CGAT CGAAP3A P3B

P1B P3AP1B P2A

10

3

2 56

4

ATCC TACCATCC GCTA

Page 35: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

35

ProcedureProcedureGenerate random paths

through the graph

Keep only those paths that begin with vin and end with vout

If the graph has n vertices, then keep only those paths that enter exactly n vertices

If any paths remain, say “Yes”; otherwise, say “No.”

Hybridization & Ligation

PCR with vin and vout

Gel electrophoresis

Gel electrophoresis &Sequecing

Keep only those paths that enter all of the vertices

of the graph at least once.

Antibody bead separation With vi

Page 36: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

(C) 2002, SNU Biointelligence Lab, http://bi.snu.ac.kr/

36

AGCTTAGG

ATGGCATG

ATCCTACC

Vertex 1 Vertex 2

Edge 12

Step 1 : Hybridization

AGCTTAGG ATGGCATGATCC TACC

AGCTTAGGATCCTACC

Step 2 : Ligation

AGCTTAGGATGGCATGGAATCCGATGCATGGCTCGAATCC ACGTACCG

Vertex 1

ATGGCATG

Vertex 4

Step 3 : PCR

56 bp 16 bp

Step 4 : Gel Electrophoresis

AGCTTAGGATGGCATGGAATCCGA…TCGAATCC

Bead for vertex 1

Step 5 : Magnetic Bead Affinity Separation

Page 37: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

(C) 2002, SNU Biointelligence Lab, http://bi.snu.ac.kr/

37

HPP

...

...... ...

ATGATG

ACGACG

TGCTGC

CGACGATAATAAGCAGCA

CGTCGT

......

...

...

......

......

10

3

2 5

6

4

Solution

PCR(Polymerase

Chain Reaction)

ATGTGCTAACGAACG

ACGCGAGCATAAATGTGCCGTACGCGAGCATAAATGTGCCGT

TAAACG

CGACGT

TAAACGGCAACG

...

...

...

...

CGACGTAGCCGT

...

...

...

ACGCGAGCATAAATGTGCCGTACGCGAGCATAAATGTGCCGT

ACGCGTAGCCGT

ACGCGT...

...

...

...

...

ACGGCATAAATGTGCACGCGT

ACGCGAGCATAAATGCGATGCCGT

ACGCGAGCATAAATGTGCCGTACGCGAGCATAAATGTGCCGT

......

.........

ACGCGAGCATAAATGTGCCGTACGCGAGCATAAATGTGCCGT

...

......

...

...

Decoding

LigationEncoding

Gel Electrophoresis

Affinity Column

ACGCGAGCATAAATGTGCACGCGT

ACGCGAGCATAAATGCGATGCACGCGT

ACGCGAGCATAAATGTGCACGCGT

ACGCGAGCATAAATGCGATGCACGCGT

2

01

3 4

5

6

Node 0 : ACG Node 3 : TAANode 1 : CGA Node 4 : ATGNode 2 : GCA Node 5 : TGC

Node 6 : CGT

Page 38: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

(C) 2002, SNU Biointelligence Lab, http://bi.snu.ac.kr/

38

DNA finds a solution!DNA finds a solution!

A Hamiltonian path with all vertices included is isolated and recovered

Page 39: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

(C) 2002, SNU Biointelligence Lab, http://bi.snu.ac.kr/

39

DNA as a Computational ToolDNA as a Computational Tool

Page 40: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

40

DNA Memory DNA Memory

A string composed of a series of four types of units (nucleotides), DNA may be viewed as logic memory or gate.

Number System (Base 4):

Nucleotide

A

C

T

G

Complement Nucleotide

DNA bindingprocess

Two strings of DNA are bonded by paired nucleotides A-C and C-G which may be considered as complements. Example:

Number TTACAG has a complement AATGTC

Page 41: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

(C) 2002, SNU Biointelligence Lab, http://bi.snu.ac.kr/

41

DNA MemoryDNA Memory

DNAmemory strands

a t c g g

t c a t ag c a c t

0 0 0

a t c g g

t c a t a

1 0 1

t a g c c c g t g a

Writing : make DNA sequences

Reading : hybridization and readout

Page 42: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

(C) 2002, SNU Biointelligence Lab, http://bi.snu.ac.kr/

42

DNA OperatorsDNA Operators

The bio-lab technology.

Hybridization Ligation Polymerase Chain Reaction (PCR) Gel electrophoresis Affinity separation (Bead) Enzymes: restriction enzyme…

Page 43: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

(C) 2002, SNU Biointelligence Lab, http://bi.snu.ac.kr/

43

Hybridization & LigationHybridization & Ligation

Hybridization base-pairing between two complementary single-strand

molecules to form a double stranded DNA molecule

Ligation Joining DNA fragments together

Solution generation step!

Page 44: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

(C) 2002, SNU Biointelligence Lab, http://bi.snu.ac.kr/

44

DNA Hybridization & LigationDNA Hybridization & Ligation

CGTACCTTAGGCT

AGCTTAGGATGGCATGG AATCCGATGCATGGC

CGTACCTTAGGCTAGCTTAGGATGGCATGGAATCCGATGCATGGC

CGTACCTTAGGCTAGCTTAGGATGGCATGGAATCCGATGCATGGC

CGTACCTTAGGCT

AGCTTAGGATGGCATGGAATCCGATGCATGGC+

+

Ligation

Hybridization

Dehybridization

Page 45: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

(C) 2002, SNU Biointelligence Lab, http://bi.snu.ac.kr/

45

PCR (PCR (PPolymerase olymerase CChain hain RReaction)eaction)

Mullis: Nobel Prize (1993)

Amplifies (produces identical copies of) selected dsDNA molecules.

Make 2n copies (n : number of iteration)

Solution filtering or amplification step!

Page 46: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

(C) 2002, SNU Biointelligence Lab, http://bi.snu.ac.kr/

46

PCRPCR(Polymerase Chain Reaction)(Polymerase Chain Reaction)

Page 47: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

(C) 2002, SNU Biointelligence Lab, http://bi.snu.ac.kr/

47

Gel electrophoresisGel electrophoresis

Molecular size fraction technique

Detect the specific DNA

Bead SeparationBead Separation

Solution detection or filtering step!

Page 48: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

48

Gel ElectrophoresisGel Electrophoresis

Page 49: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

49

Complementary

Magnetic Beads

Magnet

Bead Separation Bead Separation (1)(1)

Page 50: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

50

Bead Separation Bead Separation (2)(2)

Biotin (Vitamin H)

Page 51: (C) 2002, SNU Biointelligence Lab,  A Computer Scientist’s Guide to Molecular Biology Biointelligence Lab. Interdisciplinary Program

(C) 2002, SNU Biointelligence Lab, http://bi.snu.ac.kr/

51

Restriction enzymeRestriction enzyme

Cut the specific DNA site. Solution detection or filtering step!

A A G C T T

T T C G A A

A

T T C G A

A C G T T

A

OH 3’

3’ OH

5’ P

P 5’EcoRI