introduction to dna computing

55
Introduction to DNA Computing Russell Deaton Comp. Sci. & Engr. The University of Arkansas Fayetteville, AR 72701 [email protected] Junghuei Chen Department of Chem & Biochem University of Delaware Newark, DE 19716 [email protected]

Upload: others

Post on 10-Feb-2022

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Introduction to DNA Computing

Introduction to DNA ComputingRussell DeatonComp. Sci. & Engr.The University of ArkansasFayetteville, AR [email protected]

Junghuei ChenDepartment of Chem & BiochemUniversity of DelawareNewark, DE [email protected]

Page 2: Introduction to DNA Computing

What is DNA Computing(DNAC) ?

The use of biological molecules, primarily DNA, DNA analogs, and RNA, for computational purposes.

Page 3: Introduction to DNA Computing

Why Nucleic Acids?� Density (Adleman, Baum):

� DNA: 1 bit per nm3, 1020 molecules� Video: 1 bit per 1012 nm3

� Efficiency (Adleman)� DNA: 1019 ops / J� Supercomputer: 109 ops / J

� Speed (Adleman):� DNA: 1014 ops per s� Supercomputer: 1012 ops per s

Page 4: Introduction to DNA Computing

What makes DNAC possible?� Great advances in molecular biology

� PCR (Polymerase Chain Reaction)� DNA Microarrays� New enzymes and proteins� Better understanding of biological molecules

� Ability to produce massive numbers of DNAmolecules with specified sequence and size

� DNA molecules interact through templatematching reactions

Page 5: Introduction to DNA Computing

What are the basics frommolecular biology that I need to

know to understand DNAcomputing?

Page 6: Introduction to DNA Computing

PHYSICAL STRUCTURE OF DNA

NitrogenousBase

34 Å

MajorGroove

MinorGroove

Central Axis

Sugar-PhosphateBackbone

20 Å5� C

3� OH

3� 0HC 5�

5�

3�

3�

5�

Page 7: Introduction to DNA Computing

INTER-STRAND HYDROGEN BONDING

Adenine Thymine

to Sugar-PhosphateBackbone

to Sugar-PhosphateBackbone

(+) (-)

(+)(-)

Hydrogen Bond

Guanine Cytosine

to Sugar-PhosphateBackbone

to Sugar-PhosphateBackbone

(-) (+)

(+)(-)

(+)(-)

Page 8: Introduction to DNA Computing

STRAND HYBRIDIZATIONA B

a b

A B

ab

b

B

a

A

HEAT

COOL

ba

A B

OR

100° C

Page 9: Introduction to DNA Computing

DNA LIGATION

αααα δδδδ

αααα� δδδδ�

αααα� δδδδ�

Ligase Joins 5' phosphateto 3' hydroxyl

αααα� δδδδ�αααα

δδδδ

Page 10: Introduction to DNA Computing

RESTRICTION ENDONUCLEASES

EcoRI

HindIII

AluI

HaeIII

- OH 3�

5� P -

- P 5�

3� OH -

Page 11: Introduction to DNA Computing

DNA Polymerase

Page 12: Introduction to DNA Computing
Page 13: Introduction to DNA Computing

DNA Sequencing

Page 14: Introduction to DNA Computing

GEL ELECTROPHORESIS - SIZE SORTING

BufferGel

Electrode

Electrode

Samples

Faster

Slower

Page 15: Introduction to DNA Computing

ANTIBODY AFFINITY

CACCATGTGAC

GTGGTACACTG B

PMP

+

Anneal

CACCATGTGAC

GTGGTACACTG B+

CACCATGTGAC

GTGGTACACTG B PMP

Bind

Add oligo withBiotin label

Heat and cool

Add Paramagnetic-Streptavidin

Particles

Isolate with MagnetN

S

Page 16: Introduction to DNA Computing

POLYMERASECHAIN

REACTION

Page 17: Introduction to DNA Computing
Page 18: Introduction to DNA Computing
Page 19: Introduction to DNA Computing

What is a the typicalmethodology?

� Encoding: Map problem instance onto setof biological molecules and molecularbiology protocols

� Molecular Operations: Let molecules reactto form potential solutions

� Extraction/Detection: Use protocols toextract result in molecular form

Page 20: Introduction to DNA Computing

What is an example?

� �Molecular Computation of Solutions toCombinatorial Problems�

� Adleman, Science, v. 266, p. 1021.

Page 21: Introduction to DNA Computing
Page 22: Introduction to DNA Computing

Algorithm� Generate Random Paths through the graph.� Keep only those paths that begin with vin

and end with vout.� If graph has n vertices, then keep only those

paths that enter exactly n vertices.� Keep only those paths that enter all the

vertices at least once.� In any paths remain, say �Yes�; otherwise,

say �No�

Page 23: Introduction to DNA Computing

Encoding0

1

2

�GCATGGCC

�AGCTTAGG

�ATGGCATG

CCGGTCGA�CCGGTACC�

�GCATGGCCAGCTTAGG CCGGTCGA�

�GCATGGCCATGGCATG CCGGTACC�

00 21

Page 24: Introduction to DNA Computing
Page 25: Introduction to DNA Computing

What are the success stories?� Self-Assembling Demonstrated (Winfree,

Seeman, Reif et al)� New Approaches and Protocols Developed

� Surface-based (Wisconsin-Madison, Dimacs II)� Evolutionary Approaches (Wood and Chen,

Gecco-99, DNA-5)� Reversible Logic Gates (Rubin et al., DNA-5)� Whiplash PCR (Hagiya et al., DNA-3)� RNA Computing (Landweber et al., PNAS)

� How do cells and nature compute? (Kariand Landweber, DNA-4)

Page 26: Introduction to DNA Computing

Source: http://seemanlab4.chem.nyu.edu/

Self-Assembly

Page 27: Introduction to DNA Computing

Source: Winfree, DIMACS IV

Page 28: Introduction to DNA Computing
Page 29: Introduction to DNA Computing

Source: http://corninfo.chem.wisc.edu/writings/dnatalk/dna01.html

Page 30: Introduction to DNA Computing

In Vitro Evolution

Page 31: Introduction to DNA Computing

Source: http://www.princeton.edu/~lfl/washpost.html

DNA Computing in Cells

Page 32: Introduction to DNA Computing

DNA Fredkin Gates

Page 33: Introduction to DNA Computing

Whiplash PCR

Source: John Rose, Institute of Physics, University of Tokyo

Page 34: Introduction to DNA Computing

What are the challenges?

� Error: Molecular operations are not perfect.� Reversible and Irreversible Error� Efficiency: How many molecules

contribute?� Encoding problem in molecules is difficult.� Scaling to larger problems� Applications

Page 35: Introduction to DNA Computing

Mismatches

Page 36: Introduction to DNA Computing

DNA Word Design

� Design of DNA Sequences that hybridize asplanned (that is, minimize mismatches)

� Reliability: False Positives and Negatives� Efficiency: Hybridizations that Contribute

to Solution� Hybridizations are Templates for

Subsequent Enzymatic Steps

Page 37: Introduction to DNA Computing

DNA Word Design

� Minimum Distance Codes to PreventHybridization Error

� Distance Measure� Combinatoric (Hamming)� Energetic (Base Stacking Energy)

� Design DNA Words with EvolutionaryAlgorithms

� Good Codes Achievable

Page 38: Introduction to DNA Computing

Code Word

Hybridization

Code Word

Hybridization

Page 39: Introduction to DNA Computing

Base Stacking

Page 40: Introduction to DNA Computing
Page 41: Introduction to DNA Computing

What are the possibleapplications?

� DNAC and Conventional Computers

� DNAC and Evolutionary Computation

� DNAC and Biotechnology

� DNAC and Nanotechnology

Page 42: Introduction to DNA Computing

DNAC and ElectronicComputing

� Solution versus solid state� Individual molecules versus ensembles of

charge carriers� The importance of shape in biological

molecules� Programmability/Evolvability Trade-off

(Conrad)

Page 43: Introduction to DNA Computing

Edna

� Electronic DNA� Virtual Test Tube for Design and

Simulation of DNA Computations� Molecules as Cellular Automata� Solve Adleman and Other Problems� Distributed Edna to Solve Large Problems� New Paradigm

Page 44: Introduction to DNA Computing
Page 45: Introduction to DNA Computing

In Vitro EvolutionaryComputation

� Randomness and Uncertainty Inherent inBiomolecular Reactions

� Never Level of Control like EE over SolidState Devices

� Use Nature�s ToolBox: Enzymes,Reaction/Diffusion, Adaptability, andRobustness

� Evolved, Not Designed

Page 46: Introduction to DNA Computing
Page 47: Introduction to DNA Computing

DNAC and Biotechnology

� �Computationally Inspired Biotechnology�� DNA2DNA �killer app�� Automation of protocols� DNA Word Design (Gene Expression

Chips, Universal DNA chips)� Exquisite Detection of Biomaterials� Bio-engineered Materials

Page 48: Introduction to DNA Computing

Universal DNA Chips

� Universal chip has short oligonucleotidesequences (antitags) attached to solidsupport.

� W-C complement of antitag is tag.� Tag/Antitag pair designed to hybridize

strongly with each other, but not any otherTag/Antitag pair

Page 49: Introduction to DNA Computing

Target ReporterTag

Target Specific

PolymorphismSites

Single Nucleotide Polymorphisms

Page 50: Introduction to DNA Computing

Extension and Enzyme

Page 51: Introduction to DNA Computing

Antitag

Fluorescence

Page 52: Introduction to DNA Computing

Universal Chip Advantages

� Reusable Components

� Repeatable Manufacture

� Avoid problems with chip validation anderrors

� Target Specific in Solution Phase

Page 53: Introduction to DNA Computing

What developments can weexpect in the near-term(2002-...)?� Large libraries of evolved sequences� Iterative improvements in self-assembly,

WPCR, etc�� Increased emphasis on simulation and

modeling� Some impact on molecular biology by DNA

computation

Page 54: Introduction to DNA Computing

What are the long-termprospects?

� Cross-fertilization among evolutionarycomputing, DNA computing, molecularbiology, and computational biology

� Niche uses of DNA computers for problemsthat are difficult for electronic computers

� In Vitro Translation and Transcription

Page 55: Introduction to DNA Computing

Where can I learn more?� Web Sites:

� http://www.wi.leidenuniv.nl/~jdassen/dna.html� http://dope.caltech.edu/winfree/DNA.html� http://www.msci.memphis.edu/~garzonm/bmc.html� (Conrad) http://www.cs.wayne.edu/biolab/index.html

� DIMACS Proceedings: DNA Based Computers I (#27), II (#44), III (#48), IV (Special Issue of Biosystems), V (MIT, June 1999), VI (Leiden, June 2000)� Other: Genetic Programming 1 (Stanford, 1997), Genetic Programming 2 (Wisconsin-Madison, 1998), GECCO-1999,IEEE International Conference on Evolutionary Computation (Indianapolis, 1997)� G. Paun (ed.), Computing with Biomolecules: Theory and Experiment, Springer-Verlag, Singapore 1998.� �DNA Computing: A Review,� Fundamenta Informaticae, vol. 35, pp. 231-245.�M. H. Garzon and R. J. Deaton, �Biomolecular Computing and Programming,� IEEE Transactions on Evolutionary Computation, vol. 3, pp. 236-250, 1999.