computational methods for systems and synthetic...

39
20/10/10 François Fages - MPRI C2-19 1 Computational Methods for Systems and Synthetic Biology François Fages The French National Institute for Research in Computer Science and Control INRIA Paris-Rocquencourt Constraint Programming Group http://contraintes.inria.fr/

Upload: others

Post on 12-Mar-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Computational Methods for Systems and Synthetic Biologycontraintes.inria.fr/~fages/BioTeaching/J1.pdf · Systems and Synthetic Biology François Fages The French National Institute

20/10/10 François Fages ­ MPRI C2­19 1

Computational Methods for Systems and Synthetic Biology

François Fages

The French National Institute for Research in Computer Science and Control

INRIA Paris-Rocquencourt

Constraint Programming Grouphttp://contraintes.inria.fr/

Page 2: Computational Methods for Systems and Synthetic Biologycontraintes.inria.fr/~fages/BioTeaching/J1.pdf · Systems and Synthetic Biology François Fages The French National Institute

20/10/10 François Fages ­ MPRI C2­19 2

Cell Molecules

" Small molecules: covalent bonds 50-200 kcal/mol

� 70% water

� 1% ions

� 6% amino acids (20), nucleotides (5),

� fats, sugars, ATP, ADP, &

" Macromolecules: hydrogen bonds, ionic, hydrophobic, Waals 1-5 kcal/mol

Stability and bindings determined by the number of weak bonds: 3D shape

� 20% proteins (50-104 amino acids)

� RNA (102-104 nucleotides AGCU)

� DNA (102-106 nucleotides AGCT)

Page 3: Computational Methods for Systems and Synthetic Biologycontraintes.inria.fr/~fages/BioTeaching/J1.pdf · Systems and Synthetic Biology François Fages The French National Institute

20/10/10 François Fages ­ MPRI C2­19 3

DNA Deoxyribonucleic Acid

1) Primary structure: word over 4 nucleotides

Adenine, Guanine, Cytosine, Thymine

2) Secondary structure:

double helix of pairs A--T and C---G

stabilized by hydrogen bonds

Nobel Prizes Watson and Crick (1956)

Size of DNA = number of pairs

A gene is a sequence of DNA pairs coding for a protein or an RNA having a function in the organism                               

Page 4: Computational Methods for Systems and Synthetic Biologycontraintes.inria.fr/~fages/BioTeaching/J1.pdf · Systems and Synthetic Biology François Fages The French National Institute

20/10/10 François Fages ­ MPRI C2­19 4

DNA: Genome Size

140 Gb&

15 Gb&

3 Gb&

12 Mb

100 %1 circular5 MbE. Coli (bacteria)

Coding DNAChromosomesGenome sizeSpecies

Artificial life by Craig Venter:fully synthetic bacteria genome (0,39 $/b)implemented in a bacteria without DNA still living and proliferating!

Page 5: Computational Methods for Systems and Synthetic Biologycontraintes.inria.fr/~fages/BioTeaching/J1.pdf · Systems and Synthetic Biology François Fages The French National Institute

20/10/10 François Fages ­ MPRI C2­19 5

DNA: Genome Size

140 Gb&

15 Gb&

3 Gb&

70 %1612 MbS. Cerevisae (yeast)

100 %1 circular5 MbE. Coli (bacteria)

Coding DNAChromosomesGenome sizeSpecies

Page 6: Computational Methods for Systems and Synthetic Biologycontraintes.inria.fr/~fages/BioTeaching/J1.pdf · Systems and Synthetic Biology François Fages The French National Institute

20/10/10 François Fages ­ MPRI C2­19 6

DNA: Genome Size

140 Gb&

15 Gb&

15 %20, 233 GbMouse, Human

70 %1612 MbS. Cerevisae (yeast)

100 %1 circular5 MbE. Coli (bacteria)

Coding DNAChromosomesGenome sizeSpecies

3,200,000,000 pairs of nucleotides

single nucleotide polymorphism 1 / 2kb

Page 7: Computational Methods for Systems and Synthetic Biologycontraintes.inria.fr/~fages/BioTeaching/J1.pdf · Systems and Synthetic Biology François Fages The French National Institute

20/10/10 François Fages ­ MPRI C2­19 7

Genome Size

140 Gb&

1 %815 GbOnion

15 %20, 233 GbMouse, Human

70 %1612 MbS. Cerevisae (yeast)

100 %14 MbE. Coli (bacteria)

Coding DNAChromosomesGenome sizeSpecies

Page 8: Computational Methods for Systems and Synthetic Biologycontraintes.inria.fr/~fages/BioTeaching/J1.pdf · Systems and Synthetic Biology François Fages The French National Institute

20/10/10 François Fages ­ MPRI C2­19 8

Genome Size

0.7 %140 GbLungfish

1 %815 GbOnion

15 %20, 233 GbMouse, Human

70 %1612 MbS. Cerevisae (yeast)

100 %14 MbE. Coli (bacteria)

Coding DNAChromosomesGenome sizeSpecies

Page 9: Computational Methods for Systems and Synthetic Biologycontraintes.inria.fr/~fages/BioTeaching/J1.pdf · Systems and Synthetic Biology François Fages The French National Institute

20/10/10 François Fages ­ MPRI C2­19 9

DNA Replication

1. Separation of the two helices

2. Production of one complementary strand for each copy

(from one or several starting points of replication)

3. Segregation

4. Mitosis (cell division)

Page 10: Computational Methods for Systems and Synthetic Biologycontraintes.inria.fr/~fages/BioTeaching/J1.pdf · Systems and Synthetic Biology François Fages The French National Institute

20/10/10 François Fages ­ MPRI C2­19 10

Gene Transcription and Translation

" Activation (Inhibition): Nobel prize Jakob and Monod (1965)

transcription factors bind to the regulatory region of the gene

2. Transcription:

RNA polymerase copies the DNA from start to stop positions

into a single stranded pre-mature messenger pRNA

3. (Alternative) splicing:

non coding regions of pRNA are removed giving mature messenger mRNA

4. Translation:

mRNA moves to cytoplasm and binds to ribosome to assemble a protein

Page 11: Computational Methods for Systems and Synthetic Biologycontraintes.inria.fr/~fages/BioTeaching/J1.pdf · Systems and Synthetic Biology François Fages The French National Institute

20/10/10 François Fages ­ MPRI C2­19 11

Formal Genes: Syntax

" Part of DNA #E2

" Activation

binding of promotion factor #E2-(E2f13-DP12)

" Repression (inhibition) Genes and signals [Ptashne Gann 01]

binding of another molecule #E2-Rep

Page 12: Computational Methods for Systems and Synthetic Biologycontraintes.inria.fr/~fages/BioTeaching/J1.pdf · Systems and Synthetic Biology François Fages The French National Institute

20/10/10 François Fages ­ MPRI C2­19 12

Transcription and Translation Rules

Activation

#E2 + E2f13­DP12 => #E2­E2f13­DP12Repression

#E2 + Rep => #E2­Rep

                                    Genes and signals [Ptashne Gann 01]

Page 13: Computational Methods for Systems and Synthetic Biologycontraintes.inria.fr/~fages/BioTeaching/J1.pdf · Systems and Synthetic Biology François Fages The French National Institute

20/10/10 François Fages ­ MPRI C2­19 13

Transcription and Translation Rules

Activation

#E2 + E2f13­DP12 => #E2­E2f13­DP12Repression

#E2 + Rep => #E2­RepTranscription

_ =[#E2­E2F13­DP12]=> pRNAcycA

Page 14: Computational Methods for Systems and Synthetic Biologycontraintes.inria.fr/~fages/BioTeaching/J1.pdf · Systems and Synthetic Biology François Fages The French National Institute

20/10/10 François Fages ­ MPRI C2­19 14

Transcription and Translation Rules

Activation

#E2 + E2f13­DP12 => #E2­E2f13­DP12Repression

#E2 + Rep => #E2­RepTranscription

_ =[#E2­E2F13­DP12]=> pRNAcycA(Alternative) Splicing

pRNAcycA => mRNAcycA       (pRNAcycA => mRNAcycA2)

Page 15: Computational Methods for Systems and Synthetic Biologycontraintes.inria.fr/~fages/BioTeaching/J1.pdf · Systems and Synthetic Biology François Fages The French National Institute

20/10/10 François Fages ­ MPRI C2­19 15

Transcription and Translation Rules

Activation

#E2 + E2f13­DP12 => #E2­E2f13­DP12Repression

#E2 + Rep => #E2­RepTranscription

_ =[#E2­E2F13­DP12]=> pRNAcycA(Alternative) Splicing

pRNAcycA => mRNAcycA       (pRNAcycA => mRNAcycA2)   Translation

mRNAcycA => mRNAcycA::cyt                           mRNAcycA::cyt =[ribosome::cyt]=> cycA::cyt(mRNAcycA2::cyt =[ribosome::cyt]=> cycA2::cyt)

Page 16: Computational Methods for Systems and Synthetic Biologycontraintes.inria.fr/~fages/BioTeaching/J1.pdf · Systems and Synthetic Biology François Fages The French National Institute

20/10/10 François Fages ­ MPRI C2­19 16

Proteins

1) Primary structure: word of n amino acids residues (20n possibilities)

linked with C-N bonds

Page 17: Computational Methods for Systems and Synthetic Biologycontraintes.inria.fr/~fages/BioTeaching/J1.pdf · Systems and Synthetic Biology François Fages The French National Institute

20/10/10 François Fages ­ MPRI C2­19 17

Proteins

1) Primary structure: word of n amino acids residues (20n possibilities)

linked with C-N bonds

Example: MPRI

Methionine-Proline-Arginine-Isoleucine

Page 18: Computational Methods for Systems and Synthetic Biologycontraintes.inria.fr/~fages/BioTeaching/J1.pdf · Systems and Synthetic Biology François Fages The French National Institute

20/10/10 François Fages ­ MPRI C2­19 18

Proteins

1) Primary structure: word of n amino acids residues (20n possibilities)

linked with C-N bonds

Example: MPRI

Methionine-Proline-Arginine-Isoleucine

2) Secondary: word of m α−helix, β−strands, random coils,& (3m-10m)

stabilized by hydrogen bonds H---O

Page 19: Computational Methods for Systems and Synthetic Biologycontraintes.inria.fr/~fages/BioTeaching/J1.pdf · Systems and Synthetic Biology François Fages The French National Institute

20/10/10 François Fages ­ MPRI C2­19 19

Proteins

1) Primary structure: word of n amino acids residues (20n possibilities)

linked with C-N bonds

Example: MPRI

Methionine-Proline-Arginine-Isoleucine

2) Secondary: word of m α−helix, β−strands, random coils,& (3m-10m)

stabilized by hydrogen bonds H---O

3) Tertiary 3D structure: spatial folding

stabilized by hydrophobic interactions

explains the protein interaction capabilities

Page 20: Computational Methods for Systems and Synthetic Biologycontraintes.inria.fr/~fages/BioTeaching/J1.pdf · Systems and Synthetic Biology François Fages The French National Institute

20/10/10 François Fages ­ MPRI C2­19 20

Formal Proteins: Syntax" Cyclin dependent kinase 1 Cdk1

(free, inactive)

Page 21: Computational Methods for Systems and Synthetic Biologycontraintes.inria.fr/~fages/BioTeaching/J1.pdf · Systems and Synthetic Biology François Fages The French National Institute

20/10/10 François Fages ­ MPRI C2­19 21

Formal Proteins: Syntax" Cyclin dependent kinase 1 Cdk1

(free, inactive)

" Complex Cdk1-Cyclin B Cdk1–CycB(low activity)

Page 22: Computational Methods for Systems and Synthetic Biologycontraintes.inria.fr/~fages/BioTeaching/J1.pdf · Systems and Synthetic Biology François Fages The French National Institute

20/10/10 François Fages ­ MPRI C2­19 22

Formal Proteins: Syntax" Cyclin dependent kinase 1 Cdk1

(free, inactive)

" Complex Cdk1-Cyclin B Cdk1–CycB(low activity)

" Phosphorylated form Cdk1~{thr161}­CycBat site threonine 161

(high activity)

Page 23: Computational Methods for Systems and Synthetic Biologycontraintes.inria.fr/~fages/BioTeaching/J1.pdf · Systems and Synthetic Biology François Fages The French National Institute

20/10/10 François Fages ­ MPRI C2­19 23

Formal Proteins" Cyclin dependent kinase 1 Cdk1

(free, inactive)

" Complex Cdk1-Cyclin B Cdk1–CycB(low activity)

" Phosphorylated form Cdk1~{thr161}­CycBat site threonine 161

(high activity)

�Mitosis-Promoting Factor�

phosphorylates actin in microtubules nuclear division

Page 24: Computational Methods for Systems and Synthetic Biologycontraintes.inria.fr/~fages/BioTeaching/J1.pdf · Systems and Synthetic Biology François Fages The French National Institute

20/10/10 François Fages ­ MPRI C2­19 24

Elementary Rule Schemas

" Complexation: A + B => A-B. Decomplexation A-B => A + B.

cdk1+cycB => cdk1–cycB

Page 25: Computational Methods for Systems and Synthetic Biologycontraintes.inria.fr/~fages/BioTeaching/J1.pdf · Systems and Synthetic Biology François Fages The French National Institute

20/10/10 François Fages ­ MPRI C2­19 25

Elementary Rule Schemas

" Complexation: A + B => A-B. Decomplexation A-B => A + B.

cdk1+cycB => cdk1–cycB

" Phosphorylation: A =[C]=> A~{p}. Dephosphorylation A~{p} =[C]=> A.

Cdk1­CycB =[Myt1]=> Cdk1~{thr161}­CycBCdk1~{thr14,tyr15}­CycB =[Cdc25~{Nterm}]=> Cdk1­CycB

Page 26: Computational Methods for Systems and Synthetic Biologycontraintes.inria.fr/~fages/BioTeaching/J1.pdf · Systems and Synthetic Biology François Fages The French National Institute

20/10/10 François Fages ­ MPRI C2­19 26

Elementary Rule Schemas

" Complexation: A + B => A-B. Decomplexation A-B => A + B.

cdk1+cycB => cdk1–cycB

" Phosphorylation: A =[C]=> A~{p}. Dephosphorylation A~{p} =[C]=> A.

Cdk1­CycB =[Myt1]=> Cdk1~{thr161}­CycBCdk1~{thr14,tyr15}­CycB =[Cdc25~{Nterm}]=> Cdk1­CycB

" Synthesis: _ =[C]=> A.  Degradation: A =[C]=> _. 

_=[#E2­E2f13­Dp12]=>cycA   cycE =[@UbiPro]=> _ (not for cycE­cdk2 which is stable)

Page 27: Computational Methods for Systems and Synthetic Biologycontraintes.inria.fr/~fages/BioTeaching/J1.pdf · Systems and Synthetic Biology François Fages The French National Institute

20/10/10 François Fages ­ MPRI C2­19 27

Elementary Rule Schemas

" Complexation: A + B => A-B. Decomplexation A-B => A + B.

cdk1+cycB => cdk1–cycB

" Phosphorylation: A =[C]=> A~{p}. Dephosphorylation A~{p} =[C]=> A.

Cdk1­CycB =[Myt1]=> Cdk1~{thr161}­CycBCdk1~{thr14,tyr15}­CycB =[Cdc25~{Nterm}]=> Cdk1­CycB

" Synthesis: _ =[C]=> A.  Degradation: A =[C]=> _. 

_=[#E2­E2f13­Dp12]=>cycA   cycE =[@UbiPro]=> _ (not for cycE­cdk2 which is stable)

" Transport: A::L1 => A::L2.

Cdk1~{p}­CycB::cytoplasm=>Cdk1~{p}­CycB::nucleus

Page 28: Computational Methods for Systems and Synthetic Biologycontraintes.inria.fr/~fages/BioTeaching/J1.pdf · Systems and Synthetic Biology François Fages The French National Institute

20/10/10 François Fages ­ MPRI C2­19 28

Graphical Representation

Hypergraph of reactions represented by a bipartite graph (S,R,A) with vertices for species S and for reactions R (Petri net representation)

A+B => A-B

Systems Biology Graphical Notation (SBGN)

Page 29: Computational Methods for Systems and Synthetic Biologycontraintes.inria.fr/~fages/BioTeaching/J1.pdf · Systems and Synthetic Biology François Fages The French National Institute

20/10/10 François Fages ­ MPRI C2­19 29

Biocham Syntax of Objects Entities E == name | #name | E­E | E~{p1,…,pn}  

name of molecular compound or gene binding site

­ : binding operator for protein complexes, gene binding sites, &

Associative and commutative.

~{…}: modification operator for phosphorylated sites, &

Set of modified sites (Associative, Commutative, Idempotent).

Induce a congruence on objects

Objects O == E | E::location

location: symbolic compartment (nucleus, cytoplasm, membrane, & )

dynamic volume

Page 30: Computational Methods for Systems and Synthetic Biologycontraintes.inria.fr/~fages/BioTeaching/J1.pdf · Systems and Synthetic Biology François Fages The French National Institute

20/10/10 François Fages ­ MPRI C2­19 30

Biocham Syntax of RulesSolutions S ::=   _ | O + S | i*O + S

+ : solution operator (Associative, Commutative, Neutral element _)

Rules R ::=   S => S | kinetic­expression for R

Abbreviations for catalytic reactions: A =[C]=> B stands for A+C => B+C reversible reactions: A <=> B stands for A=>B and B=>A,

Syntax compatible with the Systems Biology Markup Language SBML

Import/export exchange format for reaction models in XML

Biomodels.net: repository of 241 curated SBML models of cell processes

Page 31: Computational Methods for Systems and Synthetic Biologycontraintes.inria.fr/~fages/BioTeaching/J1.pdf · Systems and Synthetic Biology François Fages The French National Institute

20/10/10 François Fages ­ MPRI C2­19 31

Biocham Models

A Biocham model is a finite set of Biocham rules

Questions:

" Decidability properties ?

" Which type of chemical reactions cannot be represented ?

" Which type of locations cannot be represented ?

Page 32: Computational Methods for Systems and Synthetic Biologycontraintes.inria.fr/~fages/BioTeaching/J1.pdf · Systems and Synthetic Biology François Fages The French National Institute

20/10/10 François Fages ­ MPRI C2­19 32

Kinetic Expressions

Mass action law kinetics:

k*[A] for A => B

k*[A]*[B] for A+B => C

k*[A]^m*[B]^n for m*A + n*B => R

&

Michaelis Menten kinetics:

Vm*[A]/(Km+[A]) for A => B

Hill kinetics:

Vm*[A]^n/(Km+[A]^n) for A => B

Guldberg and Waage, 1864

Page 33: Computational Methods for Systems and Synthetic Biologycontraintes.inria.fr/~fages/BioTeaching/J1.pdf · Systems and Synthetic Biology François Fages The French National Institute

20/10/10 François Fages ­ MPRI C2­19 33

Kinetic Rate Constants

" Complexation: probability of reaction upon collision (specificity, affinity)

position of matching surfaces

" Decomplexation: total energy of all bonds

(giving dissociation rates)

" Diffusion speeds (small molecules>substrates>enzymes& )

Average travel in one random walk: 1 μm in 1s, 2μm in 4s, 10μm in 100s

" For one enzyme:

500000 random collisions per second with a substrate concentration of 10-5

50000 random collisions per second with a substrate concentration of 10-6

Page 34: Computational Methods for Systems and Synthetic Biologycontraintes.inria.fr/~fages/BioTeaching/J1.pdf · Systems and Synthetic Biology François Fages The French National Institute

20/10/10 François Fages ­ MPRI C2­19 34

Semantics of Rule-based Models

Reaction rule k*[A]*[B] for A+B => C" Differential Semantics: concentrations

� Ordinary Differential Equations dA/dt = -k*[A]*[B]

dB/dt = -k*[A]*[B]

dC/dt = k*[A]*[B]

� Hybrid automata (for kinetics with conditional expressions)

Page 35: Computational Methods for Systems and Synthetic Biologycontraintes.inria.fr/~fages/BioTeaching/J1.pdf · Systems and Synthetic Biology François Fages The French National Institute

20/10/10 François Fages ­ MPRI C2­19 35

Semantics of Rule-based Models

Reaction rule k*[A]*[B] for A+B => C" Differential Semantics: concentrations

� Ordinary Differential Equations dA/dt = -k*[A]*[B]

dB/dt = -k*[A]*[B]

dC/dt = k*[A]*[B]

� Hybrid automata (for kinetics with conditional expressions)

" Stochastic Semantics: numbers of molecules

� Continuous time Markov chain A, B p A--, B--, C++

Page 36: Computational Methods for Systems and Synthetic Biologycontraintes.inria.fr/~fages/BioTeaching/J1.pdf · Systems and Synthetic Biology François Fages The French National Institute

20/10/10 François Fages ­ MPRI C2­19 36

Semantics of Rule-based Models

Reaction rule k*[A]*[B] for A+B => C" Differential Semantics: concentrations

� Ordinary Differential Equations dA/dt = -k*[A]*[B]

dB/dt = -k*[A]*[B]

dC/dt = k*[A]*[B]

� Hybrid automata (for kinetics with conditional expressions)

" Stochastic Semantics: numbers of molecules

� Continuous time Markov chain A, B p A--, B--, C++

" Boolean Semantics: presence-absence of molecules

� Asynchronous Transition System A, B C, A/¬A, B/¬B

Page 37: Computational Methods for Systems and Synthetic Biologycontraintes.inria.fr/~fages/BioTeaching/J1.pdf · Systems and Synthetic Biology François Fages The French National Institute

20/10/10 François Fages ­ MPRI C2­19 37

Hierarchy of Semantics

Stochastic model Differential model

Discrete model

abstraction

concretization

Boolean model

Theory of abstract Interpretation

Abstractions as Galois connections [Cousot Cousot POPL� 77]

[Fages Soliman CMSB� 06,TCS� 07]

Syntactical

model

Page 38: Computational Methods for Systems and Synthetic Biologycontraintes.inria.fr/~fages/BioTeaching/J1.pdf · Systems and Synthetic Biology François Fages The French National Institute

20/10/10 François Fages ­ MPRI C2­19 38

Regulation Graphs as Abstractions

Discrete model

abstraction

concretization

Boolean model

Syntactical

model[Fages Soliman CMSB’06]

Syntactic regulation graph

(pos/neg influences w.r.t.

the stoichiometric coef.

in rules)

Thm. Same graphs for

monotonic kinetics

Jacobian regulation graph

(pos/neg influences w.r.t.

the sign of the coefficients)

Stochastic model Differential model

Page 39: Computational Methods for Systems and Synthetic Biologycontraintes.inria.fr/~fages/BioTeaching/J1.pdf · Systems and Synthetic Biology François Fages The French National Institute

20/10/10 François Fages ­ MPRI C2­19 39

Regulation Graph Circuit Analyses

Discrete model

abstraction

concretization

Boolean model

Syntactical

model

Jacobian circuit analysis

Discrete circuit analysis

Boolean circuit analysisabstraction

abstraction

abstraction

Thm. Positive (resp. negative) circuits are a necessary condition for multistationarity (resp. oscillations)

[Thomas 81] [Snoussi 89] [Soulé 03] [Remy Ruet Thieffry 05] [Richard 07]

Stochastic model Differential model