computational methods for systems and synthetic...
TRANSCRIPT
20/10/10 François Fages MPRI C219 1
Computational Methods for Systems and Synthetic Biology
François Fages
The French National Institute for Research in Computer Science and Control
INRIA Paris-Rocquencourt
Constraint Programming Grouphttp://contraintes.inria.fr/
20/10/10 François Fages MPRI C219 2
Cell Molecules
" Small molecules: covalent bonds 50-200 kcal/mol
� 70% water
� 1% ions
� 6% amino acids (20), nucleotides (5),
� fats, sugars, ATP, ADP, &
" Macromolecules: hydrogen bonds, ionic, hydrophobic, Waals 1-5 kcal/mol
Stability and bindings determined by the number of weak bonds: 3D shape
� 20% proteins (50-104 amino acids)
� RNA (102-104 nucleotides AGCU)
� DNA (102-106 nucleotides AGCT)
20/10/10 François Fages MPRI C219 3
DNA Deoxyribonucleic Acid
1) Primary structure: word over 4 nucleotides
Adenine, Guanine, Cytosine, Thymine
2) Secondary structure:
double helix of pairs A--T and C---G
stabilized by hydrogen bonds
Nobel Prizes Watson and Crick (1956)
Size of DNA = number of pairs
A gene is a sequence of DNA pairs coding for a protein or an RNA having a function in the organism
20/10/10 François Fages MPRI C219 4
DNA: Genome Size
140 Gb&
15 Gb&
3 Gb&
12 Mb
100 %1 circular5 MbE. Coli (bacteria)
Coding DNAChromosomesGenome sizeSpecies
Artificial life by Craig Venter:fully synthetic bacteria genome (0,39 $/b)implemented in a bacteria without DNA still living and proliferating!
20/10/10 François Fages MPRI C219 5
DNA: Genome Size
140 Gb&
15 Gb&
3 Gb&
70 %1612 MbS. Cerevisae (yeast)
100 %1 circular5 MbE. Coli (bacteria)
Coding DNAChromosomesGenome sizeSpecies
20/10/10 François Fages MPRI C219 6
DNA: Genome Size
140 Gb&
15 Gb&
15 %20, 233 GbMouse, Human
70 %1612 MbS. Cerevisae (yeast)
100 %1 circular5 MbE. Coli (bacteria)
Coding DNAChromosomesGenome sizeSpecies
3,200,000,000 pairs of nucleotides
single nucleotide polymorphism 1 / 2kb
20/10/10 François Fages MPRI C219 7
Genome Size
140 Gb&
1 %815 GbOnion
15 %20, 233 GbMouse, Human
70 %1612 MbS. Cerevisae (yeast)
100 %14 MbE. Coli (bacteria)
Coding DNAChromosomesGenome sizeSpecies
20/10/10 François Fages MPRI C219 8
Genome Size
0.7 %140 GbLungfish
1 %815 GbOnion
15 %20, 233 GbMouse, Human
70 %1612 MbS. Cerevisae (yeast)
100 %14 MbE. Coli (bacteria)
Coding DNAChromosomesGenome sizeSpecies
20/10/10 François Fages MPRI C219 9
DNA Replication
1. Separation of the two helices
2. Production of one complementary strand for each copy
(from one or several starting points of replication)
3. Segregation
4. Mitosis (cell division)
20/10/10 François Fages MPRI C219 10
Gene Transcription and Translation
" Activation (Inhibition): Nobel prize Jakob and Monod (1965)
transcription factors bind to the regulatory region of the gene
2. Transcription:
RNA polymerase copies the DNA from start to stop positions
into a single stranded pre-mature messenger pRNA
3. (Alternative) splicing:
non coding regions of pRNA are removed giving mature messenger mRNA
4. Translation:
mRNA moves to cytoplasm and binds to ribosome to assemble a protein
20/10/10 François Fages MPRI C219 11
Formal Genes: Syntax
" Part of DNA #E2
" Activation
binding of promotion factor #E2-(E2f13-DP12)
" Repression (inhibition) Genes and signals [Ptashne Gann 01]
binding of another molecule #E2-Rep
20/10/10 François Fages MPRI C219 12
Transcription and Translation Rules
Activation
#E2 + E2f13DP12 => #E2E2f13DP12Repression
#E2 + Rep => #E2Rep
Genes and signals [Ptashne Gann 01]
20/10/10 François Fages MPRI C219 13
Transcription and Translation Rules
Activation
#E2 + E2f13DP12 => #E2E2f13DP12Repression
#E2 + Rep => #E2RepTranscription
_ =[#E2E2F13DP12]=> pRNAcycA
20/10/10 François Fages MPRI C219 14
Transcription and Translation Rules
Activation
#E2 + E2f13DP12 => #E2E2f13DP12Repression
#E2 + Rep => #E2RepTranscription
_ =[#E2E2F13DP12]=> pRNAcycA(Alternative) Splicing
pRNAcycA => mRNAcycA (pRNAcycA => mRNAcycA2)
20/10/10 François Fages MPRI C219 15
Transcription and Translation Rules
Activation
#E2 + E2f13DP12 => #E2E2f13DP12Repression
#E2 + Rep => #E2RepTranscription
_ =[#E2E2F13DP12]=> pRNAcycA(Alternative) Splicing
pRNAcycA => mRNAcycA (pRNAcycA => mRNAcycA2) Translation
mRNAcycA => mRNAcycA::cyt mRNAcycA::cyt =[ribosome::cyt]=> cycA::cyt(mRNAcycA2::cyt =[ribosome::cyt]=> cycA2::cyt)
20/10/10 François Fages MPRI C219 16
Proteins
1) Primary structure: word of n amino acids residues (20n possibilities)
linked with C-N bonds
20/10/10 François Fages MPRI C219 17
Proteins
1) Primary structure: word of n amino acids residues (20n possibilities)
linked with C-N bonds
Example: MPRI
Methionine-Proline-Arginine-Isoleucine
20/10/10 François Fages MPRI C219 18
Proteins
1) Primary structure: word of n amino acids residues (20n possibilities)
linked with C-N bonds
Example: MPRI
Methionine-Proline-Arginine-Isoleucine
2) Secondary: word of m α−helix, β−strands, random coils,& (3m-10m)
stabilized by hydrogen bonds H---O
20/10/10 François Fages MPRI C219 19
Proteins
1) Primary structure: word of n amino acids residues (20n possibilities)
linked with C-N bonds
Example: MPRI
Methionine-Proline-Arginine-Isoleucine
2) Secondary: word of m α−helix, β−strands, random coils,& (3m-10m)
stabilized by hydrogen bonds H---O
3) Tertiary 3D structure: spatial folding
stabilized by hydrophobic interactions
explains the protein interaction capabilities
20/10/10 François Fages MPRI C219 20
Formal Proteins: Syntax" Cyclin dependent kinase 1 Cdk1
(free, inactive)
20/10/10 François Fages MPRI C219 21
Formal Proteins: Syntax" Cyclin dependent kinase 1 Cdk1
(free, inactive)
" Complex Cdk1-Cyclin B Cdk1–CycB(low activity)
20/10/10 François Fages MPRI C219 22
Formal Proteins: Syntax" Cyclin dependent kinase 1 Cdk1
(free, inactive)
" Complex Cdk1-Cyclin B Cdk1–CycB(low activity)
" Phosphorylated form Cdk1~{thr161}CycBat site threonine 161
(high activity)
20/10/10 François Fages MPRI C219 23
Formal Proteins" Cyclin dependent kinase 1 Cdk1
(free, inactive)
" Complex Cdk1-Cyclin B Cdk1–CycB(low activity)
" Phosphorylated form Cdk1~{thr161}CycBat site threonine 161
(high activity)
�Mitosis-Promoting Factor�
phosphorylates actin in microtubules nuclear division
20/10/10 François Fages MPRI C219 24
Elementary Rule Schemas
" Complexation: A + B => A-B. Decomplexation A-B => A + B.
cdk1+cycB => cdk1–cycB
20/10/10 François Fages MPRI C219 25
Elementary Rule Schemas
" Complexation: A + B => A-B. Decomplexation A-B => A + B.
cdk1+cycB => cdk1–cycB
" Phosphorylation: A =[C]=> A~{p}. Dephosphorylation A~{p} =[C]=> A.
Cdk1CycB =[Myt1]=> Cdk1~{thr161}CycBCdk1~{thr14,tyr15}CycB =[Cdc25~{Nterm}]=> Cdk1CycB
20/10/10 François Fages MPRI C219 26
Elementary Rule Schemas
" Complexation: A + B => A-B. Decomplexation A-B => A + B.
cdk1+cycB => cdk1–cycB
" Phosphorylation: A =[C]=> A~{p}. Dephosphorylation A~{p} =[C]=> A.
Cdk1CycB =[Myt1]=> Cdk1~{thr161}CycBCdk1~{thr14,tyr15}CycB =[Cdc25~{Nterm}]=> Cdk1CycB
" Synthesis: _ =[C]=> A. Degradation: A =[C]=> _.
_=[#E2E2f13Dp12]=>cycA cycE =[@UbiPro]=> _ (not for cycEcdk2 which is stable)
20/10/10 François Fages MPRI C219 27
Elementary Rule Schemas
" Complexation: A + B => A-B. Decomplexation A-B => A + B.
cdk1+cycB => cdk1–cycB
" Phosphorylation: A =[C]=> A~{p}. Dephosphorylation A~{p} =[C]=> A.
Cdk1CycB =[Myt1]=> Cdk1~{thr161}CycBCdk1~{thr14,tyr15}CycB =[Cdc25~{Nterm}]=> Cdk1CycB
" Synthesis: _ =[C]=> A. Degradation: A =[C]=> _.
_=[#E2E2f13Dp12]=>cycA cycE =[@UbiPro]=> _ (not for cycEcdk2 which is stable)
" Transport: A::L1 => A::L2.
Cdk1~{p}CycB::cytoplasm=>Cdk1~{p}CycB::nucleus
20/10/10 François Fages MPRI C219 28
Graphical Representation
Hypergraph of reactions represented by a bipartite graph (S,R,A) with vertices for species S and for reactions R (Petri net representation)
A+B => A-B
Systems Biology Graphical Notation (SBGN)
20/10/10 François Fages MPRI C219 29
Biocham Syntax of Objects Entities E == name | #name | EE | E~{p1,…,pn}
name of molecular compound or gene binding site
: binding operator for protein complexes, gene binding sites, &
Associative and commutative.
~{…}: modification operator for phosphorylated sites, &
Set of modified sites (Associative, Commutative, Idempotent).
Induce a congruence on objects
Objects O == E | E::location
location: symbolic compartment (nucleus, cytoplasm, membrane, & )
dynamic volume
20/10/10 François Fages MPRI C219 30
Biocham Syntax of RulesSolutions S ::= _ | O + S | i*O + S
+ : solution operator (Associative, Commutative, Neutral element _)
Rules R ::= S => S | kineticexpression for R
Abbreviations for catalytic reactions: A =[C]=> B stands for A+C => B+C reversible reactions: A <=> B stands for A=>B and B=>A,
Syntax compatible with the Systems Biology Markup Language SBML
Import/export exchange format for reaction models in XML
Biomodels.net: repository of 241 curated SBML models of cell processes
20/10/10 François Fages MPRI C219 31
Biocham Models
A Biocham model is a finite set of Biocham rules
Questions:
" Decidability properties ?
" Which type of chemical reactions cannot be represented ?
" Which type of locations cannot be represented ?
20/10/10 François Fages MPRI C219 32
Kinetic Expressions
Mass action law kinetics:
k*[A] for A => B
k*[A]*[B] for A+B => C
k*[A]^m*[B]^n for m*A + n*B => R
&
Michaelis Menten kinetics:
Vm*[A]/(Km+[A]) for A => B
Hill kinetics:
Vm*[A]^n/(Km+[A]^n) for A => B
Guldberg and Waage, 1864
20/10/10 François Fages MPRI C219 33
Kinetic Rate Constants
" Complexation: probability of reaction upon collision (specificity, affinity)
position of matching surfaces
" Decomplexation: total energy of all bonds
(giving dissociation rates)
" Diffusion speeds (small molecules>substrates>enzymes& )
Average travel in one random walk: 1 μm in 1s, 2μm in 4s, 10μm in 100s
" For one enzyme:
500000 random collisions per second with a substrate concentration of 10-5
50000 random collisions per second with a substrate concentration of 10-6
20/10/10 François Fages MPRI C219 34
Semantics of Rule-based Models
Reaction rule k*[A]*[B] for A+B => C" Differential Semantics: concentrations
� Ordinary Differential Equations dA/dt = -k*[A]*[B]
dB/dt = -k*[A]*[B]
dC/dt = k*[A]*[B]
� Hybrid automata (for kinetics with conditional expressions)
20/10/10 François Fages MPRI C219 35
Semantics of Rule-based Models
Reaction rule k*[A]*[B] for A+B => C" Differential Semantics: concentrations
� Ordinary Differential Equations dA/dt = -k*[A]*[B]
dB/dt = -k*[A]*[B]
dC/dt = k*[A]*[B]
� Hybrid automata (for kinetics with conditional expressions)
" Stochastic Semantics: numbers of molecules
� Continuous time Markov chain A, B p A--, B--, C++
20/10/10 François Fages MPRI C219 36
Semantics of Rule-based Models
Reaction rule k*[A]*[B] for A+B => C" Differential Semantics: concentrations
� Ordinary Differential Equations dA/dt = -k*[A]*[B]
dB/dt = -k*[A]*[B]
dC/dt = k*[A]*[B]
� Hybrid automata (for kinetics with conditional expressions)
" Stochastic Semantics: numbers of molecules
� Continuous time Markov chain A, B p A--, B--, C++
" Boolean Semantics: presence-absence of molecules
� Asynchronous Transition System A, B C, A/¬A, B/¬B
20/10/10 François Fages MPRI C219 37
Hierarchy of Semantics
Stochastic model Differential model
Discrete model
abstraction
concretization
Boolean model
Theory of abstract Interpretation
Abstractions as Galois connections [Cousot Cousot POPL� 77]
[Fages Soliman CMSB� 06,TCS� 07]
Syntactical
model
20/10/10 François Fages MPRI C219 38
Regulation Graphs as Abstractions
Discrete model
abstraction
concretization
Boolean model
Syntactical
model[Fages Soliman CMSB’06]
Syntactic regulation graph
(pos/neg influences w.r.t.
the stoichiometric coef.
in rules)
Thm. Same graphs for
monotonic kinetics
Jacobian regulation graph
(pos/neg influences w.r.t.
the sign of the coefficients)
Stochastic model Differential model
20/10/10 François Fages MPRI C219 39
Regulation Graph Circuit Analyses
Discrete model
abstraction
concretization
Boolean model
Syntactical
model
Jacobian circuit analysis
Discrete circuit analysis
Boolean circuit analysisabstraction
abstraction
abstraction
Thm. Positive (resp. negative) circuits are a necessary condition for multistationarity (resp. oscillations)
[Thomas 81] [Snoussi 89] [Soulé 03] [Remy Ruet Thieffry 05] [Richard 07]
Stochastic model Differential model