flux balance analysis

5
Flux balance analysis: A geometric perspective Kieran Smallbone a,b, , Evangelos Simeonidis a,c a Manchester Centre for Integrative Systems Biology, Manchester Interdisciplinary Biocentre, 131 Princess Street, Manchester M17DN, UK b School of Mathematics, University of Manchester, Oxford Road, Manchester M13 9PL, UK c School of Chemical Engineering and Analytical Science, University of Manchester, Oxford Road, Manchester M13 9PL, UK article info Article history: Received 20 October 2008 Accepted 13 January 2009 Available online 7 February 2009 Keywords: Systems biology Metabolism Constraint-based modelling abstract Advances in the field of bioinformatics have led to reconstruction of genome-scale networks for a number of key organisms. The application of physicochemical constraints to these stoichiometric networks allows researchers, through methods such as flux balance analysis, to highlight key sets of reactions necessary to achieve particular objectives. The key benefits of constraint-based analysis lie in the minimal knowledge required to infer systemic properties. However, network degeneracy leads to a large number of flux distributions that satisfy any objective; moreover, these distributions may be dominated by biologically irrelevant internal cycles. By examining the geometry underlying the problem, we define two methods for finding a unique solution within the space of all possible flux distributions; such a solution contains no internal cycles, and is representative of the space as a whole. The first method draws on typical geometric knowledge, but cannot be applied to large networks because of the high computational complexity of the problem. Thus a second method, an iteration of linear programs which scales easily to the genome scale, is defined. The algorithm is run on four recent genome-scale models, and unique flux solutions are found. The algorithm set out here will allow researchers in flux balance analysis to exchange typical solutions to their models in a reproducible format. Moreover, having found a single solution, statistical analyses such as correlations may be performed. & 2009 Elsevier Ltd. All rights reserved. 1. Introduction Recent advances in genome sequencing techniques and bioinformatic analyses have led to an explosion of systems-wide biological data. In turn the reconstruction of genome-scale networks for micro-organisms has become possible. Whilst the first stoichiometric model of E. coli was limited to the central metabolic pathways (Varma and Palsson, 1993), the most recent reported model is much more comprehensive, consisting of 2077 reactions and 1039 metabolites (Feist et al., 2007). Reaction networks for S. cerevisiae have been similarly expanded through incorporation of more genes and their corresponding metabolites a recent consensus model consists of 1761 reactions and 1168 metabolites (Herrga ˚ rd et al., 2008). Genome-scale stoichiometric models for other micro-organisms (Kim et al., 2008) and even H. sapiens (Duarte et al., 2007) have been developed. The ability to analyse, interpret and ultimately predict cellular behaviour has been a long sought-after goal. The genome sequencing projects are defining the molecular components within the cell, and describing the integrated function of these molecular components will be a challenging task (Edwards and Palsson, 2000). Ideally, one would like to use kinetic modelling to characterize fully the mechanics of each enzymatic reaction, in terms of how changes in metabolite concentrations affect local reaction rates. However, a considerable amount of data is required to parameterize even a small mechanistic model; the deter- mination of such parameters is costly and time-consuming, and moreover many may be difficult or impossible to determine experimentally. Instead, genome-scale metabolic modelling has relied on constraint-based analysis (Beard et al., 2002; Covert et al., 2003; Kim et al., 2008; Price et al., 2004), which uses physicochemical constraints such as mass balance, energy balance, and flux limitations to describe the potential behaviour of an organism. In particular, flux balance analysis (FBA) (Kauffman et al., 2003) highlights the most effective and efficient paths through the network in order to achieve a particular objective function, such as the maximization of biomass or ATP production. The key benefit of FBA and similar techniques lies in the minimal amount of biological knowledge and data required to make quantitative inferences about network behaviour (Bonarius et al., 1997). In general there is degeneracy in stoichiometric networks, leading to an infinite number of flux distributions satisfying the ARTICLE IN PRESS Contents lists available at ScienceDirect journal homepage: www.elsevier.com/locate/yjtbi Journal of Theoretical Biology 0022-5193/$ - see front matter & 2009 Elsevier Ltd. All rights reserved. doi:10.1016/j.jtbi.2009.01.027 Corresponding author at: Manchester Centre for Integrative Systems Biology, Manchester Interdisciplinary Biocentre, 131 Princess Street, Manchester M1 7DN, UK. Tel.: +441613065146; fax: +441613065201. E-mail address: [email protected] (K. Smallbone). Journal of Theoretical Biology 258 (2009) 311–315

Upload: natalia39

Post on 01-Oct-2015

213 views

Category:

Documents


0 download

DESCRIPTION

análisis de flujo

TRANSCRIPT

  • pe

    entre

    UK

    ord R

    Available online 7 February 2009

    Keywords:

    Constraint-based modelling

    bioi

    s. T

    ers,

    reactions necessary to achieve particular objectives. The key benets of constraint-based analysis lie in

    the minimal knowledge required to infer systemic properties. However, network degeneracy leads to a

    sequenn expnstruc

    The ability to analyse, interpret and ultimately predict cellular

    usesergyviour

    through the network in order to achieve a particular objective

    ARTICLE IN PRESS

    Contents lists availabl

    .e

    Journal of Theor

    Journal of Theoretical Biology 258 (2009) 3113151997).Corresponding author at: Manchester Centre for Integrative Systems Biology,

    Manchester Interdisciplinary Biocentre, 131 Princess Street, Manchester M1 7DN,behaviour has been a long sought-after goal. The genomesequencing projects are dening the molecular components

    function, such as the maximization of biomass or ATP production.The key benet of FBA and similar techniques lies in the minimalamount of biological knowledge and data required to makequantitative inferences about network behaviour (Bonarius et al.,

    In general there is degeneracy in stoichiometric networks,leading to an innite number of ux distributions satisfying the

    UK. Tel.: +441613065146; fax: +441613065201.

    E-mail address: [email protected] (K. Smallbone).0022-51

    doi:10.1models for other micro-organisms (Kim et al., 2008) and even H.sapiens (Duarte et al., 2007) have been developed.

    of an organism. In particular, ux balance analysis (FBA) (Kauffmanet al., 2003) highlights the most effective and efcient pathsincorporation of more genes and their corresponding metabolitesa recent consensus model consists of 1761 reactions and 1168metabolites (Herrgard et al., 2008). Genome-scale stoichiometric

    et al., 2003; Kim et al., 2008; Price et al., 2004), whichphysicochemical constraints such as mass balance, enbalance, and ux limitations to describe the potential behanetworks for micro-organisms has become possible. Whilst therst stoichiometric model of E. coli was limited to the centralmetabolic pathways (Varma and Palsson, 1993), the most recentreported model is much more comprehensive, consisting of 2077reactions and 1039 metabolites (Feist et al., 2007). Reactionnetworks for S. cerevisiae have been similarly expanded through

    terms of how changes in metabolite concentrations affect localreaction rates. However, a considerable amount of data is requiredto parameterize even a small mechanistic model; the deter-mination of such parameters is costly and time-consuming, andmoreover many may be difcult or impossible to determineexperimentally. Instead, genome-scale metabolic modelling hasrelied on constraint-based analysis (Beard et al., 2002; Covert1. Introduction

    Recent advances in genomebioinformatic analyses have led to abiological data. In turn the reco93/$ - see front matter & 2009 Elsevier Ltd. A

    016/j.jtbi.2009.01.027dominated by biologically irrelevant internal cycles. By examining the geometry underlying the

    problem, we dene two methods for nding a unique solution within the space of all possible ux

    distributions; such a solution contains no internal cycles, and is representative of the space as a whole.

    The rst method draws on typical geometric knowledge, but cannot be applied to large networks

    because of the high computational complexity of the problem. Thus a second method, an iteration of

    linear programs which scales easily to the genome scale, is dened. The algorithm is run on four recent

    genome-scale models, and unique ux solutions are found. The algorithm set out here will allow

    researchers in ux balance analysis to exchange typical solutions to their models in a reproducible

    format. Moreover, having found a single solution, statistical analyses such as correlations may be

    performed.

    & 2009 Elsevier Ltd. All rights reserved.

    cing techniques andlosion of systems-widetion of genome-scale

    within the cell, and describing the integrated function of thesemolecular components will be a challenging task (Edwards andPalsson, 2000). Ideally, one would like to use kinetic modelling tocharacterize fully the mechanics of each enzymatic reaction, inSystems biology

    Metabolismlarge number of ux distributions that satisfy any objective; moreover, these distributions may beFlux balance analysis: A geometric pers

    Kieran Smallbone a,b,, Evangelos Simeonidis a,c

    a Manchester Centre for Integrative Systems Biology, Manchester Interdisciplinary Biocb School of Mathematics, University of Manchester, Oxford Road, Manchester M13 9PL,c School of Chemical Engineering and Analytical Science, University of Manchester, Oxf

    a r t i c l e i n f o

    Article history:

    Received 20 October 2008

    Accepted 13 January 2009

    a b s t r a c t

    Advances in the eld of

    number of key organism

    networks allows research

    journal homepage: wwwll rights reserved.ctive

    , 131 Princess Street, Manchester M1 7DN, UK

    oad, Manchester M13 9PL, UK

    nformatics have led to reconstruction of genome-scale networks for a

    he application of physicochemical constraints to these stoichiometric

    through methods such as ux balance analysis, to highlight key sets of

    e at ScienceDirect

    lsevier.com/locate/yjtbi

    etical Biology

  • given optimality criteria. It is a great focus of the FBA communityto reduce the size of this optimal ux space, through imposingtighter limits on each ux based, for instance, on measurementsof intracellular uxes with nuclear magnetic resonance, or otheradditional constraints (Kim et al., 2008). Despite use of thesetechniques, the resultant solution remains a space of uxes, ratherthan a unique ux. In a recent paper, Price et al. (2004) do not view

    allow us to perform typical analyses, for example correlating ux

    Indeed the actual non-unique ux solution found will be entirelydependent on the underlying algorithm of ones software. Forexample, one possible solution is given by v% 1;1;1000;

    ARTICLE IN PRESS

    K. Smallbone, E. Simeonidis / Journal of Th312with associated protein levels.In this paper we show that, by examining the geometry

    underpinning the problem, we may nd a well-dened, uniqueux through application of a well-known and fundamentalmathematical theorem. Unfortunately, this method proves im-practicable for large, genome-scale models. Thus, we dene asecond method that gives similar results to the rst, and moreoveris computationally feasible. The algorithm is applied to a range ofexisting genome-scale models.

    2. Methods

    The problems encountered when performing constraint-basedanalysis, and the methods we propose to overcome theseproblems are best described through referral to a simple example.Consider the small metabolic network presented in Fig. 1, whoseuxes we wish to estimate. This may be addressed throughappealing to FBA. This method allows us to identify the optimalpath through the network in order to achieve a particularobjective; quantitative predictions will then hold true if the celloptimizes its growth under the conditions considered. Whenapplying LP to predict ux distributions it is assumed that the cellhas found an optimal solution for survival through naturalselection, where we equate survival with growth (Edwards andPalsson, 2000).

    For our example network, FBA would involve maximizingoutput ux v5 subject to a limited nutrient consumption rate(v1p1, say). The maximal solution to the problem here is clear

    1

    4

    2

    53 BAthis as a problem, stating

    The mathematical notion of equivalent optimal states iscoincident with the biological notion of silent phenotypes.This property distinguishes in silico modelling in biology fromthat in the physicochemical sciences where a single and uniquesolution is sought.

    While it is true that equivalent solutions may be important inrepresenting biological reality, at the same time we also believethat the ability to state a well-dened, single solution that isrepresentative of the space of all possible uxes would be of greatbenet to the modelling community. From a practical perspective,researchers often do quote a single ux that results from theiranalysis; since this is chosen randomly from a large space ofpossible uxes, their results are irreproducible and entirelydependent on the software or algorithm used to solve the linearprogramming (LP) problem. From a scientic perspective, theability to extract a representative solution from the space wouldFig. 1. A small metabolic network that admits multiple solutions: the output uxmay be delivered through any of reactions 2, 3 or 4.1000;1T . Such an answer is unappealingthe result isdominated by an internal cycle between uxes 3 and 4; whilstcells are known to demonstrate such proigacy with regard to ux(Westerhoff et al., 1983), it is unlikely to occur to such an extent.The problemwe shall address through the remainder of this paperis how to extract a solution from the space of ux distributionsthat is both unique (and hence reproducible) and sensible, from abiological perspective.

    2.1. Method one

    The set of all possible solutions to the FBA problem is given bythe equation and constraints

    Av b; vminpvpvmax, (1)

    where

    A N

    f T

    !; b 0

    Z%

    .

    From a geometric perspective, Eq. (1) denes a polyhedron;appealing further to geometric understanding (and using the factthat ba0), the representation theorem (Minkowski, 1910) tells usthat polyhedra such as these may be naturally decomposed as thesum of a convex hull and a pointed cone as shown in Fig. 2.

    v 2Xi

    lixi : liX0;Xi

    li 1( )|{z}

    convex hull

    Xj

    mjyj : mjX0

    8