Evolution of C4 photosynthesis predicted by constraint-based modelling

  1. Mary-Ann Blätke  Is a corresponding author
  2. Andrea Bräutigam
  1. Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Germany
  2. Computational Biology, Faculty of Biology, Bielefeld University, Universitätsstraße, Germany

Abstract

Constraint-based modelling (CBM) is a powerful tool for the analysis of evolutionary trajectories. Evolution, especially evolution in the distant past, is not easily accessible to laboratory experimentation. Modelling can provide a window into evolutionary processes by allowing the examination of selective pressures which lead to particular optimal solutions in the model. To study the evolution of C4 photosynthesis from a ground state of C3 photosynthesis, we initially construct a C3 model. After duplication into two cells to reflect typical C4 leaf architecture, we allow the model to predict the optimal metabolic solution under various conditions. The model thus identifies resource limitation in conjunction with high photorespiratory flux as a selective pressure relevant to the evolution of C4. It also predicts that light availability and distribution play a role in guiding the evolutionary choice of possible decarboxylation enzymes. The data shows evolutionary CBM in eukaryotes predicts molecular evolution with precision.

eLife digest

Virtually all plants use energy from sunlight to convert carbon dioxide and water into oxygen and sugars via a process called photosynthesis. This process has many steps that each rely on different enzymes to drive specific chemical reactions. Most plants use a pathway of enzymes that is referred to as C3 photosynthesis.

Plants absorb carbon dioxide gas from the atmosphere. However, the levels of carbon dioxide in the atmosphere are very low, so this limits the amount of photosynthesis plants can perform. To overcome this problem, some plants have evolved a different type of photosynthesis – called C4 photosynthesis – with a mechanism that increases the levels of carbon dioxide in the cells.

Today, plants that use C4 photosynthesis (so-called ‘C4 plants’) typically grow faster than other plants, especially in warmer climates. This gives C4 plants, such as corn, an advantage over their competitors and also helps them to colonize harsh environments that other plants struggle to thrive in. However, it remains unclear how C4 photosynthesis evolved in some plants living in wet habitats, or why other plants use forms of photosynthesis that are intermediate between C4 and C3 photosynthesis.

C4 photosynthesis uses pathways containing enzymes that are found in all plants; therefore, C4 plants evolved by changing how they used enzymes they already had. To understand how these different enzyme pathways may have evolved, Blätke and Bräutigam used an approach known as constraint-based modelling. The researchers built a mathematical model of C3 photosynthesis and used it to predict the optimal enzyme pathways (for example, pathways involving the fewest enzymes or requiring the least energy) for photosynthesis under particular conditions.

The model predicted that, in addition to shortages in carbon dioxide, shortages in an important plant nutrient known as nitrogen may have driven the evolution of C4 photosynthesis. Furthermore, enzyme pathways that were intermediate between C3 and C4 photosynthesis were predicted to be optimal solutions under particular conditions. Together, the findings of Blätke and Bräutigam may explain why different variations of C4 photosynthesis exist in plants. These findings could be used to breed crops that use the most efficient type of photosynthesis for the conditions they are grown in, leading to better yields.

Introduction

Identifying specific evolutionary trajectories and modelling the outcome of adaptive strategies at the molecular levels is a major challenge in evolutionary systems biology (Papp et al., 2011). The evolution of novel metabolic pathways from existing parts may be predicted using constraint-based modelling (CBM) (Orth et al., 2010). In CBM, selective pressures are coded via the objective functions for which the model is optimised. The factors which constrain evolution are integrated into the models via changes in model inputs or outputs and via flux constraints. We hypothesised that the evolution of the agriculturally important trait of C4 photosynthesis is accessible to CBM.

C4 photosynthesis evolved independently in at least 67 independent origins in the plant kingdom (Scheben et al., 2017) and it allows colonisation of marginal habitats (Sage et al., 2012) and high biomass production in annuals such as crops (Sage, 2004; Edwards et al., 2010). The C4 cycle acts as a biochemical pump which enriches the CO2 concentration at the site of Rubisco to overcome a major limitation of carbon fixation (Sage, 2004). Enrichment is beneficial because Rubisco, the carbon fixation enzyme, can react productively with CO2 and form two molecules of 3-PGA, but it also reacts with O2 and produces 2-phosphoglycolate which requires detoxification by photorespiration (Ogren and Bowes, 1971). The ratio between both reactions is determined by the enzyme specificity towards CO2, by the temperature, and the concentrations of both reactants, which in turn is modulated by stresses such as drought and pathogen load. Evolution of Rubisco itself is constrained since any increase in specificity is paid for by a reduction in speed (Spreitzer and Salvucci, 2002). Lower speeds most likely cause maladaptivity since Rubisco is a comparatively slow enzyme and can comprise up to 50% of the total leaf protein (Ellis, 1979). In the C4 cycle, phosphoenolpyruvate carboxylase affixes CO2 to a C3 acid, phosphoenolpyruvate (PEP), forming a C4 acid, oxaloacetate (OAA). After stabilisation of the resulting C4 acid by reduction to malate or transamination to aspartate, it is transferred to the site of Rubisco and decarboxylated by one of three possible decarboxylation enzymes, NADP-dependent malic enzyme (NADP-ME), NAD-dependent malic enzyme (NAD-ME), or PEP carboxykinase (PEP-CK) (Hatch, 1987; Schlüter et al., 2016b). Species such as corn (Zea mays) (Pick et al., 2011) and great millet (Sorghum bicolor) (Döring et al., 2016) use NADP-ME, species like common millet (Panicum miliaceum) (Hatch, 1987) and African spinach (Gynandropsis gynandra) (Feodorova et al., 2010; Voznesenskaya et al., 2007) use NAD-ME and species such as guinea grass (Panicum maximum) (Bräutigam et al., 2014) use mainly PEP-CK with the evolutionary constraints leading to one or the other enzyme unknown. Mixed forms are only known to occur between a malic enzyme and PEP-CK but not between both malic enzymes (Wang et al., 2014). After decarboxylation, the C3 acid diffuses back to the site of phosphoenolpyruvate carboxylase (PEPC) and is recycled for another C4 cycle by pyruvate phosphate dikinase (PPDK) (Hatch, 1987; Schlüter et al., 2016b). All the enzymes involved in the C4 cycle are also present in C3 plants (Aubry et al., 2011). In its most typical form, this C4 cycle is distributed between different cell types in a leaf in an arrangement called Kranz anatomy (Haberlandt, 1904). Initial carbon fixation by PEPC occurs in the mesophyll cell, the outer layer of photosynthetic tissue. The secondary fixation by Rubisco after decarboxylation occurs in an inner layer of photosynthetic tissue, the bundle sheath which in turn surrounds the veins. Both cells are connected by plasmodesmata which are pores with limited transfer specificity between cells. A model which may test possible carbon fixation pathways at the molecular level thus requires two cell architectures connected by transport processes (Bräutigam and Weber, 2010).

CBM of genome-scale or close to it are well suited to study evolution (summarised in Papp et al., 2011). Evolution of different metabolic modes from a ground state, the metabolism of Escherichia coli, such as glycerol usage (Lewis et al., 2010) or endosymbiotic metabolism (Pál et al., 2006) have been successfully predicted. Metabolic maps of eukaryotic metabolism are of higher complexity compared to bacteria since they require information about intracellular compartmentation and intracellular transport (Duarte, 2004) and may require multicellular approaches. In plants, aspects of complex metabolic pathways, such as the energetics of CAM photosynthesis (Cheung et al., 2014), and fluxes in C3 and C4 metabolism (Boyle and Morgan, 2009; Gomes de Oliveira Dal’Molin et al., 2011; de Oliveira Dal'Molin et al., 2010; Arnold and Nikoloski, 2014; Saha et al., 2011) have been elucidated with genome scale models. The C4 cycle is not predicted by these current C4 models unless the C4 cycle is forced by constraints (Gomes de Oliveira Dal’Molin et al., 2011; Mallmann et al., 2014). In the C4GEM model, the fluxes representing the C4 cycle are a priori constrained to the cell types (Gomes de Oliveira Dal’Molin et al., 2011), and in the Mallmann model, the C4 fluxes are induced by activating flux through PEPC (Mallmann et al., 2014). Models in which specific a priori constraints activated C4 were successfully used to study metabolism under conditions of photosynthesis, photorespiration, and respiration (Saha et al., 2011) and to study N-assimilation under varying conditions (Simons et al., 2013). However, they are incapable of testing under which conditions the pathway may evolve.

Schematic models suggest that the C4 cycle evolves from its ancestral metabolic state C3 photosynthesis along a sequence of stages (summarised in Sage, 2004; Bräutigam and Gowik, 2016). In the presence of tight vein spacing and of photosynthetically active bundle sheath cells (i.e. Kranz anatomy), a key intermediate in which the process of photorespiration is divided between cell types is thought to evolve (Monson, 1999; Sage et al., 2012; Heckmann et al., 2013; Bauwe, 2010). The metabolic fluxes in this intermediate suggest an immediate path towards C4 photosynthesis (Mallmann et al., 2014; Bräutigam and Gowik, 2016). (Heckmann et al., 2013) built a kinetic model in which the complex C4 cycle was represented by a single enzyme, PEPC. Assuming carbon assimilation as a proxy for fitness, the model showed that the evolution from a C3 progenitor species with Kranz-type anatomy towards C4 photosynthesis occurs in modular, individually adaptive steps on a Mount Fuji fitness landscape. It is frequently assumed that evolution of C4 photosynthesis requires water limitation (Bräutigam and Gowik, 2016; Heckmann et al., 2013; Mallmann et al., 2014). However, ecophysiological research showed that C4 can likely evolve in wet habitats (Osborne and Freckleton, 2009; Lundgren and Christin, 2017). CBM presents a possible avenue to study the evolution of C4 photosynthesis including its metabolic complexity in silico.

In this study, we establish a generic two-celled, constraint-based model starting from the Arabidopsis core model (Arnold and Nikoloski, 2014). We test under which conditions and constraints C4 photosynthesis is predicted as the optimal solution. Finally, we test which constraints result in the prediction of the particular C4 modes with their different decarboxylation enzymes. In the process, we demonstrate that evolution is predictable at the molecular level in an eukaryotic system and define the selective pressures and limitations guiding the 'choice' of metabolic flux.

Results

The curated Arabidopsis core model predicts physiological results

Flux balance analysis requires five types of information, the metabolic map of the organism, the input, the output, a set of constraints (i.e. limitations on input, directionality of reactions, forced flux through reactions), and optimisation criteria for the algorithm which approximate the selective pressures the metabolism evolved under. In this context, inputs define the resources that need to be taken up by the metabolic network to fulfil a particular metabolic function, which is related to the outputs, for example the synthesis of metabolites part of the biomass or other specific products. In CBM, the objective is most likely related to the in- and/or outputs.

For reconstruction of the C3 metabolic map we curated the Arabidopsis core model (Arnold and Nikoloski, 2014) manually (Table 1) to represent the metabolism of a mesophyll cell in a mature photosynthetically active leaf of a C3 plant , further on called one-cell model (provided in Figure 1—source data 1). The Arabidopsis core model is a bottom-up-assembled, large-scale model relying solely on Arabidopsis-specific annotations and the inclusion of only manually curated reactions of the primary metabolism. The Arabidopsis core model is accurate with respect to mass and energy conservation, allowing optimal nutrient utilisation and biochemically sound predictions (Arnold and Nikoloski, 2014).

Table 1
Curation of the Arabidopsis core model from Arnold and Nikoloski (2014).
Arabidopsis core modelObservationone-cell modelReference
NADP-dependent malate dehydrogenases in all compartmentscycles through nitrate reductase to interconvert NAD and NADPNAD-dependent malate dehydrogenases in all compartments, NADP-dependent malate dehydrogenase only in chloroplast(Swarbreck et al., 2008)
Cyclic electron flowabsence of cyclic electron flowadded(Shikanai, 2016)
Alternative oxidasemissing alternative routes for electrons to pass the electron transport chain to reduce oxygenadded alternative oxidase reactions to the chloroplast and mitochondria(Vishwakarma et al., 2015)
Alanine transferaseNo alanine transferase in cytosol Alanine transferaseadded(Liepman and Olsen, 2003)
Transport chloroplastno maltose transporter by MEX1added(Linka and Weber, 2010)
no glucose transporter by MEX1 and pGlcT MEX1added
no unidirectional transport of ATP, ADP, AMP by BT-likeadded
no Mal/OAA, Mal/Pyr, and Mal/Glu exchange by DiTsadded
no folate transporter by FBT and FOLT1added
Transport Mitochondriano Mal/OAA, Cit/iCit, Mal/KG exchange by DTCadded(Linka and Weber, 2010)
no H+ importer by UCPs importadded
no OAA/Pi exchange by DIC1-3added
no ATP/Pi exchange by APCsadded
no NAD/ADP and NAD/AMP exchange by NDT2added
no ThPP/ATP exchange by TPCsadded
no Asp/Glu by AGCsadded
no uncoupled Ala exchangeadded
Transport peroxisomemissing NAD/NADH, NAD/ADP, NAD/AMP exchange by PXNadded(Linka and Weber, 2010)
no ATP/ADP and ATP/AMP exchange by PNCsadded
H+ sinks/sourcesH+ sinks/source reaction for the cytosol and futile transport cycles introduced by H+ -coupled transport reactionsH+ sinks/source reaction added for each compartment
ATPase stoichiometryFalse H+/ATP ratios for the plastidal and mitochondrial ATP synthaseH+/ATP ratio set to 3 : 1 (chloroplast) and 4:1 (mitochondria)(Petersen et al., 2012; Turina et al., 2016)
Alanine/aspartate transferaseno direct conversion of alanine and aspartateadded to cytosol, chloroplast and mitochondria(Schultz and Coruzzi, 1995; Duff et al., 2012)

For the inputs, we considered a photoautotrophic growth scenario with a fixed CO2 uptake of about 20 μmol/(m2s) (Lacher, 2003). Light, sulphates, and phosphate are freely available. Due to the observation that nitrate is the main source (80%) of nitrogen in leaves in many species (Macduff and Bakken, 2003), we set nitrate as the sole nitrogen source. If both ammonia and nitrate are allowed, the model will inevitably predict the physiologically incorrect sole use of ammonia since fewer reactions and less energy are required to convert it into glutamate, the universal amino group currency in plants. Water and oxygen can be freely exchanged with the environment in both directions.

To compute the output, we assume a mature fully differentiated and photosynthetically active leaf, which is optimised for the synthesis and export of sucrose and amino acids to the phloem under minimal metabolic effort. Following the examples of models in bacteria, many plant models use a biomass function which assumes that the leaf is required to build itself (de Oliveira Dal'Molin et al., 2010; Arnold and Nikoloski, 2014; Saha et al., 2011) using photoautotrophic that is (Arnold and Nikoloski, 2014) or heterotrophic that is (Cheung et al., 2014) energy and molecule supply. In plants, however, leaves transition from a sink phase in which they build themselves from metabolites delivered by the phloem to a source phase in which they produce metabolites for other organs including sink leaves (Turgeon, 1989). The composition of Arabidopsis phloem exudate (Wilkinson and Douglas, 2003) was used to constrain the relative proportions of the 18 amino acids and the ratio of sucrose : total amino acids (2.2 : 1). To account for daily carbon storage as starch for export during the night, we assume that half of the assimilated carbon is stored in the one-cell model. We explicitly account for maintenance costs by the use of a generic ATPase and use the measured ATP costs for protein degradation and synthesis of a mature Arabidopsis leaf (Li et al., 2017) as a constraint. We initially assume a low photorespiratory flux according to the ambient CO2 and O2 partial pressures considering no heat, drought, salt or osmotic stress which may alter the ratio towards higher flux towards the oxygenation reaction.

To develop a largely unconstrained model and detect possible errors in the metabolic map, we initially kept the model unconstrained with regard to fixed fluxes, flux ratios, and reaction directions. Different model iterations were run in (re-)design, simulate, validate cycles against known physiology with errors sequentially eliminated and a minimal set of constraints required for a C3 model recapitulating extant plant metabolism determined. After each change, the CBM predicted all fluxes which were output as a table and manually examined (for example see Figure 1—source data 2).

The initial FBA resulted in carbon fixation by enzymes such as the malic enzymes which, in reality, are constrained by the kinetics of the enzymes towards decarboxylation. All decarboxylation reactions were made unidirectional towards decarboxylation to prevent erroneous carbon fixation in the flux distribution. The next iteration of FBA predicted loops through nitrate reductases which ultimately converted NADH to NADPH. We traced this loop to an error in the initial model, in which malate dehydrogenases in the cytosol and mitochondrion were NADP-dependent instead of NAD-dependent. After correction of the co-factor in the one-cell model, the loops through nitrate reductases were no longer observed. Another iteration predicted excessive flux through the mitochondrial membrane where multiple metabolites were exchanged and identified missing transport processes as the likely reason. Based on Linka and Weber (2010), we added known fluxes across the mitochondrial and plastidic envelope membranes which remedied the excessive fluxes in the solution. The chloroplastic ADP/ATP carrier protein is constrained to zero flux since its mutant is only affected during the night but not if light is available (Reiser et al., 2004).

The obtained flux distribution still contained excessive fluxes through multiple transport proteins across internal membranes which ultimately transferred protons between the organelles and the cytosol. Since for most if not all transport proteins the precise protonation state of metabolites during transport is unknown and hence cannot be correctly integrated into the model, we allowed protons to appear and disappear as needed in all compartments. This provision precludes conclusions about the energetics of membrane transport. ATP generation occurred in a distorted way distributed across different organelles which were traced to the H+ consumption of the ATPases in mitochondria and chloroplasts. The stoichiometry was altered to to 3:1 (chloroplast) and 4:1 (mitochondria) (Petersen et al., 2012; Turina et al., 2016). We assume no flux for the chloroplastic NADPH dehydrogenase and plastoquinol oxidase because (Josse et al., 2000; Yamamoto et al., 2011) have shown that their effect on photosynthesis is minor.

In preparation for modelling the C4 cycle, we ensured that all reactions known to occur in C4 (i.e. malate/pyruvate exchange, likely via DiT2 in maize [Weissmann et al., 2016], possibly promiscuous amino transferases [Duff et al., 2012]) are present in the one-cell model, since (Aubry et al., 2011) showed that all genes encoding enzymes and transporters underlying the C4 metabolism are already present in the genome of C3 plants. We integrated cyclic electron flow (Shikanai, 2016) and alternative oxidases in the mitochondria (Vishwakarma et al., 2015), since both have been hypothesised to be important during the evolution and/or execution of the C4 cycle. Models and analysis workflows provided as jupyter notebooks (Thomas et al., 2016) are available as supplementary material or can be accessed on GitHub https://github.com/ma-blaetke/CBM_C3_C4_Metabolism (Blätke, 2019; copy archived at https://github.com/elifesciences-publications/CBM_C3_C4_Metabolism).

The one-cell model comprises in total 413 metabolites and 572 reactions, whereof 139 are internal transporters, 90 are export and eight import reactions (see also below), which are involved in 59 subsystems. Figure 1 provides an overview of the primary subsystems according to Arnold and Nikoloski (2014).

Figure 1 with 2 supplements see all
Schematic representation of the primary subsystems in the one-cell model and the used input/output constraints; adapted from Arnold and Nikoloski (2014).
Figure 1—source data 1

SBML code of the one-cell model.

https://cdn.elifesciences.org/articles/49305/elife-49305-fig1-data1-v2.sbml
Figure 1—source data 2

Complete flux solution of the one-cell model.

https://cdn.elifesciences.org/articles/49305/elife-49305-fig1-data2-v2.csv
Figure 1—source code 1

Jupyter notebook - Predicted fluxes of C3 metabolism.

https://cdn.elifesciences.org/articles/49305/elife-49305-fig1-code1-v2.ipynb
Figure 1—source code 2

Jupyter notebook- Effect of the CO2 uptake rate on C3 metabolism.

https://cdn.elifesciences.org/articles/49305/elife-49305-fig1-code2-v2.ipynb
Figure 1—source code 3

Jupyter notebook - Effect of the PPFD on C3 metabolism.

https://cdn.elifesciences.org/articles/49305/elife-49305-fig1-code3-v2.ipynb

The one-cell model requires a photosynthetic photon flux density (PPFD) of 193.7 μmol/(m2s) (Table 2). The one-cell model takes up the maximal amount of CO2 to produce the maximum amount of phloem sap, as well as 0.8 μmol/(m2s) of NO3- and 18.2 μmol/(m2s) of H2O. According to the assumed ratio of sucrose and amino acids in the phloem sap, the flux of sucrose predicted by the model is 0.5 μmol/(m2s) and of amino acids 0.3 μmol/(m2s). The rate of oxygen supply by the network is 20.9 μmol/(m2s). Part of the complete flux table is displayed in Table 2; the full table is available, see Figure 1—source data 2. The flux table of all reactions did not display circular fluxes, and the reactions were within expected physiological ranges (Figure 1—source data 2).

Table 2
Input/output fluxes of one-cell model in comparison to physiological observations.
Molecular SpeciesFlux [µmol/(m2/s)]Physiological Range [µmol/(m2/s)]Reference
(i) Inputs
Photons193.7100 - 400Bailey et al. (2001)
CO22020Lacher (2003)
NO3-0.50.11 - 0.18Kiba et al. (2012)
H2O18.2-
(ii) Outputs
O220.916.5Sun et al. (1999)
Amino Acids0.3-
Sucrose/Starch0.8-
  1. Note: CO2 has one carbon per molecule while Sucrose has 12. Starch is configured to have the same number of carbons compared to sucrose while amino acids on average have 5.5 carbons.

The CO2 uptake rate and the phloem sap output have a positive linear relationship, see Figure 1—figure supplement 1(A). The same is true for the correlation of the PPFD and phloem sap output in the range of 100 μmol/(m2s)–200 μmol/(m2s), see Figure 1—figure supplement 1(B). Above 200 μmol/(m2s), the CO2 uptake rate acts as a limiting factor restricting the increase of phloem sap production. If either the PPFD or the CO2 uptake rate is zero, the phloem sap cannot be produced, compare Figure 1—figure supplement 1(A) and (B). Most of the metabolic processes use ATP/ADP as main energy equivalent (60%), followed by NADP/NADPH (37.5%) and NAD/NADH (2.4%), see Figure 1—figure supplement 2(D). Nearly all ATP is produced by the light reactions (97.2%) and consumed by the reductive pentose phosphate cycle (94.1%), see Figure 1—figure supplement 2(A). The oxidative phosphorylation produces only (1%) of ATP. In proportion, the maintenance cost for protein synthesis and degradation makeup 28% of the respiratory ATP produced by the oxidative phosphorylation (Figure 1—figure supplement 2(E)). Similarly, nearly all NADPH is produced by the light reaction (98.9%), which is consumed by the reductive pentose-phosphate cycle (98.3%) as well (Figure 1—figure supplement 2(B)). The canonical glycolysis and photorespiration produce nearly equal amounts of NADH, 45% and 47.7%, significantly less NADH is produced through the pyruvate dehydrogenase activity 6.85%. Nitrate assimilation (45%), glutamate biosynthesis (47.7%), glyoxylate cycle (21.6%) and alternative respiration (11.8%) consume the produced NADH (Figure 1—figure supplement 2(C)).

A C4 cycle is predicted under resource limitation

To rebuild the characteristic physiology of C4 leaves, we duplicated the one-cell model and connected the two network copies by bi-directional transport of cytosolic metabolites including amino acids, sugars, single phosphorylated sugars, mono-/di-/tri-carboxylic acids, glyceric acids, glycolate, glycerate, glyceraldehyde-3-phosphate, di-hydroxyacetone-phosphate and CO2, see Materials and methods for details. Since CBM is limited to static model analysis, we introduced two Rubisco populations in the bundle sheath network to approximate CO2 concentration-dependent changes in the oxygenation : carboxylation ratio of Rubisco (vRBO/vRBC) itself. We kept the native constrained Rubisco population that is forced to undertake oxygenation reactions and added a CCM-dependent Rubisco population which can only carboxylate ribulose 1,5-bisphosphate. The CCM-dependent Rubisco population is only able to use CO2 produced by the bundle sheath network but not environmental CO2 released by the mesophyll. C4 plants have a higher CO2 consumption and thus, an increased CO2 uptake of 40 μmol/(m2s) was allowed (Leakey et al., 2006). All other constraints and the objective of the one-cell model are maintained in the two-cell model, see Figure 2.

Schematic representation of the primary subsystems in the two-cell model and the used input/output constraints; adapted from Arnold and Nikoloski (2014).

Initially, we optimised for the classical objective function of minimal total flux through the metabolic network at different levels of photorespiration. These different levels of photorespiration integrate changes to external CO2 concentration and stomatal opening status which is governed by plant water status and biotic interactions. From the complete flux distribution, we extracted fluxes of PEPC and PPDK, the decarboxylation enzymes, Rubisco and metabolite transporter between the two cells to ascertain the presence of a C4 cycle, see Figure 3 and Figure 3—figure supplement 1. At low photorespiratory levels, flux through PEPC is barely detectable (Figure 3(A)). If photorespiration increases to moderate levels, flux through PEPC can be predicted and increases to 40 μmol/(m2s), that is all CO2 is funnelled through PEPC, for high photorespiratory fluxes. Concomitant with flux through PEPC, the activity of the decarboxylation enzymes changes (Figure 3(B)). At low to intermediate levels of photorespiratory flux, glycine decarboxylase complex activity is predicted to shuttle CO2 to the bundle sheath at up to 4.7 μmol/(m2s). Decarboxylation of C4 acids is initially mostly mediated by PEP-CK and is largely taken over by NADP-ME at high fluxes through photorespiration. Flux through NAD-ME is very low under all photorespiration levels. The decarboxylation enzymes dictate flux through the different Rubiscos in the model (Figure 3(C)). At low photorespiratory flux, both the Rubiscos in mesophyll and bundle sheath are active. Only very little flux occurs through the CCM-dependent Rubisco, which is a result of the glycine decarboxylase (Figure 3(B)). With increasing photorespiratory flux, this flux through glycine decarboxylase increases (Figure 3(B)) and therefore, total Rubisco activity exceeds the carbon intake flux (Figure 3(C)). Carbon fixation switches to the CCM-dependent Rubisco with increasing flux through PEPC (Figure 3(A)) and the classic C4 cycle decarboxylation enzymes (Figure 3(B)). Flux through PPDK mostly reflects flux through PEPC (Figure 3(D)). The transport fluxes between the cells change with changing photosynthetic mode (Figure 3(E and F)).

Figure 3 with 1 supplement see all
Effect of oxygenation : carboxylation ratio on the major steps in C4 cycle, including (A) activity of phosphoenolpyruvate carboxylase (PEPC), (B) metabolite transport to the bundle sheath, (C) activity of Rubisco, (D) activity of the decarboxylation enzymes, (E) metabolite transport to the mesophyll, and (F) activity of pyruvate phosphate dikinase (PPDK).
Figure 3—source code 1

Jupyter notebook - Analysing the effect of oxygenation : carboxylation ratio on the emergence of the C4 cycle.

https://cdn.elifesciences.org/articles/49305/elife-49305-fig3-code1-v2.ipynb

At low rates of photorespiration when PEPC is barely active, the only flux towards the bundle sheath is CO2 diffusion (Figure 3(E)) with no fluxes towards the mesophyll (Figure 3(F)). In the intermediate phase glycolate and glycerate are predicted to be transported and a low-level C4 cycle dependent on the transport of aspartate, malate, PEP and alanine operates (Figure 3(E) and (F)). In case of high photorespiratory rates, the exchange between mesophyll and bundle sheath is mainly carried by malate and pyruvate (Figure 3(E) and (F)). Flux through PPDK (Figure 3(D)) is lower than flux through PEPC (Figure 3(A)) at the intermediate stage (Figure 3(F)). Evolution of C4 photosynthesis with NADP-ME as the major decarboxylation enzyme is predicted if the photorespiratory flux is high and model optimised for minimal total flux, in other words, resource limitation.

C4 modes with different decarboxylation enzymes result from different set of constraints

Among the known independent evolutionary events leading to C4 photosynthesis, 20 are towards NAD-ME while 21 occurred towards NADP-ME (Sage, 2004). PEP-CK is dominant or at least co-dominant only in Panicum maximum (Bräutigam et al., 2014), Alloteropsis semialata semialata (Christin et al., 2012), and in the Chloridoideae (Sage, 2004). To analyse whether the predicted evolution of the C4 cycle is independent of a particular decarboxylation enzyme, we performed three separate experiments, where only one decarboxylation enzyme can be active at a time. The other decarboxylation enzymes were de-activated by constraining the reaction flux to zero resulting in three different predictions, one for each decarboxylation enzyme. The flux distributions obtained under the assumption of oxygenation : carboxylation ratio of 1 : 3 and minimisation of photorespiration as an additional objective predicts the emergence of a C4 cycle for each known decarboxylation enzyme. To visualise the possible C4 fluxes, the flux distribution for candidate C4 cycle enzymes was extracted from each of the three predictions and visualised as arc width and color (Figure 4). While the flux distribution in the mesophyll is identical for three predicted C4 cycles of the decarboxylation enzymes, it is diverse in the bundle sheath due to the different localisation of the decarboxylation and related transport processes, see Figure 4. The flux distribution does not completely mimic the variation in transfer acids known from laboratory experiments (Hatch, 1987) since all of the decarboxylation enzymes use the malate/pyruvate shuttle. In the case of NAD-ME and PEP-CK, the two-cell model also predicts a supplementary flux through the aspartate/alanine shuttle. We tested whether transfer acids other than malate and pyruvate are feasible and explored the near-optimal space. To this end, the model predictions are repeated, allowing deviation from the optimal solution and the changes recorded. Deviations from the optimal solution are visualised as error bars (Figure 5). Performing a flux variability analysis (FVA) and allowing the minimal total flux to differ by 1.5%, predicts that for most metabolites which are transferred between mesophyll and bundle sheath, the variability is similar for all three decarboxylation types. For the NAD-ME and PEP-CK types, changes in the near-optimal space were observed for the transfer acids malate, aspartate, pyruvate and alanine. Minor differences were present for triose phosphates and phosphoglycerates as well as for PEP. For the NADP-ME type, FVA identifies only minor variation (Figure 5). In the case of NAD-ME but not in the case of NADP-ME the activity of the malate/pyruvate shuttle can be taken over by the aspartate/alanine shuttle and partly taken over in case of PEP-CK, see Figure 5. The aspartate/alanine shuttle is thus only a near-optimal solution when the model and by proxy evolutionary constraints are resource efficiency and minimal photorespiration.

Flux maps illustrating the effect of the C4 mode.

(A) NADP-ME, (B) PEP-CK, (C) NAD-ME. (Arc width and colour are set relative to flux values in flux, grey arcs - no flux).

Figure 4—source code 1

Jupyter notebook - Effect of C4 mode on the emergence of the C4 cycle.

https://cdn.elifesciences.org/articles/49305/elife-49305-fig4-code1-v2.ipynb
Flux variability analysis of metabolite exchange with 1.5% deviation of the total flux minimum.

The upper bar defines the maximum exchange flux, while the lower bar defines the minimum exchange flux, points indicate the value of the original flux solution under minimal metabolic effort constraint. Positive flux values correspond to the transport direction from mesophyll to bundle sheath, negative values to the transport direction from bundle sheath to mesophyll, see also Figure 4—source code 1.

To analyse the effect of other conditions on the particular C4 state, we apply the minimisation of photorespiration as an additional objective to minimal total flux. Since NAD-ME and PEP-CK type plants use amino acids as transfer acids in nature, nitrogen availability has been tagged as a possible evolutionary constraint that selects for decarboxylation by NAD-ME or PEP-CK. When nitrate uptake was limiting, the optimal solution to the model predicted overall reduced flux towards the phloem output (Figure 6—figure supplement 1) but reactions were predicted to occur in the same proportions as predicted for unlimited nitrate uptake. Flux through NADP-ME and supplementary flux through PEP-CK dropped proportionally, since restricting nitrogen limits the export of all metabolites from the system and reduced CO2 uptake is observed (Figure 6—figure supplement 1). Similarly, limiting water or CO2 uptake into the model resulted in overall reduced flux towards the phloem output (Figure 6—figure supplement 1) but reactions were predicted to occur in the same proportions as predicted for unlimited uptake.

Given that C4 plants sometimes optimise light availability to the bundle sheath (Bellasio and Lundgren, 2016) we next explored light availability and light distribution. The model prediction is re-run with changes in the constraints, and the resulting tables of fluxes are queried for CO2 uptake and fluxes through the decarboxylation enzymes. In the experiment, we varied the total PPFD between 0 μmol/(m2s) to 1000 μmol/(m2s) and photon distribution in the range 0.1PPFDB / PPFDM2, see Figure 6. Under light limitation, if the total PPFD is lower than 400 μmol/(m2s) , the CO2 uptake rate is reduced, leading to a decreased activity of the decarboxylation enzymes (Figure 6(A)). PEP-CK is used in the optimal solutions active under light-limiting conditions (Figure 6(B)). Under limiting light conditions, photon distribution with a higher proportion in the bundle sheath shifts decarboxylation towards NADP-ME but only to up to 26%. Under non-limiting conditions, the distribution of light availability determines the optimal decarboxylation enzyme. NADP-ME is the preferred decarboxylation enzyme with supplemental contributions by PEP-CK if light availability is near the threshold of 400 μmol/(m2s) or if at least twice as many photons are absorbed by the mesophyll. Excess light availability and a higher proportion of photons reaching the bundle sheath leads to optimal solutions which favour PEP-CK as the decarboxylation enzyme. In the case of very high light availability and an abrupt shift towards the bundle sheath, NAD-ME becomes the optimal solution (Figure 6(B)). NAD-ME is the least favourable enzyme overall, only low activity is predicted under extreme light conditions, where the bundle sheath absorbs equal or more photons than the mesophyll (Figure 6(B)). PEP-CK complements the activity of NADP-ME and NAD-ME to 100% in many conditions, meaning the two-cell model also predicts the co-existence of PEP-CK/NADP-ME and PEP-CK/NAD-ME mode, while the flux distribution indicates no parallel use of NAD-ME and NADP-ME, compare Figure 6(B).

Figure 6 with 1 supplement see all
Effect of light on the C4 mode.

(A) CO2 uptake rate in dependence of the total PPFD, (B) Heat-maps illustrating the activity of the decarboxylation enzymes PEP-CK, NADP-ME, and NAD-ME relative to the CO2 uptake rate in dependence of the total PPFD and the photon distribution among mesophyll and bundle sheath.

Figure 6—source code 1

Jupyter notebook - Effect of light on the C4 mode.

https://cdn.elifesciences.org/articles/49305/elife-49305-fig6-code1-v2.ipynb
Figure 6—source code 2

Jupyter notebook - Effect of NO3- limitation on the C4 mode.

https://cdn.elifesciences.org/articles/49305/elife-49305-fig6-code2-v2.ipynb
Figure 6—source code 3

Jupyter notebook - Effect of H2O limitation on the C4 mode.

https://cdn.elifesciences.org/articles/49305/elife-49305-fig6-code3-v2.ipynb
Figure 6—source code 4

Jupyter notebook - Effect of CO2 limitation on the C4 mode.

https://cdn.elifesciences.org/articles/49305/elife-49305-fig6-code4-v2.ipynb
Figure 6—source code 5

Jupyter notebook - Effect of malate : aspartate transport ratio on the C4 mode.

https://cdn.elifesciences.org/articles/49305/elife-49305-fig6-code5-v2.ipynb

Finally, we assumed that intercellular transport capacity for charged metabolites might be different between species. Assuming a fixed transport ratio between aspartate and malate (Figure 6—figure supplement 1D) introduces a shift in the C4 state. Higher proportions of malate exchange foster the use of NADP-ME (Figure 6—figure supplement 1D). In contrast, higher portions of aspartate exchange foster the use of PEP-CK (Figure 6—figure supplement 1D).

Discussion

Evolutionary CBM can suggest the molecular outcomes of past evolutionary events if models are parametrised with objective functions representing possible selective pressures. In the case of C4 photosynthesis, more than sixty independent evolutionary origins represent metabolic types characterised by their decarboxylation enzyme. The selective pressure which drives evolution towards one or the other flux are unknown and were tested using CBM.

One-cell model reflects C3 plant physiology

To analyse evolution towards C4 photosynthesis based on C3 metabolism, a CBM of C3 metabolism is required (Figure 1). Design, simulation, validation cycles used current knowledge about plant biochemistry (Heldt, 2015) to identify possible errors in the metabolic map required for modelling. Even after error correction (Table 1), a significant problem remained, namely excessive fluxes to balance protons in all compartments. This observation leads to the realisation that the biochemical knowledge about transport reactions does not extend to the protonation state of the substrates, which affects all eukaryotic CBM efforts. In plants, predominantly export and vacuolar transport reactions are directly or indirectly coupled with proton gradients to energise transport (Bush, 1993; Neuhaus, 2007). For chloroplasts and mitochondria, proton-coupled transport reactions have been described but may couple different metabolite transporters together rather than energising them (Furumoto et al., 2011). Introducing proton sinks in all compartments solves the immediate modelling problem. However, intracellular transport reactions and their energetic costs are no longer correctly assessed by the model. Despite this band-aid fix which will be required for all eukaryotic constraint-based models which include proton-coupled transport reactions, the curated one-cell model correctly predicts energy usage and its distribution (Figure 1—figure supplement 2 and Li et al., 2017). This indicates that in models which exclude vacuolar transport and energised export reactions, energy calculations remain likely within the correct order of magnitude. Overall, our one-cell model operates within parameters expected for a C3 plant: The predicted PPFD lies within the range of light intensities used for normal growth condition of Arabidopsis thaliana, which varies between 100 μmol/(m2s)–200 μmol/(m2s), see Table 2. The gross rate of O2 evolution for a PPFD of 200 μmol/(m2s) is estimated to be 16.5 μmol/(m2s) in the literature (Sun et al., 1999), which is in close proximity to the predicted flux of the one-cell model, see Table 2. For the amount of respiratory ATP that is used for maintenance, (Li et al., 2017) predicted an even lower proportion of energy 16%, see Figure 1—figure supplement 2. The model’s flux map is in accordance with known C3 plant physiology (Heldt, 2015), and its input and output parameters match expected values (Figure 2(B)). The current model excludes specialised metabolism since the output function focuses solely on substances exported through the phloem in a mature leaf. If the model were to be used to study biotic interactions in the future, the addition of specialised metabolism in the metabolic map and a new output function would be required.

The two-cell model predicts a C4 cycle if photorespiration is present

Most evolutionary concepts about C4 photosynthesis assume that selective pressure drives pathway evolution due to photorespiration and carbon limitation (Heckmann et al., 2013). Most extant C4 species occupy dry and arid niches (Edwards et al., 2010), even more, the period of C4 plant evolution was accompanied with an increased oxygen concentration in the atmosphere (Sage, 2004). Therefore, it is frequently assumed that carbon limitation by excessive photorespiration drives the evolution of C4 photosynthesis. Yet, in most habitats plants are limited by nutrients other than carbon (Agren et al., 2012; Körner, 2015). Ecophysiological analyses also show that C4 can evolve in non-arid habitats (Liu and Osborne, 2015; Lundgren and Christin, 2017; Osborne and Freckleton, 2009). To resolve this apparent contradiction, we tested whether resource limitation may also lead to the evolution of a C4 cycle. We optimised the model approximating resource limitation via an objective function for total minimal flux at different photorespiratory levels. Indeed, with increasing photorespiration, the optimisation for resource efficiency leads to the emergence of the C4 cycle as the optimal solution. Balancing the resource cost of photorespiration against the resource cost of the C4 cycle, the model predicts that N limitation may have facilitated C4 evolution given high levels of photorespiration. Other possible selective pressures such as biotic interactions can currently not be tested using the model since specialised metabolism is not included in the metabolic map or the output function. Extant C4 species have higher C : N ratios reflecting the N-savings the operational C4 cycle enables (Sage et al., 1987). The photorespiratory pump using glycine decarboxylase based CO2 enrichment also emerges from the model, showing that C2 photosynthesis is also predicted under simple resource limitation. Indeed N-savings have been reported from C2 plants compared with their C3 sister lineages (Schlüter et al., 2016a). Simply minimising photorespiration as the objective function also yields C4 photosynthesis as the optimal solution. Hence, two alternatively or parallelly acting selective pressures towards C4 photosynthesis, limitation in C and/or N, are identified by the model. In both cases, the model correctly predicts the C4 cycle of carboxylation and decarboxylation and the C2 photorespiratory pump as observed in extant plants. The evolution of C4 photosynthesis in response to multiple selective pressures underscores its adaptive value and potential for agriculture. Intermediacy also evolves indicating that it, too, is likely an added value trait which could be pursued by breeding and engineering efforts.

The optimal solutions for the metabolic flux patterns predict an intermediate stage in which CO2 transport via photorespiratory intermediates glycolate and glycerate (Figure 3(E) and (F)) and decarboxylation by glycine decarboxylase complex (Figure 3(B)) is essential. All of the models of C4 evolution (Monson, 1999; Bauwe, 2010; Sage et al., 2012; Heckmann et al., 2013; Williams et al., 2013) predict that the establishment of a photorespiratory CO2 pump is an essential intermediate step towards the C4 cycle. The photorespiratory CO2 pump, also known as C2 photosynthesis, relocates the photorespiratory CO2 release to the bundle sheath cells. Plants using the photorespiratory CO2 pump are often termed C3-C4 intermediates owing to their physiological properties (Sage et al., 2012). Displaying the flux solution in Figure 3 on a metabolic map in Figure 3—figure supplement 1 clearly illustrates that increasing photorespiratory flux through Rubisco drives the two-cell metabolic model from C3 to C4 metabolism by passing the C3-C4 intermediate state. On the C3-C4 trajectory, the activity of Rubisco is shifted from the mesophyll to the bundle sheath, as well as from the constrained to the CCM-dependent Rubisco population as a consequence of the increased costs of photorespiration under increased pO2:pCO2 ratio, see Equation 5. The increase of the oxygenation rate in the photorespiration constraint drives the reprogramming of the metabolism to avoid oxygenation by establishing the C4 cycle. Therefore, our analysis recovers the evolutionary C3-C4 trajectory and confirms the emergence of a photorespiratory CO2 pump as an essential step during the C4 evolution also under optimisation for resources (Heckmann et al., 2013). The model may also provide a reason for why some plant species have halted their evolution in this intermediary phase (Scheben et al., 2017). Under the conditions of resource limitations and intermediate photorespiration, the model predicts intermediacy as the optimal solution. In a very narrow corridor of conditions, no further changes are required to reach optimality and the model thus predicts that a small number of species may remain intermediate.

Two-cell model realises different C4 states

Since the model predicts C4 metabolism without specific constraints, different input and reaction constraints can be tested for their influence on the molecular nature of the C4 cycle. This approach may identify the selective pressure and boundaries limiting evolution. Initial optimisation without additional constraints or input limitations predict a C4 cycle based on decarboxylation by NADP-ME (Figure 3 and Figure 3—figure supplement 1(A)). This prediction recapitulates intuition; the NADP-ME based C4 cycle is considered the 'most straight forward' incarnation of C4 photosynthesis, it is always explained first in textbooks and is a major focus of research. The NADP-ME based cycle thus represents the stoichiometrically optimal solution when resource limitation or photorespiration are considered. Once NADP-ME is no longer available via constraint, PEP-CK and NAD-ME become optimal solutions albeit with a prediction of malate and pyruvate as the transfer acids (Figure 6). The FVA identified aspartate and alanine as slightly less optimal solutions (Figure 5). Since in vivo this slightly less optimal solution has evolved in all NAD-ME origins tested to date, kinetic rather than stoichiometric reasons suggest themselves for the use of aspartate and alanine (Bräutigam et al., 2018).

Light is a potential evolutionary driver for the different C4 states

Since all extant C3 species and therefore also the ancestors of all C4 species contain all decarboxylation enzymes (Aubry et al., 2011), it is unlikely that unavailability of an enzyme is the reason for the evolution of different decarboxylation enzymes in different origins (Sage, 2004). Stochastic processes during evolution, that is up-regulation of particular enzyme concentrations via changes in expression and therefore elements cis to the gene (Bräutigam and Gowik, 2016), may have played a role in determining which C4 cycle evolved. Alternatively, environmental determinants may have contributed to the evolution of different C4 cycles. Physiological experiments have pointed to a connection between nitrogen use efficiency and type of decarboxylation enzyme (Pinto et al., 2016). Hence the variation in nitrogen input to the model was tested for their influence on optimal solutions with regard to decarboxylation enzymes. Input limitation of nitrogen, water as a metabolite, and CO2 limited the output of the system but did not change the optimal solution concerning decarboxylation Figure 6—figure supplement 1 making it an unlikely candidate as the cause. Differences in nitrogen use is possibly a consequence of decarboxylation type.

In some grasses, light penetrable cells overlay the vascular bundle leading to different light availability (summarised in Bellasio and Lundgren, 2016 and Karabourniotis et al., 2000) and hence light availability and distribution were tested (Figure 6(B)). Changes in light input and distribution of light input between mesophyll and bundle sheath indeed altered the optimal solutions (Figure 6(B)). The changes in the solution can be traced to the energy status of the plant cells. For very high light intensities, the alternative oxidases in the mitochondria are used to dissipate the energy and hence a path towards NAD-ME is paved. Under light limitation, the C4 cycle requires high efficiency and hence PEP-CK which, at least in part allows energy conservation by using PEP rather than pyruvate as the returning C4 acid, is favoured. Interestingly, the sensitivity of different species towards environmental changes in light is influenced by the decarboxylation enzyme present (Sonawane et al., 2018). NADP-ME species are less compromised compared to NAD-ME species by shade possibly reflecting an evolutionary remnant as NAD-ME is predicted to emerge only in high light conditions. PEP-CK is more energy efficient compared to malic enzyme based decarboxylation which requires PEP recycling by PPDK at the cost of two molecules of ATP (Figure 3(D)). Notably, two C4 plants known to rely on PEP-CK P. maximum and A. semialata (African accessions) are shade plants which grow in the understory (Lundgren and Christin, 2017). PEP-CK can be co-active with NADP-ME and NAD-ME (Figure 6(B)). This co-use of PEP-CK with a malic enzyme has been shown in C4 plants (Pick et al., 2011; Wingler et al., 1999) and explained as an adaptation to different energy availability and changes in light conditions (Pick et al., 2011; Bellasio and Griffiths, 2014). Dominant use of PEP-CK in the absence of malic enzyme activity as suggested (Figure 3(B), Figure 3—figure supplement 1 and Figure 4) is rare in vivo (Ueno and Sentoku, 2006) but observed in P. maximum and in A. semialata. While the model predictions are in line with ecological observations, we cannot exclude that kinetic constraints (i.e. [Bräutigam et al., 2018]) may also explain why a stoichiometrically optimal solution such as the NADP-ME cycle is not favoured in nature where NADP-ME and NAD-ME species evolve in nearly equal proportions (Sage, 2004).

Conclusion

CBM of photosynthetically active plant cells revealed a major knowledge gap impeding CBM, namely the unknown protonation state of most transport substrates during intracellular transport processes. When photoautotrophic metabolism was optimised in a single cell for minimal metabolic flux and therefore, optimal resource use, C3 photosynthetic metabolism was predicted as the optimal solution. Under low photorespiratory conditions, a two-celled model which contains a CCM-dependent Rubisco optimised for resource use, still predicts C3 photosynthesis. However, under medium to high photorespiratory conditions, a molecularly correct C4 cycle emerged as the optimal solution under resource limitation and photorespiration reduction as objective functions which points to resource limitation as an additional driver of C4 evolution. Light and light distribution was the environmental variable governing the choice of decarboxylation enzymes. Modelling compartmented eukaryotic cells correctly predicts the evolutionary trajectories leading to extant C4 photosynthetic plant species.

Materials and methods

Flux Balance Analysis

Request a detailed protocol

Flux balance analysis (FBA) is a CBM approach (Orth et al., 2010) to investigate the steady-state behaviour of a metabolic network defined by its stoichiometric matrix S. By employing linear programming, FBA allows computing an optimised flux distribution that minimises and/or maximises the synthesis and/or consumption rate of one specific metabolite or a combination of various metabolites. Next to the steady-state assumption and stoichiometric matrix S, FBA relies on the definition of the reaction directionality and reversibility, denoted by the lower bound vmin and upper bound vmax , as well as the definition of an objective function z. The objective function z defines a flux distribution v, with respect to an objective c.

(1) min/maxzFBA=cTvs.t.Sv=0vminvvmax

The degeneracy problem, the possible existence of alternate optimal solutions, is one of the major issues of constraint-based optimisation, such as FBA (Mahadevan and Schilling, 2003). To avoid this problem, we use the parsimonious version of FBA (pFBA) (Lewis et al., 2010). This approach incorporates the flux parsimony as a constraint to find the solution with the minimum absolute flux value among the alternative optima, which is in agreement with the assumption that the cell is evolutionary optimised to allocate a minimum amount of resources to achieve its objective.

(2) min/maxzpFBA=|vi|s.t.Sv=0vminvvmaxcTv=zFBA

All FBA experiments in this study employ pFBA and are performed using the cobrapy module in a python 2.7 environment run on a personal computer (macOS Sierra, 4 GHz Intel Core i7, 32 GB 1867 MHz DDR3). All FBA experiments are available as jupyter notebooks in the supplementary material and can also be accessed and executed from the GitHub repository https://github.com/ma-blaetke/CBM_C3_C4_Metabolism (Blätke, 2019; copy archived at https://github.com/elifesciences-publications/CBM_C3_C4_Metabolism).

Generic model for C3 metabolism

Metabolic model

Request a detailed protocol

The generic model representing the metabolism of a mesophyll cell of a mature photosynthetically active C3 leaf, further on called one-cell model, is based on the Arabidopsis core model (Arnold and Nikoloski, 2014). The model is compartmentalised into cytosol (c), chloroplast (h), mitochondria (m), and peroxisome (p). Each reaction in the Arabidopsis core model (Arnold and Nikoloski, 2014) was compared with the corresponding entry in AraCyc (Mueller et al., 2003). Based on the given information, we corrected co-factors, gene associations, enzyme commission numbers and reversibility (information from BRENDA [Schomburg et al., 2002] were included). The gene associations and their GO terms (Ashburner et al., 2000) of the cellular components were used to correct the location of reactions. Major additions to the model are the cyclic electron flow (Shikanai, 2016), alternative oxidases in mitochondria and chloroplast (Vishwakarma et al., 2015), as well as several transport processes between the compartments and the cytosol (Linka and Weber, 2010). NAD-dependent dehydrogenase to oxidise malate is present in all compartments (Gietl, 1992; Berkemeyer et al., 1998), which excludes the interconversion of NAD and NADP by cycles through the nitrate reductase present in the Arabidopsis core model. Correctly defining the protonation state of the metabolites in the various cellular compartments is a general drawback of metabolic models due to the lack of knowledge in that area. This issue mainly affects biochemical reactions and transport reactions involving protons. We added a sink/source reaction for protons in the form:

(3) H_{x}x=c,h,m,p

to all compartments to prevent futile fluxes of protons and other metabolites coupled through the proton transport. The curated one-cell model is provided in Figure 1—source data 1.

Import

Request a detailed protocol

As in Arnold and Nikoloski (2014), we assume photoautotrophic growth conditions. Only the import of light, water, CO2, inorganic phosphate (Pi), nitrate/ammonium, and sulphates/hydrogen sulphide is allowed, compare Table 3. More specifically, we do only allow for nitrate uptake, since it is the main source (80%) of nitrogen in leaves (Macduff and Bakken, 2003). The CO2 uptake is limited to 20 μmol/(m2s) (Lacher, 2003). Therefore, the carbon input constrains the model.

Table 3
Flux boundary constraints of Im-/export reactions
Input (Reaction ID)Flux [μmol/(m2s)]
Lower boundUpper bound
Photons (Im_hnu)0inf
C02 (Im_CO2)020
NO3- (Im_NO3)0inf
NH4+ (Im_NH4)00
SO42- (Im_SO4)0inf
H2S (Im_H2S)0inf
Pi0inf
H2O (Im_H2O)-infinf
O2 (Im_O2)-infinf
Amino Acids (Ex_AA)0inf
Surcose (Ex_Suc)0inf
Starch (Ex_starch)0inf
Other export reactions00
  1. -inf/inf is approximated by −106 / 106

Export

Request a detailed protocol

In contrast to Arnold and Nikoloski (2014), we focus on mature, fully differentiated and photosynthetic active leaves supporting the growth of the plant through the export of nutrients in the phloem sap, mainly sucrose and amino acids. An output reaction for sucrose Ex_Suc is already included in the model. An additional export reaction Ex_AA represents the relative proportion of 18 amino acids in the phloem sap of Arabidopsis as stoichiometric coefficients in accordance to experimentally measured data from Wilkinson and Douglas (2003). The ratio of exported sucrose : total amino acid is estimated to be 2.2 : 1 (Wilkinson and Douglas, 2003). This ratio is included as a flux ratio constraint of the reactions Ex_Suc and Ex_AA. Furthermore, it is known that the export of sucrose and the formation of starch is approximately the same (Stitt and Zeeman, 2012), which is reflected by the flux ratio constraint vEx_Suc:vEx_starch = 1:1. The model allows for the export of water and oxygen. The flux of all other export reactions is set to 0, see Table 3 for a summary.

Additional Constraints

Request a detailed protocol

We explicitly include the maintenance costs in our model to cover the amounts of ATP that is used to degradation and re-synthesis proteins for each compartment. (Li et al., 2017) specifies the ATP costs for protein degradation and synthesis of each compartment of a mature Arabidopsis leaf. Based on the given data, we were able to calculate the flux rates to constrain the maintenance reactions in each compartment (Table 4).

Table 4
Maintenance costs by compartment
CompartmentFlux [μmol/(m2s)]
cytosol0.0427
chloroplast0.1527
mitochondria0.0091
peroxisome0.0076

The one-cell model contains maintenance reactions only for the cytsol (NGAM_c), chloroplast (NGAM_h) and mitochondria (NGAM_m) in the form:

(4) ATP_{x}+H2O_{x}ADP_{x}+H_{x}+Pi_{x}x=c,h,m

An equivalent maintenance reaction cannot be formulated for the peroxisome since in the one-cell model ATP/ADP are not included as peroxisomal metabolites. The flux through the maintenance reactions is fixed to the determined maintenance costs given in Table 4. The peroxisomal maintenance costs are added to the cytosolic maintenance costs.

The CO2 and O2 partial pressures determine the ratio of the oxygenation : carboxylation rate of Rubisco (given by reactions RBO_h and RBC_h) and can be described by the mathematical expression:

(5) vRBO_hvRBC_h=1SRpO2pCO2,

where SR specifies the ability of Rubisco to bind CO2 over O2. In the case of a mature leave and ambient CO2 and O2 partial pressures in temperate regions with adequate water supply, the ratio vRBOh/vRBCh is fixed and is predicted to be 10%, which is encoded by an additional flux ratio constraint.

We assume no flux for the chloroplastic NADPH dehydrogenase (iCitDHNADP_h) and plastoquinol oxidase (AOX4_h) because (Josse et al., 2000) and (Yamamoto et al., 2011) have shown that their effect on the photosynthesis is minor.

Objective

Request a detailed protocol

In accordance with the assumption of mature, fully differentiated and photosynthetic active leaf, the model’s objective is to maximise the phloem sap output defined by reactions Ex_Suc and Ex_AA. Additionally, we assume that the involved plant cells put only a minimal metabolic effort, in the form of energy and resources, into the production of phloem sap as possible. This assumption is in correspondence with minimising the nitrogen investment by reducing the number of enzymes that are active in a metabolic network. Therefore, we perform a parsimonious FBA to minimise the total flux.

For enhanced compliance with the recent standards of the systems biology community, the one-cell model is encoded in SBML level 3. Meta-information on subsystems, publications, cross-references are provided as evidence code in the form of MIRIAM URI’s. FBA related information, gene association rules, charge and formula of a species element are encoded using the Flux Balance Constraints package developed for SBML level 3. All fluxes in the model are consistently defined as μmol/(m2s).

Generic model for C4 metabolism

Metabolic model

Request a detailed protocol

The generic model of C4 metabolism, short two-cell model, comprises two copies of the one-cell model to represent one mesophyll and one bundle sheath cell. Reactions and metabolites belonging to the metabolic network of the mesophyll are indicated with the prefix [M], whereas the prefix for the bundle sheath is [B]. The separate mesophyll and bundle sheath networks are connected via reversible transport reactions of the cytosolic metabolites indicated with the prefix [MB], Figure 2. The C4 evolution not only confined Rubisco to the bundle sheath cells, the CO2 concentrating mechanism steadily supplies Rubisco with CO2 in such a way that the oxygenation rate is negligible. Therefore, the bundle sheath network is equipped with two Rubisco populations. The native Rubisco population binds external CO2 and adheres to forced oxygenation : carboxylation ratios, where the optimised evolutionary population binds only internal CO2 and the carboxylation occurs independently of the oxygenation. External CO2 is defined as [B]_CO2_ex_{c,h} supplied by the mesophyll network. Internal CO2 given by [B]_CO2_{c,h,m} originates from reactions in the bundle sheath network producing CO2. External CO2in the bundle sheath network is only allowed to move to the chloroplast [B]_Tr_CO2h_Ex and to react with Rubisco [B]_RBC_h_Ex. The differentiation of two Rubisco populations binding either external or internal CO2 approximates the concentration-dependent shift of the oxygenation : carboxylation ratio.

Imports

Request a detailed protocol

As for the one-cell model, we assume photoautotrophic growth conditions, see Table 3. During C4 evolution the CO2 assimilation became more efficient allowing higher CO2 assimilation rates. Zea mays achieves up to 40 μmol/(m2s) ([M]_Im_CO2) (Rozema, 1993). We assume that the CO2 uptake from the environment by the bundle sheath has to be bridged by the mesophyll. Therefore, the input flux of [B]_Im_CO2 is set to zero.

Exports

Request a detailed protocol

The outputs of the one-cell model are transferred to the mesophyll and bundle sheath network, as well as the corresponding flux ratios, see Table 3.

Additional Constraints

Request a detailed protocol

The ATP costs for cell maintenance in the genC3 model are assigned to both cell types in the two-cell model. Due to declining CO2 concentrations over evolutionary time and/or adverse conditions which close the stromata, the oxygenation : carboxylation ratio of the native Rubisco population in the bundle sheath and the mesophyll is increased and can be predicted as 1 : 3, the corresponding flux ratios are adapted accordingly. Furthermore, we assume that the total photon uptake in the mesophyll and bundle sheath is in the range of 0 μmol/(m2s)to 1000 μmol/(m2s). Since they are more central in the leaf, the photon uptake by the bundle sheath must be equal or less compared to the mesophyll. The mesophyll and bundle sheath networks are connected by a range of cytosolic transport metabolites including amino acids, sugars (glucose, fructose, sucrose, trehalose, ribose), single phosphorylated sugar (glucose-6-phosphate, glucose-1-phosphate, fructose-6-phosphate, sucrose-6-phosphate), mono-/di-/tri-carboxylic acids (phosphoenolpyruvate, pyruvate, citrate, cis-aconitate, isocitrate, α-ketoglutarate, succinate, fumarate, malate), glyceric acids (2-Phosphoglycerate, 3-Phosphoglycerate), glycolate, glycerate, glyceraldehyde-3-phosphate, di-hydroxyacetone-phosphate and CO2. Nucleotides, NAD/NADH, NADP/NADPH, pyrophosphate, inorganic phosphate are not considered as transport metabolites. Oxaloacetate has been excluded as transport metabolite since concentrations of oxaloacetate are very low in vivo and it is reasonably unstable in aqueous solutions. Other small molecules that can be imported by the bundle sheath from the environment, as well as protons and HCO3-, are not exchanged between the two cell types.

Objective

Request a detailed protocol

The maximisation of the phloem sap output through the bundle sheath and the minimisation of the metabolic effort are kept as objectives in the two-cell model.

Data availability

All data generated or analysed during this study are included in the manuscript and supporting files. We provide jupyter notebooks as documentation for all the in silico experiments using constraint-based modelling and additional python code for Figure 1, 3, 4, 6, as well as the metabolic network used as source data for Figure 1 which can be accessed and executed from the GitHub repository https://github.com/ma-blaetke/CBM_C3_C4_Metabolism (copy archived at https://github.com/elifesciences-publications/CBM_C3_C4_Metabolism).

References

    1. Bauwe H
    (2010)
    C4 Photosynthesis and Related CO2 Concentrating Mechanisms
    Photorespiration: The bridge to C4 photosynthesis, C4 Photosynthesis and Related CO2 Concentrating Mechanisms, The Netherlands, Springer.
    1. Bräutigam A
    2. Weber APM
    (2010)
    C4 Photosynthesis and Related CO2 Concentrating Mechanisms
    Transport processes: Connecting the reactions of C4 photosynthesis, C4 Photosynthesis and Related CO2 Concentrating Mechanisms, The Netherlands, Springer.
  1. Book
    1. Haberlandt GFJ
    (1904)
    Physiologische Pflanzenanatomie
    Engelmann V. erlagvonW, editors. Macmillan and Company.
  2. Book
    1. Heldt HW
    (2015)
    Piechulla B. Pflanzenbiochemie
    Berlin Heidelberg: Springer.
  3. Book
    1. Lacher W
    (2003)
    Physiological Plant Ecology: Ecophysiology and Stress Physiology of Functional Groups
    Heidelberg: Springer.
    1. Monson RK
    (1999)
    C4 Plant Biology
    The Origins of C4 Genes and Evolutionary Pattern in the C4 Metabolic Phenotype, C4 Plant Biology, Elsevier.
    1. Papp B
    2. Szappanos B
    3. Notebaart RA
    (2011)
    Methods in Molecular Biology
    Use of genome-scale metabolic models in evolutionary systems biology, Methods in Molecular Biology, Humana Press.
    1. Simons M
    2. Misra A
    3. Sriram G
    (2013)
    Methods in Molecular Biology
    Genome-scale models of plant metabolism, Methods in Molecular Biology, Humana Press.
  4. Conference
    1. Thomas K
    2. Benjamin R-K
    3. Fernando P
    4. Brian G
    5. Matthias B
    6. Jonathan F
    7. Kyle K
    8. Jessica H
    9. Jason G
    10. Sylvain C
    (2016)
    Jupyter notebooks: a publishing format for reproducible computational workflows
    Stand Alone. 2016; 0(Positioning and Power in Academic Publishing: Players, Agents and Agendas).
    1. Turgeon R
    (1989) The Sink-Source transition in leaves
    Annual Review of Plant Physiology and Plant Molecular Biology 40:119–138.
    https://doi.org/10.1146/annurev.pp.40.060189.001003

Decision letter

  1. Daniel J Kliebenstein
    Reviewing Editor; University of California, Davis, United States
  2. Christian S Hardtke
    Senior Editor; University of Lausanne, Switzerland
  3. Gavin Conant
    Reviewer; North Carolina State University, United States
  4. Amy Marshall-colon
    Reviewer; University of Illinois

In the interests of transparency, eLife publishes the most substantive revision requests and the accompanying author responses.

Acceptance summary:

The environmental and evolutionary forces pushing the transition between different types of photosynthesis in plants has been a question of long-standing interest. In this work, the authors utilized computational modelling of plant metabolism in conjunction with the environmental parameters to test this question. They identify a surprisingly limited set environmental parameters and metabolic features that are sufficient to explain how these transitions occur and why.

Decision letter after peer review:

Thank you for submitting your article "Evolution of C4 photosynthesis predicted by constraint-based modelling" for consideration by eLife. Your article has been reviewed by three peer reviewers, one of whom is a member of our Board of Reviewing Editors, and the evaluation has been overseen by Christian Hardtke as the Senior Editor. The following individuals involved in review of your submission have agreed to reveal their identity: Gavin Conant (Reviewer #2); Amy Marshall-Colon (Reviewer #3).

The reviewers have discussed the reviews with one another and the Reviewing Editor has drafted this decision to help you prepare a revised submission.

The main requests for clarity by the reviewers revolve around these three general areas.

1) Commenting on how this model may or may not fit to reality. This includes issues of assumptions inherent to the CBM model, flux issues in Arabidopsis that are lineage specific, and how existing physiological measures could be used to support the modeling.

2) Noted issues with the Jupyter Notebook.

3) Making the writing more accessible in how the data shows the claims and what are hypothesis generated by the models that need testing.

Reviewer #1:

The authors use a flux based optimization approach to investigate the C3/C4 transitions and the different C4 possibilities. Overall, this is a very interesting manuscript. As an editorial note, as a general biologist, I could understand the main conclusions due to the explicit statements making these conclusions. It would help however to walk the general biologists through the results and figures more to help them understand how the data makes these conclusions. I was able to divine this from the data but it did take effort that a general biologist may not put into the work.

I do have one point of clarification. The model is built on the Arabidopsis flux model but I was unsure if this is the generic flux model or the one that incorporates specialized metabolism. It might make some sense to comment on how using the Arabidopsis C3 map as a resource from the flux model given that Arabidopsis has very different sulfate/sulfur fluxes in comparison to non-Brassicaceous plants. This links directly to sulfur-amino acids and glutathione metabolism due to the heavy requirement for glucosinolate metabolism as shown by the work of Conant, Pires and colleagues. At the very least some discussion about how large carbon sinks not in the model may or may not change the conclusions would be helpful especially to guide future modelling efforts.

Reviewer #2:

In this manuscript, the authors develop a constraint-based metabolic model (CBM) of C4 photosynthesis using known C3 metabolic models with additional constraints. They compare the C3 and C4 models to see if various evolutionary pressures can drive a metabolic shift from C3 to C4 photosynthesis. They find that selection for minimizing metabolic turnover will drive metabolism total the C4 solution with differing light conditions controlling the appearance of different classes of C4.

I enjoyed reading this manuscript, and I commend the authors on their approach to an important and complex topic. However, I think that the manuscript could use improvement in two key areas.

First, while the authors discuss at several points how their models are consistent with physiological data, these comparisons appear to be vague and qualitative. For instance, in the ninth paragraph of the subsection “The curated Arabidopsis core model predicts physiological results”, the authors give many numerical predictions of their model, but no comparative values from real plants. Obviously, many model parameters are not measurable (hence the value of the model). But if the model is reproducing physiology, the authors should clearly specify how and with what level of error. Note also that there are advanced approaches for fitting models such as these to real data on gene expression levels, trying to make models that more closely follow observed physiology (1). The authors certainly need not apply them, but the text (c.f., especially the Abstract) as written rather implies that CBMs have not been successfully used with eukaryotes, which is not true. (Also note that "eukaryote" is not synonymous with "multicellular" (2)).

My second concern is that the authors do not make enough of an effort to guide the reader through the potential confounds of CBM for problems such as this one. For instance, in the last paragraph of the subsection “C4 modes with different decarboxylation enzymes result from different set of constraints”, the authors reject nitrogen-limitation as a driver of C4 evolution. However, if compounds that are composed of a good deal of nitrogen are used as catalysts in a CBM, the model will not predict a need for high levels of nitrogen because those compounds are regenerated. I believe that the various C4 carbon shuttles function in exactly this way, and a CBM that seeks only to synthesize sugars and amino acids and has no biomass component will not predict an increased nitrogen requirement for C4 even if that requirement exists, because the compounds requiring that nitrogen cycle in the model. To be clear, I suspect the authors are correct that light is a more likely driver of C4 evolution than nitrogen limitation. But I am not convinced the CBM proves this.

Likewise, CBMs "overpredict" solutions to metabolic problems because they are not kinetic. Hence, the fact that the base 2-cell model does not differentiate between the C4 classes does not mean that there are not kinetic differences that are evolutionarily important.

References:

1) Shlomi T, Cabili MN, Herrgard MJ, Palsson BO, and Ruppin E (2008) Network-based prediction of human tissue-specific metabolism. Nature Biotechnology 26(9):1003-1010.

2) Duarte NC, Herrgård MJ, and Palsson BØ (2004) Reconstruction and validation of Saccharomyces cerevisiae iND750, a fully compartmentalized genome-scale metabolic model. Genome Research 14:1298-1309.

Reviewer #3:

The manuscript by Blätke and Brautigam provides a novel method using constraint based modeling to predict the selective pressures that resulted in C4 metabolism. The take-away from this study is that C4 metabolism emerged from increasing photorespiration with concomitant resource efficiency, and that light is a driver for different C4 states. The model presented here overcomes some of the limitations of previous flux models of C3 and C4 metabolism, and makes an important contribution to the field. However, the main take-aways of this study are not easy for the reader to pull out as they are buried within the manuscript. This very interesting manuscript could be improved by streamlining the text and more clearly summarizing the main results either in the section titles or as a couple of sentences at the end of each Results section. Much of the Discussion is redundant to the results, so can be shortened for clarity. Likewise, the Discussion would be more interesting if the authors took a deeper dive into the potential outcomes and applicability of their results; how can the information revealed in this study guide future efforts for engineering C4 metabolism into C3 crops? Are there potential transcriptional regulatory mechanisms that could be investigated to evolve C4 metabolism in C3 crops? It is also unclear if the authors altered exogenous CO2 concentration in the model; this should either be more clearly explained how it was done and the results, or why it wasn't done. Finally, it is wonderful that the authors include their Jupyter Notebooks and source data for the reader; however, we identified a few issues that made it difficult to run the code. First, there is no Readme file to follow.

Second, the code would not run with the current installations of the Cobra and Escher packages; this was resolved by changing cobra.io.sbml3.read_sbml_model() to cobra.io.sbml.read_sbml_model() in the load_sbml_model.py file as cobra.io.sbml3 does not appear to exist in the current version of the cobra package. The escher package returned a syntax error from the plots.py file in the initialization step of the jupyter notebook files. In order to run the files, we commented out (removed) any uses of the escher package in the code.

[Editors' note: further revisions were requested prior to acceptance, as described below.]

Thank you for resubmitting your work entitled "Evolution of C4 photosynthesis predicted by constraint-based modelling" for further consideration by eLife. Your revised article has been reviewed by three peer reviewers, including Dan Kliebenstein as the Reviewing Editor, and the evaluation has been overseen by Christian Hardtke as the Senior Editor.

The manuscript has been improved but there are some remaining issues that need to be addressed before final acceptance, as outlined below.

There are a few suggestions where in a couple of places the writing could be tightened to improve the accessibility for the general reader.

Reviewer #1:

The authors have nicely addressed my previous concerns and I have nothing further to add.

Reviewer #2:

In this manuscript, the authors develop a constraint-based metabolic model (CBM) of C4 photosynthesis using known C3 metabolic models with additional constraints. They compare the C3 and C4 models to see if various evolutionary pressures can drive a metabolic shift from C3 to C4 photosynthesis.

The authors have addressed most of my concerns, but I still find the description of the effects of nitrogen, water and CO2 limitations confusing. The authors write "the optimal solution to the model predicted the same behavior" and "resulted in the same optimal solution as unlimited uptake with the differences of proportionally lower flux overall."

A quick reading of these statements makes one believe that the optimal solution is unchanged with N, H2O or CO2 limitation, which is common with FBA when a metabolite is not truly limiting. I think the authors' point is actually that the proportional fluxes are the same but with different magnitudes, but neither the text nor Figure 6 nor Figure 6—figure supplement 1 seem to me to be crystal clear on this point.

Introduction, second paragraph: "prohibitive" is a slightly odd word to use here. Lower speeds are very likely maladaptive, but are not lethal for growth.

Subsection “The curated Arabidopsis core model predicts physiological results”, first paragraph: I would add a few words on what these inputs and outputs are.

Reviewer #3:

The authors addressed each of our comments to various extents. We appreciate the clear headings that provide the reader with the main findings of each section. However, we still find the summary at the end of each section to be lacking. That said, the authors did a good job of streamlining the section on the C4 cycle under resource limitation to reveal the key insights. This helped to reduce the redundancy of the Discussion section. Importantly, the authors corrected the Jupyter Notebooks issue such that now they can easily be run. The conclusion paragraph still needs work to achieve clarity and a logical flow of ideas. For example, the second sentence seems out of place. Otherwise we are satisfied with the revised version.

https://doi.org/10.7554/eLife.49305.sa1

Author response

Reviewer #1:

The authors use a flux based optimization approach to investigate the C3/C4 transitions and the different C4 possibilities. Overall, this is a very interesting manuscript. As an editorial note, as a general biologist, I could understand the main conclusions due to the explicit statements making these conclusions.

Reviewer 3 asked for streamlining the manuscript, and indeed, some sections between the Results and Discussion were repetitive. We have excised those Discussion elements from the Results and hope that the clarity has not suffered.

It would help however to walk the general biologists through the results and figures more to help them understand how the data makes these conclusions. I was able to divine this from the data but it did take effort that a general biologist may not put into the work.

Thank you for this comment. We have now added an introductory sentence for each figure that explains how the visualization is generated and why. In essence, all figures and tables are partial visualizations of the flux distribution predicted by the model under different constraints.

I do have one point of clarification. The model is built on the Arabidopsis flux model but I was unsure if this is the generic flux model or the one that incorporates specialized metabolism. It might make some sense to comment on how using the Arabidopsis C3 map as a resource from the flux model given that Arabidopsis has very different sulfate/sulfur fluxes in comparison to non-Brassicaceous plants. This links directly to sulfur-amino acids and glutathione metabolism due to the heavy requirement for glucosinolate metabolism as shown by the work of Conant, Pires and colleagues.

The Arabidopsis core model from Arnold and Nikoloski, 2014, describes only the primary metabolism. We added two sentences to make explicit the absence of specialized/secondary metabolism. Our curation process based on the Arabidopsis core model did not add reactions part of the specialized metabolism. The duplicated two-cell model we are using to investigate C4 also contains no additional specialized metabolic routes; only transport reactions have been added.

At the very least some discussion about how large carbon sinks not in the model may or may not change the conclusions would be helpful especially to guide future modelling efforts.

We agree that the current model is unable to integrate biotic interactions since it lacks specialized metabolism in both metabolic map and output function. We have inserted one sentence each in the discussion of the results of the single-cell model and the two-cell model to make this absence explicit for the reader.

Reviewer #2:

[…] I enjoyed reading this manuscript, and I commend the authors on their approach to an important and complex topic. However, I think that the manuscript could use improvement in two key areas.

First, while the authors discuss at several points how their models are consistent with physiological data, these comparisons appear to be vague and qualitative. For instance, in the ninth paragraph of the subsection “The curated Arabidopsis core model predicts physiological results”, the authors give many numerical predictions of their model, but no comparative values from real plants. Obviously, many model parameters are not measurable (hence the value of the model). But if the model is reproducing physiology, the authors should clearly specify how and with what level of error.

Thank you for making this point. The original plant model on which the single-cell model was based contained multiple reactions which lead to flux distributions through enzymes and transporters not known to carry large fluxes in plants. This included circular fluxes across membranes for simple transport processes since transport reactions were missing. All of these were corrected, leading to a state more in agreement with the physiological state and the biochemistry according to current textbooks.

To our knowledge, MFA in plant leaves is currently limited to the Calvin Benson Bassham cycle (https://www.pnas.org/content/pnas/111/47/16967.full.pdf), and hence it is presently not possible for us to benchmark the model predictions against a known flux distribution.

Hence, we limit our comparison to output from the model and make the comparison explicit in a table in Figure 1B, which includes the model predictions and the measured values. A more detailed interpretation and comparison to experimental measurements can be found in the Discussion section, for the particular case “One cell model reflects C3 plant physiology”. We now also include a flux distribution table. Using this table, one can check the fluxes against one’s own expectations of plant metabolism without having to re-run the model.

Note also that there are advanced approaches for fitting models such as these to real data on gene expression levels, trying to make models that more closely follow observed physiology (1). The authors certainly need not apply them, but the text (c.f., especially the Abstract) as written rather implies that CBMs have not been successfully used with eukaryotes, which is not true. (Also note that "eukaryote" is not synonymous with "multicellular" (2)).

We did not attend to leave the impression that CBM has not already been successfully applied to eukaryotic/multicellular systems. We instead aimed to say that we see potential in the applications of CBMs when it comes to systems more complex than single-celled bacteria. According to the comment of reviewer 3, we have rewritten the Abstract and rephrased the respective part in the new Abstract and in the Introduction.

My second concern is that the authors do not make enough of an effort to guide the reader through the potential confounds of CBM for problems such as this one. For instance, in the last paragraph of the subsection “C4 modes with different decarboxylation enzymes result from different set of constraints”, the authors reject nitrogen-limitation as a driver of C4 evolution. However, if compounds that are composed of a good deal of nitrogen are used as catalysts in a CBM, the model will not predict a need for high levels of nitrogen because those compounds are regenerated. I believe that the various C4 carbon shuttles function in exactly this way, and a CBM that seeks only to synthesize sugars and amino acids and has no biomass component will not predict an increased nitrogen requirement for C4 even if that requirement exists, because the compounds requiring that nitrogen cycle in the model. To be clear, I suspect the authors are correct that light is a more likely driver of C4 evolution than nitrogen limitation. But I am not convinced the CBM proves this.

We obviously were not clear enough in our writing. Indeed, the model predicts C4 evolution if we optimize for minimal total flux. To us, this indicates that C4 is predicted to evolve under conditions where resources (i.e. N) are limited. The C:N ratios of C4 plants and intermediates support this prediction.

Light and its distribution “only” influences which decarboxylation path is favoured, while limited N uptake does not influence the decarboxylation path. Reviewer 3 criticized that the manuscript was not written clearly enough. The Results section was edited to clarify this issue in the text.

Likewise, CBMs "overpredict" solutions to metabolic problems because they are not kinetic. Hence, the fact that the base 2-cell model does not differentiate between the C4 classes does not mean that there are not kinetic differences that are evolutionarily important.

We agree that kinetic constraints are likely also critical. To avoid the impression that CBM provides a final solution, we have inserted a sentence in the Discussion, which points out the critical role of kinetic constraints.

References:

1) Shlomi T, Cabili MN, Herrgard MJ, Palsson BO, and Ruppin E (2008) Network-based prediction of human tissue-specific metabolism. Nature Biotechnology 26(9):1003-1010.

2) Duarte NC, Herrgård MJ, and Palsson BØ (2004) Reconstruction and validation of Saccharomyces cerevisiae iND750, a fully compartmentalized genome-scale metabolic model. Genome Research 14:1298-1309.

Reviewer #3:

The manuscript by Blätke and Brautigam provides a novel method using constraint based modeling to predict the selective pressures that resulted in C4 metabolism. The take-away from this study is that C4 metabolism emerged from increasing photorespiration with concomitant resource efficiency, and that light is a driver for different C4 states. The model presented here overcomes some of the limitations of previous flux models of C3 and C4 metabolism, and makes an important contribution to the field. However, the main take-aways of this study are not easy for the reader to pull out as they are buried within the manuscript. This very interesting manuscript could be improved by streamlining the text and more clearly summarizing the main results either in the section titles or as a couple of sentences at the end of each Results section.

We have removed the intermittent partial discussion in favour of single sentences at the end of each section. Section headers were also changed.

Much of the Discussion is redundant to the Results, so can be shortened for clarity.

Thank you for pointing this out. We have removed the intermittent discussion from the Results and limited interpretation in the Results to single sentences at the end of each section.

Likewise, the Discussion would be more interesting if the authors took a deeper dive into the potential outcomes and applicability of their results; how can the information revealed in this study guide future efforts for engineering C4 metabolism into C3 crops? Are there potential transcriptional regulatory mechanisms that could be investigated to evolve C4 metabolism in C3 crops?

Thank you for pointing out this oversight. We have now inserted into the Discussion an evaluation of C4 and intermediacy as breeding and engineering targets for agriculture.

With regard to the transcriptional regulation, we think CBM cannot reveal potential transcriptional regulatory mechanisms. In Mallmann et al., 2014, and in the review Bräutigam and Gowik, 2016, we point out that evolution most likely occurs in cis in the C4 cycle genes; in Kühlahoglu et al. we have proposed that the C4 genes are likely hooked into the photosynthetic regulon.

It is also unclear if the authors altered exogenous CO2 concentration in the model; this should either be more clearly explained how it was done and the results, or why it wasn't done.

The CO2 concentration is reflected in changes to photorespiration. There are multiple factors (i.e. external CO2 concentration and stomatal opening, which in turn is influenced by plant water status and biotic interactions) which alter CO2 availability in the plant. Hence we opted to converge them in altered photorespiratory flux. We now added a sentence to make this fact explicit.

Finally, it is wonderful that the authors include their Jupyter Notebooks and source data for the reader; however, we identified a few issues that made it difficult to run the code. First, there is no Readme file to follow.

Second, the code would not run with the current installations of the Cobra and Escher packages; this was resolved by changing cobra.io.sbml3.read_sbml_model() to cobra.io.sbml.read_sbml_model() in the load_sbml_model.py file as cobra.io.sbml3 does not appear to exist in the current version of the cobra package. The escher package returned a syntax error from the plots.py file in the initialization step of the jupyter notebook files. In order to run the files, we commented out (removed) any uses of the escher package in the code.

Thank you for putting in the effort to run the code and point out the deficiencies in the documentation. We updated the file “READM.md” in the GitHub repository to link listed figure supplements in chronological order to the corresponding jupyter notebooks. In addition, we added a file “environment.yml” which contains detailed information about the used module versions to avoid any version conflicts. With the addition of the “environment.yml” the jupyter notebooks can now easily be executed at “https://mybinder.org/”, links are provided in the “READM.md”. Please be aware, that we provide the GitHub repository as zipped folder, but it is more convenient to view and access files in the GitHub repository itself:

https://github.com/ma-blaetke/CBM_C3_C4_Metabolism

We added a short comment about the GitHub Repository to the end of the first Results section and to the “Materials and methods”.

[Editors' note: further revisions were requested prior to acceptance, as described below.]

Reviewer #2:

[…] The authors have addressed most of my concerns, but I still find the description of the effects of nitrogen, water and CO2 limitations confusing. The authors write "the optimal solution to the model predicted the same behavior" and "resulted in the same optimal solution as unlimited uptake with the differences of proportionally lower flux overall."

A quick reading of these statements makes one believe that the optimal solution is unchanged with N, H2O or CO2 limitation, which is common with FBA when a metabolite is not truly limiting. I think the authors' point is actually that the proportional fluxes are the same but with different magnitudes, but neither the text nor Figure 6 nor Figure 6—figure supplement 1 seem to me to be crystal clear on this point.

We have edited the text passage to reflect our results better and now clearly state that the solutions result in reduced flux but proportionally identical solutions. We have also edited the figure legend of the figure in question to clarify the point further.

Introduction, second paragraph: "prohibitive" is a slightly odd word to use here. Lower speeds are very likely maladaptive, but are not lethal for growth.

Subsection “The curated Arabidopsis core model predicts physiological results”, first paragraph: I would add a few words on what these inputs and outputs are.

We also corrected the typos and wording as suggested by the reviewers. As suggested by reviewer #2, we also added a short explanation on inputs and outs after introducing the basic set up of flux balance analysis.

Reviewer #3:

The authors addressed each of our comments to various extents. We appreciate the clear headings that provide the reader with the main findings of each section. However, we still find the summary at the end of each section to be lacking. That said, the authors did a good job of streamlining the section on the C4 cycle under resource limitation to reveal the key insights. This helped to reduce the redundancy of the Discussion section. Importantly, the authors corrected the Jupyter Notebooks issue such that now they can easily be run. The conclusion paragraph still needs work to achieve clarity and a logical flow of ideas. For example, the second sentence seems out of place. Otherwise we are satisfied with the revised version.

The concern of reviewer #3 with the conclusion statement was addressed by rephrasing to improve the flow.

https://doi.org/10.7554/eLife.49305.sa2

Article and author information

Author details

  1. Mary-Ann Blätke

    Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Tübingen, Germany
    Contribution
    Resources, Data curation, Software, Formal analysis, Investigation, Visualization, Methodology, Writing—original draft
    For correspondence
    blaetke@ipk-gatersleben.de
    Competing interests
    No competing interests declared
    ORCID icon "This ORCID iD identifies the author of this article:" 0000-0002-4790-7377
  2. Andrea Bräutigam

    1. Leibniz Institute of Plant Genetics and Crop Plant Research (IPK), Tübingen, Germany
    2. Computational Biology, Faculty of Biology, Bielefeld University, Universitätsstraße, Bielefeld, Germany
    Contribution
    Conceptualization, Supervision, Validation, Writing—original draft
    Competing interests
    No competing interests declared

Funding

The authors declare that there was no funding for this work.

Acknowledgements

We like to thank Udo Gowik (Carl von Ossietzky University Oldenburg, Germany) and Urte Schlüter (Heinrich-Heine-University Düsseldorf, Germany) for critically revising this manuscript.

Senior Editor

  1. Christian S Hardtke, University of Lausanne, Switzerland

Reviewing Editor

  1. Daniel J Kliebenstein, University of California, Davis, United States

Reviewers

  1. Gavin Conant, North Carolina State University, United States
  2. Amy Marshall-colon, University of Illinois

Version history

  1. Received: June 13, 2019
  2. Accepted: November 8, 2019
  3. Accepted Manuscript published: December 4, 2019 (version 1)
  4. Version of Record published: December 10, 2019 (version 2)
  5. Version of Record updated: June 22, 2021 (version 3)

Copyright

© 2019, Blätke and Bräutigam

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

Metrics

  • 7,788
    Page views
  • 591
    Downloads
  • 31
    Citations

Article citation count generated by polling the highest count across the following sources: Crossref, PubMed Central, Scopus.

Download links

A two-part list of links to download the article, or parts of the article, in various formats.

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

  1. Mary-Ann Blätke
  2. Andrea Bräutigam
(2019)
Evolution of C4 photosynthesis predicted by constraint-based modelling
eLife 8:e49305.
https://doi.org/10.7554/eLife.49305

Further reading

    1. Biochemistry and Chemical Biology
    2. Computational and Systems Biology
    Ian R Outhwaite, Sukrit Singh ... Markus A Seeliger
    Research Article

    Kinase inhibitors are successful therapeutics in the treatment of cancers and autoimmune diseases and are useful tools in biomedical research. However, the high sequence and structural conservation of the catalytic kinase domain complicates the development of selective kinase inhibitors. Inhibition of off-target kinases makes it difficult to study the mechanism of inhibitors in biological systems. Current efforts focus on the development of inhibitors with improved selectivity. Here, we present an alternative solution to this problem by combining inhibitors with divergent off-target effects. We develop a multicompound-multitarget scoring (MMS) method that combines inhibitors to maximize target inhibition and to minimize off-target inhibition. Additionally, this framework enables optimization of inhibitor combinations for multiple on-targets. Using MMS with published kinase inhibitor datasets we determine potent inhibitor combinations for target kinases with better selectivity than the most selective single inhibitor and validate the predicted effect and selectivity of inhibitor combinations using in vitro and in cellulo techniques. MMS greatly enhances selectivity in rational multitargeting applications. The MMS framework is generalizable to other non-kinase biological targets where compound selectivity is a challenge and diverse compound libraries are available.

    1. Computational and Systems Biology
    2. Physics of Living Systems
    Jennifer K Briggs, Anne Gresch ... Richard KP Benninger
    Research Article

    Diabetes is caused by the inability of electrically coupled, functionally heterogeneous -cells within the pancreatic islet to provide adequate insulin secretion. Functional networks have been used to represent synchronized oscillatory [Ca2+] dynamics and to study -cell subpopulations, which play an important role in driving islet function. The mechanism by which highly synchronized -cell subpopulations drive islet function is unclear. We used experimental and computational techniques to investigate the relationship between functional networks, structural (gap-junction) networks, and intrinsic -cell dynamics in slow and fast oscillating islets. Highly synchronized subpopulations in the functional network were differentiated by intrinsic dynamics, including metabolic activity and KATP channel conductance, more than structural coupling. Consistent with this, intrinsic dynamics were more predictive of high synchronization in the islet functional network as compared to high levels of structural coupling. Finally, dysfunction of gap junctions, which can occur in diabetes, caused decreases in the efficiency and clustering of the functional network. These results indicate that intrinsic dynamics rather than structure drive connections in the functional network and highly synchronized subpopulations, but gap junctions are still essential for overall network efficiency. These findings deepen our interpretation of functional networks and the formation of functional sub-populations in dynamic tissues such as the islet.