Integrative Approaches to Enhance Understanding of Plant Metabolic Pathway Structure and Regulation1

Page created by Juanita Morales
 
CONTINUE READING
Integrative Approaches to Enhance Understanding of Plant Metabolic Pathway Structure and Regulation1
Update on Integrative Studies

Integrative Approaches to Enhance Understanding of
Plant Metabolic Pathway Structure and Regulation1

Takayuki Tohge*, Federico Scossa, and Alisdair R. Fernie
Max-Planck-Institute of Molecular Plant Physiology, 14476 Potsdam-Golm, Germany (T.T., A.R.F.); and
Consiglio per la Ricerca e Analisi dell’Economia Agraria, Centro di Ricerca per la Frutticoltura, 00134 Rome,
Italy (F.S.)

                                                                                                                                                         Downloaded from https://academic.oup.com/plphys/article/169/3/1499/6113889 by guest on 28 January 2022
Huge insight into molecular mechanisms and biological network coordination have been achieved following the application of
various profiling technologies. Our knowledge of how the different molecular entities of the cell interact with one another
suggests that, nevertheless, integration of data from different techniques could drive a more comprehensive understanding of
the data emanating from different techniques. Here, we provide an overview of how such data integration is being used to aid
the understanding of metabolic pathway structure and regulation. We choose to focus on the pairwise integration of large-scale
metabolite data with that of the transcriptomic, proteomics, whole-genome sequence, growth- and yield-associated phenotypes,
and archival functional genomic data sets. In doing so, we attempt to provide an update on approaches that integrate data
obtained at different levels to reach a better understanding of either single gene function or metabolic pathway structure and
regulation within the context of a broader biological process.

   The diversity of metabolites in the plant kingdom is                       utility in enhancing our understanding of enzyme
staggering: a commonly quoted estimate is that plants                         mechanisms (and their regulation) and about the in
produce somewhere in the order of 200,000 unique                              vivo functions of enzymes, respectively. To give just a
chemical structures (Dixon and Strack, 2003; Yonekura-                        couple of recent examples from organic acid metabo-
Sakakibara and Saito, 2009; Tohge et al., 2014). Of these,                    lism, a detailed study of the effect of phosphorylation of
only a relatively small subset will be abundant in any                        phosphoenolpyruvate carboxylase reveals an important
given tissue or any one species (Fernie, 2007); however,                      anaplerotic control point in developing castor bean
certain species have evolved a particularly rich meta-                        (Ricinus communis) endosperm (Hill et al., 2014), while
bolic diversity, presumably in response to environ-                           the enzyme pyruvate orthophosphate dikinase was
mental features of their habitat (for examples, see                           recently demonstrated to represent a second gateway
Futuyma and Agrawal, 2009; Moore et al., 2014; Li et al.,                     for organic acids into the gluconeogenic pathway in
2015). Given these facts, it is unsurprising that our                         Arabidopsis (Arabidopsis thaliana; Eastmond et al., 2015).
current understanding of the metabolic structure of a                         We aim to provide examples from both primary and
large number of pathways remains fragmentary; not to                          secondary metabolism and to illustrate the power of
mention our current views of regulatory mechanisms                            such approaches both in (1) gene functional annotation
underlying metabolite accumulation, which cover, at                           and (2) enhancing our understanding of the systems-
best, a very limited fraction of the metabolic network.                       level response to cellular circumstances. We will addi-
This statement is especially true for the highly special-                     tionally discuss recent studies combining genome
ized pathways of secondary metabolism, although a                             sequence data with metabolomics in order to highlight
number of gaps still remain to be filled also concerning                       the utility of such approaches in metabolic quantitative
important sectors of plant primary metabolism. As                             loci analyses. Finally, we will detail insight that can be
detailed in other Update articles within this issue, the                      obtained from fusing archived data that can be down-
adoption of various broad-scale profiling technolo-                            loaded from databases with experimental data gener-
gies to assess the gene, transcript, protein, and small                       ated de novo. Given that, as documented previously
molecule complement of the cell has started to mine                           (Fernie and Stitt, 2012), a number of complicating fac-
this metabolic complexity. Additionally, the same ap-                         tors still exist when attempting such analyses, we will
proaches have also started to shed light on the evolu-                        discuss these on an approach-by-approach basis.
tion of gene and metabolite regulatory networks across
the plant kingdom. In addition to large-scale profiling
approaches, classical reductionist biochemistry and
reverse genetic approaches retain, in any case, great                         INTEGRATING METABOLITE AND
                                                                              TRANSCRIPTOME DATA
  1
    This work was supported by the Max Planck Society (to T.T. and               The earliest integrative approaches with relevance to
A.R.F.) and an Alexander von Humboldt grant (to T.T.).                        plant metabolism featured the combination of data
  * Address correspondence to tohge@mpimp-golm.mpg.de.                        from transcript and metabolite profiling (Urbanczyk-
  www.plantphysiol.org/cgi/doi/10.1104/pp.15.01006                            Wochniak et al., 2003; Achnine et al., 2005; Tohge
Plant PhysiologyÒ, November 2015, Vol. 169, pp. 1499–1511, www.plantphysiol.org Ó 2015 American Society of Plant Biologists. All Rights Reserved. 1499
Integrative Approaches to Enhance Understanding of Plant Metabolic Pathway Structure and Regulation1
Tohge et al.

et al., 2005). Such studies were initially restricted to      metabolites across tuber development, irrespective of
model species for which ESTs or oligonucleotides were         whether the transcript was associated with the meta-
available; early transcriptomics approaches relied in         bolic pathway under question or not (Urbanczyk-
fact on differential hybridization of complementary           Wochniak et al., 2003).
DNA samples to known sequences immobilized on                    This approach was indeed able to identify some
solid supports. The advent of next-generation se-             transcripts that exhibited very high correlation with the
quencing technologies, however, has removed this              expression of certain genes and, as such, proved effec-
barrier, and far more exotic species are beginning to be      tive in identifying a number of candidate genes for
studied using this approach (Góngora-Castillo et al.,         biofortification. By corollary, the same approach can be,
2012; Gechev et al., 2013; Li et al., 2015). Two basic        and indeed has been, used to elucidate the variation in
questions are commonly addressed by combining tran-           gene-to-metabolite networks following short- and long-
script and metabolite data. The first concerns whether a       term nutritional stresses in Arabidopsis (Hirai et al.,

                                                                                                                                  Downloaded from https://academic.oup.com/plphys/article/169/3/1499/6113889 by guest on 28 January 2022
gene functions within a given metabolic pathway.              2004) or to identify metabolic regulators of gene ex-
When a better characterization of the pathway is achieved,    pression (Hirai et al., 2007). Cryptoxanthin, for exam-
it becomes fundamental to investigate also the extent         ple, was identified as highly correlating with a broad
of transcriptional control (except in some cases, for ex-     number of genes across diverse environmental condi-
ample, regulation by posttranscriptional modifications         tions in Arabidopsis (Hannah et al., 2010), and the or-
of the enzyme and positive/negative feedback reg-             ganic acid malate was putatively identified (Carrari
ulation by substrates/products) under various physio-         et al., 2006) and subsequently confirmed (Centeno et al.,
logical conditions and how it is distributed across the       2011) to be important in mediating the ripening process
various enzymatic steps.                                      in tomato (Solanum lycopersicum). Such current studies
   Initial observations about the role of differential gene   are all examples of the guilt-by-association approach,
expression in tuning the synthesis of metabolites date        which in essence postulates biological entities as being
back to the 1990s. Some specific pathways, such as             functionally related if they exhibit strong correlation or
hormone, glucosinolate, and flavonoid biosynthesis,            coresponse across a wide range of cellular circum-
were the initial focus of these investigations. For ex-       stances. The power of this approach is that, given that it
ample, differential mechanisms of gene expression             does not rely on a priori pathway knowledge, it can
helped clarify in Arabidopsis the involvement of two          have great utility in identifying novel metabolic inte-
different nitrilase genes in regulating the synthesis of      gration and/or novel regulatory mechanisms (Hirai
auxin (Bartling et al., 1994). Similarly, the contributions   et al., 2007; Tohge et al., 2007; Yonekura-Sakakibara
of gene duplication and inducible gene expression             et al., 2008; Tohge and Fernie, 2010). However, a
(differential activation of subsets of biosynthetic genes)    drawback of the approach is that, in the absence of
were shown to impact the amount and the composition           subsequent rounds of experimentation, it is difficult to
of glucosinolates (Kliebenstein et al., 2001). An addi-       gain any insight into the mechanistic links underlying
tional early evidence of the role of specific transcript       the observed behavior, given that correlation between
accumulation on a metabolic phenotype came from the           biological entities does not always imply causation or
elucidation of the role that different regulation mecha-      the existence of functional links (Sweetlove and Fernie,
nisms affecting Trp synthase a and b had on the               2005; Sweetlove et al., 2008; Stitt, 2013). In this regard, it
amount of 2,4-dihydroxy-7-methoxy-1,4-benzoxazin-3-           becomes imperative to validate the outputs of coex-
one, a natural pesticide synthesized in maize (Zea mays)      pression analyses with follow-up approaches in order
leaves (Melanson et al., 1997). Another example of the        to prove the existence of putative functional links. Ar-
coordination between transcripts and metabolite accu-         guably, the greatest advances made to date following
mulation came from the analysis of maize anthers,             approaches to integrate transcript and metabolite data
where a strong correlation was found between the ex-          have been achieved in gene annotation and the struc-
pression of a structural gene (flavanone 3-hydroxylase)        tural elucidation of plant intermediary and secondary
and the appearance of specific flavonols (mainly quer-          metabolism.
cetin and kaempferol; Deboo et al., 1995). These same            Two early studies of particular note are those from
approaches have also been used to select a number of          the Saito and Dixon laboratories investigating Arabi-
candidate genes involved in the biosynthesis of cap-          dopsis anthocyanin and Medicago truncatula triterpene
saicinoids, a group of vanillylamides conferring pun-         metabolism, respectively (Achnine et al., 2005; Tohge
gency to hot peppers (Capsicum spp.). In this case, the       et al., 2005). In the case of the anthocyanin pathway,
comparison between sweet and hot pepper varieties             prior to the study of Tohge et al. (2005), no late bio-
facilitated the identification of some placenta-specific,       synthetic genes involved in anthocyanin decoration
differentially expressed genes that were directly corre-      steps had been identified in Arabidopsis, although all
lated with the accumulation of capsaicinoids (Curry           early biosynthetic genes have been characterized by
et al., 1999). The examples cited above laid the foun-        visible phenotype screening. A combination of tran-
dation for large-scale studies using the parallel analysis    script and metabolite profiling on a Production of An-
of transcripts and metabolites. One of the first examples      thocyanin Pigment1 activation-tagged line alongside
of this approach focused on the identification of tran-        validatory experiments involving both heterologously
scripts strongly correlated with the abundance of given       expressed enzymes and knockout mutants resulted in
1500                                                                                              Plant Physiol. Vol. 169, 2015
Integrative Studies of Metabolism

the identification of five genes and the identification of         be active under specific conditions. Occasionally,
up to 11 anthocyanins. Such confirmatory experiments             however, they can also provide more mechanistic in-
are essential in order to unequivocally assign gene             formation. One prominent example of this is the de-
function. The combination of reverse genetic strategies         tailed analysis of several transgenic Arabidopsis lines
with the characterization of enzyme activity when the           with altered flavonoid levels via transcriptomic and
gene is expressed in a heterologous system remains the          metabolomics analyses, including hormone analysis,
gold standard for the molecular identification of novel          which revealed that the overaccumulation of fla-
enzyme-catalyzed reactions (Tohge et al., 2005; Luo             vonoids exhibiting strong oxidative capacity in vitro
et al., 2007; Yonekura-Sakakibara et al., 2012). Subse-         also confers oxidative stress and drought tolerance
quent follow-up studies have identified some six genes           (Nakabayashi et al., 2014; Nakabayashi and Saito,
associated with flavonol metabolism, and some 24                 2015). In addition, a range of developmental processes
compounds (among 35 compounds found) of this class              have been followed at high resolution by a combination

                                                                                                                               Downloaded from https://academic.oup.com/plphys/article/169/3/1499/6113889 by guest on 28 January 2022
have now been identified in Arabidopsis (Tohge et al.,           of transcript and metabolite profiles. Such studies are
2007; Yonekura-Sakakibara et al., 2007, 2008, 2014;             dominated by studies of fruit ripening (Zamboni et al.,
Nakabayashi et al., 2009; Tohge and Fernie, 2010; Saito         2010; Lin et al., 2015; Vallarino et al., 2015) and leaf
et al., 2013; Fig. 1). While the expansion of the charac-       development (Pick et al., 2011; Wang et al., 2014);
terized triterpenoid metabolism in M. truncatula is not         however, they are not limited to these processes, with
quite so impressive, the study of Achnine et al. (2005)         studies also covering the development of various or-
allowed the functional annotation of 30 different sa-           gans, lignin deposition, and the establishment of
ponins, and currently, over 70 metabolites of this com-         arbuscular mycorrhizal symbiosis (Vanholme et al.,
pound class have been identified in M. truncatula                2013; Laparre et al., 2014; Nakamura et al., 2014; Wang
(Pollier et al., 2011; Gholami et al., 2014; Watson et al.,     et al., 2014). In this regard, these approaches prove
2015). The utility of this approach is at its greatest for      informative in clarifying the relative importance of
the relatively unchartered pathways of specialized              seemingly redundant pathways of biosynthesis and the
metabolism; however, it is worth noting that slight             degradation of specific metabolites or may also help
variations on this strategy independently identified the         to define the role of those primary metabolites (e.g.
gene encoding plant Thr aldolase (Fernie et al., 2004;          g-aminobutyrate) for which a signaling role was hy-
Jander et al., 2004) in Arabidopsis and 2,4-dihydroxy-7-        pothesized (Batushansky et al., 2014). For example,
methoxy-1,4-benzoxazin-3-one glucoside methyltrans-             ascorbate biosynthesis, which is one of the well-studied
ferase in maize (Meihls et al., 2013). A decade later, the      metabolisms in several higher plants, especially in
number of species and pathways for which this ap-               Arabidopsis (Wheeler et al., 1998; Gatzek et al., 2002;
proach has been adopted has expanded massively to               Laing et al., 2004; Conklin et al., 2006; Dowdle et al.,
include several crops and medicinal plants. Strategies          2007), has been revealed as the dominant route of
combining transcript and metabolite profiling have               ascorbate biosynthesis during ripening in tomato
proved effective in elucidating the structure of several        (Carrari et al., 2006). Another example could be found in
metabolic pathways involved in the synthesis of pri-            the elucidation of the arogenate pathway as an alterna-
mary metabolites, flavonoids, terpenoids, and alkaloids          tive route for Phe biosynthesis (Dal Cin et al., 2011). A
(Osorio et al., 2011, 2012; Shelton et al., 2012; Lin et al.,   similar approach in Arabidopsis, based on feeding
2015).                                                          studies and coexpression analysis, allowed an alterna-
   On a broader level, the combination of transcript and        tive pathway to be proposed for Lys degradation in
metabolite profiling has commonly been used for                  dark-induced senescent leaves (Araújo et al., 2010).
multilayered descriptions of plant responses, partic-              However, despite the fact that these examples illus-
ularly those to abiotic stress (Gibon et al., 2006;             trate that combined transcriptome/metabolome stud-
Maruyama-Nakashita et al., 2006; Kusano et al., 2011;           ies provide increases in our understanding of the
Gechev et al., 2013; Bielecka et al., 2014; Nakabayashi         regulation of metabolic networks, we contend that they
et al., 2014). In this vein, a number of studies have been      remain at their most powerful in gene functional an-
carried out that assess the combined transcript and             notation and in the elucidation of species- and/or
metabolite responses to water stress, temperature               tissue-specific metabolic pathway structures.
stress, light stress, and limitations of nutrient supply
(Urano et al., 2009; Caldana et al., 2011; Kusano et al.,
2011; Nakabayashi et al., 2014). Such studies, while by         INTEGRATING METABOLITE AND PROTEOME/
nature descriptive, can afford insight into global met-         ENZYME ACTIVITY DATA
abolic variations under certain conditions as well as
identify which pathways are under tight and which are              Less commonly used to date than combined tran-
under loose transcriptional control. Given the highly           scriptome and metabolome analyses are combined
interconnected nature and nonlinearity of metabolic             proteome and metabolome analyses. They are addi-
pathways in the global network structure, and even in           tionally largely used in a manner analogous to the more
the absence of flux profiling data, the integration of            descriptive studies reviewed above. That said, consid-
transcriptomics with wide metabolic profiling can, in            erable insight into metabolic network structure as well
any case, narrow down which metabolic steps could               as into general aspects of metabolic regulation have
Plant Physiol. Vol. 169, 2015                                                                                           1501
Tohge et al.

                                                                                                                                                              Downloaded from https://academic.oup.com/plphys/article/169/3/1499/6113889 by guest on 28 January 2022
         Figure 1. Current model of flavonol/anthocyanin biosynthesis in Arabidopsis. Colors are as follows: blue, early biosynthetic genes;
         green, flavonol-specific biosynthetic genes; and purple, anthocyanin-specific biosynthetic genes. CHS, Chalcone synthase, At5g13930; CHI,
         chalcone isomerase, At3g55120; CHIL1, At5g05270; F3H, flavanone-3-hydroxylase, At3g51240; F39H, flavonoid 39-hydroxylase, At5g07990;
         DFR, dihydroflavonol reductase, At5g42800; ANS, anthocyanidin synthase, At4g22880; F3GlcT, flavonoid-3-O-glucosyltransferase,
         UGT78D2, At5g17050; A5GlcT, anthocyanin-5-O-glucosyltransferase, UGT75C1, At4g14090; A3Glc299XylT, anthocyanin-3-O-
         glucoside-299-O-xylosyltransferase, UGT79B1, At5g54060; A5Glc69999MalT, anthocyanin-5-O-glucoside-69999-O-malonyltransferase,
         At3g29590; A3Glc699pCouT, anthocyanin-3-O-glucoside-699-O-p-coumaroyltransferase, At1g03940, At1g03495; A3Glc299XylSinT,
         anthocyanin-3-O-(299-O-xylosyl)-glucoside-6999-O-sinapoyltransferase, At2g23000; A3Glc699pCouT, anthocyanin-3-O-(699-
         O-coumaroylglucoside-O-glucosyltransferase, At4g27830; FLS1, flavonol synthase, At5g08640; F3RhaT, flavonol-3-O-rhamnosyltransferase,
         UGT78D1, At1g30530; F3AraT, flavonol-3-O-arabinosyltransferase, UGT78D3, At5g17030; F7RhaT, flavonol 7-O-rhamnosyltransferase,
         UGT89C1, At1g06000; F7GlcT, flavonol 7-O-glucosyltransferase, UGT73C6, At2g36790; OMT1, O-methyltransferase, At5g54160.

been gained in this manner. Here, we will describe eight                         parallel to enzyme data (and transcriptomics data)
studies that illustrate how the integration of proteomic                         across varying diurnal cycles in wild-type and a
and metabolomic data sets has been used to inform                                starchless mutant of Arabidopsis, revealing that rapid
our understanding of systems regulation. In the first                             changes in transcripts are integrated over time to gen-
of these examples, metabolite data were studied in                               erate essentially stable changes in many sectors of
1502                                                                                                                          Plant Physiol. Vol. 169, 2015
Integrative Studies of Metabolism

metabolism (Gibon et al., 2006). The same group went           Tohge et al., 2014); we contend here that an additional
on to apply this approach to tomato fruit development          reason to explain this (partial) lack of concordance in
and natural variance in Arabidopsis. In tomato, enzyme         the integrative approaches involving metabolism could
profiles were sufficiently characteristic to allow stages        lie within the incomplete annotation of most genomes,
of development and cultivars and the wild species to be        including those of model organisms. However, we be-
distinguished, but comparison of enzyme activity and           lieve the most likely reason to be the lack of linear
metabolites revealed remarkably little connectivity            relationship between genes, their protein products, and
between the developmental changes of enzyme and                metabolites and, secondly, the fact that most genomes,
metabolite levels, suggesting the operation of post-           even those of model organisms, remain incompletely
translational modification mechanisms (Steinhauser              annotated. Despite this serious drawback, we hope
et al., 2010). In Arabidopsis, they documented highly          to illustrate in this section that the integration of
coordinated changes between enzyme activities, par-            metabolomics and genomic data can be incredibly

                                                                                                                               Downloaded from https://academic.oup.com/plphys/article/169/3/1499/6113889 by guest on 28 January 2022
ticularly within those of the Calvin-Benson cycle, as          powerful in understanding natural variation in metab-
well as significant correlations in specific metabolite          olism and its regulation.
pairs and between starch and growth. On the other                 Whole-genome sequences are available for more than
hand, few correlations, and thus low overall connec-           100 plant species (including microalgae; Tohge et al.,
tivity, were observed between enzyme activities and            2014); this massive acceleration afforded by next-
metabolite levels (Sulpice et al., 2010), but strong links     generation technologies cannot currently be matched
were seen between starch levels and growth, which we           by metabolomics, especially if high-quality species-
describe below. In an alternative approach, proteomic          optimized approaches are adopted (Fukushima et al.,
and metabolic data were used merely to extend the              2014). The KNApSAcK database, which is one of the
range of molecular entities in order to demonstrate that       largest curated compendia of phytochemicals, contains
fascicular and extrafascicular phloem are isolated from        over 700 compounds for early sequenced plants like
one another and divergent in function (Zhang et al.,           Arabidopsis and rice (Oryza sativa) but no entries for
2010). A similar approach was taken to identify root as        recently sequenced species such as goatgrass (Aegilops
the major organ involved in alkaloid biosynthesis in           tauschii) and wild tobacco (Nicotiana tomentosiformis).
Macleaya spp. (Zeng et al., 2013). Three further studies       In this section, we will describe insight gained from
of note are more similar to that of Gibon et al. (2006) in     combining metabolomic data with genome sequences
that they use a combination of proteomics and meta-            in three different case studies: (1) a simple comparison
bolomics as a means to define the complex response of           of a reference genome with metabolomics data; (2) a
the cell to varying circumstances, be they iron nutrition      comparison of natural allelic and metabolic variance;
in Arabidopsis (Sudre et al., 2013), the drought re-           and (3) integrating genome sequence data into quanti-
sponse in maize xylem (Alvarez et al., 2008), or heat          tative genetics approaches. The first of these has been
stress acclimation in the model alga Chlamydomonas             covered in considerable detail recently (Fukushima
reinhardtii (Hemme et al., 2014). The fact that many of        et al., 2014; Tohge et al., 2014), so we will only briefly
these studies were published in the last 2 years reflects       describe it here. The starting point is to perform
the growing uptake of such strategies. That said, in our       genome-wide ortholog searches using functionally an-
opinion, it remains an underexploited research ap-             notated genes; best practice is to use cross-species
proach to date.                                                cluster-based BLAST searches such as those housed in
                                                               the PLAZA database (Proost et al., 2009) or, in the case of
                                                               photosynthetic microbes, pico-PLAZA (Vandepoele et al.,
INTEGRATING METABOLITE AND GENOME DATA
                                                               2013). Illustrations of how such analyses have been
                                                               performed for central, shikimate, phenylpropanoid,
   Given that the advent of metabolomics more or less          terpenoid, alkaloid, and glucosinolate metabolism have
paralleled the release of the first plant genome, the           been presented (Hofberger et al., 2013; Tohge et al.,
integration of metabolomics and whole-genome se-               2013a, 2013b, 2014; Cavalcanti et al., 2014; Boutanaev
quence data is perhaps unsurprising. The true potential        et al., 2015). Thereafter, comparison of these gene in-
of this approach has been realized only within the last        ventories with metabolite profiles of the species under
few years; we will not describe it again in detail, given      evaluation allows the construction of putative meta-
that it is discussed in a previous correspondence in           bolic pathway structures that can be further tested via
Plant Physiology (Fernie and Stitt, 2012). Suffice it to say,   reverse genetics or heterologous expression, as de-
there are considerable complexities in such combina-           scribed in “Integrating Metabolite and Transcript Data”
tions; tellingly, early studies aimed at computational         above. Important insights into pathway evolution can
prediction of the size of the Escherichia coli metabolome      be gained from such approaches, as illustrated by the
estimated a complement of approximately 750 metab-             recent cross-kingdom comparison of ascorbate biosyn-
olites, while subsequent experimental approaches have          thesis (Wheeler et al., 2015).
revealed many metabolites that were not computed                  The second case study, that of evaluating allelic and
from the genome (van der Werf et al., 2007). Several           metabolic variance across natural diversity, is similar in
potential reasons could be put forward to explain this         scope yet far more targeted than genome-wide associ-
discrepancy (for review, see Fernie and Stitt, 2012;           ation studies, which we describe below. The majority of
Plant Physiol. Vol. 169, 2015                                                                                           1503
Tohge et al.

recent examples of its utility come from the analysis of     availability of the sequences of both parental genomes
wild species tomato; however, it is important to note        (Bolger et al., 2014) narrowed down the origin of the
that the approach itself is essentially just a modification   metabolic variation to specific genetic polymorphisms
of that adopted over decades in the cloning of natural       in some selected metabolic quantitative trait loci
color mutants (Fernie and Klee, 2011). In the last few       (Quadrana et al., 2014; Alseekh et al., 2015). The inte-
years, understanding of primary as well as secondary         gration between genotypic and metabolic variance can
and cuticular cell wall metabolism has been enhanced         be, and has actually been applied, also on large col-
considerably via this approach (Schauer et al., 2005;        lections of unrelated individuals (metabolite-based
Matas et al., 2011; Kim et al., 2012, 2014; Koenig et al.,   genome-wide association studies): as in the case of
2013), albeit the greatest insight into the latter was       biparental populations, also with this strategy, several
ultimately elucidated via the use of an introgression line   cases of polymorphological variants of genomic se-
population, as described below. In essence, this ap-         quences have been identified and related to metabolic

                                                                                                                                Downloaded from https://academic.oup.com/plphys/article/169/3/1499/6113889 by guest on 28 January 2022
proach starts with the identification of metabolic vari-      variation. These two approaches, based either on bi-
ance within a population of ecotypes, cultivars, or          parental populations or on large collections of natural
similarly related species and attempts to link this with     accessions, have been used in Arabidopsis and crop
allelic diversity or gene duplication, as has been           species (maize, rice, wheat [Triticum aestivum], and fruit
achieved for acyl-sugar metabolites (Schilmiller et al.,     trees; Gong et al., 2013; Li et al., 2013; Wen et al., 2014;
2015), terpenes (Matsuba et al., 2013), and isoprenoids      Matsuda et al., 2015; for review, see Luo, 2015). The
(Kang et al., 2014), or even with the presence or absence    boon that new sequences will provide, especially from
of genes, as described recently for methylated flavo-         wild relatives or locally adapted varieties, will be rep-
noids of glandular trichomes (Kim et al., 2014). The         resented by the possibility to dissect the genetic basis of
preceding list documents the success of this approach;       metabolite variation, with a view to introgress benefi-
until recently, however, it was constrained by the limits    cial traits in crop improvement.
of our a priori knowledge, which is needed in order to
select the candidate genes in which we search for allelic
variance. The development of RNA sequencing tech-
                                                             INTEGRATING METABOLITE AND
nologies means that we are no longer limited by the
                                                             PHYSIOLOGICAL DATA
amount of sequence data; a potential hurdle to these
integrative approaches, however, can still be present           While the above examples concentrate on the inte-
when comparing highly genetically divergent individ-         gration of various types of profiling data with one
uals, since the number of genetic polymorphisms is           another in order to advance our understanding of
too great to evaluate one by one. For this reason, the       metabolic pathway structure and/or metabolic regu-
quantitative trait loci approach is a powerful alternative   lation, relatively few studies have attempted to cor-
method of associating phenotypes to their underlying         relate metabolite content with physiological data,
genetic variance. The use of such approaches in plant        including growth and yield (for review, see Stitt et al.,
metabolism has been the subject of several recent            2010; Carreno-Quintero et al., 2013). One of the earliest
comprehensive reviews (Kliebenstein, 2009; Scossa            studies to do so was the above-described metabolic
et al., 2015); however, we will provide a couple of          quantitative trait loci analysis of the S. pennellii intro-
examples of their utility for advancing the under-           gression lines, in which yield-associated plant traits
standing of metabolite accumulation and metabolic            were measured alongside primary metabolite content
regulation.                                                  of the fruit (Schauer et al., 2006). In this study, network
  The fruit of tomato, as the model species for ripen-       analysis based on cartographic modeling algorithms
ing of fleshy fruits, has been the subject of combined        developed by Guimerà and Nunes Amaral (2005)
large-scale genomic, physiological, and metabolic in-        identified that yield-associated traits were positively
vestigations, often making use of specific biparental         correlated to a range of previously defined signal me-
populations or large sets of unrelated individuals, in       tabolites, compounds that have signaling as well as
an attempt to understand the causal variants of the          metabolic functions, including Suc, hexose, and inositol
metabolic variations (Schauer et al., 2005; Lin et al.,      phosphates, Pro, and g-aminobutyrate. In addition, this
2014; Sauvage et al., 2014). In particular, the use of a     study indicated that the harvest index (i.e. the ratio of
population of introgression lines, obtained from the         harvestable product to total biomass) negatively cor-
cross between tomato and Solanum pennellii (a wild           related with the content of the vast majority of amino
tomato species), has greatly aided the identification of      acids. This relationship was confirmed in an indepen-
quantitative trait loci for a large number of physio-        dent population and following experiments that artifi-
logical and metabolic traits. Profiling data of primary       cially altered the fruit load per truss (Do et al., 2010).
and secondary metabolites in this population were            However, as would perhaps be anticipated, subsequent
collected over several years (along with some classi-        evaluation of the relationship between growth and
cal yield-related traits), revealing more than 1,500         secondary metabolite content revealed far less correla-
metabolic quantitative trait loci affecting the levels       tion (Alseekh et al., 2015). Using essentially the same
of several sugars, amino acids, organic acids, vita-         approach in an Arabidopsis recombinant inbred line
mins, phenylpropanoids, and glycoalkaloids. The              population, Meyer et al. (2007) found that, although no
1504                                                                                            Plant Physiol. Vol. 169, 2015
Integrative Studies of Metabolism

single metabolite exhibited a very high correlation with      natural environments, the accumulation of capsaicinoids
biomass, canonical correlation analysis in which the          in populations of Capsicum chacoense (a wild pepper
data of a linear combination of metabolites allowed the       species) is inversely correlated with seed set; these
improvement of this correlation by a factor of 10, thus       metabolites, however, have a defensive role in highly
defined a metabolic signature of growth. Intriguingly,         humid environments, where their accumulation deters
the hexose phosphates Glc-6-P and Fru-6-P as well as          the attack of phytopathogenic fungi. Across a geo-
Suc were among the 20 top metabolites contributing            graphical gradient of decreasing rainfall (with a grad-
to this signature. When similar approaches were ap-           ual decreasing pressure of the pathogens, which thrive
plied to maize, strong genome-wide association links          only in humid environments), the accumulation of
were found between coumaric and caffeic acids and             capsaicinoids also decreases in Capsicum spp. popula-
cinnamoyl-CoA reductase, while these precursors also          tions, while seed set, on the other hand, increases. This
significantly correlated with lignin content plant height      study is an example of the combination of targeted

                                                                                                                              Downloaded from https://academic.oup.com/plphys/article/169/3/1499/6113889 by guest on 28 January 2022
and dry matter yield, presenting another example of the       metabolic approaches with population ecology in dis-
narrowing of the genotype-phenotype gap of complex            secting the basis of natural polymorphic traits (Haak
agronomic traits (Riedelsheimer et al., 2012). The same       et al., 2012). Further studies that address the concept of
group revealed that models based on data obtained for         metabolism and growth tradeoffs have used reciprocal
130 metabolites gave highly accurate predictions of           crosses to assess the contribution of the organellar ge-
agronomic traits and suggested that combined metab-           nome to the processes and came to the conclusion that
olite, genomic, and agronomic phenotyping represents          there is far greater diversity in defense chemistry than
an important screening tool for the identification of          primary metabolism (Joseph et al., 2013, 2015). The in-
parental lines for the creation of superior hybrid crops      terrogation of such tradeoffs is only possible via the
(Riedelsheimer et al., 2012).                                 integrated approach described here and appears to be
   Returning to Arabidopsis, evaluation of the variation      very powerful; as such, we would expect considerable
of growth, metabolite levels, and enzyme activities was       advances in our understanding of this phenomenon to
also carried out across 94 accessions, revealing that         be gained following its application. Not just the last
biomass correlated negatively with many metabolites,          three studies but all of the above studies have been
including starch and protein and to a much lesser extent      published within the last 6 years, reflecting the fact
Suc (Sulpice et al., 2009). However, further experiments      that such analyses are in their infancy. Given the
in which 97 accessions grown in near-optimal carbon           recognized complexity of the metabolism-to-growth
and nitrogen supply, restricted carbon supply, and re-        interactions, a comprehensive understanding of the
stricted nitrogen supply and analyzed for biomass and         intricate networks that coordinate this interface is
54 metabolic traits revealed that robust prediction of        likely some time off. That said, as the above examples
phenotypic traits (biomass, starch, and protein) is most      illustrate, the integration of growth data into metab-
effective (and reliable) when metabolite data (upon           olite profiling data as well as that of simpler physio-
which predictions are based) are collected from the           logical processes such as photosynthetic or respiratory
same growth environments (Meyer et al., 2007; Sulpice         rates (Florez-Sarasa et al., 2012) has already presented
et al., 2009; Korn et al., 2010; Steinfath et al., 2010).     a number of key findings.
Clearly, attempting to predict biomass, for example,
from metabolic profiles collected in a different growth
environment generally yields fewer (and weaker) cor-          POSTGENOMIC INTEGRATION OF DATABASE-
relations (Sulpice et al., 2013). Therefore, the prediction   HOUSED RESEARCH WITH NOVEL EXPERIMENTS
of biomass across a range of conditions would bet-
ter require condition-specific measurement of meta-               The examples described above rely on the integration
bolic traits to take account of environment-dependent         of data obtained in parallel using different experimen-
changes of the underlying networks (Sulpice et al.,           tal approaches. While such approaches are ideal for
2013). Data from this study were subsequently ana-            addressing a number of questions, particularly those
lyzed with respect to the tradeoffs between metabolism        concerning the temporal aspects underlying dynamic
and growth, specifically comparing increasing size with        responses to a systems perturbation, the integration
increasing protein concentration, demonstrating that          of novel experimental data with different types of
accessions with high metabolic efficiency lie closer to        archived data can also prove highly informative, pro-
the Pareto performance frontier (the optimal solution         viding an appropriate amount of caution is used in
for the two contending tasks) and hence exhibit in-           interpreting the results. Here, we will provide several
creased metabolic plasticity (Kleessen et al., 2014). A       examples illustrative of such approaches, which largely
related study addressing an ecological tradeoff between       fit into two major types of approaches: (1) those using
secondary metabolism and fitness relates to the accu-          correlative approaches and (2) those using genome-
mulation of capsaicinoids in the placenta of pepper           scale stoichiometric models. The first study we will
fruits (Capsicum spp.). Capsaicinoids constitute a class      describe fits into the former category, being an attempt
of vanillylamides derived from Phe; they accumulate           to define the storage metabolome of the vacuole (Tohge
in ripening pepper fruit and are responsible for the          et al., 2011; Fig. 2). In this research, a combination of
pungency sensation occurring upon ingestion. In               gas chromatography-mass spectrometry and Fourier
Plant Physiol. Vol. 169, 2015                                                                                          1505
Tohge et al.

                                                                        previously described transporter proteins as well as to
                                                                        highlight the dynamic nature of the storage metabolome.
                                                                        The coexpression approach has also been combined
                                                                        with metabolic profiling in the annotation of plasma
                                                                        membrane lignin and plastidial glycolate/glycerate
                                                                        and bile acid transporters (Gigolashvili et al., 2009;
                                                                        Sawada et al., 2009; Alejandro et al., 2012) as well as
                                                                        a multitude of cell wall-associated proteins (Persson
                                                                        et al., 2005). Moreover, this approach has also been used
                                                                        to identify process, as opposed to pathway-specific, pro-
                                                                        teins, identifying proteins involved in dark-induced
                                                                        senescence (Araújo et al., 2011) and in the response to

                                                                                                                                         Downloaded from https://academic.oup.com/plphys/article/169/3/1499/6113889 by guest on 28 January 2022
                                                                        UV-B irradiance (Kusano et al., 2011).
                                                                           The other type of examples we would like to discuss
                                                                        are based on the integration of transcriptomic and
                                                                        metabolomics level genome-scale models (Töpfer et al.,
                                                                        2014). In the first of these studies, microarray data from
                                                                        Arabidopsis exposed to eight different light and tem-
                                                                        perature conditions (data published in Caldana et al.,
                                                                        2011) were integrated into a genome-scale model
                                                                        (Mintz-Oron et al., 2012). Before discussing the out-
                                                                        come of this integration, we first digress to provide a
                                                                        brief description of how genome-scale models are
Figure 2. Schematic overview of an integrative approach using me-       generated. Essentially, a genome-scale model corre-
tabolite profiling of storage metabolite and membrane proteomic data.   sponds metabolic genes with metabolic pathways in a
Example of network: barley vacuole network from Tohge et al. (2011).
                                                                        manner whereby a stoichiometrically balanced meta-
                                                                        bolic network is generated, which corresponds to all
                                                                        gene functions annotated for that organism. Such
transform mass spectrometry was used to detect and                      models were originally published for microbes at the
quantify some 59 primary metabolites and 200 sec-                       turn of the century (Edwards and Palsson, 2000), with
ondary metabolites (defined on the basis of strong                       many models for plants species being subsequently
chemical formulae predictions) in either silicon oil-                   generated, including the model species Arabidopsis as
purified barley (Hordeum vulgare) vacuoles or the pro-                   well as crop species such as rice and maize (for review,
toplasts from which these were derived. Of the 259                      see Simons et al., 2014). Returning to the superimposi-
putative metabolites, 12 were exclusively detected in                   tion of experimental data on the model, the addition of
the vacuole, 34 were exclusively in the protoplast, and                 transcriptomic data was able to predict flux capacities
213 were common to both samples. At the quantitative                    and statistically assess whether these vary under the
level, the difference between vacuole and protoplast                    experimental conditions tested. Moreover, this study
was yet more striking, with secondary metabolites be-                   introduced the concepts of metabolic sustainers and
ing differentially abundant between the two sample                      modulators, with the former being metabolic functions
types. As a next step to predict the underlying cytosolic-              that are differentially up-regulated with respect to the
vacuolar transporters, tonoplast proteins predicted to                  null model whereas the latter are differentially down-
have a transport function were evaluated within the                     regulated in order to control a certain flux and, there-
context of the metabolic profiling data. Specifically, 88                 fore, modulate affected processes (Töpfer et al., 2013).
proteins reported to be tonoplast proteins in barley                    In a follow-up study, predictions made from the inte-
(Endler et al., 2006) were evaluated after conversion to                gration of transcriptomics were complemented with
Affymetrix probe identifiers and coexpression analysis                   metabolomics data from the same experiment. In doing
of the resultant 128 probe sets was carried out using                   so, the authors were able to bridge flux-centric and
PlaNet for barley (Mutwil et al., 2011; http://aranet.                  metabolomics-centric approaches and, in so doing,
mpimp-golm.mpg.de). Coexpressed networks of these                       demonstrate that, under certain conditions, metabolites
probes separated into 13 subgroups, with the most                       serving as pathway substrates in pathways defined as
dense cluster being highly correlated with aromatic                     either modulators or sustainers display lower temporal
amino acid-related genes and the second most dense                      variation with respect to all other metabolites (Töpfer
cluster including several vacuolar ATP synthase pro-                    et al., 2013). These findings are thus in concordance
teins and tricarboxylic acid cycle-related genes. In                    with theories of network rigidity and pathway robust-
addition, clear associations were found between the                     ness (Stephanopoulos and Vallino, 1991; Rontein et al.,
expression of transport proteins and that of pathways                   2002; Williams et al., 2008). Furthermore, considerable
of flavonoid and mugineic acid synthesis as well                         evidence suggests that the levels of specific metabolites,
as storage protein functions (Tohge et al., 2011). This                 such as Ala, pyruvate, 2-oxoglutarate, Gln, and sper-
study was thus able to putatively assign function to                    midine, are exceptionally stable across a massive range
1506                                                                                                     Plant Physiol. Vol. 169, 2015
Integrative Studies of Metabolism

of cellular circumstances (Geigenberger, 2003; Stitt                          Alejandro S, Lee Y, Tohge T, Sudre D, Osorio S, Park J, Bovet L, Lee Y,
                                                                                 Geldner N, Fernie AR, et al (2012) AtABCG29 is a monolignol trans-
and Fernie, 2003). They also are in keeping with                                 porter involved in lignin biosynthesis. Curr Biol 22: 1207–1212
observations that the levels of metabolites such as Ser                       Alseekh S, Tohge T, Wendenberg R, Scossa F, Omranian N, Li J, Kleessen
coordinately control the levels of expression of genes                           S, Giavalisco P, Pleban T, Mueller-Roeber B, et al (2015) Identification
encoding multiple steps of the pathways to which they,                           and mode of inheritance of quantitative trait loci for secondary metab-
themselves, belong (Timm et al., 2013). The high sta-                            olite abundance in tomato. Plant Cell 27: 485–512
                                                                              Alvarez S, Marsh EL, Schroeder SG, Schachtman DP (2008) Metabolomic
bility of these metabolites is in keeping with their re-                         and proteomic changes in the xylem sap of maize under drought. Plant
quirement across a range of different stresses. It also                          Cell Environ 31: 325–340
highlights the fact that the robust metabolites may well                      Araújo WL, Ishizaki K, Nunes-Nesi A, Larson TR, Tohge T, Krahnert I,
be the most biologically relevant for metabolic regula-                          Witt S, Obata T, Schauer N, Graham IA, et al (2010) Identification of
                                                                                 the 2-hydroxyglutarate and isovaleryl-CoA dehydrogenases as alterna-
tion; this is an important point, since it is at odds with                       tive electron donors linking lysine catabolism to the electron transport
the manner in which the majority of the metabolomics                             chain of Arabidopsis mitochondria. Plant Cell 22: 1549–1563

                                                                                                                                                              Downloaded from https://academic.oup.com/plphys/article/169/3/1499/6113889 by guest on 28 January 2022
community assesses their data. This observation addi-                         Araújo WL, Ishizaki K, Nunes-Nesi A, Tohge T, Larson TR, Krahnert I,
tionally highlights the potential difficulties and chal-                          Balbo I, Witt S, Dörmann P, Graham IA, et al (2011) Analysis of a range
lenges in interpreting data from a single level of the                           of catabolic mutants provides evidence that phytanoyl-coenzyme A
                                                                                 does not act as a substrate of the electron-transfer flavoprotein/electron-
cellular hierarchy and thus provides further grounds                             transfer flavoprotein:ubiquinone oxidoreductase complex in Arabidopsis
for integrated models.                                                           during dark-induced senescence. Plant Physiol 157: 55–69
                                                                              Bartling D, Seedorf M, Schmidt RC, Weiler EW (1994) Molecular char-
                                                                                 acterization of two cloned nitrilases from Arabidopsis thaliana: key
                                                                                 enzymes in biosynthesis of the plant hormone indole-3-acetic acid. Proc
CURRENT AND FUTURE CHALLENGES IN
                                                                                 Natl Acad Sci USA 91: 6021–6025
DATA INTEGRATION                                                              Batushansky A, Kirma M, Grillich N, Toubiana D, Pham PA, Balbo I,
                                                                                 Fromm H, Galili G, Fernie AR, Fait A (2014) Combined transcriptomics
   The above sections document that integrative ap-                              and metabolomics of Arabidopsis thaliana seedlings exposed to exoge-
proaches to further our understanding of metabolism                              nous GABA suggest its role in plants is predominantly metabolic. Mol
have proven very successful over the last decade or so,                          Plant 7: 1065–1068
particularly when linked with genetic and/or envi-                            Bielecka M, Watanabe M, Morcuende R, Scheible WR, Hawkesford MJ,
                                                                                 Hesse H, Hoefgen R (2014) Transcriptome and metabolome analysis of
ronmental experiments. To date, the approaches taken                             plant sulfate starvation and resupply provides novel information on
have been relatively straightforward and have gener-                             transcriptional regulation of metabolism associated with sulfur, nitro-
ally not been performed at a high level of spatial reso-                         gen and phosphorus nutritional responses in Arabidopsis. Front Plant
lution. Several methods currently exist to obtain data                           Sci 5: 805
from all of the methods described here at the tissue,                         Bolger A, Scossa F, Bolger ME, Lanz C, Maumus F, Tohge T, Quesneville
                                                                                 H, Alseekh S, Sørensen I, Lichtenstein G, et al (2014) The genome of
cellular, and even subcellular levels (Aharoni and                               the stress-tolerant wild tomato species Solanum pennellii. Nat Genet 46:
Brandizzi, 2012); while still technically challenging, it                        1034–1038
seems conceivable that such methods could provide                             Boutanaev AM, Moses T, Zi J, Nelson DR, Mugford ST, Peters RJ,
data required to better understand the cell specializa-                          Osbourn A (2015) Investigation of terpene diversification across mul-
                                                                                 tiple sequenced plant genomes. Proc Natl Acad Sci USA 112: E81–E88
tion of metabolism. In addition, methods to gain accu-                        Caldana C, Degenkolbe T, Cuadros-Inostroza A, Klie S, Sulpice R, Leisse
rate metabolic flux estimates following 13CO2 labeling                            A, Steinhauser D, Fernie AR, Willmitzer L, Hannah MA (2011) High-
have recently been established (Young et al., 2008;                              density kinetic analysis of the metabolomic and transcriptomic response
Szecowka et al., 2013; Ma et al., 2014) but are not yet                          of Arabidopsis to eight environmental conditions. Plant J 67: 869–884
fully integrated with protein or transcript data. How-                        Carrari F, Baxter C, Usadel B, Urbanczyk-Wochniak E, Zanor MI, Nunes-
                                                                                 Nesi A, Nikiforova V, Centero D, Ratzka A, Pauly M, et al (2006)
ever, it is important to note that such experiments, al-                         Integrated analysis of metabolite and transcript levels reveals the
beit using [13C]Glc as a precursor, have already been                            metabolic shifts that underlie tomato fruit development and highlight
carried out in in vitro-cultivated Brassica napus em-                            regulatory aspects of metabolic network behavior. Plant Physiol 142:
bryos, providing considerable insight into the systems-                          1380–1396
                                                                              Carreno-Quintero N, Bouwmeester HJ, Keurentjes JJB (2013) Genetic
level regulation of this organ (Schwender et al., 2015). It                      analysis of metabolome-phenotype interactions: from model to crop
additionally seems highly likely that future research                            species. Trends Genet 29: 41–50
will draw more heavily on archived genomics data than                         Cavalcanti JH, Esteves-Ferreira AA, Quinhones CG, Pereira-Lima IA,
it has to date; thus, the continued availability and                             Nunes-Nesi A, Fernie AR, Araújo WL (2014) Evolution and functional
quality-control curation of such data sets are imperative                        implications of the tricarboxylic acid cycle as revealed by phylogenetic
                                                                                 analysis. Genome Biol Evol 6: 2830–2848
if we are going to fully exploit their value.                                 Centeno DC, Osorio S, Nunes-Nesi A, Bertolo ALF, Carneiro RT, Araújo
Received July 7, 2015; accepted September 10, 2015; published September 14,      WL, Steinhauser MC, Michalska J, Rohrmann J, Geigenberger P, et al
2015.                                                                            (2011) Malate plays a crucial role in starch metabolism, ripening, and
                                                                                 soluble solid content of tomato fruit and affects postharvest softening.
                                                                                 Plant Cell 23: 162–184
LITERATURE CITED                                                              Conklin PL, Gatzek S, Wheeler GL, Dowdle J, Raymond MJ, Rolinski S,
                                                                                 Isupov M, Littlechild JA, Smirnoff N (2006) Arabidopsis thaliana VTC4
Achnine L, Huhman DV, Farag MA, Sumner LW, Blount JW, Dixon RA                   encodes L-galactose-1-P phosphatase, a plant ascorbic acid biosynthetic
  (2005) Genomics-based selection and functional characterization of             enzyme. J Biol Chem 281: 15662–15670
  triterpene glycosyltransferases from the model legume Medicago              Curry J, Aluru M, Mendoza M, Nevarez J, Melendrez M, O’Connell MA
  truncatula. Plant J 41: 875–887                                                (1999) Transcripts for possible capsaicinoid biosynthetic genes are dif-
Aharoni A, Brandizzi F (2012) High-resolution measurements in plant bi-          ferentially accumulated in pungent and non-pungent Capsicum spp.
  ology. Plant J 70: 1–4                                                         Plant Sci 148: 47–57

Plant Physiol. Vol. 169, 2015                                                                                                                         1507
Tohge et al.

Dal Cin V, Tieman DM, Tohge T, McQuinn R, de Vos RCH, Osorio S,                 Góngora-Castillo E, Childs KL, Fedewa G, Hamilton JP, Liscombe DK,
   Schmelz EA, Taylor MG, Smits-Kroon MT, Schuurink RC, et al (2011)               Magallanes-Lundback M, Mandadi KK, Nims E, Runguphan W,
   Identification of genes in the phenylalanine metabolic pathway by ec-            Vaillancourt B, et al (2012) Development of transcriptomic resources for
   topic expression of a MYB transcription factor in tomato fruit. Plant Cell      interrogating the biosynthesis of monoterpene indole alkaloids in me-
   23: 2738–2753                                                                   dicinal plant species. PLoS One 7: e52506
Deboo GB, Albertsen MC, Taylor LP (1995) Flavanone 3-hydroxylase                Guimerà R, Nunes Amaral LA (2005) Functional cartography of complex
   transcripts and flavonol accumulation are temporally coordinate in               metabolic networks. Nature 433: 895–900
   maize anthers. Plant J 7: 703–713                                            Haak DC, McGinnis LA, Levey DJ, Tewksbury JJ (2012) Why are not all
Dixon RA, Strack D (2003) Phytochemistry meets genome analysis, and                chilies hot? A trade-off limits pungency. Proc Biol Sci 279: 2012–2017
   beyond. Phytochemistry 62: 815–816                                           Hannah MA, Caldana C, Steinhauser D, Balbo I, Fernie AR, Willmitzer L
Do PT, Prudent M, Sulpice R, Causse M, Fernie AR (2010) The influence of            (2010) Combined transcript and metabolite profiling of Arabidopsis
   fruit load on the tomato pericarp metabolome in a Solanum chmielewskii          grown under widely variant growth conditions facilitates the identifi-
   introgression line population. Plant Physiol 154: 1128–1142                     cation of novel metabolite-mediated regulation of gene expression. Plant
Dowdle J, Ishikawa T, Gatzek S, Rolinski S, Smirnoff N (2007) Two genes            Physiol 152: 2120–2129
   in Arabidopsis thaliana encoding GDP-L-galactose phosphorylase are           Hemme D, Veyel D, Mühlhaus T, Sommer F, Jüppner J, Unger AK,

                                                                                                                                                               Downloaded from https://academic.oup.com/plphys/article/169/3/1499/6113889 by guest on 28 January 2022
   required for ascorbate biosynthesis and seedling viability. Plant J 52:         Sandmann M, Fehrle I, Schönfelder S, Steup M, et al (2014) Systems-
   673–689                                                                         wide analysis of acclimation responses to long-term heat stress and re-
Eastmond PJ, Astley HM, Parsley K, Aubry S, Williams BP, Menard GN,                covery in the photosynthetic model organism Chlamydomonas reinhardtii.
   Craddock CP, Nunes-Nesi A, Fernie AR, Hibberd JM (2015) Arabi-                  Plant Cell 26: 4270–4297
   dopsis uses two gluconeogenic gateways for organic acids to fuel             Hill AT, Ying S, Plaxton WC (2014) Phosphorylation of bacterial-type
   seedling establishment. Nat Commun 6: 6659                                      phosphoenolpyruvate carboxylase by a Ca2+-dependent protein kinase
Edwards JS, Palsson BO (2000) The Escherichia coli MG1655 in silico                suggests a link between Ca2+ signalling and anaplerotic pathway control
   metabolic genotype: its definition, characteristics, and capabilities. Proc      in developing castor oil seeds. Biochem J 458: 109–118
   Natl Acad Sci USA 97: 5528–5533                                              Hirai MY, Sugiyama K, Sawada Y, Tohge T, Obayashi T, Suzuki A, Araki
Endler A, Meyer S, Schelbert S, Schneider T, Weschke W, Peters SW,                 R, Sakurai N, Suzuki H, Aoki K, et al (2007) Omics-based identification
   Keller F, Baginsky S, Martinoia E, Schmidt UG (2006) Identification of           of Arabidopsis Myb transcription factors regulating aliphatic gluco-
   a vacuolar sucrose transporter in barley and Arabidopsis mesophyll              sinolate biosynthesis. Proc Natl Acad Sci USA 104: 6478–6483
   cells by a tonoplast proteomic approach. Plant Physiol 141: 196–207          Hirai MY, Yano M, Goodenowe DB, Kanaya S, Kimura T, Awazuhara M,
Fernie AR (2007) The future of metabolic phytochemistry: larger numbers            Arita M, Fujiwara T, Saito K (2004) Integration of transcriptomics and
   of metabolites, higher resolution, greater understanding. Phytochemis-          metabolomics for understanding of global responses to nutritional
   try 68: 2861–2880                                                               stresses in Arabidopsis thaliana. Proc Natl Acad Sci USA 101: 10205–
Fernie AR, Klee HJ (2011) The use of natural genetic diversity in the un-          10210
   derstanding of metabolic organization and regulation. Front Plant Sci 2:     Hofberger JA, Lyons E, Edger PP, Pires JC, Schranz ME (2013) Whole
   59                                                                              genome and tandem duplicate retention facilitated glucosinolate path-
Fernie AR, Stitt M (2012) On the discordance of metabolomics with pro-             way diversification in the mustard family. Genome Biol Evol 5: 2155–
   teomics and transcriptomics: coping with increasing complexity in logic,        2173
   chemistry, and network interactions. Plant Physiol 158: 1139–1145            Jander G, Norris SR, Joshi V, Fraga M, Rugg A, Yu S, Li L, Last RL (2004)
Fernie AR, Trethewey RN, Krotzky AJ, Willmitzer L (2004) Metabolite                Application of a high-throughput HPLC-MS/MS assay to Arabidopsis
   profiling: from diagnostics to systems biology. Nat Rev Mol Cell Biol 5:         mutant screening; evidence that threonine aldolase plays a role in seed
   763–769                                                                         nutritional quality. Plant J 39: 465–475
Florez-Sarasa I, Araújo WL, Wallström SV, Rasmusson AG, Fernie AR,              Joseph B, Corwin JA, Kliebenstein DJ (2015) Genetic variation in the
   Ribas-Carbo M (2012) Light-responsive metabolite and transcript levels          nuclear and organellar genomes modulates stochastic variation in the
   are maintained following a dark-adaptation period in leaves of Arabi-           metabolome, growth, and defense. PLoS Genet 11: e1004779
   dopsis thaliana. New Phytol 195: 136–148                                     Joseph B, Corwin JA, Züst T, Li B, Iravani M, Schaepman-Strub G,
Fukushima A, Kanaya S, Nishida K (2014) Integrated network analysis                Turnbull LA, Kliebenstein DJ (2013) Hierarchical nuclear and cyto-
   and effective tools in plant systems biology. Front Plant Sci 5: 598            plasmic genetic architectures for plant growth and defense within Ara-
Futuyma DJ, Agrawal AA (2009) Macroevolution and the biological di-                bidopsis. Plant Cell 25: 1929–1945
   versity of plants and herbivores. Proc Natl Acad Sci USA 106: 18054–         Kang JH, Gonzales-Vigil E, Matsuba Y, Pichersky E, Barry CS (2014)
   18061                                                                           Determination of residues responsible for substrate and product speci-
Gatzek S, Wheeler GL, Smirnoff N (2002) Antisense suppression of                   ficity of Solanum habrochaites short-chain cis-prenyltransferases. Plant
   L-galactose dehydrogenase in Arabidopsis thaliana provides evidence             Physiol 164: 80–91
   for its role in ascorbate synthesis and reveals light modulated              Kim J, Kang K, Gonzales-Vigil E, Shi F, Jones AD, Barry CS, Last RL
   L-galactose synthesis. Plant J 30: 541–553                                      (2012) Striking natural diversity in glandular trichome acylsugar com-
Gechev TS, Benina M, Obata T, Tohge T, Sujeeth N, Minkov I, Hille J,               position is shaped by variation at the Acyltransferase2 locus in the wild
   Temanni MR, Marriott AS, Bergström E, et al (2013) Molecular                    tomato Solanum habrochaites. Plant Physiol 160: 1854–1870
   mechanisms of desiccation tolerance in the resurrection glacial relic        Kim J, Matsuba Y, Ning J, Schilmiller AL, Hammar D, Jones AD,
   Haberlea rhodopensis. Cell Mol Life Sci 70: 689–709                             Pichersky E, Last RL (2014) Analysis of natural and induced variation in
Geigenberger P (2003) Response of plant metabolism to too little oxygen.           tomato glandular trichome flavonoids identifies a gene not present in
   Curr Opin Plant Biol 6: 247–256                                                 the reference genome. Plant Cell 26: 3272–3285
Gholami A, De Geyter N, Pollier J, Goormachtig S, Goossens A (2014)             Kleessen S, Laitinen R, Fusari CM, Antonio C, Sulpice R, Fernie AR, Stitt
   Natural product biosynthesis in Medicago species. Nat Prod Rep 31:              M, Nikoloski Z (2014) Metabolic efficiency underpins performance
   356–380                                                                         trade-offs in growth of Arabidopsis thaliana. Nat Commun 5: 3537
Gibon Y, Usadel B, Blaesing OE, Kamlage B, Hoehne M, Trethewey R,               Kliebenstein D (2009) Advancing genetic theory and application by met-
   Stitt M (2006) Integration of metabolite with transcript and enzyme             abolic quantitative trait loci analysis. Plant Cell 21: 1637–1646
   activity profiling during diurnal cycles in Arabidopsis rosettes. Genome      Kliebenstein DJ, Lambrix VM, Reichelt M, Gershenzon J, Mitchell-Olds
   Biol 7: R76                                                                     T (2001) Gene duplication in the diversification of secondary metabo-
Gigolashvili T, Yatusevich R, Rollwitz I, Humphry M, Gershenzon J,                 lism: tandem 2-oxoglutarate-dependent dioxygenases control gluco-
   Flügge UI (2009) The plastidic bile acid transporter 5 is required for          sinolate biosynthesis in Arabidopsis. Plant Cell 13: 681–693
   the biosynthesis of methionine-derived glucosinolates in Arabidopsis         Koenig D, Jiménez-Gómez JM, Kimura S, Fulop D, Chitwood DH,
   thaliana. Plant Cell 21: 1813–1829                                              Headland LR, Kumar R, Covington MF, Devisetty UK, Tat AV, et al
Gong L, Chen W, Gao Y, Liu X, Zhang H, Xu C, Yu S, Zhang Q, Luo J                  (2013) Comparative transcriptomics reveals patterns of selection in
   (2013) Genetic analysis of the metabolome exemplified using a rice               domesticated and wild tomato. Proc Natl Acad Sci USA 110: E2655–
   population. Proc Natl Acad Sci USA 110: 20320–20325                             E2662

1508                                                                                                                         Plant Physiol. Vol. 169, 2015
You can also read