A genome scan of diversifying selection in Ophiocordyceps zombie ant fungi suggests a role for enterotoxins in co evolution and host specificity

Page created by Henry Higgins
 
CONTINUE READING
A genome scan of diversifying selection in Ophiocordyceps zombie ant fungi suggests a role for enterotoxins in co evolution and host specificity
Received: 8 September 2017      |   Accepted: 13 July 2018

DOI: 10.1111/mec.14813

ORIGINAL ARTICLE

A genome scan of diversifying selection in Ophiocordyceps
zombie‐ant fungi suggests a role for enterotoxins in
co‐evolution and host specificity

Noppol Kobmoo1,2                        | Duangdao Wichadakul3,4 | Nuntanat Arnamnart2 |
Ricardo C. Rodríguez De La Vega1 | Janet J. Luangsa-ard2 | Tatiana Giraud1

1
  Ecologie Systématique Evolution,
Université Paris-Sud, CNRS, AgroParisTech,           Abstract
Université Paris-Saclay, Orsay, France               Identification of the genes underlying adaptation sheds light on the biological func-
2
 National Center for Genetic Engineering
                                                     tions targeted by natural selection. Searches for footprints of positive selection, in the
and Biotechnology (BIOTEC), National
Science and Development Agency (NSTDA),              form of rapid amino acid substitutions, and the identification of species‐specific genes
Klhong Luang, Thailand
                                                     have proved to be powerful approaches to identifying the genes involved in host spe-
3
 Chulalongkorn University Big Data
Analytics and IoT Center (CUBIC),                    cialization in plant‐pathogenic fungi. We used an evolutionary comparative genomic
Department of Computer Engineering,                  approach to identify genes underlying host adaptation in the ant‐infecting genus
Faculty of Engineering, Chulalongkorn
University, Bangkok, Thailand                        Ophiocordyceps, which manipulates ant behaviour. A comparison of the predicted
4
 Center of Excellence in Systems Biology,            genes in the genomes of species from three species complexes—O. unilateralis,
Faculty of Medicine, Chulalongkorn
                                                     O. australis and O. subramanianii—revealed an enrichment in pathogenesis‐associated
University, Bangkok, Thailand
                                                     functions, including heat‐labile enterotoxins, among species‐specific genes. Further-
Correspondence
                                                     more, these genes were overrepresented among those displaying significant footprints
Noppol Kobmoo, BIOTEC, NSTDA, Thailand
Science Park, Khlong Neung, Khlong Luang,            of positive selection. Other categories of genes suspected to be important for viru-
12120 Pathum Thani, Thailand.
                                                     lence and pathogenicity in entomopathogenic fungi (e.g., chitinases, lipases, proteases,
Email: noppol.kob@biotec.or.th
                                                     core secondary metabolism genes) were much less represented, although a few candi-
Funding information
                                                     date genes were found to evolve under positive selection. An analysis including
H2020 Marie Skłodowska-Curie Actions,
Grant/Award Number: 655278; Thailand                 orthologs from other entomopathogenic fungi in a broader context showed that posi-
Research Fund, Grant/Award Number:
                                                     tive selection on enterotoxins was specific to the ant‐infecting genus Ophiocordyceps.
TRG5780162
                                                     Together with previous studies reporting the overexpression of an enterotoxin during
                                                     behavioural manipulation in diseased ants, our findings suggest that heat‐labile
                                                     enterotoxins are important effectors in host adaptation and co‐evolution in the Ophio-
                                                     cordyceps entomopathogenic fungi.

                                                     KEYWORDS
                                                     adaptation, enterotoxins, host specificity, Ophiocordyceps, positive selection

1 | INTRODUCTION                                                                  of the copies (Ohno, 1970; Zhang, Zhang, & Rosenberg, 2002). Gene
                                                                                  losses can also be adaptive (Juárez‐Vázquez et al., 2017), particularly
The identification of genes underlying adaptation is a major goal in              in pathogens, as the absence of a molecule recognized by the host
evolutionary biology, as it can shed light on the biological functions            may enable the pathogen to colonize its host without triggering a
targeted by natural selection and the genetic mechanisms generating               response from the host immune system (Albalat & Cañestro, 2016;
new, adaptive variants. Innovation may be generated during evolution              Ghanbarnia et al., 2015; Rouxel & Balesdent, 2017). Gene duplications
by gene duplication followed by rapid amino acid substitutions in one             and losses result in the presence of species‐specific genes, which are

Molecular Ecology. 2018;1–17.                                  wileyonlinelibrary.com/journal/mec                      © 2018 John Wiley & Sons Ltd   |   1
2   |                                                                                                                                  KOBMOO    ET AL.

often overrepresented among the genes involved in adaptation               models in which to study the genomics of host specialization and
(Gladieux et al., 2014; Lespinet, Wolf, Koonin, & Aravind, 2002; Zhou      co‐evolution. Specialist Metarhizium strains and species generally
et al., 2015). Adaptation may also occur through positive selection,       have fewer genes, with notably fewer genes encoding host‐killing
with rapid amino acid substitutions, typically detected as higher rates    toxins (Wang, Leclerque, Pava‐Ripoll, Fang, & St. Leger, 2009), but
of nonsynonymous substitutions (dN) than of synonymous substitu-           more genes evolving under positive selection than generalists (Hu
tions (dS) among orthologous genes of closely related species (Ina,        et al., 2014). However, it remains unclear how adaptation has
1996; Kimura, 1983). Comparisons of dN/dS ratios to neutral expecta-       shaped the genomes of closely related fungal entomopathogens
tions therefore also constitute a powerful approach to identifying         specializing on different hosts.
genes under recurrent positive selection (Yang & Nielsen, 1998; Yang,         We therefore tried to identify genes involved in host speci-
Nielsen, Goldman, & Krabbe Pedersen, 2000; Yang, Wong, & Nielsen,          ficity in three complexes of closely related species from the genus
2005).                                                                     Ophiocordyceps (Hypocreales, Ascomycota): O. unilateralis sensu lato,
    Pathogens are particularly interesting models for investigations       O. subramanianii s.l. and O. australis s. l. One of the key features
of the genomic mechanisms of adaptation, as they are locked in an          of these pathogens is their ability to manipulate their hosts to
arms race with their hosts, leading to continuous, rapid evolution         promote their own dispersal. Infected ants, often described as
(Anderson et al., 2010; Kurtz, Schulenburg, & Reusch, 2016). Identifi-     “zombie ants,” leave their nests and develop erratic behaviour,
cation of the genes underlying host‐specific adaptations in patho-         wandering alone into vegetation and then biting into a leaf
gens improves our fundamental understanding of natural selection           located at a precise height and orientation optimal for subsequent
and evolution, but it also has more applied implications, shedding         fungal dispersal just before they die. Fungal spores produced from
light on major epidemics and disease emergence in plants and ani-          the diseased ant are thus dispersed farther, from a height (de
mals (Möller & Stukenbrock, 2017).                                         Bekker, Ohm, Evans, Brachmann, & Hughes, 2017; Hughes et al.,
    Fungi are the principal pathogens of plants (Anderson et al.,          2011, 2016; Pontoppidan, Himaman, Hywel‐Jones, Boomsma, &
2004), and they also represent threats to the health of many animals       Hughes, 2009). Ophiocordyceps unilateralis s.l. is a highly diverse
(Fisher et al., 2012; Sexton & Howlett, 2006). Many studies have           complex of pathogenic cryptic species specific to formicine ants. It
searched for genes under positive selection as a means of identifying      is distributed worldwide, and many species occur together in sym-
genes and functions involved in the species‐specific adaptation of         patry while displaying strong host specificity (Araújo, Evans,
fungal pathogens of plants (Aguileta, Refrégier, Yockteng, Fournier,       Kepler, & Hughes, 2018; Evans, Elliot, & Hughes, 2011; Kobmoo,
& Giraud, 2009; Möller & Stukenbrock, 2017). For example, in the           Mongkolsamrit,    Tasanathai,      Thanakitpipattana,   &    Luangsa‐Ard,
Microbotryum and Botrytis fungal plant pathogens, many such genes          2012; Kobmoo et al., 2015). Ants develop erratic behaviour only
have been identified through comparative transcriptomics studies as        when infected with their specific pathogen species (de Bekker et
being under recurrent positive selection and they were involved in         al., 2014; Sakolrak, Blatrix, Sangwanit, Arnamnart, & Kobmoo,
biological processes important for the recognition and cell signalling     2018). The taxonomy and phylogeny of the other ant‐manipulating
between the host and the pathogen (Aguileta et al., 2010, 2012).           Ophiocordyceps species complexes have been studied in less detail,
More recent, next‐generation sequencing made it possible to per-           but host specificity is also considered to be the rule for these
form genomewide scans in plant‐pathogenic fungi, resulting in the          other taxa (Araújo et al., 2018).
identification of an array of effectors under positive selection              We conducted a comparative genomic study of ant‐infecting
(Badouin et al., 2017; Poppe, Dorsheimer, Happel, & Stukenbrock,           Ophiocordyceps species, with the aim of identifying genes underlying
2015; Schirrmann et al., 2018; Stukenbrock et al., 2011; Wicker et         host specificity by searching for species‐specific genes and genes
al., 2013) and species‐ or lineages‐specific genes underlying adapta-      evolving under positive selection. We sequenced the genomes of
tions (Baroncelli et al., 2016; Hartmann, Rodríguez de la Vega, Bran-      two closely related species of the O. unilateralis complex from Thai-
denburg, Carpentier, & Giraud, 2018).                                      land: O. camponoti-leonardi and O. camponoti-saundersi, specific to
    By contrast, far fewer such studies have been performed on             the ants Colobopsis leonardi and C. saundersi, respectively. We also
entomopathogenic fungi (Wang & Wang, 2017), despite the impor-             improved the available genome assembly of another species of this
tance of identifying genes underlying host‐specific adaptation for the     complex,   O. polyrhachis-furcata,    specific   to   Polyrhachis   furcata
use of these fungi as biological control agents against insect pests in    (Wichadakul et al., 2015), and used the published genomes of other
agriculture (Wang & Feng, 2014). Furthermore, an understanding of          ant‐infecting Ophiocordyceps species (de Bekker et al., 2017): one
host specificity and evolution in these insect pathogens is of funda-      genome of each of two species of O. unilateralis s.l., O. kimflemingiae
mental interest in its own right, particularly for fungi able to manipu-   from the United States infecting Camponotus castaneus (Araújo et al.,
late the behaviour of the insect host for their own benefit, as in the     2017; de Bekker et al., 2015) and O. camponoti-rufipedis from Brazil
“zombie‐ant”    phenomenon.     Most    genomic    studies   on   ento-    specific to C. rufipes (Araújo et al., 2018; Evans et al., 2011); one
mopathogenic fungi have focused on species with agricultural appli-        genome of O. subramanianii s.l. from a ponerine ant in Ghana; one
cations such as Beauveria bassiana and Metarhizium anisopliae (Gao         genome of each of two strains of O. australis s.l. found on different
et al., 2011; Hu et al., 2014; Pattemore et al., 2014). However, these     ponerine ant species, from Ghana and Brazil, probably belonging to
species have broad host ranges and may not, therefore, be the best         different cryptic species (de Bekker et al., 2017).
KOBMOO       ET AL.                                                                                                                                 |   3

      The entomopathogenic fungi of the order Hypocreales are                  (Wichadakul et al., 2015); we aimed to improve the existing refer-
known to infect their host by penetrating the cuticle (Boomsma, Jen-           ence genome, but the original strain BCC54312 could not be grown
sen, Meyling, & Eilenberg, 2014). This process requires an array of            from the culture collection. We therefore collected three additional
proteinases, lipases and chitinases. The acquisition of nutrients from         samples of this species (strains NK275ss‐12, NK142ss and NK294ss‐
the host requires proteases and glycoside hydrolases, including tre-           20), in 2013 and 2014, from the same site as the reference strain, in
halases in particular, as trehalose is a major carbon source present in        Khao Yai National Park, Nakhon Ratchasima Province. The collected
the insect haemolymph (Thompson, 2003). Secondary metabolites,                 samples were isolated and grown as described by Wongsa, Tasana-
including toxins, help to combat the host immune system and even-              tai, Watts, and Hywel‐Jones (2005). We complied with the Nagoya
tually kill the insect (Ortiz‐Urquiza, Riveiro‐Miranda, Santiago‐Álvarez,      protocols on access and benefit‐sharing, by obtaining authorization
& Quesada‐Moraga, 2010; Schrank & Vainstein, 2010). Ophiocordy-                from the Department of National Parks, Wildlife and Plant Conserva-
ceps polyrhachis-furcata has a more extensive family of genes encod-           tion (DNP) at the Ministry of Natural Resources and Environment of
ing      putative     heat‐labile   enterotoxins   than   other   specialist   Thailand for all strain collections. After two to three months of
entomopathogenic fungi (Wichadakul et al., 2015), and some of                  growth on Grace Insect Cell Medium (Sigma‐Aldrich), the mycelia
these genes are expressed during host‐specific behavioural manipula-           and spores were harvested and DNA was extracted with the
tion. Heat‐labile enterotoxins may, therefore, act as neuromodulators          NucleoSpin® Soil kit (Macherey‐Nagel). The long incubation period is
(de Bekker et al., 2015). We hypothesized that enterotoxin‐coding              due to the fact that O. unilateralis species in Thailand are very fastid-
genes would be under recurrent positive selection in ant‐manipulat-            ious to grow, requiring few steps of enlarging the culture scale to a
ing Ophiocordyceps fungi, as they have probably been involved in co‐           sufficient level for DNA extraction. Genomic libraries were con-
evolution with the host and in host‐specific adaptation. Small pro-            structed (150‐bp paired‐end reads) for sequencing with an Illumina
teins secreted by fungal pathogens are often involved in interactions          HiSeq3000 machine at the GenoToul platform (Toulouse, France).
with the host (Barrett & Heil, 2012; Rafiqi, Ellis, Ludowici, Hardham,
& Dodds, 2012). We therefore conducted genome scans for positive
                                                                               2.2 | Read pretreatment, de novo assembly and
selection and focused on the heat‐labile enterotoxin gene family and
                                                                               improvement of the reference genome
small secreted proteins. We conducted formal tests for positive
selection (statistical comparisons of models of evolution with and             The raw reads were trimmed to remove adapters and low‐quality
without diversifying selection). As such tests detect only highly              bases from their ends (q < 20). Duplicate reads were removed using
recurrent and rapid positive selection, we also investigated the 5%            Picard Tools MarkDuplicate. The reference genomes for O. cam-
of genes with the highest dN/dS values. High dN/dS values, even if             ponoti-leonardi and O. camponoti-saundersi were assembled de novo
below 1, may be indicative of positive selection at a few sites in the         with SPAdes (Bankevich et al., 2012), which progressively integrates
protein, although they may also result from relaxed selection. In sev-         k‐mers of increasing size. The k‐mer sizes used were 21, 33, 55, 77,
eral classes of genes thought to be important for virulence and                99, 119 and 127 for NK405ss‐6, and 21, 33, 55, 77, 99 and 115 for
pathogenicity in entomopathogenic fungi (e.g., chitinases, lipases,            NK511ss‐8. The appropriate maximum k‐mer sizes were estimated
proteases, small secreted proteins), only a few genes showed signs             with Kmergenie (Chikhi & Medvedev, 2014).
of selection or species specificity. By contrast, we found that heat‐             The reads obtained for the new O. polyrhachis-furcata samples
labile enterotoxins were overrepresented among both the species‐               were used to fill gaps in the existing reference genome of this spe-
specific genes and the genes with significant footprints of positive           cies with GapFiller (Boetzer et al., 2012), which mapped the reads
selection. An analysis including enterotoxin‐encoding genes from               onto the reference sequence over the regions flanking the gaps and
other entomopathogenic fungi (Hypocreales), that do not manipulate             identified a consensus between reads overlapping the gaps. In total,
host behaviour, showed that positive selection was specific to the             175 of 3,915 gaps were closed (identifying around 1.6 Mb from a
ant‐infecting genus Ophiocordyceps. These findings suggest that                total gap length of 2.4 Mb in the reference genome).
heat‐labile enterotoxins are important effectors involved in host
adaptation and co‐evolution in entomopathogenic Ophiocordyceps
                                                                               2.3 | Gene prediction and functional annotation
fungi.
                                                                               Gene prediction was based exclusively on scaffolds of more than
                                                                               1 kb in length and involved a two‐round approach based on MAKER
2 | MATERIALS AND METHODS
                                                                               (Cantarel et al., 2008). Gene sets were initially predicted with
                                                                               CEGMA (Parra, Bradnam, & Korf, 2007) and GeneMark‐ES (Lom-
2.1 | Sampling and sequencing
                                                                               sadze, Ter‐Hovhannisyan, Chernoff, & Borodovsky, 2005) and were
In 2015, we collected a sample of O. camponoti-leonardi (strain                then used as inputs into MAKER for the first round of prediction.
NK511ss‐8) from Kalayaniwattana district, in Chiang Mai province in            The predicted proteins and transcripts identified in previous studies
Thailand, and a sample of O. camponoti-saundersi (strain NK405ss‐6)            on O. polyrhachis-furcata (Wichadakul et al., 2015) were also used as
from the Phu Kiew National Park, in Chaiyaphum province. We used               a training set for MAKER. The predicted gene set from this first
the reference genome of O. polyrhachis-furcata (strain BCC54312)               round was then fed into SNAP (Korf, 2004) and Augustus (Keller,
4   |                                                                                                                                 KOBMOO   ET AL.

Kollmar, Stanke, & Waack, 2011). The output of these two tools was         2016). Fisher's exact tests were used to compare gene counts
then fed back into MAKER for a second round of prediction.                 between paralogous species‐specific or complex‐specific groups and
    The predicted proteins were annotated with InterProScan 5              the whole gene set for the species or complex, respectively.
(Jones et al., 2014), which also associated the protein domains               Sequences within all orthologous groups were aligned with       MACSE

detected with sequences in the Pfam (Finn et al., 2016) and KEGG           (Ranwez, Harispe, Delsuc, & Douzery, 2011) for further analyses. A
(Kanehisa, Sato, Kawashima, Furumichi, & Tanabe, 2016; Ogata et            phylogenetic tree with bootstrap support was constructed according
al., 1999) databases and with Gene Ontology (GO) terms. Small              to the GTRCAT model under         RAXML-HPC   v8.1.5 (Stamatakis, 2014),
secreted proteins (SSPs) were identified as proteins of 10) ratios, were discarded. The functions overrepresented among
and on gene positions on scaffolds. SMGCs were predicted with the          the 5% of genes with the highest dN/dS ratios were inferred by an
fungal version of antiSMASH (Weber et al., 2015). SMGC homology            analysis of enrichment in GO terms. A mean dN/dS>1 for a given
across species was inferred with BiG‐SCAPE (Navarros‐Munõz J.,             gene indicates positive selection, whereas high dN/dS values below
https://git.wageningenur.nl/medema-group/BiG-SCAPE/wikis/home),            1 can be due to positive selection on a small number of sites within
which classified SMGCs into families based on Jaccard similarity           the protein or to relaxed selection.
indices between clusters. RepeatMasker was used to predict repeti-            We also formally tested for positive selection by performing site‐
tive elements for the three species from Thailand.                         model likelihood ratio tests (LRTs) with the     CODEML   program imple-
                                                                           mented in   PAML   v.4.8a (Yang, 2007), excluding gaps and ambiguous
                                                                           sites and using trees inferred under GTRCAT model from respective
2.4 | Orthology and phylogenomics
                                                                           orthologous groups.   CODEML   estimates the parameter omega (ω = dN/
In addition to the predicted proteins from the de novo assembled           dS) by maximum‐likelihood methods, allowing variation between
and improved genomes of O. unilateralis species from Thailand, we          sites. While the pairwise measures above only approximate synony-
also included in our analyses the predicted proteins of other ant‐         mous and nonsynonymous rates, likelihood ratio tests (LRTs) statisti-
infecting Ophiocordyceps fungi specific to different ant species and       cally compare two models of evolution, one in which ω < 1 (null
originating from different geographic areas (de Bekker et al., 2017).      model) at all sites and another in which ω > 1 at some sites (alterna-
We used the available genomes from two additional O. unilateralis          tive hypothesis of positive selection); LRTs thus indicate whether a
s.l. species (O. kimflemingiae from the United States and O. cam-          model with positive selection is more likely than a model without
ponoti-rufipedis from Brazil), from two cryptic species of O. australis    positive selection. We compared the M7 (beta distribution of ω) and
s.l., from Ghana and Brazil (de Bekker et al., 2017), and from O. sub-     M8 (beta distribution of ω with a proportion of sites with ω > 1;
ramanianii s.l., also from Ghana. The predicted proteins correspond-       Nielsen & Yang, 1998; Yang et al., 2000) models, and the M8a (simi-
ing to all these genomes were subjected to Blast comparisons with          lar to M8 but with a category of sites evolving with ω = 1) and M8
                                                           −5
each other, with a significance threshold e‐value of 1e . The Blast        (Swanson, Nielsen, & Yang, 2003) models in LRTs. Only genes with a
results were used as input for orthAgogue (Ekseth, Kuiper, & Miro-         p‐value below 0.05 after false‐discovery rate (FDR) correction were
nov, 2014), a tool for the rapid inference of orthologous groups with      considered significant. The M7 vs. M8 test is known to lack robust-
the Markov clustering algorithm (MCL, Dongen, 2000). This algo-            ness when the probability mass is located around ω = 1, in which
rithm recovers species‐specific paralogous groups, with genes from a       case this test gives a high proportion of false positives; under these
given species considered to be more closely related to each other          conditions, the M8a vs. M8 test is preferred (Swanson et al., 2003).
than to any other gene in any other species. The functional annota-        We ensured the robustness of our results by considering only genes
tions obtained for O. polyrhachis-furcata were transferred to the          in which significant evolution under positive selection was detected
other species for gene copies in the same orthologous group. Spe-          in both tests. We checked for enrichment in particular GO terms
cies‐specific paralogous genes were annotated as described above.          among the genes evolving under positive selection.
We analysed GO term enrichment among species‐ or complex‐speci-               We also investigated whether genes encoding heat‐labile entero-
fic paralogs, with the   TOPGO   package in   R   (Alexa & Rahnenfuhrer,   toxins evolved under positive selection specifically in ant‐infecting
KOBMOO      ET AL.                                                                                                                                        |   5

Ophiocordyceps and not in other Hypocrealean fungi. We therefore                     mate‐pair libraries (Wichadakul et al., 2015) (Table 1). These gen-
downloaded predicted gene sequences from other Hypocrealean                          omes are markedly larger than those reported for O. kimflemingiae
fungi that were annotated as putative heat‐labile enterotoxins from                  (OKi: 23.91 Mb), O. camponoti-rufipedis (OCR: 21.91 Mb), O. australis
the Ensembl Genome database (Herrero et al., 2016). Putative heat‐                   s.l. from Brazil (OAB: 23.32 Mb) and from Ghana (OAG: 22.19 Mb),
labile enterotoxin genes were retrieved for 14 entomopathogenic                      and O. subramanianii s.l. (OSS: 32.31 Mb), but all these previously
fungi (one strain per species) (Supporting Information Table S1):                    published genomes were more fragmented than our assemblies
Metarhizium anisopliae ARSEF23 (24 genes), M. acridum CQMa 102                       (Table 1).
(three genes) (Pattemore et al., 2014); M. album ARSEF1941 (12                           Despite the differences in genome size, the numbers of predicted
genes),   M. brunneum         ARSEF3297        (32    genes),    M. guizhouense      genes were of a same order of magnitude across species (Table 1),
ARSEF977 (32 genes), M. majus ARSEF297 (32 genes) (Hu et al.,                        although the number of predicted genes was nevertheless largest for
2014);    M. rileyi    RCEF4871       (three    genes),   Isaria    fumosorosea      OSS. For the three species from Thailand, OPF had the largest num-
ARSEF2679 (five genes), Aschersonia aleyrodis RCEF2490 (14 genes),                   ber of predicted genes, probably because the protein and transcript
Cordyceps       confragosa     RCEF1005        (six   genes),      C. brongniartii   training set used for prediction came from this species. The number
RCEF3172 (30 genes) (Shang et al., 2016), Cordyceps militaris CM01                   of SSPs was similar between the three Thai species. The number of
(one gene, Zheng et al., 2011), Beauveria bassiana ARSEF2860 (six                    genes with assigned Pfam domains or InterPro classification and the
genes, Xiao et al., 2012); and Ophiocordyceps sinensis Co18 (13                      complete predicted gene sets obtained by core eukaryotic genes
genes, Xia et al., 2017). We also included putative heat‐labile entero-              mapping (CEGMA) were also very similar in the three species (~95%:
toxin sequences from two nematode‐killing fungi: Purpureocillium                     Table 1), but smaller than those for species from the New World
lilacinum PLBJ‐1 (two genes, Wang et al., 2016) and Pochonia                         (~99%).
chlamydosporia        170    (four   genes).   Orthologs        between     these
sequences and the putative enterotoxins of O. unilatealis species
                                                                                     3.2 | Orthology and phylogenomics
studied here were identified. The occurrence of clade‐specific posi-
tive selection in O. unilateralis was assessed with branch‐model LRTs                The genomes used in this study were sequenced from individuals
in PAML (Yang, 1998; Yang & Nielsen, 1998) and with the BUSTED                       belonging to one of the three species complexes: O. unilateralis s.l.,
test, an alignment‐wide test of episodic positive selection (Murrell et              O. australis s.l. and O. subramanianii s.l. Most of the genes were com-
al., 2015). Both these tests are log‐likelihood ratio tests comparing a              mon to all three complexes (Figure 1a): 8,554 orthologous groups
model in which positive selection is allowed in the foreground                       were retrieved, 5,718 of which were common to all complexes. For
branches (i.e., the clade of interest) to the null model in which posi-              orthologous groups present in only one of the three complexes (Sup-
tive selection is not allowed. The branch model (Yang & Nielsen,                     porting Information Table S2), pathogenesis (GO:0009405) was the
1998), as implemented in PALM, detects positive selection by allow-                  function displaying the most significant enrichment in all complexes
ing a candidate clade to have a dN/dS ratio higher than those of the                 (Bonferroni‐corrected p‐values: 2e−10 for O. unilateralis s.l., 0.016 for
other branches (background branches) without taking into account                     O. australis s.l., 3.4e−5 for O. subramanianii s.l.), mostly due to the
variation between sites or allowing variation between branches of                    presence of genes encoding putative heat‐labile enterotoxins in
the same category. By contrast, BUSTED is a stochastic test using                    these complex‐specific genes. Complex‐specific genes were also
information from all sites and branches; it is therefore considered to               found to be enriched in interspecies interactions and multi‐organism
have greater statistical power (Murrell et al., 2015).                               process functions.
                                                                                         Within each species complex, most of the genes were common
                                                                                     to several species (Figure 1b,c). The function pathogenesis was
3 | RESULTS
                                                                                     found to be overrepresented among species‐specific genes (Support-
                                                                                     ing Information Tables S3 and S4), due to the presence of genes
3.1 | General genome features
                                                                                     encoding heat‐labile enterotoxins, and SSPs (Tables 2 and 3). In par-
The three reference genomes of closely related species sequenced                     ticular, we detected an overrepresentation of SSPs among the genes
here differed considerably in size, O. camponoti-saundersi (OCS)                     unique to O. kimflemingiae (p‐value = 0.003) relative to O. unilateralis
being the largest (49.26 Mb), followed by O. polyrhachis-furcata                     s.l. complex, and among the genes unique to O. australis from Brazil
(OPF) (43.25 Mb) and O. camponoti-leonardi (OCL) (37.91 Mb). These                   relative to O. australis s.l. complex (p‐value = 0.001). None of these
differences probably partly reflect methodological differences as the                species‐specific SSPs had a predicted function, suggesting an expan-
OPF genome is an improved version of a genome sequenced with a                       sion of rapidly evolving families of effectors (Kim et al., 2016).
different technology (454 pyrosequencing combined with Illumina                          There were 4,651 orthologous groups common to all eight gen-
mate‐pair sequencing, Wichadakul et al., 2015). OCL and OCS were                     omes. We used a subset of 4,014 single‐copy orthologous groups
sequenced and assembled with the same methodology, so the                            common to all species to construct a phylogenetic tree (Figure 2). This
observed differences probably reflect genuine differences in genome                  tree recovered the expected relationships between the sibling species
size. OCS also had more scaffolds (1700) than OCL (531). OPF had                     from Thailand, with O. polyrhachis-furcata being the most closely
fewer scaffolds and larger contigs, due to the use of variable‐size                  related to O. camponoti-leonardi (Kobmoo et al., 2012, 2015); the
6   |                                                                                                                                  KOBMOO    ET AL.

T A B L E 1 Genome summary statistics for the ant‐infecting Ophiocordyceps species used in this study
                                          OPF             OCL            OCS             OKi         OCR          OAB          OAG         OSS
 Species (sample name)                    (BCC54312)a     (NK511ss‐8)    (NK405ss‐6)     (SC16a)b    (Map‐16)b    (Map‐64)b    (1348a)b    (1346)b
 Genome size in Mb (scaffolds >1 kb)      43.25           37.91          49.26           23.91       21.90        23.32        22.19       32.30
 Number of scaffolds (>1 kb)              68              531            1,700           1,64        2,204        595          2,296       3,395
 Largest scaffold (kb)                    5,272.94        574.15         755.06          167.40      146.68       427.81       117.86      138.81
 N50 (kb)                                 2,974.013       139.47         102.43          26.91       23.06        111.99       17.42       17.59
 GC content (%)                           45.03           45.88          40.13           55.92       56.1         53.13        53.48       60.35
 Number of Ns per 100 kb                  5,426.84        11.32          15.22           739.17      13.02        403.43       554.75      376.08
 Number of protein‐coding genes           8,988           7,059          6,970           8,629       7,621        8,174        7,995       11,275
 Number of exons per gene                 3.57            3.00           2.98            3.00        2.00         2.00         2.00        2.00
 Exon length (median)                     146             303            303             220         273          268          290         266
 Core eukaryotic gene mapping             95.56           95.16          95.97           99.13       98.69        99.13        98.25       98.47
  (CEGMA) completeness (%)
 Repetitive content (% of the genome)     5.23            5.41           5.65            6.83        6.59         2.87         2.45        4.06
 Number of genes with SignalP             716             811            761             914         840          802          681         1,064
 Number of small secreted proteins        270             252            239             373         802          776          648         1,027
  (SSPs)

Notes. OAB: Ophiocordyceps australis from Brazil; OAG: Ophiocordyceps australis from Ghana; OCL: Ophiocordyceps camponoti-leonardi; OCR: Ophiocordy-
ceps camponoti‐rufipedis; OCS: Ophiocordyceps camponoti-saundersi; OKi: Ophiocordyceps kimflemingiae; OPF: Ophiocordyceps polyrhachis-furcata; OSS:
Ophiocordyceps subramanianii. aImproved from Wichadakul et al. (2015). bTaken from de Bekker et al. (2017).

                                                                                                       F I G U R E 1 Inference of orthologous
                                                                                                       groups: Venn diagram showing the number
                                                                                                       of orthologous groups common to and
                                                                                                       specific to species complexes and species
                                                                                                       a. between the three ant‐infecting
                                                                                                       Ophiocordyceps species complexes used in
                                                                                                       this study; (b) between the species in the
                                                                                                       O. unilateralis complex
                                                                                                       (OPF = O. polyrhachis-furcata,
                                                                                                       OCL = O. camponoti-leonardi,
                                                                                                       OCS = O. camponoti-saundersi,
                                                                                                       OKi = O. kimflemingiae,
                                                                                                       OCR = O. camponoti-rufipedis), (c) between
                                                                                                       the species in the O. australis complex
                                                                                                       (OAG = O. australis from Ghana,
                                                                                                       OAB = O. australis from Brazil)

species from the Americas, O. kimflemingiae and O. camponoti-rufipe-
                                                                            3.3 | Variation of dN/dS across genomes and
dis, clustered together but were separate from those from Thailand,
                                                                            putative functions
corresponding to the separation between the Old and New Worlds
observed in a previous study (Evans, Araújo, Halfeld, & Hughes, 2018).      The median pairwise dN/dS ratio was 0.081, indicating that most sin-
The two O. australis s.l. species were grouped together and formed,         gle‐copy orthologs evolved under strong purifying selection (Fig-
with O. subramanianii, an outgroup to the O. unilateralis complex.          ure 3a). No orthologous group had dN/dS > 1 (Supporting
KOBMOO     ET AL.                                                                                                                                   |   7

T A B L E 2 Characteristics of orthologous groups specific to different species among the Ophiocordyceps unilateralis sensu lato complex
                                                   Number of
                                                   predicted genes
                                Number of          (number of genes                                                    Species‐specific SSPs/genes,
                                specific           with Pfam                                                           whole‐genome SSPs/genes
                                orthologous        domains/InterPro                                                    (p‐value for enrichment analysis
 Species                        groups             classification)       Enriched functions (GO term; FDR p‐value)     of SSPs)
 O. polyrhachis-furcata          61                519 (25)              Pathogenesis (GO:0009405;0.0054)              18/519, 270/7678 (0.60)
                                                                         Interspecies interaction between organisms
                                                                          (GO:0044419; 0.0021)
                                                                         Multiorganism process (GO:0051704; 0.0021)
 O. camponoti-leonardi            9                   9 (8)              ‐                                             ‐
 O. camponoti-saundersi           7                   9 (6)              ‐                                             ‐
 O. kimflemingiae               169                185 (83)              ‐                                             17/185,373/7457(0.003)
 O. camponoti-rufipedis         103                122 (49)              Pathogenesis (GO:0009405;0.023)               17/122,802/6868(0.396)
                                                                         Interspecies interaction between organisms
                                                                          (GO:0044419; 0.023)
                                                                         Multiorganism process (GO:0051704; 0.023)

Note. SSPs: small secreted proteins.

T A B L E 3 Characteristics of orthologous groups specific to different species among the Ophiocordyceps australis sensu lato complex
                                              Number of
                          Number of           predicted genes                                                          Species‐specific SSPs/genes,
                          specific            (number of genes                                                         whole‐genome SSPs/genes
                          orthologous         with Pfam domains/                                                       (p‐value for enrichment analysis
 Species                  groups              InterPro classification)   Enriched functions (GO term; FDR p‐value)     of SSPs)
 O. australis Ghana       150                 173 (88)                   ‐                                             14/173,648/7414 (0.892)
 O. australis Brazil      339                 356 (201)                  Pathogenesis (GO:0009405; 1.07e−5)            66/356,776/7558 (0.001)
                                                                          Interspecies interaction between organisms
                                                                          (GO:0044419; 1.07e−5)
                                                                         Multiorganism process (GO:0051704; 1.07e−5)

Note. SSPs: small secreted proteins.

F I G U R E 2 The best maximum‐
likelihood tree based on 4,014 single‐copy
orthologous groups with bootstrap
supports. The horizontal scale bar
represents the branch length based on
substitution rates                                                                        0.09
8    |                                                                                                                             KOBMOO    ET AL.

    (a)                                             (b)

F I G U R E 3 Distribution of pairwise nonsynonymous‐to‐synonymous substitution ratios (dN/dS) for the genes in all single‐copy orthologous
groups with at least four species of ant‐infecting Ophiocordyceps represented. (a) Whole‐genome dN/dS distributions, (b) Boxplots of pairwise
dN/dS values for the whole genome (small secreted protein‐coding genes or SSPs, in blue, vs. non‐SSPs, in red) and between different
categories of genes suspected a priori to be involved in pathogenesis and virulence, that is, with the putative functions of enterotoxins, core
proteins of secondary metabolism (SM), lipases, proteases (including subtilisin‐like, trypsin and aspartyl proteases) and trehalases. The dotted
line represents the mean dN/dS value for the whole genome (0.145)

Information Table S5). We investigated the putative functions of the       higher than that of other genes (t test: Bonferroni‐corrected
5% of genes with the highest dN/dS values (297 genes) (Supporting          p‐value = 0.885) (Figure 3b). The genes encoding putative SSPs had
Information Table S5), even if these ratios were below 1, as this          a significantly higher dN/dS ratio than non‐SSP genes (t test:
could be indicative of positive selection at a small number of sites in    p‐value = 2.2e‐16) (Figure 3b). This suggests that a higher proportion
the protein, although relaxed selection cannot be excluded for dN/dS       of genes may evolve under positive selection among SSP‐encoding
KOBMOO    ET AL.                                                                                                                                |   9

T A B L E 4 Results of the gene ontology (GO) term enrichment analyses for the genes with significant likelihood ratio test (LRT) results for
positive selection in both the M7 vs. M8 (Nielsen & Yang, 1998) and M8a vs. M8 (Swanson et al., 2003) comparisons
 GO Category                         GO.ID                     Term                                                                     p‐value
 Biological process                  GO:0009405                Pathogenesis                                                             0.033
                                     GO:0044419                Interspecies interaction between organisms                               0.033
                                     GO:0051704                Multiorganism process                                                    0.033
 Molecular function                  GO:0090729                Toxin activity                                                           5.7e‐4
                                     GO:0005524                ATP Binding                                                              0.013
                                     GO:0032559                Adenyl ribonucleotide binding                                            0.013
                                     GO:0030554                Adenyl nucleotide binding                                                0.013
                                     GO:0016301                Kinase activity                                                          0.013
                                     GO:0000166                Nucleotide binding                                                       0.013
                                     GO:1901265                Nucleoside phosphate binding                                             0.013
                                     GO:0016772                Transferase activity, transferring phosphorus‐containing group           0.013
                                     GO:0036094                Small molecule binding                                                   0.013
                                     GO:0016773                Phosphotransferase activity, alcohol group as an acceptor                0.013
 Cellular compartment                GO:0005615                Extracellular space                                                      3.7e‐4
                                     GO:0044421                Extracellular region part                                                5.3e‐4
                                     GO:0005576                Extracellular region                                                     5.3e‐4

F I G U R E 4 Percentages of genes in
various functional categories for which
likelihood ratio tests (LRTs) for positive
selection (M7 vs. M8 and M8a vs. M8)
yielded significant results (false‐discovery
rate‐corrected p‐value < 0.05).
Proteases = subtilisin, trypsin and aspartyl
proteases, SM = core genes of secondary
metabolites. The total number of genes in
each category is indicated above the bars

high‐energy molecule (ATP) to a substrate and are involved in vari-        essential for pathogen growth and survival and, thus, for pathogene-
ous cellular processes. The proportion of kinases evolving under pos-      sis and virulence (Lee et al., 2016).
itive selection was lower than that of heat‐labile enterotoxins                  The functions relating to hydrolytic enzymes important for
(Figure 4), but the numbers of kinases and heat‐labile enterotoxins        pathogenesis (glycoside hydrolases, lipases, proteases) were not
evolving under positive selection were similar, and these two func-        overrepresented among the genes evolving under positive selection.
tions were overrepresented among the genes evolving under positive         Indeed, the proportions of genes in these families found to be under
selection. Most of these kinases were annotated as protein kinases,        positive selection were markedly smaller than those for heat‐labile
histidine kinases and phosphatidylinositol 3 and 4‐kinases. These          enterotoxins (Figure 4). Nevertheless, several of the genes from
families of kinases are well known to be involved in cell signalling,      these gene families were found to evolve under positive selection in
10   |                                                                                                                             KOBMOO     ET AL.

model tests (Supporting Information Table S7) and can be considered        cytochrome b5‐like haem/steroid‐binding domain. The first of these
good candidates for involvement in co‐evolution and host specificity.      genes was shown to be associated with iron uptake in yeast (Roman,
One to three of the 11 chitinases (GH18) presented significant foot-       Dancis, Anderson, & Klausner, 1993), whereas the product of the
prints of positive selection depending on the evolution model con-         second mediates iron‐free electron transfer. The third of these genes
sidered, and significant p‐values were obtained in all tests for one of    may encode a nitrate reductase or sulphite oxidase, both of which
these enzymes. Chitinases are involved in the degradation of the           are involved in nitrogen assimilation. These putative neuromodula-
insect cuticle, a major component of the insect exoskeleton, and in        tors thus seem to be involved in host resource utilization. The neu-
the degradation and remodelling of fungal cell walls (Adams, 2004;         rological   disorder   displayed   by   zombie   ants   infected   with
Langner & Göhre, 2016). Other GH families converging to various            Ophiocordyceps may result from the pathogen outcompeting the host
functions, such as cellulase, glucanase, glucosidase and galactosidase     for iron and nitrogen.
(e.g., GH5, GH16, GH47, GH76), also included a few genes display-
ing significant tests of positive selection (Supporting Information
                                                                           3.5 | Positive selection of heat‐labile enterotoxin
Table S7). Neither of the two trehalases (GH37), which are thought
                                                                           genes specific to the ant‐manipulating O. unilateralis
to play important roles in nutrient acquisition within the host body,
                                                                           species complex
displayed significant signs of positive selection.
     Zero to four of the nine subtilisin‐like (MEROPS family S08 and       The above results and those of previous studies (de Bekker et al.,
S53) and trypsinlike proteases (MEROPS family S01), which are con-         2015; Wichadakul et al., 2015) suggest that heat‐labile enterotoxin
sidered to act as cuticle‐degrading proteases, presented significant       genes are candidate genes for host‐specific adaptation. We there-
footprints of positive selection, depending on the evolution model         fore investigated whether the positive selection detected above
considered. Zero to one of 16 putative aspartyl proteases (MEROPS          was specific to the ant‐infecting Ophiocordyceps species or general
family A01) was found to evolve under positive selection following         to Hypocrealean entomopathogenic and nematode‐killing fungi.
different models. However, none of these proteases yielded signifi-        Thirty‐six orthologous groups of heat‐labile enterotoxin genes were
cant results in both tests (Figure 4). Two to six of the 39 putative       inferred for a group of 16 Hypocrealean entomopathogenic and
lipases yielded significant p‐values in positive selection tests, and      nematode‐killing fungi in addition to our eight focal species (Sup-
only one yielded significant p‐values in both tests (Figure 4; Support-    porting Information Table S1); 22 of these orthologous groups
ing Information Table S7).                                                 included at least one sequence from the ant‐infecting Ophiocordy-
     One to four of the seven core genes of secondary metabolites          ceps, and 10 (42%) of these groups included only sequences from
displayed significant signatures of positive selection, depending on       the ant‐infecting Ophiocordyceps species. We further analysed the
the evolution model considered (Supporting Information Table S7).          only group (ORTHAgEnt13) common to at least four of the ant‐
The only gene to yield significant p‐values in both tests (orthologous     infecting Ophiocordyceps species considered and sequences recov-
group ORTHAg2248, Supporting Information Table S7) encoded a               ered from other species from Hypocreales, for which both site‐
polyketide synthase (PKS)‐like protein with a beta‐ketoacyl synthase       model LRTs for positive selection were significant. This group
domain. Beta‐ketoacyl synthase is involved in fatty acid biosynthesis      included five sequences each from an O. unilateralis species. In a
and has been shown to be involved in the production of polyketide          maximum‐likelihood tree, all the O. unilateralis sequences were
antibiotics in fungi (Beck, Ripka, Siegner, Schiltz, & Schweizer, 1990).   located on the same branch (Figure 6). The PAML branch‐model
The gene encoding this enzyme is part of a secondary metabolic             LRTs indicated that this gene was evolving under positive selec-
gene cluster that is highly syntenic across the species of the O. uni-     tion specifically in the O. unilateralis clade (p‐values < 0.001). The
lateralis complex, but located in different clusters in O. australis and   branch at the base and the internal branches of the O. unilateralis
in O. subramanianii (Figure 5).                                            clade therefore had significantly higher dN/dS ratios than the
     We also investigated whether the genes previously identified as       other branches (Figure 6). The BUSTED test, which is similar to
encoding possible “neuromodulators” (de Bekker et al., 2017), based        PAML branch tests but considered more powerful, also gave a sig-
on their overexpression during the manipulation of ant behaviour,          nificant result (p‐value = 6.16e‐14).
showed signs of positive selection. In total, 12 to 41 of these genes
yielded significant results in tests for positive selection (Supporting
                                                                           4 | DISCUSSION
Information Table S8). Five genes yielded significant results in both
tests. These genes encoded a short‐chain dehydrogenase, a DNA
                                                                           4.1 | Enterotoxin genes as major candidate genes
mismatch repair protein (MutC), a DNA replication factor, an ATPase
                                                                           underlying host adaptation
and a protein with no functional annotation. Seven other genes
yielded results only in the M8a vs. M8 test, which is more robust          Comparative genomic studies of closely related species of fungal
than the M7 vs. M8 test. These seven genes included oxidoreduc-            pathogens have shown that, in general, genes involved in adaptation,
tases clearly involved in metabolic reactions: a protein with a ferric‐    particularly those involved in virulence and pathogenicity, are spe-
reductase transmembrane‐like domain, a flavodoxin oxidoreductase           cies‐specific, highly divergent and/or under diversifying selection, as
and an oxidoreductase binding to a molybdopterin cofactor with a           a result of the arms race between host and pathogen or
KOBMOO    ET AL.                                                                                                                             |   11

F I G U R E 5 Homology of putative secondary metabolic gene clusters (SMGCs) with the core gene under positive selection according to log‐
likelihood ratio tests (M7 vs. M8 models and M8a vs. M8 models). The dashed lines indicate orthology between the putative polyketide
synthase (PKS)‐like core gene. The phylogenetic tree was inferred from Jaccard similarity indices between alignments of common gene
domains within families. OCS = Ophiocordyceps camponoti-saundersi, OCL = O. camponoti-leonardi, OPF = O. polyrhachis-furcata,
OKi = O. kimflemingiae, OCR = O. camponoti-rufipedis, OAG = O. australis from Ghana, OAB = O. australis from Brazil, OSS = O. subramanianii

specialization on new hosts (Ghanbarnia et al., 2015; Huang, Si,               differences between species, suggesting the occurrence of diversi-
Deng, Li, & Yang, 2014; Plissonneau et al., 2017; Stukenbrock et al.,          fying selection, which was confirmed by formal tests comparing
2011). We therefore used an evolutionary comparative genomic                   models with and without positive selection. Furthermore, in the
approach for identifying genes underlying host adaptation in ant‐              cases in which orthologs of enterotoxin genes were found in
infecting Ophiocordyceps from three species complexes (O. unilater-            other entomopathogenic fungi, we inferred that positive selection
alis s.l., O. australis s.l. and O. subramanianii s.l.). Genome comparisons    was specific to the ant‐infecting Ophiocordyceps clade. These find-
showed that species complex‐specific genes were enriched in genes              ings support the view that heat‐labile enterotoxins are effectors
associated with the function pathogenesis which included genes                 involved in host adaptation, as previously suggested based on
encoding heat‐labile enterotoxins. The species‐specific genes were             observations of enterotoxin overexpression during manipulation of
also enriched in this function. However, most species‐specific genes           the behaviour of the diseased ants (de Bekker et al., 2015) and of
lacked functional annotation, perhaps due to their rapid evolution as          the species‐specific nature of behavioural manipulation (de Bekker
part of the arms race between pathogen and host, resulting in                  et al., 2014; Sakolrak et al., 2018). The proximal mechanisms via
homology no longer being detectable. Most of the small secreted                which enterotoxins act during infection and the manipulation of
proteins (SSPs), in particular, lacked predicted functions, but these          host behaviour remain unclear, but it has been suggested that
proteins were particularly abundant among the species‐specific                 these molecules interfere with the chemical communication of
genes. SSPs may act as effectors in pathogenicity, but the validation          social insects; bacterial enterotoxins have been shown to affect
of their function requires further studies.                                    pheromone production in boll weevils (Wiygul & Sikorowski, 1986,
   Heat‐labile enterotoxin genes were also overrepresented in the              1991). Alterations to chemical communication may contribute to
orthologous        groups   with   the   highest   rates   of   amino   acid   the modification of behaviour in infected ant hosts.
12   |                                                                                                                               KOBMOO    ET AL.

F I G U R E 6 The best RAxML tree based on the GTRCAT model for the orthologous group ORTHAgEnt13 of putative heat‐labile
enterotoxins in entomopathogenic and nematode‐killing fungi of the order Hypocreales. The numbers above the nodes are bootstrap supports.
The numbers below the branches are the ratios of nonsynonymous‐to‐synonymous substitution rates (dN/dS)

                                                                           (Boomsma et al., 2014; Ortiz‐Urquiza & Keyhani, 2013; Wang, Fang,
4.2 | Minor role of the cuticle in exerting selective
                                                                           Wang, & St. Leger, 2011; Wang & St. Leger, 2005). Nevertheless, as
pressure leading to diversifying selection
                                                                           the fungi in the three ant‐infecting complexes considered here are
Hypocrealean entomopathogenic fungi are known to infect their              all pathogens of formicine and ponerine ants, our findings do not
insect hosts by penetrating the cuticle from the outside (Boomsma          rule out diversifying selection occurring across larger phylogenetic
et al., 2014). An array of hydrolytic enzymes, including chitinases,       scales. These enzymes may be highly conserved among pathogens of
lipases and proteases, is required to break through the insect cuticle     formicine and ponerine ants, providing a common arsenal for attack-
(Ortiz‐Urquiza & Keyhani, 2013). Chitins are major constituents not        ing taxonomically related ants. There may also be constraints in the
only of insect cuticles, but also of fungal cell walls (Langner & Göhre,   host or the fungus preventing rapid co‐evolution through changes to
2016) while lipids are a major component of the epicuticle waxy            these molecules.
layer (Jarrold, Moore, Potter, & Charnley, 2007; Pedrini, Ortiz‐
Urquiza, Huarte‐Bonnet, Zhang, & Keyhani, 2013). Proteases are
                                                                           4.3 | Utilization of host resources
important for the penetration of the cuticle by fungi and have been
shown to be virulence factors for the infection of insect hosts (Shah,     Once inside the host, the pathogen requires other hydrolases for
Wang, & Butt, 2005). Subtilisin proteases have been shown to play a        carbon assimilation. Efficient nutrient uptake from the host allows
particularly important role in regulating insect host specificity          optimal proliferation of the fungus within its host and ultimately
through the differential expressions of specific isoforms (Bye &           leads to insect death (Luo, Qin, Pei, & Keyhani, 2014). It has, there-
Charnley, 2008; Mondal, Baksi, Koris, & Vatai, 2016). We therefore         fore, been suggested that host resource utilization is crucial for host
hypothesized that the genes encoding chitinases, proteases and             specificity (Gillespie, Bailey, Cobb, & Vilcinskas, 2000). Trehalases, in
lipases might have evolved under diversifying selection. However,          particular, probably play an important role in this respect. Indeed,
we found footprints of positive selection for only a few putative          the fly pathogen Entomophthora muscae (Entomophthorales) carries
genes encoding these enzymes in the ant‐infecting Ophiocordyceps           more trehalase‐encoding genes in its genome than its close relative,
species. This challenges the widely accepted view that the insect          the generalist Conidiobolus coronatus, which is a nonobligate patho-
cuticle, as a major barrier to infections, exerts a strong selective       gen (De Fine Licht, Jensen, & Eilenberg, 2017). We identified two
pressure on entomopathogenic fungi, leading to different host ranges       trehalases with no positive selection signature as conserved across
KOBMOO    ET AL.                                                                                                                               |   13

all species. Other glycoside hydrolases and lipases may be involved        Ferrara, & Perrone, 2013). We detected significant footprints of pos-
in breaking down primary carbon sources (Ortiz‐Urquiza & Keyhani,          itive selection in some of the core genes of secondary metabolites.
2013; Schrank & Vainstein, 2010). However, the evidence for posi-          The most notable case concerned a PKS‐like function involved in
tive selection is less robust for these enzymes. Thus, diversifying        lipid biosynthesis. Lipids have been shown to be involved in patho-
selection in ant‐pathogenic Ophiocordyceps fungi probably acts less        physiological processes in pathogenic fungi, but the role of the lipid
strongly on the function of carbon assimilation than on enterotoxins.      signalling network in host‐specific pathogenicity remains to be deter-
Again, there may be constraints preventing the rapid evolution of          mined (Singh & Poeta, 2011). Kinases are also known to participate
host cuticle or fungal hydrolase and lipase functions.                     in lipid signalling pathways, and the kinases with significant foot-
   Nitrogen also plays a key role in the proliferation of ento-            prints of positive selection identified included phosphatidylinositol 3
mopathogenic fungi (Luo et al., 2014). However, our results suggest        and 4‐kinases. The phosphorylated form of phosphatidylinositol plays
that initial nutrient acquisition via proteinases is not under strong      an important role in lipid and cell signalling (Funaki, Katagiri, Inukai,
diversifying selection. Genes evolving under positive selection were       Kikuchi, & Asano, 2000). Lipid metabolism thus seemed to be subject
not enriched in functions related to the assimilation of nitrogen or       to diversifying selection, although to a much lesser extent than heat‐
amino acid synthesis.                                                      labile enterotoxins.
   In addition to carbon and nitrogen, iron uptake is also crucial for
pathogen success (Bairwa, Hee Jung, & Kronstad, 2017; Haas, 2012;
Sutak, Lesuisse, Tacherzy, & Richardson, 2008). The candidate neu-         5 | CONCLUSIONS
romodulator genes found to be under positive selection included
iron‐related oxidoreductases. In particular, one of the proteins identi-   We focused on three ant‐infecting species complexes from the
fied had a ferric‐reductase transmembrane domain, and another was          genus Ophiocordyceps, including closely related species. Complex‐
a flavodoxin oxidoreductase. Proteins with ferric‐reductase trans-         and species‐specific genes were found to be enriched in genes for
membrane domains have been shown to be crucial for ferric iron             heat‐labile enterotoxins, and this gene family was found to be evolv-
uptake in yeast (Roman et al., 1993), whereas flavodoxin is an iron‐       ing under positive selection to a greater extent than other candidate
free electron‐transfer protein facilitating a range of metabolic reac-     gene families. Our results thus suggest that the specific adaptation
tions in the absence of iron. Specialist entomopathogens kill their        and co‐evolution of specialist species in the ant‐infecting Ophiocordy-
hosts more slowly than generalists (Boomsma et al., 2014). In such a       ceps fungi to their hosts is dependent on selection occurring within
context, ant‐specific Ophiocordyceps might be expected to have             the body of the host rather than during cuticle penetration. By con-
developed strategies for hijacking resources from the host. The effi-      trast, we detected little positive selection on lipases, proteases or
cient acquisition of iron and an ability to divert its use may be the      chitinases, although we did identify a few interesting candidate
key to outcompeting the host during infection.                             genes from these groups. Comparative genomic studies of ento-
                                                                           mopathogenic fungi remain scarce, and the few studies that have
                                                                           been performed have focused exclusively on species of agricultural
4.4 | The role of kinases and signal transduction
                                                                           or medical interest. The findings of this study improve our under-
Kinase enzymes are widely recognized as participating in various cel-      standing of the mechanisms of fungal adaptation to insect hosts, and
lular processes, crucial to growth and survival (Lee et al., 2016). The    future studies on fungal pathogens associated with other groups of
genes under positive selection in the ant‐infecting Ophiocordyceps         insects should provide more general insight into the adaptation of
were enriched in kinase‐related functions. Most were clearly related       entomopathogenic fungi and a more documented comparison with
to signal transduction, which plays a crucial role in interactions         the mechanisms of adaptation in fungal pathogens of plants. The
between hosts and pathogens (Bahia, Satoskar, & Dussurget, 2018).          insect innate immune response seems to be much more specific than
Pathogens sense and respond to environmental stimuli, including the        that in plants, suggesting a certain level of acquired immune
expression of virulence factor regulatory systems, in the hostile con-     response (Cooper & Eleftherianos, 2017). Fungal pathogens of
ditions of the host immune system. As extremely specialized patho-         insects would be expected to display extensive expansions and con-
gens, ant‐infecting Ophiocordyceps fungi must fine‐tune their              tractions of gene families, as observed in plant pathogens, but the
responses in the host body.                                                target functions may be different. Additional insight gleaned from
                                                                           entomopathogenic fungi would help to improve our general under-
                                                                           standing of the mechanisms of adaptive evolution in eukaryotes.
4.5 | Importance of lipid metabolism
Many entomopathogenic fungi are also thought to deploy a plethora
                                                                           ACKNOWLEDGEMENTS
of metabolites and toxins within the bodies of their hosts (Schrank &
Vainstein, 2010; Singh, Son, & Lee, 2016). The nature of these mole-       This work was supported by the Marie Sklodowska Curie Action No
cules probably differs between groups of insect‐pathogenic fungi           655278 and Thailand Research Fund (TRF) Young Scientist Grant
and remains to be precisely determined, but the principal molecules        (TRG5780162) to NK. We would like to thank Alodie Snirc for
include polyketides (PKs) and nonribosomal peptides (NRPs) (Gallo,         advice concerning DNA extraction, Antoine Branca for suggestions
14   |                                                                                                                                          KOBMOO     ET AL.

about bioinformatic protocols, Rayan Chikhi for training in genome                Badouin, H., Gladieux, P., Gouzy, J., Siguenza, S., Aguileta, G., Snirc, A.,
assembly, Jérome Collemare and Jorge C. Navarro‐Muñoz for their                       … Giraud, T. (2017). Widespread selective sweeps throughout the
                                                                                      genome of model plant pathogenic fungi and identification of effec-
guidance on using antiSMASH and BiG‐SCAPE, and Suchada
                                                                                      tor candidates. Molecular Ecology, 26, 2041–2062. https://doi.org/10.
Mongkholsamrit and Kanoksri Tasanathai for the organization of                        1111/mec.13976
sampling trips. We also would like to sincerely thank Clarissa de                 Bahia, D., Satoskar, A., & Dussurget, O. (2018). Cell signalling in host‐
Bekker and David P. Hughes for kindly sharing their data on the                       pathogen interactions: The host point of view. Frontiers in Immunol-
                                                                                      ogy, 9, 1–4. https://doi.org/10.3389/fimmu.2018.00221
candidate neuromodulators.
                                                                                  Bairwa, G., Hee Jung, W., & Kronstad, J. (2017). Iron acquisition in fungal
                                                                                      pathogens of human. Metallomics: Integrated Biometal Science, 9(3),
                                                                                      215–227. https://doi.org/10.1039/c6mt00301j
AUTHOR CONTRIBUTION
                                                                                  Bankevich, A., Nurk, S., Antipov, D., Gurevich, A. A., Dvorkin, M., Kulikov,
N.K., J.J.L. and T.G. designed the study. N.K. and N.A. conducted                     A. S., … Pevzner, P. A. (2012). SPAdes: A new genome assembly
                                                                                      algorithm and its applications to single‐cell sequencing. Journal of
sampling and DNA extraction. N.K., D.W. and RCRSLV analysed
                                                                                      Computational Biology, 19(5), 455–477. https://doi.org/10.1089/cmb.
sequencing and comparative genomic data. N.K. and T.G. wrote the                      2012.0021
manuscript, with contributions from all the authors.                              Baroncelli, R., Amby, D. B., Zapparata, A., Sarrocco, S., Vannacci, G., Le
                                                                                      Floch, G., … Thon, M. R. (2016). Gene family expansions and con-
                                                                                      tractions are associated with host range in plant pathogens of the
DATA ACCESSIBILITY                                                                    genus Colletotrichum. BMC Genomics, 17(555), 1–17. https://doi.org/
                                                                                      10.1186/s12864-016-2917-6
The de novo assemblies of Ophiocordyceps camponoti-leonardi (NCBI                 Barrett, L. G., & Heil, M. (2012). Unifying concepts and mechanisms in
Biosample     SAMN07662903)          and   O. camponoti-saundersi       (NCBI         the specificity of plant‐enemy interactions. Trends in Plant Science, 17
Biosample SAMN07662932) have been deposited with the NCBI as                          (5), 282–292. https://doi.org/10.1016/j.tplants.2012.02.009
                                                                                  Beck, J., Ripka, S., Siegner, A., Schiltz, E., & Schweizer, E. (1990). The
whole‐genome       shotgun     (WGS)     projects   with    Accession     nos.
                                                                                      multifunctional 6‐methylsalicylic acid synthase gene of Penicillium pat-
PDHP00000000 and PDHQ00000000, respectively. The O. polyrha-                          ulum. Its gene structure relative to that of other polyketide synthases.
chis-furcata genome was updated based on the improved assembly                        European Journal of Biochemistry, 192(2), 487–498. https://doi.org/10.
from this study (LKCN00000000.2).                                                     1111/j.1432-1033.1990.tb19252.x
                                                                                  de Bekker, C., Ohm, R. A., Evans, H. C., Brachmann, A., & Hughes, D. P.
                                                                                      (2017). Ant‐infecting Ophiocordyceps genomes reveal a high diversity
REFERENCES
                                                                                      of potential behavioral manipulation genes and a possible major role
Adams, D. J. (2004). Fungal cell wall chitinases and glucanases. Microbiol-           for enterotoxins. Scientific Reports, 7, 12508. https://doi.org/10.
    ogy, 150(7), 2029–2035. https://doi.org/10.1099/mic.0.26980-0                     1038/s41598-017-12863-w
Aguileta, G., Lengelle, J., Chiapello, H., Giraud, T., Viaud, M., Fournier, E.,   de Bekker, C., Ohm, R. A., Loreto, R. G., Sebastian, A., Albert, I., Merrow,
    … Gout, L. (2012). Genes under positive selection in a model plant                M., … Hughes, D. P. (2015). Gene expression during zombie ant bit-
    pathogenic fungus, Botrytis. Infection, Genetics and Evolution, 12(5),            ing behavior reflects the complexity underlying fungal parasitic
    987–996. https://doi.org/10.1016/j.meegid.2012.02.012                             behavioral manipulation. BMC Genomics, 16(620), 1–23. https://doi.
Aguileta, G., Lengelle, J., Marthey, S., Chiapello, H., Rodolphe, F., Gen-            org/10.1186/s12864-015-1812-x
    drault, A., … Giraud, T. (2010). Finding candidate genes under posi-          de Bekker, C., Quevillon, L. E., Smith, P. B., Fleming, K. R., Ghosh, D., Pat-
    tive selection in non‐model species: Examples of genes involved in                terson, A. D., & Hughes, D. P. (2014). Species‐specific ant brain
    host specialization in pathogens. Molecular Ecology, 19(2), 292–306.              manipulation by a specialized fungal parasite. BMC Evolutionary Biol-
    https://doi.org/10.1111/j.1365-294X.2009.04454.x                                  ogy, 14(166), 1–12. https://doi.org/10.1186/s12862-014-0166-3
Aguileta, G., Refrégier, G., Yockteng, R., Fournier, E., & Giraud, T. (2009).     Boetzer, M., Pirovano, W., Zerbino, D., Birney, E., Simpson, J., & Wong,
    Rapidly evolving genes in pathogens: Methods for detecting positive               K. (2012). Toward almost closed genomes with GapFiller. Genome
    selection and examples among fungi, bacteria, viruses and protists.               Biology, 13(6), R56. https://doi.org/10.1186/gb-2012-13-6-r56
    Infection, Genetics and Evolution, 9(4), 656–670. https://doi.org/10.         Boomsma, J. J., Jensen, A. B., Meyling, N. V., & Eilenberg, J. (2014). Evo-
    1016/j.meegid.2009.03.010                                                         lutionary interaction networks of insect pathogenic fungi. Annual
Albalat, R., & Cañestro, C. (2016). Evolution by gene loss. Nature Reviews            Review of Entomology, 59, 467–485. https://doi.org/10.1146/annure
    Genetics, 17(7), 379–391. https://doi.org/10.1038/nrg.2016.39                     v-ento-011613-162054
Alexa, A., & Rahnenfuhrer, J. (2016). topGO: Enrichment Analysis for Gene         Bye, N. J., & Charnley, A. K. (2008). Regulation of cuticle‐degrading sub-
    Ontology. R package version 2.28.0.                                               tilisin proteases from the entomopathogenic fungi, Lecanicillium spp:
Anderson, P. K., Cunningham, A. A., Patel, N. G., Morales, F. J., Epstein,            Implications for host specificity. Archives of Microbiology, 189(1), 81–
    P. R., & Daszak, P. (2004). Emerging infectious diseases of plants:               92. https://doi.org/10.1007/s00203-007-0296-8
    Pathogen pollution, climate change and agrotechnology drivers.                Cantarel, B. L., Korf, I., Robb, S. M. C., Parra, G., Ross, E., Moore, B., …
    Trends in Ecology and Evolution, 19(10), 535–544. https://doi.org/10.             Yandell, M. (2008). MAKER: An easy‐to‐use annotation pipeline
    1016/j.tree.2004.07.021                                                           designed for emerging model organism genomes. Genome Research,
Anderson, J. P., Gleason, C. A., Foley, R. C., Thrall, P. H., Burdon, J. B., &        18, 188–196. https://doi.org/10.1101/gr.6743907
    Singh, K. B. (2010). Plants versus pathogens: An evolutionary arms            Chikhi, R., & Medvedev, P. (2014). Informed and automated k‐mer size
    race. Functional Plant Biology, 37(6), 499–512. https://doi.org/10.               selection for genome assembly. Bioinformatics, 30(1), 31–37.
    1071/FP09304                                                                      https://doi.org/10.1093/bioinformatics/btt310
Araújo, J. P. M., Evans, H. C., Kepler, R., & Hughes, D. P. (2018). Zombie‐       Cooper, D., & Eleftherianos, I. (2017). Memory and specificity in the
    ant fungi across continents: 15 new species and new combinations                  insect immune system: Current perspectives and future challenges.
    with Ophiocordyceps. I. Myrmecophilous hirsutelloid species. Studies in           Frontiers in Immunology, 8(539), 1–6. https://doi.org/10.3389/fimmu.
    Mycology, 90, 119–160.                                                            2017.00539
You can also read