Combined effects of genotype and childhood adversity shape variability of DNA methylation across age
←
→
Page content transcription
If your browser does not render page correctly, please read the page content below
Czamara et al. Translational Psychiatry (2021)11:88 https://doi.org/10.1038/s41398-020-01147-z Translational Psychiatry ARTICLE Open Access Combined effects of genotype and childhood adversity shape variability of DNA methylation across age Darina Czamara 1, Elleke Tissink 2, Johanna Tuhkanen3, Jade Martins 1, Yvonne Awaloff4, Amanda J. Drake 5 , Batbayar Khulan5, Aarno Palotie6, Sibylle M. Winter7, Charles B. Nemeroff8, W. Edward Craighead 9, Boadie W. Dunlop 9, Helen S. Mayberg9,10, Becky Kinkead 9, Sanjay J. Mathew 11, Dan V. Iosifescu10,12, Thomas C. Neylan13, Christine M. Heim 14, Jari Lahti 3,15, Johan G. Eriksson16,17,18,19, Katri Räikkönen3, Kerry J. Ressler 20, Nadine Provençal21,22 and Elisabeth B. Binder 1,9 Abstract Lasting effects of adversity, such as exposure to childhood adversity (CA) on disease risk, may be embedded via epigenetic mechanisms but findings from human studies investigating the main effects of such exposure on epigenetic measures, including DNA methylation (DNAm), are inconsistent. Studies in perinatal tissues indicate that variability of DNAm at birth is best explained by the joint effects of genotype and prenatal environment. Here, we extend these analyses to postnatal stressors. We investigated the contribution of CA, cis genotype (G), and their additive (G + CA) and interactive (G × CA) effects to DNAm variability in blood or saliva from five independent cohorts 1234567890():,; 1234567890():,; 1234567890():,; 1234567890():,; with a total sample size of 1074 ranging in age from childhood to late adulthood. Of these, 541 were exposed to CA, which was assessed retrospectively using self-reports or verified through social services and registries. For the majority of sites (over 50%) in the adult cohorts, variability in DNAm was best explained by G + CA or G × CA but almost never by CA alone. Across ages and tissues, 1672 DNAm sites showed consistency of the best model in all five cohorts, with G × CA interactions explaining most variance. The consistent G × CA sites mapped to genes enriched in brain-specific transcripts and Gene Ontology terms related to development and synaptic function. Interaction of CA with genotypes showed the strongest contribution to DNAm variability, with stable effects across cohorts in functionally relevant genes. This underscores the importance of including genotype in studies investigating the impact of environmental factors on epigenetic marks. Introduction later in life1–4. Exposure to CA is not only associated with Childhood adversity (CA), including child abuse and disease risk, but also with a number of lasting biological neglect, is a major risk factors for the development of and physiological changes, including alterations in brain stress-related psychiatric and other medical disorders structure, function, and connectivity5, stress response6, and immune function7. DNA methylation (DNAm) has been proposed as a Correspondence: Darina Czamara (darina@psych.mpg.de) or biological process by which early-life adversity may have Elisabeth B. Binder (binder@psych.mpg.de) 1 Department of Translational Research in Psychiatry, Max Planck Institute of lasting effects on gene transcription providing a molecular Psychiatry, Kraepelinstr. 2-10, 80804 Munich, Germany 2 mechanism for how early environment could influence Department of Complex Trait Genetics, Center for Neurogenomics and health outcomes later in life8,9. A number of studies have Cognitive Research, Amsterdam Neuroscience, Vrije Universiteit Amsterdam, 1081 HV Amsterdam, The Netherlands investigated DNAm changes with exposure to CA in Full list of author information is available at the end of the article © The Author(s) 2021 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
Czamara et al. Translational Psychiatry (2021)11:88 Page 2 of 11 peripheral tissues, such as saliva or blood, either using social services. This enabled us to test for the stability of candidate gene approaches or genome-wide DNAm stu- G and CA effects with age as well as across different dies (EWAS). Overall, while there is some evidence for the types of assessment of exposure. association between CA and altered patterns of DNAm, results for individual DNAm targets remain inconsistent10. Methods The majority of autosomal CpGs (about 80%) are not Samples variable across tissues and individuals11,12, leaving only Five independent cohorts were included in our analysis: about 20% of CpG sites that may contribute to differences GRADY, PReDICT, U19, BerlinLCS, and HBCS. All in phenotypes and health13. These variable CpGs are of subjects (or their legal guardians) gave written informed specific interest as they are enriched for functionally consent and ethical approval was given by the Institu- relevant genomic regions, associated with effects on gene tional Review Board or Ethical Committee of each site expression12. In contrast to CA, genetic factors have been participating in every study. Register linkage has been shown to have replicable influences on DNAm variability. conducted with permission from the register authority The impact of genetic variation, especially of single- (HBCS: the Finnish National Archives). nucleotide polymorphisms (SNPs), on DNAm in different The GRADY cohort consisted of 309 participants who tissues, has been investigated in many studies and a large were recruited as part of the GRADY Trauma Project at number of methylation quantitative trait loci (SNPs sig- the Grady Memorial Hospital in Atlanta, Georgia18. All nificantly associated with DNAm status14) have been participants come from an urban population with low discovered which are relatively stable throughout the life socioeconomic status and are characterized by high pre- course15. valence and severity of trauma over lifetime19–21. Environmental factors and genetic factors may thus act The PReDICT cohort consisted of 363 treatment-naive in concert to influence DNAm, however only a few studies patients who met criteria for current major depressive have investigated the joint effects of environment and disorder. All participants were recruited at three Atlanta genotype on DNAm variability. In the context of the sites associated with the Emory University School of influence of prenatal environments on DNAm at birth, Medicine, Department of Psychiatry and Behavioral Teh et al.16 as well as our group17 reported that combined Sciences22. effects of genotype and prenatal environment explain The U19 cohort consisted of 78 nonmedicated women most of the variance in umbilical cord and cord blood recruited at four academic sites in the USA (Emory DNAm. In fact, environment alone was almost never the University, Icahn School of Medicine at Mount Sinai, strongest driver, rather additive or interactive effects of Baylor College of Medicine, University of California San genotype (G) and environment (E) explained DNAm Francisco/San Francisco Veterans Affairs Medical Cen- variability best in the majority of CpGs. This may be ter). All U19 participants were untreated and had to fulfill specific of prenatal environments, where there is less time criteria for post-traumatic stress disorder (PTSD) for at for exposure. least 3 months23. Here, we aimed to expand the analysis of combined G The BerlinLCS cohort consisted of 173 children, who and E effects to a postnatal stressor (CA). We examined were recruited via child care centers, child and youth if, similar to our results in neonates, combined effects social services, child psychiatric departments, or pedia- are also stronger drivers of DNAm variation later in life tricians. Children were followed for 2.5 years with and if the proportion of explained variance varies with extensive psychometric and biological assessments. In time to exposure, i.e., whether effects of CA measured in addition, DNA from saliva samples was collected at five childhood are qualitatively or quantitatively different time points over the course of the study (every 6 months). than when measured later in life. For this purpose, we Cases were victims of one or more of the following: systematically tested main effects of CA (E = CA) and physical abuse, physical neglect, and/or emotional mal- genotype located in a 1 MB window of the CpG (G) on treatment (MT) requiring intervention by social services. DNAm as well as their additive (G + CA) and multi- The Helsinki Birth Cohort Study (HBCS)24–26 consisted plicative effects (G × CA). For each tested CpG site, we of 77 men who were evacuated to Sweden or Denmark sought the model that explained most of the DNAm unaccompanied by their parents during the Second World variability. We explored this in five independent cohorts War according to the Finnish National Archives’ regis- with a total of 1074 individuals, of whom 541 were ter27. The controls were 74 men who were not evacuated, exposed to CA. The five cohorts ranged in age from and who were matched to cases for birth year and father’s early childhood (3–5 years of age) to elderly individuals occupational status. These men donated blood for DNA (mean age of 64 years) with both retrospective self- samples in a clinical study in 2001–2004. Information on reports of CA and verified exposures by registries or these five cohorts is summarized in Table 1.
Czamara et al. Translational Psychiatry (2021)11:88 Page 3 of 11 Table 1 Demographic overview of cohorts. Cohort Na Mean age (SD) Sex (male) Ethnicity Assessment of CA CA Na (%) Tissue Methylation array GRADY 309 42.08 (12.92) 25.56% African American Self-report 148 (47.90%) Whole blood 450K PReDICT 363 39.83 (11.50) 39.67% Mixed Self-report 164 (45.18%) Whole blood 450K U19 78 39.27 (12.10) 0.00% Mixed Self-report 66 (84.62%) Whole blood 450K BerlinLCS 173 4.23 (0.79) 52.60% Caucasian Documented 86 (49.71%) Saliva EPIC HBCS 151 63.5 (2.8) 100% Finnish Documented 77 (51.00%) Whole blood 450K In GRADY, PReDICT, and U19 CA refers to moderate-to-severe ranges of CTQ scores for either sexual, physical, or emotional abuse (GRADY n = 77, PReDICT n = 79, U19 n = 14) or to moderate-to-severe scores in at least two abuse groups (GRADY n = 71, PReDICT n = 85, U19 n = 52), in BerlinLCS CA refers to maltreatment and in HBCS to evacuation and separation from parents in World War II. SD standard deviation, CA childhood adversity. a N = sample size. DNAm data (2%), significant deviation from the Hardy–Weinberg Methylation450K BeadChips in GRADY, PReDICT, U19, equilibrium (HWE, p < 10−5), or a low minor allele fre- and HBCS and by the Infinium MethylationEPIC BeadChip quency (MAF < 5%) were excluded from further analyses. for BerlinLCS (for this study we focused on the baseline Afterward, additional SNPs were imputed using IMPUTE methylation levels). Beta values were normalized using v239, the 1000 Genomes phase III sample served as functional normalization28,29. Batch effects were removed reference panel40. Imputed SNPs with a low information using ComBat30 with the sva package31. Subsequently, all content metric ( 95%, call rate > 95% for individuals and tion33. Saliva cell counts for the BerlinLCS cohort were markers (99% for markers with MAF < 5%), MAF > 1%, computed according to Smith et al.34. Smoking scores in and HWE p > 10−06. Moreover, heterozygosity, sex check, each cohort were calculated as described by Elliott and relatedness checks were performed and any dis- et al.35,36. For the BerlinLCS cohort, we computed a pre- crepancies were removed. Imputed genotype probabilities natal smoking exposure according to Richmond et al.37. were converted into best-guessed genotypes using a threshold of 0.90. SNPs were pruned to a reduced subset Genotype data of approximately independent SNPs using a repeated DNA isolation and SNP genotyping sliding window (window size of 100 kb, 5 kb shift at the In all cohorts, except BerlinLCS, DNA was isolated from end of each step) procedure with a pairwise SNP R2 blood samples (GRADY: using either the ArchivePure threshold of 0.238. DNA Blood Kit (5 Prime, Gaithersburg, MD, USA) or E.Z. N.A. Mag-Bind Blood DNA Kit (Omega Bio-tek, Nor- Environmental data cross, GA, USA), U19: using the PerkinElmer Chemagic Self-reported childhood trauma: childhood trauma 360 extraction robot). In BerlinLCS, DNA was isolated questionnaire (CTQ) from saliva samples. Genome-wide SNP genotyping was The CTQ is a psychometrically validated assessment of performed using Illumina OmniQuad (GRADY), Huma- physical, sexual, and emotional child abuse and neglect, nOmniExpress BeadChips (PReDICT and U19), Illumina using 28 self-report items41. Participants in GRADY, GSA-24 v2.0 BeadChips (BerlinLCS), and Illumina 610k PReDICT, and U19 were classified into three groups chips (HBCS, modified Illumina 610k chip by the Well- based on established cutoff scores for moderate-to-severe come Trust Sanger Institute, Cambridge, UK). exposure levels for each type of childhood abuse (CA; ≥10 for physical abuse, ≥8 for sexual abuse, ≥13 for emotional Quality control and imputation abuse)41. The first group of participants scored in the Quality control was performed in PLINK38 indepen- none-to-mild range for sexual, emotional, and physical dently in all cohorts. Samples with low genotyping rate abuse and was classified as negative for exposure, the
Czamara et al. Translational Psychiatry (2021)11:88 Page 4 of 11 second group scored in the moderate-to-severe range for Explaining variability of VMPs either sexual, physical, or emotional abuse, and the third To assess to what extent genotype, environment (CA), group scored moderate-to-severe in at least two abuse genotype and environment, as well as genotype–environment groups. interaction contributed to variation in VMPs, four different linear regression models (1–4) were tested for each over- Verified childhood trauma lapping VMP in each cohort to identify the model with the For BerlinLCS, maltreated (physical abuse, physical largest adjusted R2. neglect, or emotional MT) children were recruited via (1) Environment model (CA): VMP ~ covariates + CA. child welfare offices, child and youth social services, child (2) Genotype model (G): VMP ~ covariates + Gi. psychiatric departments, or pediatricians and corrobora- (3) Additive model (G + CA): VMP ~ covariates + Gi tion/details of MT exposure was obtained by caretaker + CA. report. Assessment and coding of maltreatment was based (4) Interaction model (G × CA): VMP ~ covariates + on42 Of the 173 children, 86 were victims of MT. The 87 Gi + CA + Gi × CA. children in the control group presented with no MT, and VMP represents the uncorrected beta value of the no other significant stressors as assessed with the pre- identified variable CpG site described above. For models school age psychiatric assessment43. (2–4), Gi is a SNP-genotype coded by the minor allele count (0, 1, 2); all pruned SNPs in a cis window of ±1 MB Childhood adversities in the HBCS around the VMP were tested sequentially and the SNP In the HBCS sample that was used for this study, 77 that presented with the largest adjusted R2 was selected individuals had been evacuated to Sweden or Denmark for further analysis. Covariates are the DNAm con- unaccompanied by their parents during the Second World founders and cohort-specific covariates described in the War according to the Finnish National Archives’ register. preceding section. In models (3) and (4) all possible SNP This experience was used as CA in comparison to the and CA combinations were tested other 74 individuals of the sample, who had not been The model with the largest adjusted R2 value, explaining evacuated. the most variance across (1)–(4), was chosen as the best model for that VMP. Statistical analyses A work flow of the general procedure is depicted in All statistical analyses were performed in each cohort Supplementary Fig. 1. independently using R version 3.5.2. Mapping VMPs to genomic regions Identification and characterization of overlapping variable VMPs were mapped to their genomic location using the R- methylated CpGs packages minfi28 and ChIPseeker44 and to their correspond- In order to correct DNAm levels for known con- ing ChromHMM states based on histone ChiP-Seq peaks founders, these were regressed out of the beta values from the Roadmap Epigenomics project derived for blood using linear regression separately for each cohort. Cov- cells (http://egg2.wustl.edu/roadmap/data/byFileType/peaks/ ariates were defined as sex, age, blood cell counts33, or consolidated/broadPeak/). saliva cell counts for BerlinLCS34, smoking scores, and Enrichment tests were performed using Fisher’s tests. genotype principal components to account for population The significance levels were set using Bonferroni correc- stratification (GRADY: the first two PCs, PReDICT, and tion according to the number of performed tests. U19 and BerlinLCS: the first five PCs, HBCS: the first three PCs). Using residuals from these models, the med- Gene-set enrichment analysis ian absolute deviation (MAD) was estimated per CpG as a VMP sites were mapped to their closest genes using the robust measure of DNAm variability within each cohort. matchGenes function in the R-package bumphunter45. The MAD score is preferred for this purpose as it is not Gene-set enrichments were tested using FUMA’s GEN- driven by outliers. We tested the 80th, 85th, 90th, and E2FUNC v1.3.546 setting the FDR adjusted p values for 95th percentile as cutoffs of the MAD score. The 80th enrichment to 0.05 and considering Gene Ontology (GO) percentile cutoff resulted in 45,962 variably methylated terms as well as tissue-specific transcripts derived from probes (VMPs) overlapping among GRADY, PReDICT, GTEx v647. A minimum number of ten genes had to and U19. These VMPs were selected for initial analysis overlap with the specific gene set. We compared enrich- and defined as overlapping VMPs. For a sensitivity ana- ments for stable across age G × CA CpGs (variably lysis, we further regressed out cohort-specific covariates methylated CpGs with the best model being G × CA (anxiety score in U19; depression score in GRADY, PRe- across all cohorts, n = 1400) and stable in adults G × CA DICT, and U19; PTSD score in GRADY and U19). CpGs (variably methylated CpGs with the best model Addition of these covariates did not influence the results. being G × CA in the three adult cohorts, but not in the
Czamara et al. Translational Psychiatry (2021)11:88 Page 5 of 11 Fig. 1 Overlapping VMPs between GRADY, PReDICT, and U19. Venn diagram of overlapping VMPs for GRADY, PReDICT, and U19 at a MAD score threshold at the 80th percentile (a). Distribution of the best models explaining variation in DNAm across the three adult cohorts. Percentage of overlapping VMPs (n = 45,962) best explained by G, CA, G + CA, or G × CA in each cohort using the highest adjusted R2 (b). Consistent best models across three adult cohorts. Percentage of overlapping VMPs (n = 45,962) best explained by the same model type (G, CA, G + CA, or G × CA) in at least two cohorts (c). other two cohorts, n = 670). We used the group of effects of cis genotype (G) and CA together. For each inconsistent CpGs (n = 5652) that showed different best cohort, we compared the adjusted R2 of four regression models across all three adult cohorts as control. For the models (CA, G, G + CA, G × CA) to find the model which enrichment, we created ten random subsets of genes best explained DNAm variation in VMPs. The adjusted R2 mapping to these inconsistent CpGs and equal in size to is well suited to determine the most predictive model as it the number of genes mapping to stable across age G × CA adjusts for the number of parameters in the model and CpGs (n = 1123 genes). only increases if the inclusion of these parameters also increases the model fit48. In all cohorts, the majority of Results VMPs was best explained by additive or interactive effects VMPs in adults with self-reported retrospective CA of G and CA (see Fig. 1b and Supplementary Fig. 2 for We first assessed which of the four models (CA, G, G + details) with CA alone being the best model only in very CA, and G × CA) explained most of the DNAm variability few VMPs. in the three adult cohorts (see Table 1) that used the CTQ Over 80% of overlapping VMPs showed a consistent for retrospective assessment of CA. CpGs with a MAD best model across at least two of the three cohorts (see score larger than the 80th MAD percentile and over- Fig. 1c and Supplementary Fig. 3). As we based our results lapping between GRADY, PReDICT, and U19 were on pruned SNPs and as all cohorts had a different ethnic defined as overlapping VMPs (n = 45,962 VMPs, see Fig. background, we matched the consistency based on best 1a). As previously described17, VMPs were enriched for model for the same CpG only and did not require that the distinct genomic features, including intergenic regions same SNPs be included in the model across cohorts. The (p < 2.20 × 10−16, OR = 1.65, Fisher’s test) and enhancers majority of VMPs (43.87%) were consistently best (p < 2. 20 × 10−16, OR = 1.87, Fisher’s test). explained by G × CA models. These results remained We examined whether interindividual differences in stable with inclusion of cohort-specific covariates, DNAm levels of overlapping VMPs were better explained including symptoms severity (see Supplementary Figs. 4 by genotype in cis (defined as 1 MB window around the and 5). For all cohorts, ΔadjR2, i.e., the difference between specific VMP), by CA (E), or by additive or interaction the adjusted R2 of the best models to the adjusted R2 of
Czamara et al. Translational Psychiatry (2021)11:88 Page 6 of 11 Fig. 2 Enrichment of overlapping VMPs with regard to ChromHMM states. Histone mark enrichment for overlapping VMPs with regard to other 450K CpG sites (above panel) (a). Histone mark enrichment for overlapping VMPs (below) with at least two consistent best G, G + CA, or G × CA models against overlapping VMPs with no consistent models. Green color indicates depletion, and red color indicates enrichment. Thick black lines around the rectangles indicate significant enrichment/depletion based on Fisher’s tests and a Bonferroni threshold of p < 4.66 × 10−04. VMPs with at least two consistent best G × CA models were enriched in repressed Polycomb (p = 5.48 × 10−19, OR = 1.19, Fisher’s test) (b). Average distance between SNP and CpG site for consistent best G (left), consistent best G + CA (middle), and consistent G × CA models (right). CpGs with at least two consistent G × CA models presented with a significantly longer distance between SNP and VMP than VMPs with other consistent models (mean absolute distance = 271.83 kb, p < 2.20 × 10−16, Wilcoxon’s test) (c). the next best model, was highest for VMPs where G × CA HBCS, a cohort of 151 elderly individuals, of which 77 had was chosen as best model and significantly larger as been evacuated to Sweden or Denmark during World compared to G and G + CA models (see Supplementary War II. Fig. 6A–C, p < 2.2 × 10−16 for all cohorts, Wilcoxon’s To base the comparison of best models across the test). VMPs with at least two consistent best G × CA developmental trajectory on the same CpG sites in all models were enriched in repressed Polycomb (see Fig. 2a, cohorts, we used the overlap of VMPs identified in b) and presented with a significantly longer distance GRADY/PReDICT/U19 and CpGs available in BerlinLCS between SNP and VMP than VMPs with other consistent as well as in HBCS. This resulted in 36,091 VMPs avail- models (see Fig. 2c). able in all five cohorts. Even with this more restricted set of VMPs, the best models remained combined models of VMPs across the life course and with documented G and CA (see Fig. 3a and Supplementary Fig. 7). adversity There were 1672 VMPs (5.4%) with a consistent best To test if the identified combined effects of genotype model across all five cohorts (see Fig. 3b). Among these and CA are stable across the life course and also observed stable VMPs, 83.73% had G × CA as the best model (n = with documented and not only self-reported adversity, we 1400, “stable across age G × CA CpGs”). used two additional cohorts. The BerlinLCS cohort, In comparison, only 670 VMPs were consistently best consisting of 173 DNAm saliva samples of children aged explained by G × CA across the three adult cohorts but between 3 and 5 years, of which 86 were recruited from neither in the BerlinLCS nor in the HBCS (“stable in social services and other child welfare centers due to MT adults G × CA CpGs”). Both groups of CpGs were sig- or neglect. At the other end of the age spectrum is the nificantly enriched (p < 0.002 for both groups of CpGs) for
Czamara et al. Translational Psychiatry (2021)11:88 Page 7 of 11 Fig. 3 Distribution of the best models explaining variation in DNAm across the five cohorts. Percentage of overlapping VMPs (n = 36,091) best explained by G, CA, G + CA, or G × CA in each cohort using the highest adjusted R2 For HBCS, 49.03% of VMPs presented with best model G × CA, 37.40% with best model G, and 13.55% with best model G + CA, and for BerlinLCS, 61.26% of VMPs presented with best model G × CA, 29.51% with best model G, and 8.75% with G + CA (a). Consistency of best model explaining DNAm variability in VMPs across five cohorts. Percentage of VMPs with the same model explaining variation in DNAm best overlapping between the cohorts stratified by model type (b). eQTM sites49 as compared to all 450K CpGs (based on Stable across age G × CA CpGs were significantly enri- 10,000 randomly drawn CpG sets). ched for 24 GO terms, and stable in adults G × CA CpGs were significantly enriched for 35 GO terms in the bio- Tissue specificity and functions of genes linked to stable logical processes categories (all FDR-corrected p values < G × CA CpGs 0.05). While some of these processes overlapped, stable We annotated each VMP to the closest gene and used across age G × CA CpGs were selectively significantly the list of unique genes to test for differences in gene-set enriched in categories reflecting processes related to enrichment using FUMA45. As a background set, we neuron development and synapse organization (see Sup- mapped the 36,091 VMPs which were used in the analysis plementary Fig. 8). Stable across age G × CA CpGs were across all five cohorts representing 10,308 unique genes. significantly enriched for the cellular component terms We tested gene lists derived from stable across age G × “neuron part” and “neuron projection” and the molecular CA CpGs and stable in adults G × CA CpGs for enrich- function terms “DNA binding transcription factor activ- ment in differentially expressed gene sets across different ity,” “sequence specific DNA binding,” and “sequence tissues using the GTEx database46. The genes mapping to specific double DNA binding” (all FDR-corrected p values stable across age G × CA CpGs (1400 CpGs mapping to < 0.05). Stable in adults G × CA CpGs had no cellular 1123 unique genes) were significantly enriched for genes component or molecular function term significantly specific to brain (FDR-corrected p value = 2.85 × 10−04) enriched. Non-consistent CpGs showed no significant but not to other tissues. This enrichment for brain tran- consistent enrichments for any GO terms. scripts was not observed for genes mapped to stable in The analyses investigating tissue-specific gene expres- adults G × CA CpGs (670 CpGs mapping to 584 unique sion as well as GO terms point to the fact that stable G × genes, FDR-corrected p value = 1.00 × 10−01). As control, CA VMPs could have a distinct functional relevance, we compared these to enrichments from random subsets related to development and brain function. from the list of inconsistent CpGs that showed different best models across all three adult groups (see Fig. 1c, Discussion n = 5652). We randomly picked groups of 1123 genes In this study, we investigated the contributions of (which is the number of genes matching to stable across exposure to CA, genotype in cis as well as their additive age G × CA CpGs) matching to these CpGs. None of these and interactive effects on interindividual variability of subsets showed significant tissue-specific enrichments DNAm in variable CpGs in peripheral tissues. Inde- (see Fig. 4). pendent of the age of the cohort, we observed that
Czamara et al. Translational Psychiatry (2021)11:88 Page 8 of 11 Fig. 4 Enrichment of stable across age G × CA, stable in adults G × CA and non-consistent CpGs for GTEx upregulated gene sets. The x-axis denotes the −log10(p value), and the y-axis the specific tissue type. Significant enrichments (based on FDR correction of 0.05) are depicted in blue, and nonsignificant in gray. For the group of nonconsistent CpGs, we randomly picked ten subsets of genes mapping to nonconsistent CpGs, equal in number to the genes mapping to stable across age G × CA CpGs. None of these subsets presented with significant adjusted p values for enrichment. In the plot, median p values across all subsets are depicted. models combining G and CA best explained DNAm results for the effect of CA alone50,51. Indeed, very few of variability in the majority of CpGs, suggesting that the the overlapping VMPs were best explained by environ- extent of the combined impact on DNAm is similar for ment independent of genotype (
Czamara et al. Translational Psychiatry (2021)11:88 Page 9 of 11 The proportions of best G × CA models which we sufficient power to detect consistent effect directions after identified in our cohorts are analogous to the ~70% of correction for multiple testing. In order to have sufficient variably methylated regions that were shown to be best power to assess specific SNP × CA effects surviving mul- explained by integrated genetic and prenatal environment tiple testing correction larger, ethnically homogenous effects in neonates16,17. Our findings corroborate that cohorts are necessary. All reported interactions are sta- genotype acts as an important moderating influence on tistical interactions and limited to a cis window around main environmental effects on DNAm also in the context the CpG site. Further experiments are required to assess of a postnatal stressor. In fact, G × CA was the model that whether these would also reflect biological/mechanistic best explained DNAm variance in the majority (83%) of interactions. Along the same lines, much larger cohorts CpGs that had the same best model across cohorts. The will be needed to assess potential trans effects. stability of the best model for these CpGs cuts across a Furthermore, strategies to reduce the number of tests, large age range from early childhood to late adulthood, i.e., SNPs, are needed. Possible methods include the pre- across tissue (blood and saliva), different DNAm varia- filtering for functionally relevant SNPs using deep learn- bility thresholds, psychiatric diagnoses, as well as self- ing algorithm such as DeepSEA for instance54, or reported retrospective vs. verified CA. Additional studies experimental approaches such as SNPs disrupting tran- in longitudinal cohorts with repeated measures of DNAm scription factor binding or chromatin structure55. How- in the same individuals are needed to confirm such sta- ever, our results can highlight those CpGs that are most bility across time. influenced by the combination of a genetic variant in cis The VMPs with stable G × CA models mapped to genes and CA in a consistent manner across five cohorts and with distinct functionality. In contrast to VMPs that only hence are environmentally sensitive. While our results showed the G × CA model in all adult cohorts, but not highlight convergent effects of CA across ages, we did not more, the genes mapped to stable G × CA VMPs across have sufficient power to identify effects specific to certain five cohorts were enriched for transcripts specific to the forms of CA or neglect, or related to specific timing of the brain (see Fig. 4) as well as to GO terms related to brain exposure. Our analysis provides a possible framework of development and synapse function. Importantly, G × CA how specific combined effects of genotype and environ- VMPs were also enriched for eQTMs, indicating that any ment on DNAm might be studied in the future. factors influencing variability at these loci will have effects In conclusion, in this study, we show that CA has a on gene transcription. Our samples size was under- larger impact on DNAm in combination with genetic powered to reliably detect consistent effect directions of variation than by itself. Inclusion of information on SNP × CA interactions after correction for multiple test- genetic variation may thus help to uncover impact of ing. For this larger, ethnically homogenous cohorts will be environmental factors on epigenetic measures that would necessary. Nonetheless, our results can highlight those otherwise remain concealed. Such combined approaches CpGs that are most influenced by the combination of a could support to identify gene pathways relevant to risk or genetic variant in cis and CA in a consistent manner resilience following exposure to CA. across age, unmasking an epigenetic signature of CA. Consistent with the previous literature14, cis meQTLs Acknowledgements The authors would like to thank all study participants as well as all involved in were clearly apparent in the five independent cohorts with the HBCS. The authors also acknowledge the Genetics Core of the Wellcome diverse ethnic backgrounds. Although it is known that Trust Clinical Research Facility (Edinburgh, UK) which used the 450K array for Caucasian and African American meQTLs significantly the HBCS samples. The PReDICT study was supported by the following National Institutes of Health grants: P50 MH077083; R01 MH080880; UL1 overlap, 14–45% shows specificity for ethnicity52. In our RR025008; and M01 RR0039. Funding for the U19 study was provided from a analysis, we found converging evidence that G × CA grant from the National Institute of Mental Health, U19 MH069056, with interactions best explain variability of DNAm across dif- additional support from VA CSRD Project ID 09S-NIMH-002. The GRADY study was supported by a grant from the National Institute of Mental Health, R01 ferent ethnicities, but this does not exclude ancestry MH071537-01A1. HBCS has been supported by grants from the British Heart specific effects. To identify such specific interaction Foundation, Academy of Finland, the Finnish Diabetes Research Society, effects, larger samples for each ethnicity are required. Folkhälsan Research Foundation, Novo Nordisk Foundation, Finska Läkaresällskapet, Signe and Ane Gyllenberg Foundation, University of Helsinki, Finally, we want to note the limitations of this study. Ministry of Education, Ahokas Foundation, and the Emil Aaltonen Foundation. First, we restricted our analyses to specific DNAm array The BerlinLCS study was funded by a research grant from the German Federal contents and to potentially functional CpGs, i.e., VMPs, Ministry of Education and Research (BMBF) to CMH and EBB (FKZ 01KR1301B). AJD has received a Scottish Senior Clinical Fellowship (SCD/09). so that we do not reflect every CpG tested on the array. Second, we used the adjusted R2 as main criterion for Author details model fit as we were mainly interested in explaining 1 Department of Translational Research in Psychiatry, Max Planck Institute of variability of DNAm. A variety of other model selection Psychiatry, Kraepelinstr. 2-10, 80804 Munich, Germany. 2Department of Complex Trait Genetics, Center for Neurogenomics and Cognitive Research, criteria are available53 and which one to choose is an Amsterdam Neuroscience, Vrije Universiteit Amsterdam, 1081 HV Amsterdam, ongoing debate. Third, our analysis does not provide The Netherlands. 3Department of Psychology and Logopedics, Faculty of
Czamara et al. Translational Psychiatry (2021)11:88 Page 10 of 11 Medicine, University of Helsinki, Haartmaninkatu 3, 00014 Helsinki, Finland. consultant for Aptinyx, Greenwich Biosciences, Myriad Neuroscience, Otsuka, 4 Sopra Steria, 80804 Munich, Germany. 5University/British Heart Foundation and Sophren Therapeutics. HSM receives consulting and intellectual property Centre for Cardiovascular Science, Queen’s Medical Research Institute, licensing fees from Abbott Neuromodulation. SJM is supported through the Edinburgh BioQuarter, University of Edinburgh, 47 Little France Crescent, use of facilities and resources at the Michael E. Debakey VA Medical Center, Edinburgh EH16 4TJ, UK. 6Institute for Molecular Medicine Finland (FIMM), Houston, TX, and has served as a consultant to Alkermes, Allergan, Signant University of Helsinki, 00014 Helsinki, Finland. 7Department of Child and Health, Clexio Biosciences, Janssen, Neurocrine, Perception Neurosciences, Adolescent Psychiatry, Charité—Universitätsmedizin Berlin, Campus Virchow, Praxis Precision Medicines, Sage Therapeutics, and Seelos Therapeutics. He has 13353 Berlin, Germany. 8Department of Psychiatry, Dell Medical School, received research support from Biohaven Pharmaceuticals and VistaGen University of Texas at Austin, 1601 Trinity St, Austin, TX 78712, USA. Therapeutics. In the past 5 years, DVI has received consulting fees from 9 Department of Psychiatry and Behavioral Sciences, Emory University School of Alkermes, Axsome, Centers for Psychiatric Excellence, Global Medical Medicine, 12 Executive Park Dr, Atlanta, GA 30329, USA. 10Icahn School of Education, MYnd Analytics (CNS Response), Jazz, Lundbeck, Otsuka, Precision Medicine at Mount Sinai, 1 Gustave L. Levy PI, New York, NY 10029, USA. Neuroscience, Sage, and Sunovion, and has received research support 11 Menninger Department of Psychiatry and Behavioral Sciences, Baylor College (through his academic institutions) from Alkermes, Astra Zeneca, BrainsWay, of Medicine Mental Health Care Line, Michael E. Debakey VA Medical Center, LiteCure, NeoSync, Roche, Shire, and Otsuka. TCN has received study Houston, TX, USA. 12NYU School of Medicine and Nathan Kline Institute, New medication from Corcept Therapeutics and served as a consultant for Jazz York, NY, USA. 13Departments of Psychiatry and Neurology, University of Pharmaceuticals. Over the last 5 years, DVI has received consulting fees from California, San Francisco, CA, USA. 14Charité–Universitätsmedizin Berlin, Axsome, Alkermes, Centers of Psychiatric Excellence, MYnd Analytics (CNS Corporate Member of Freie Universität Berlin, Humboldt-Universität zu Berlin, Response), Jazz, Lundbeck, Precision Neuroscience, Otsuka, and Sunovion, and Berlin Institute of Health (BIH), Institute of Medical Psychology, Luisenstraße 57, has received research support (through his academic institutions) from 10117 Berlin, Germany. 15Turku Institute for Advanced Studies, University of Alkermes, Astra Zeneca, BrainsWay, LiteCure, NeoSync, Roche, and Shire. EBB is Turku, 20500 Turku, Finland. 16Department of General Practice and Primary the coinventor of FKBP5: a novel target for antidepressant therapy, European Health Care, Helsinki University Hospital, University of Helsinki, 00290 Helsinki, Patent no. EP 1687443 B1, and receives a research grant from Böhringer Finland. 17Folkhälsan Research Center, 00250 Helsinki, Finland. 18Department of Ingelheim for a collaboration on functional investigations of FKBP5. Obstetrics and Gynecology, Yong Loo Lin School of Medicine, National University of Singapore, Singapore, Singapore. 19Singapore Institute for Clinical Sciences, Singapore, Singapore. 20Mailman Research Center, 115 Mill St., Publisher’s note Mailstop 339, Belmont, MA 02478, USA. 21Faculty of Health Sciences, Simon Springer Nature remains neutral with regard to jurisdictional claims in Fraser University, 8888 University Drive, Burnaby, BC, Canada. 22BC Children’s published maps and institutional affiliations. Hospital Research Institute, Vancouver, BC, Canada Supplementary Information accompanies this paper at (https://doi.org/ Funding 10.1038/s41398-020-01147-z). Open Access funding enabled and organized by Projekt DEAL. Received: 19 November 2020 Revised: 2 December 2020 Accepted: 4 Data availability December 2020 Due to ethical issues and consent, datasets from PReDICT, U19, BerlinLCS, and HBCS analyzed during the current study are not publicly available. Interested researchers can obtain a deidentified dataset after approval from the respective study boards. Data requests may be subject to further review by the national register authority and the ethical committees (HBCS). The raw methylation data and all related phenotypes for the GRADY cohort have been References deposited into NCBI GEO (GSE72680). 1. Kessler, R. C. et al. Childhood adversities and adult psychopathology in the WHO World Mental Health Surveys. Br. J. Psychiatry 197, 378–385 Conflict of interest (2010). DC, ET, JT, JM, YA, AJD, BK, AP, SMW, BK, DVI, CMH, JL, JGE, KR, KJR, and NP 2. Kosidou, K. et al. Socioeconomic status and risk of psychological distress and report no financial disclosures. CBN: Research/Grants: National Institutes of depression in the Stockholm Public Health Cohort: a population-based study. Health; consulting (last 3 years): Xhale, Takeda, Taisho Pharmaceutical Inc., J. Affect. Disord. 134, 160–167 (2011). Signant Health, Sunovion Pharmaceuticals Inc., Janssen Research & 3. Luby, J. L., Barch, D., Whalen, D., Tillman, R. & Belden, A. Association between Development LLC, Magstim, Inc., Navitor Pharmaceuticals, Inc., Sunovion, TC early life adversity and risk for poor emotional and physical health in ado- MSO, Inc., Intra-Cellular Therapies, Inc., EMA Wellness, Gerson Lehrman Group, lescence: a putative mechanistic neurodevelopmental pathway. JAMA Pediatr. and Acadia Pharmaceuticals; stockholder: Xhale, Celgene, Seattle Genetics, 171, 1168–1175 (2017). AbbVie, OPKO Health, Inc., Antares, BI Gen Holdings, Inc., Corcept Therapeutics, 4. Molnar, B. E., Buka, S. L. & Kessler, R. C. Child sexual abuse and subsequent TC MSO, Inc., Trends in Pharma Development, LLC, and EMA Wellness; scientific psychopathology: results from the National Comorbidity Survey. Am. J. Public advisory boards: American Foundation for Suicide Prevention (AFSP), Brain and Health 91, 753–760 (2001). Behavior Research Foundation, Xhale, Anxiety Disorders Association of America 5. Teicher, M. H. & Khan, A. Childhood maltreatment, cortical and amygdala (ADAA), Skyland Trail, Signant Health, and Laureate Institute for Brain Research morphometry, functional connectivity, laterality, and psychopathology. Child (LIBR), Inc.; board of directors: AFSP, Gratitude America, ADAA, and Xhale Smart, Maltreatment 24, 458–465 (2019). Inc.; income sources or equity of $10,000 or more: American Psychiatric 6. Pesonen, A. K. et al. Childhood separation experience predicts HPA axis hor- Publishing, Xhale, Signant Health, CME Outfitters, Intra-Cellular Therapies, Inc., monal responses in late adulthood: a natural experiment of World War II. Magstim, and EMA Wellness; patents: Method and devices for transdermal Psychoneuroendocrinology 35, 758–767 (2010). delivery of lithium (US 6,375,990B1), Method of assessing antidepressant drug 7. Segerstrom, S. C. & Miller, G. E. Psychological stress and the human immune therapy via transport inhibition of monoamine neurotransmitters by ex vivo system: a meta-analytic study of 30 years of inquiry. Psychol. Bull. 130, 601–630 assay (US 7,148,027B2), and Compounds, Compositions, Methods of Synthesis, (2004). and Methods of Treatment (CRF Receptor Binding Ligand) (US 8,551, 996 B2). 8. Szyf, M. & Bick, J. DNA methylation: a mechanism for embedding early life WEC is a board member of Hugarheill ehf, an Icelandic company dedicated to experiences in the genome. Child Dev. 84, 49–57 (2013). the prevention of depression, and he receives book royalties from John Wiley 9. Gutierrez-Arcelus, M. et al. Passive and active DNA methylation and the & Sons. His research is also supported by the Mary and John Brock Foundation interplay with genetic variation in gene regulation. Elife 2, e00523 (2013). and the Fuqua family foundations. He is a consultant to the George West 10. Cecil, C. A. M., Zhang, Y. & Nolte, T. Childhood maltreatment and DNA Mental Health Foundation and is a member of the Scientific Advisory Board of methylation: a systematic review. Neurosci. Biobehav. Rev. 112, 392–409 the ADAA and the AIM for Mental Health Foundation. BWD has received (2020). research support from Acadia, Aptinyx, Axsome, Compass Pathways, 11. Ziller, M. J. et al. Charting a dynamic DNA methylation landscape of the Intracellular Therapies, Janssen, Sage, and Takeda. He has served as a human genome. Nature 500, 477–481 (2013).
Czamara et al. Translational Psychiatry (2021)11:88 Page 11 of 11 12. Gu, J. et al. Mapping of variable DNA methylation across multiple cell types 33. Houseman, E. A. et al. DNA methylation arrays as surrogate measures of cell defines a dynamic regulatory landscape of the human genome. G3 6, mixture distribution. BMC Bioinform. 13, 86 (2012). 973–986 (2016). 34. Smith, A. K. et al. DNA extracted from saliva for methylation studies of psy- 13. Portela, A. & Esteller, M. Epigenetic modifications and human disease. Nat. chiatric traits: evidence tissue specificity and relatedness to brain. Am. J. Med. Biotechnol. 28, 1057–1068 (2010). Genet. B Neuropsychiatr. Genet. 168B, 36–44 (2015). 14. Gibbs, J. R. et al. Abundant quantitative trait loci exist for DNA methylation and 35. Elliott, H. R. et al. Differences in smoking associated DNA methylation patterns gene expression in human brain. PLoS Genet. 6, e1000952 (2010). in South Asians and Europeans. Clin. Epigenetics 6, 4 (2014). 15. Gaunt, T. R. et al. Systematic identification of genetic influences on methyla- 36. Zeilinger, S. et al. Tobacco smoking leads to extensive genome-wide tion across the human life course. Genome Biol. 17, 61 (2016). changes in DNA methylation. PLoS ONE 8, https://doi.org/10.1371/journal. 16. Teh, A. L. et al. The effect of genotype and in utero environment on inter- pone.0063812 (2013). individual variation in neonate DNA methylomes. Genome Res. 24, 1064–1074 37. Richmond, R. C., Suderman, M., Langdon, R., Relton, C. L. & Davey Smith, G. (2014). DNA methylation as a marker for prenatal smoke exposure in adults. Int J. 17. Czamara, D. et al. Integrated analysis of environmental and genetic influences Epidemiol. 47, 1120–1130 (2018). on cord blood DNA methylation in new-borns. Nat. Commun. 10, 2548 (2019). 38. Purcell, S. et al. PLINK: a tool set for whole-genome association and 18. Davis, R. G., Ressler, K. J., Schwartz, A. C., Stephens, K. J. & Bradley, R. G. population-based linkage analyses. Am. J. Hum. Genet. 81, 559–575 (2007). Treatment barriers for low-income, urban African Americans with undiag- 39. Howie, B. N., Donnelly, P. & Marchini, J. A flexible and accurate genotype nosed posttraumatic stress disorder. J. Trauma Stress 21, 218–222 (2008). imputation method for the next generation of genome-wide association 19. Gillespie, C. F. et al. Trauma exposure and stress-related disorders in inner city studies. PLoS Genet. 5, e1000529 (2009). primary care patients. Gen. Hosp. Psychiatry 31, 505–514 (2009). 40. Genomes Project, C. et al. A map of human genome variation from 20. Zannas, A. S. et al. Lifetime stress accelerates epigenetic aging in an urban, population-scale sequencing. Nature 467, 1061–1073 (2010). African American cohort: relevance of glucocorticoid signaling. Genome Biol. 41. Bernstein, D. P. et al. Initial reliability and validity of a new retrospective 16, 266 (2015). measure of child abuse and neglect. Am. J. Psychiatry 151, 1132–1136 (1994). 21. Zannas, A. S. et al. Correction to: lifetime stress accelerates epigenetic aging in 42. Barnett, D., Manly, J. T. & Cicchetti, D. In Child abuse, child development, and an urban, African American cohort: relevance of glucocorticoid signaling. social policy (Ciccetti, D. & Toth, S. L. eds) (Ablex, 1993). Genome Biol. 19, 61 (2018). 43. Egger, H. L. & Angold, A. In Handbook of Infant, Toddler, and Preschool Mental 22. Dunlop, B. W. et al. Predictors of remission in depression to individual and Health Assessment (eds DelCarmen-Wiggins, R. & Carter, A. S.) 223–243 (Oxford combined treatments (PReDICT): study protocol for a randomized controlled University Press, 2004). trial. Trials 13, 106 (2012). 44. Yu, G., Wang, L. G. & He, Q. Y. ChIPseeker: an R/Bioconductor package for ChIP 23. Dunlop, B. W. et al. Corticotropin-releasing factor receptor 1 antagonism is peak annotation, comparison and visualization. Bioinformatics 31, 2382–2383 ineffective for women with posttraumatic stress disorder. Biol. Psychiatry 82, (2015). 866–874 (2017). 45. Jaffe, A. E. et al. Bump hunting to identify differentially methylated regions in 24. Barker, D. J., Osmond, C., Forsen, T. J., Kajantie, E. & Eriksson, J. G. Trajectories of epigenetic epidemiology studies. Int J. Epidemiol. 41, 200–209 (2012). growth among children who have coronary events as adults. N. Engl. J. Med. 46. Watanabe, K., Taskesen, E., van Bochoven, A. & Posthuma, D. Functional 353, 1802–1809 (2005). mapping and annotation of genetic associations with FUMA. Nat. Commun. 8, 25. Eriksson, J. G., Osmond, C., Kajantie, E., Forsen, T. J. & Barker, D. J. Patterns of 1826 (2017). growth among children who later develop type 2 diabetes or its risk factors. 47. Consortium, G. T. The Genotype-Tissue Expression (GTEx) project. Nat. Genet. Diabetologia 49, 2853–2858 (2006). 45, 580–585 (2013). 26. Räikkönen, K. et al. Infant growth and hostility in adult life. Psychosom. Med. 70, 48. Theil, H. Economic Forecasts and Policy, 213 (North-Holland Pub. Co., Amster- 306–313 (2008). dam, 1961). 27. Khulan, B. et al. Epigenomic profiling of men exposed to early-life stress 49. Bonder, M. J. et al. Disease variants alter transcription factor levels and reveals DNA methylation differences in association with current mental state. methylation of their binding sites. Nat. Genet. 49, 131–138 (2017). Transl. Psychiatry 4, e448 (2014). 50. Marzi, S. J. et al. Analysis of DNA methylation in young people: limited evi- 28. Aryee, M. J. et al. Minfi: a flexible and comprehensive bioconductor package dence for an association between victimization stress and epigenetic variation for the analysis of infinium DNA methylation microarrays. Bioinformatics 30, in blood. Am. J. Psychiatry 175, 517–529 (2018). 1363–1369 (2014). 51. Lang, J. et al. Adverse childhood experiences, epigenetics and telomere length 29. Fortin, J. P. et al. Functional normalization of 450k methylation array data variation in childhood and beyond: a systematic review of the literature. Eur. improves replication in large cancer studies. Genome Biol. 15, 503 (2014). Child Adolesc. Psychiatry, https://doi.org/10.1007/s00787-019-01329-1 (2019). 30. Johnson, W. E., Li, C. & Rabinovic, A. Adjusting batch effects in microarray 52. Smith, A. K. et al. Methylation quantitative trait loci (meQTLs) are consistently expression data using empirical Bayes methods. Biostatistics 8, 118–127 detected across ancestry, developmental stage, and tissue type. BMC Genom. (2007). 15, 145 (2014). 31. Leek, J. T., Johnson, W. E., Parker, H. S., Jaffe, A. E. & Storey, J. D. The sva package 53. Zhang, P. Inference after variable selection in linear regression models. Bio- for removing batch effects and other unwanted variation in high-throughput metrika 79, 741–746 (1992). experiments. Bioinformatics 28, 882–883 (2012). 54. Zhou, J. & Troyanskaya, O. G. Predicting effects of noncoding variants with 32. Chen, Y. A. et al. Discovery of cross-reactive probes and polymorphic CpGs in deep learning-based sequence model. Nat. Methods 12, 931–934 (2015). the Illumina Infinium HumanMethylation450 microarray. Epigenetics 8, 55. Schoenfelder, S. & Fraser, P. Long-range enhancer-promoter contacts in gene 203–209 (2013). expression control. Nat. Rev. Genet. 20, 437–455 (2019).
You can also read