Regulatory Element by Guanine and Adenine Ligation-Mediated Polymerase Chain Reaction
←
→
Page content transcription
If your browser does not render page correctly, please read the page content below
MOLECULAR AND CELLULAR BIOLOGY, May 1992, p. 2135-2142 Vol. 12, No. 5 0270-7306/92/052135-08$02.00/O Copyright C© 1992, American Society for Microbiology In Vivo Footprinting of the Human o-Globin Locus Upstream Regulatory Element by Guanine and Adenine Ligation- Mediated Polymerase Chain Reaction ERICH C. STRAUSS,"12 NANCY C. ANDREWS,' DOUGLAS R. HIGGS,3 AND STUART H. ORKIN',3* Division of Hematology/Oncology, Children's Hospital and the Dana-Farber Cancer Institute, and Department of Pediatrics, Harvard Medical School,' Harvard-MIT Division of Health Sciences and Technology, 2 and Howard Hughes Medical Institute, 3 Boston, Massachusetts 02115, and MRC Molecular Haematology Unit, Institute of Molecular Medicine, John Radcliffe Hospital, Headington, Oxford, England4 Received 4 November 1991/Accepted 12 February 1992 Downloaded from http://mcb.asm.org/ on March 1, 2021 by guest A major regulatory element required for expression of the human ao-globin genes is located 40 kb upstream of the embryonic C-globin gene. To understand how this and other locus control region (LCR) elements contribute to high-level expression in erythroid cells, we have performed high-resolution, in vivo dimethyl sulfate footprinting. In addition, we have modified the dimethyl sulfate-based ligation-mediated polymerase chain reaction in vivo footprinting procedure to permit the assessment of interactions at guanine and adenine residues, rather than guanines alone. In vivo footprinting of the human ca-LCR element carried on chromosome 16 in a mouse erythroleukemia cell environment revealed protein occupancy at GATA-1, AP-1/NF-E2, and CACC/GGTGG motifs, specific differences compared with in vitro protein binding, and distinct changes in one region upon dimethyl sulfoxide-induced cellular maturation. No protein contacts were detected in nonexpress- ing hepatoma cells. In addition, we have demonstrated that two AP-1 motifs in the a-LCR element which are occupied in vivo bind purified mouse NF-E2 protein in vitro. Our data suggest that three proteins, GATA-1, NF-E2, and unknown CACC/GGTGG factors, are minimally required as DNA-binding proteins for the function of LCR-like elements. The juxtaposition and interaction of these factors with each other, and with accessory proteins not directly in contact with DNA, are likely to account for the relative position independence of the upstream globin regulatory elements. Expression of ox- and ,-like globin genes in developing ,B-globin clusters in thalassemic individuals with deletions erythroid cells is dependent on the integrity of distant, encompassing these upstream regulatory elements provides upstream regulatory elements, termed locus control regions conclusive evidence of their role in vivo (7, 13). (LCRs) (reviewed in reference 28). As first shown in studies The functional activity of LCR elements is thought to be of the human ,-globin gene complex, the LCR corresponds mediated through their interaction with both cell-specific and to a region of chromatin exhibiting erythroid-specific DNase ubiquitous nuclear proteins, perhaps in chromatin regions I hypersensitivity (36). Linkage of the 13-LCR to globin, or associated with nuclear matrix. To investigate the properties unrelated, genes permits high-level, position-independent of LCR elements and the basis of cell-specific gene expres- erythroid expression of the linked gene in transgenic mice or sion, DNA binding of nuclear proteins has been examined in cultured cells (12). Indirect evidence suggests that the LCR vitro (15, 22, 26, 27, 30, 31, 33, 34). These studies have acts in synergy with promoter sequences to enhance tran- revealed multiple binding sites of several types that include scription (2). Competition of individual genes in the 3-globin motifs which bind the erythroid transcription factor GATA-1 cluster for productive interactions with the ,B-LCR may (reviewed in reference 28) (or closely related family mem- contribute to differential regulation of the human fetal and bers [37]), AP-1 and/or the erythroid AP-1 like activity (15, adult globin (-y- and P-globin) genes (5, 8). Within the entire 22, 26, 27, 30, 34) termed NF-E2 (21), and proteins recog- P-LCR, discrete regions of approximately 250 to 500 bp nizing CACC or GGTGG sequences (30, 33). However, in encompassing the individual hypersensitive sites display vitro studies are limited in several respects. They may detect partial activity (30, 31, 34). Interspecies sequence compari- binding to sites that are unavailable in native chromatin and sons demonstrate extraordinary sequence and spatial con- fail to reveal sites which bind proteins that are present in low servation of these P-LCR segments in humans, mice, and concentrations in nuclear extracts or are displaced by more goats (16, 23). Recently, Higgs and colleagues have identi- abundant proteins with overlapping specificities. Further- fied a region located 40 kb upstream of the embryonic more, in vitro analysis is insensitive to chromatin structure. 4-globin gene that appears to serve as an LCR-like element For these reasons, we have used in vivo dimethyl sulfate for the ot-globin cluster (14, 15). Like the 1-LCR, this (DMS) footprinting (9, 25) as a complementary method for element is associated with erythroid-specific DNase I hyper- dissection of LCR elements. In an effort to derive maximal sensitivity, and it is capable of directing high-level a-globin insights from this approach, we have analyzed a 350-bp gene expression in stable erythroid cell lines and transgenic region that appears to contain the major upstream regulatory mice (14, 15). The inactivation of genes within the ax- or activity of the ao-globin cluster (14, 15). In addition, we have modified the ligation-mediated polymerase chain reaction (LMPCR) in vivo footprinting procedure of Mueller and * Corresponding author. Wold (25) to permit the analysis of DNA-protein interactions 2135
2136 STRAUSS ET AL. MOL. CELL. BIOL. at both guanine and adenine residues, rather than at guanines Guanine-specific piperidine cleavage of in vivo-methyl- alone; we term this modified method GA-LMPCR in vivo ated DNA was done according to Maxam and Gilbert (18). footprinting. Piperidine cleavage products were precipitated in 0.3 M GA-LMPCR in vivo footprinting detects occupancy of sodium acetate with 2.5 volumes of ethanol. Trace amounts specific protein binding sites in the a-globin regulatory of piperidine were removed by repeated lyophilizations in a element. The in vivo usage of binding sites differs in detail SpeedVac concentrator. Guanine-specific cleavage products from the pattern of proteins binding to naked DNA in vitro, were resuspended in water at approximately 1 ,ug/ul. and our results highlight a subregion that may reflect local- In vivo footprinting. LMPCR genomic footprinting was ized alterations of chromatin structure upon induction of performed essentially as described by Mueller and Wold erythroid maturation. With our demonstration that the eryth- (25). After exponential amplification of the ligation products, roid factor NF-E2 can bind the AP-1 motifs of the a-globin the 100-pA reaction volume was transferred to a 1.5-ml tube regulatory element, the composite in vitro and in vivo data containing 295 pA of Taq stop buffer (260 mM sodium acetate allow us to infer that GATA-1, NF-E2, and unknown CACC/ [pH 7.0], 10 mM Tris-HCI [pH 7.5], 4 mM EDTA [pH 8.0], GGTGG factor(s) are minimally required as DNA-binding 35 ,ug of tRNA per ml); the solution was extracted once with proteins for function of LCR-like elements. It is likely that phenol-chloroform and precipitated with ethanol. The am- the juxtaposition and interaction of these factors with each Downloaded from http://mcb.asm.org/ on March 1, 2021 by guest plification reaction products were resuspended in 100 pI of other, and with accessory proteins not directly contacting water; 50 ,u was used in a three-cycle labeling reaction. The DNA, account for the functional attributes of the regulatory labeling reaction was processed as described above and elements of the globin LCRs. resuspended in 10 pI of formamide-dye; 1 to 2 pA was applied to denaturing polyacrylamide gels. Gels were dried and MATERIALS AND METHODS exposed to Kodak X-AR film with an intensifying screen at Cell lines and cell culture. The interspecies human/mouse -70°C for 15 to 45 h. somatic cell hybrid line J3-8B was established by the method Oligonucleotide primers. Oligonucleotides were synthe- of Deisseroth and Hendrick (6) as modified by Zeitlin and sized on an Applied Biosystems DNA synthesizer model Weatherall (38). Cells were cultured in Ham's F12 medium 380B and gel purified prior to use. For top- and bottom- supplemented with 15% fetal calf serum; J3-8B cells contain- strand LMPCR genomic footprinting analysis, specific ing human chromosome 16 were selected with methotrexate primer sets were used for the Sequenase extension reaction (10 ,M), adenine (0.1 mM), and thymidine (30 ,M). HepG2 (primer 1), the PCR amplification reaction (primer 2), and the cells were cultured in Dulbecco's modified Eagle's medium labeling reaction (primer 3). Primer sets for top-strand anal- with 10% fetal calf serum. K562 cells (32) were cultured in ysis were as follows: RPMI 1640 medium containing 10% fetal calf serum. S.1: 1, CTTCAGCTCCAGATGAAGAACG Methylation and isolation of genomic DNA. J3-8B cells 2, AACGTATTTACTGTCTGGGTCAGGC were induced by addition of 1.5% dimethyl sulfoxide 3, CTGTCTGGGTCAGGCTTTGCCCCTG (DMSO) 48 h prior to in vivo methylation. In vivo methyla- S.2: 1, TCCAGAAGCAOTGAGTCATG tion of cultured cells with DMS was done by the procedure 2, GAGTCATGGTTGGCCCAGTTATCTG of Becker and Schutz (3). In vitro DMS methylation of 3, GGCCCAGTTATCTGCTCCCTCAAGTG control protein-free DNA was performed according to Primer sets for bottom-strand analysis were as follows: Maxam and Gilbert (18). In vivo-methylated and control AS.i: 1, AAGAGCTCCTTCTGCAACCAT protein-free genomic DNAs were prepared by lysing cells in 2, CCATGATGACTGGGTCAAAGGACAG harvest buffer (200 mM Tris-HCl [pH 7.5], 100 mM EDTA, 3, TGGGTCAAAGGACAGTGCAGGAGGCTC 1% sodium dodecyl sulfate, 0.2 mg of proteinase K per ml) AS.2: 1, CAACCATGACTCAGTGCTTC for 3 to 4 h at 37°C. Sodium perchlorate was added to 1 M; 2, TTCTGGAGGCCAACAGGACTGCT the mixture was extracted four times with phenol-chloro- 3, GGCCAACAGGACTGCTGAGTCATCCTG form, precipitated with 2.5 volumes of ethanol, and resus- NF-E2 binding studies. NF-E2, purified from mouse eryth- pended in 10 mM Tris-HCI (pH 7.5)-i mM EDTA-10 ,g of roleukemic (MEL) cells by affinity chromatography (la), RNase A per ml; digestion of RNA was performed at 4°C for was used in gel shift assays (1) as follows. For each reaction, 15 to 20 h. The solution was extracted twice with phenol- a small aliquot (estimated 5 fmol) of purified NF-E2 was chloroform, precipitated in 0.3 M sodium acetate with 2.5 incubated with 1 ng (2 x 104 dpm) of end-labeled 80-bp DNA volumes of ethanol, and resuspended in 10 mM Tris-HCI (pH fragment containing the 46-bp NF-E2 binding site from 7.5)-i mM EDTA at a final concentration of 1 to 2 mg/ml. DNase-hypersensitive site 2 (HS 2) of the human P-LCR in Base-specific DNA cleavage. The guanine/adenine-specific 10% glycerol-60 mM potassium chloride-20 mM N-2-hy- cleavage of methylated DNA was modified from the G>A droxyethylpiperazine-N'-2-ethanesulfonic acid (HEPES)- chemistry described by Maxam and Gilbert (19). Specifi- KOH (pH 7.9)-6 mM magnesium chloride-1 mM dithiothre- cally, water and 30 ,g of methylated genomic DNA were itol-0.2 mM EDTA-1 mg of bovine serum albumin per ml-50 combined to a volume of 100 ,u, an equal volume of 20 mM ,g of poly(dI-dC) per ml with or without 0.5 ,ug of nonra- sodium phosphate (pH 7.0) was added, and the solution was dioactive oligonucleotide competitor per ml for 20 minutes at mixed and incubated at 90°C for 15 min. The reaction room temperature. The samples were then electrophoresed mixtures were collected by brief centrifugation and trans- at 17 V/cm through a 5% acrylamide (19:1 acrylamide/ ferred to ice; 20 ,u of 1 M sodium hydroxide was added, and bisacrylamide)-0.5 x Tris-borate-EDTA gel at room temper- the solution was mixed well and incubated at 90°C for 30 ature for about 90 min. Oligonucleotide competitors (top min. The reaction mixtures were collected again by brief strand) were as follows: porphobilinogen deaminase (PBGD) centrifugation, neutralized with 1 M HCI, and precipitated in promoter, GATCCTGGGGAACCTGTGCTGAGTCACTG 0.3 M sodium acetate with 2.5 volumes of ethanol. Guanine/ GAGG; mutant 1 PBGD promoter, GATCCTGGGGAACC adenine-specific cleavage products were resuspended in TGTTCTGAGTCACTGGAGG; mutant 2 PBGD promoter, water at approximately 1 ,ug/ul. GATCCTGGGGAACCTGAGCTGAGTCAGTGGAG; and
VOL. 12, 1992 IN VIVO FOOTPRINTING OF THE oa-GLOBIN LCR ELEMENT 2137 A & G ~A G B 2 G Fig. 1B, GA-cleavage chemistry reveals protection of two adenines in J3-8B cells but not in either K562 or HepG2 G G TT cells. We draw two inferences from these data. First, GA- _a T~~~~~~~~~~ - _ cleavage chemistry provides information not obtainable with l- G chemistry alone. Second, K562 cells, though partially AA A GATA 1 -A o GATA erythroid in phenotype, may be inadequate for the detection T T of in vivo footprints at many erythroid regulatory elements. Of the multiple regions that displayed protein occupancy in A c c J3-8B cells, only a few corresponding sites showed detect- G A G able, but less extensive, in vivo footprints in hemin-treated FIG. 1. Analysis of in situ, DNA-protein interactions at the K562 cells; no in vivo footprints were observed in uninduced nonconsensus GATA-1 binding site in the ot-LCR element, using K562 cells. Presumably these differences are attributable to guanine (A) and guanine/adenine (B) LMPCR in vivo footprinting. variability in expression and/or heterogeneity in the cellular Expressing cell lines include in vivo-methylated J3-8B and K562; in commitment of our K562 cell line. The lack of an in vivo vivo-methylated HepG2 cells were used as a nonexpressing control. footprint in HepG2 cells is most consistent with inaccessi- Downloaded from http://mcb.asm.org/ on March 1, 2021 by guest K562 cells were treated with 30 ,uM hemin for 2 days prior to in vivo bility of the ct-LCR element in nonerythroid (non-globin- methylation; J3-8B cells were treated with DMSO as described in expressing) cells and the absence of GATA-1 protein in cells Materials and Methods. The same preparations of methylated DNA of hepatic origin (35, 39). were used for the two experiments. In vivo DMS footprinting of the human a-LCR element. The major functional activity of the ox-LCR element has been localized to a 350-bp region (15). We have examined this o.1, TGGGCCAACCATGACTCAGTGCTTCTG; and a.2, region by GA-LMPCR in vivo footprinting. The analysis was AACAGGACTGCTGAGTCATCCTGTGGG. performed in uninduced and DMSO-induced J3-8B cells. Comparison was made with in vivo-methylated DNA from HepG2 cells and with in vitro-methylated, protein-free RESULTS DNA. The human a-LCR element contains several motifs that GA-LMPCR in vivo footprinting. In vivo footprinting has are bound by proteins present in nuclear extracts of eryth- generally involved the use of DMS, an alkylating agent that roid and nonerythroid cells (15). These include four potential penetrates the nucleus of intact cells to methylate genomic binding sites for the erythroid transcription factor GATA-1, DNA at the N-7 position of guanines and the N-3 position two potential sites for AP-1 and/or the erythroid-factor of adenines (3). Proteins bound at, or adjacent to, these designated NF-E2, and four potential sites for factors that purine residues may either reduce (protect) or increase recognize CACC/GGTGG elements. Using DNase I foot- (enhance) the frequency of DMS methylation in vivo in printing and gel shift analyses, Jarman et al. (15) have comparison with protein-free, control DNA. Since adenines demonstrated in vitro interactions with many of these motifs are methylated less efficiently than guanines (19) and the N-7 in the a-LCR element. GA-LMPCR in vivo footprinting of position of guanines resides in the major groove of DNA, a the region is displayed in Fig. 2 to 4 and summarized in Fig. common site for binding proteins, the reactivity of guanine 5. In nonerythroid HepG2 cells, no discernible footprint was residues has been used exclusively for in vivo footprinting evident throughout the entire oa-LCR. In vivo footprints studies of complex genomes. However, an analysis re- detected in J3-8B cells were unchanged following DMSO- stricted to guanines is inherently limited in its informative- induced erythroid maturation with the single exception of an ness. induced hypersensitivity in a region outside the previously In vivo footprinting may also be limited by heterogeneity recognized binding motifs. Protein occupancy of each motif in the cell population studied (25). Cell heterogeneity with is described below. regard to level of gene expression, stage of differentiation, or GATA elements. Sequences of the general form (T/A)GA cell type may compromise or obscure observable in vivo TA(A/G) (11) bind the abundant, erythroid transcription protein-DNA interactions. factor GATA-1 (10, 35). Of four potential GATA-1 binding Two aspects of our experiments were critical to a com- sites in the ao-LCR element, including a nonconsensus site plete in vivo footprinting analysis of the human a-globin (TGA1T7A), only three display protections or enhancements regulatory element. First, we modified DNA cleavage con- in vivo in J3-8B cells (Fig. 2, 4, and 5). The upstream GATA ditions to permit scoring of adenine as well as guanine site identified by in vitro binding studies (15) is not contacted residue contacts (GA-LMPCR in vivo footprinting). Second, in vivo. we compared footprints of the same chromatin region in AP-1/NF-E2 elements. Two AP-1 consensus sites [TGA(C/ different cellular environments. In this regard, we examined G)TCA] reside in the central portion of the a-LCR element. the human chromosome 16 a-LCR element in K562 cells, Motifs of this variety are bound by a multiplicity of proteins which exhibit erythroid, megakaryocytic, and myeloid prop- in vitro, including an erythroid-restricted factor (NF-E2). erties (17, 32), in MEL cells containing a single human This factor, which was first identified through study of the chromosome 16 (line J3-8B), and in nonerythroid hepatoma erythroid promoter of the PBGD gene (20, 21), appears to (HepG2) cells. mediate enhancer activity of a segment of human P-LCR HS As shown by the example in Fig. 1, the combined use of 2 encompassing an AP-1 dimer motif (22, 26, 27, 33). GA-cleavage chemistry and MEL hybrid cells greatly en- GA-LMPCR in vivo footprinting reveals strong protections hances the power of this technique at elements in which G and enhancements over both AP-1/NF-E2 motifs in the residues are not contacted. With G chemistry alone, no in oa-LCR element (Fig. 2, 4, and 5). Jarman et al. (15) observed vivo footprint is detected in either J3-8B hybrid or noneryth- in vitro footprints in this region in both erythroid and roid HepG2 cells in the vicinity of a potential GATA motif in nonerythroid cells. Although they identified an erythroid- the human oa-LCR element (Fig. 1A). However, as shown in specific gel shift complex corresponding to NF-E2 with use
2138 STRAUSS ET AL. I !.![7. 2 A 3 ~~~~~. 4 _ -O 1 2 B 3 4 ii.,~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~IiGI -U -o CATA -1 NI:-1:2 1 _ 2 CAC3W"lw _ 3 _ 4 MOL. CELL. BIOL. urn, ~ .. .... ." 4~ ~ - ~U - ___ - m _u- ___m 4m -* im a~am~ Downloaded from http://mcb.asm.org/ on March 1, 2021 by guest .J_Al'- I -o1 N g 1-.2 - u~~ .... ~mum ~~I. -> "m j__ u.. _ -.... _ ......- , -o E m Il ~~~~~~~~~~~~~~~~(;T \-J __-_W_-*~~~~ (A ISl'l' iiU - -. -. ATA 'I - oh mm FIG. 2. In vivo DMS footprinting of the top strand of the human a-LCR element, using primer sets S.1 (A and B) and S.2 (C). Lanes: 1, in vitro-methylated protein-free K562 DNA; 2, in vivo-methylated HepG2 DNA; 3, in vivo-methylated J3-8B DNA (uninduced cells); 4, in vivo-methylated J3-8B DNA (DMSO-induced cells). Protections are indicated by open circles; enhancements are represented by closed circles. A solid arrow corresponds to a hypersensitive guanine between the two downstream GATA elements. of human erythroleukemia nuclear extracts, no such gel shift Induced hypersensitivity between two GATA elements. An was seen with MEL extracts, possibly because of unfavor- aim of in vivo footprinting experiments is the identification able experimental conditions. As shown below, we provide of novel protein-DNA interactions that might account for the evidence that both AP-1/NF-E2 motifs are, in fact, suitable relative position independence and potent enhancer activity targets for the binding of murine NF-E2. of LCR regions. Between the two downstream GATA motifs CACC/GGTGG elements. Sequence motifs of this type are contacted in vivo, we observe a discrete region character- widely seen in globin gene promoters and in the 3-LCR. ized by a protection and a weak hypersensitivity on the Within the ,B-LCR HS 3, these motifs are found associated bottom strand and a strong hypersensitivity on the top strand with GATA consensus sites. Although the ubiquitous tran- in either uninduced or induced J3-8B cells (Fig. 2, 4, and 5). scription factor Spl will bind to these sites, other proteins In DMSO-treated cells, the weak hypersensitivity on the also appear to bind in vitro, although none have been bottom strand becomes distinctly pronounced (Fig. 4). To demonstrated to be erythroid specific (33). Of four potential investigate whether this region binds nuclear proteins in binding sites of the CAC/GT class, only one is occupied in vitro, gel shift experiments were performed. An oligonucle- vivo in J3-8B cells (Fig. 2 to 5). An expanded, high- otide probe corresponding to the region between the two resolution analysis of the top strand encompassing this downstream GATA elements was incubated with nuclear element is shown in Fig. 3. In the in vitro analyses of Jarman extracts (1) from NIH 3T3, uninduced MEL, and DMSO- et al. (15), protein binding to this element was not observed; induced MEL cells. Several complexes were detected, and conversely, in vitro binding to sequences upstream of this their binding specificities were determined by competition CAC/GT motif was noted. experiments (not shown). Our findings from this analysis indicate that the in vitro binding specificity of these com- plexes is determined by sequences which are not coincident I 2 3 4 C C; (188) with the in vivo protection and hypersensitivities. Thus, the G protection and hypersensitivity observed in both uninduced A and induced J3-8B cells, as well as the inducible hypersen- sitivity, may reflect local chromatin structure, perhaps re- ___ j G CAC sulting from the interaction of bound GATA-1 molecules or TT their interaction with other proteins that may not directly Ge contact DNA. Similar hypersensitivities have been detected A A (201) upstream of a GATA-binding site in the human 3-globin HS FIG. 3. High-resolution analysis of the top-strand CACCC/GT 2 in a region where the primary sequence is distinct from that element by in vivo DMS footprinting. Lanes: 1, in vitro-methylated of the ot-LCR element (32a). Alternatively, the protection protein-free K562 DNA; 2, in vivo-methylated HepG2 DNA; 3, in and hypersensitivities may represent in vivo protein occu- vivo-methylated J3-8B DNA (uninduced cells); 4, in vivo-methyl- pancy that in vitro binding studies fail to detect, possibly as ated J3-8B DNA (DMSO-induced cells). Protections are indicated a result of low abundance or instability of the relevant by open circles; an enhancement is represented by a closed circle. protein in erythroid nuclear extracts.
VOL. 12, 1992 1 ____~ m rn-- 2 -_"_ _ momnam_m A3 4 _ ICAC _ ~~- CT 1 s . IN VIVO FOOTPRINTING OF THE a-GLOBIN LCR ELEMENT 2 B 4 6*18-^ 3 ... 4 ~~~AP-IL w w -f_ 0 I NF-E2 IO CAC GT CAC GT iCAC GT 1 is 2 C M!RR; M 3 4 = -0 4- 4-- I GATA 1 2139 Downloaded from http://mcb.asm.org/ on March 1, 2021 by guest sow -*. w-s---.~ ~~a. aU -No|-E2 GATA - I GATA-1 I-GATA-1 - FIG. 4. In vivo DMS footprinting of the bottom strand of the human a-LCR element, using primer sets AS.1 (A and B) and AS.2 (C). Lanes: 1, in vitro-methylated protein-free K562 DNA; 2, in vivo-methylated HepG2 DNA; 3, in vivo-methylated J3-8B DNA (uninduced cells); 4, in vivo-methylated J3-8B DNA (DMSO-induced cells). Protections are indicated by open circles; enhancements are represented by closed circles; a protected adenine between the downstream GATA motifs is indicated with an open arrow; an inducible hypersensitivity in this region is indicated with a closed arrow. A GATA-1 +1 56 Binding of NF-E2 to the AP-1/NF-E2 motifs of the a-LCR element. While in vivo footprinting reveals protein occu- CACYGT GATA-1 pancy of the ao-LCR element AP-1 motifs, it provides no *_zoo oo 112 specific information regarding the nature of the interacting proteins. Though these elements bind putative AP-1-related proteins in vitro (15), they may be targets for the erythroid AP-/NF-E2 AP-VNF-E2 factor NF-E2 in vivo. This is of interest in two respects. 168 First, the upstream motif differs at one position from the CCCTGGIC 0 000 0 0 proposed NF-E2 consensus binding site (ACTGAGTCA CAC/GT CACTCAYGT GATA-1 versus GCTGAGTCA). Second, if the AP-1/NF-E2 motifs in 224 the a-LCR element are functionally analogous to those in ,-LCR HS 2, we might anticipate that they would specifi- 00 cally bind NF-E2. GATA-1 To evaluate protein binding to these sites in the a-LCR 0 0 0 element, we have used gel shift analysis with highly purified WryG T=T --_ , 280 mouse NF-E2. In other studies, we have shown that affinity- _ aA _~~~~ purified NF-E2 is distinct from other AP-1 binding activities, is unrelated to known Jun, Fos, or Fra proteins, and corre- lates best with a polypeptide(s) of about 43 kDa (lb). For this B GATA.1 CAr- GT GATA-1 NF-C AB MCr N1.0 GT C GT r GT GATA1 GATAA experiment, we have used the tandem AP-1/NF-E2 sites of In vitro ,-LCR HS 2 as a probe and excess, nonradioactive oligonu- cleotides as specific competitors. As shown in Fig. 6, excess oligonucleotides bearing either of the two AP-1/NF-E2 mo- tifs from the ot-LCR element strongly compete for binding of In vivo _ _ labeled probe to purified NF-E2 (lanes 5 and 6), as does a PBGD promoter oligonucleotide (lane 2). Although two FIG. 5. (A) Summary of altered DMS reactivities in the human synthetic AP-1-like sites (PBGD promoter mutant 1 and ax-LCR element. Protections are indicated by open circles; enhance- mutant 2; lanes 3 and 4) do not compete with the labeled ments are represented by closed circles. Hypersensitive sites be- probe, they are still able to bind AP-1 (not shown). Of tween two GATA-1 binding sites are represented by closed arrows; an associated protection is indicated by an open arrow. (B) Sche- interest, one of the sequences which does not compete matic comparison of in vitro and in vivo footprinting analyses of the (PBGD mutant 2) contains the proposed NF-E2 consensus a-LCR element. Potential binding sites where no footprints were binding sequence (20, 21, 26, 33). This finding agrees with observed are represented by open boxes; protein occupancy of other data that indicate that the NF-E2 binding site is more regulatory motifs is indicated by closed boxes. A protection and two complex than previously appreciated (la). Our in vitro hypersensitivities detected by in vivo footprinting are designated protein binding studies, therefore, demonstrate that purified with open and closed arrows, respectively.
2140 STRAUSS ET AL. MOL. CELL. BIOL. region which has not been shown to bind proteins in vitro. A Comnpetitor: - Y - schematic comparison of the in vitro and in vivo footprinting results is shown in Fig. 5B. In contrast with the P-LCR, which spans several kilobases (12), the majority of the activity of the ot-LCR has been localized to 350 bp (14, 15). Our results suggest that the a-LCR element is remarkably compact: six binding sites in erythroid cells are occupied in vivo within 150 bp. Delineation of all sites or protein contact and developmental changes in chromatin in this small region should guide functional analyses. -}2- IW g" Functional dissection of LCR elements in transgenic mice and cultured cells, in vitro DNase I footprinting and gel shift analyses, and the in vivo footprinting results presented here underscore features in common between the oa-LCR element and active core regions of the 1-LCR. These include multi- plicity of potential binding sites for GATA-1 and association Downloaded from http://mcb.asm.org/ on March 1, 2021 by guest of these sites with AP-1/NF-E2 and CACC/GGTGG motifs. In 3-LCR HS 2, GATA and AP-1/NF-E2 motifs are juxta- posed and conserved among species (16, 23, 24, 31, 34). Multiple GATA and CACC/GGTGG elements are found in association in HS 3 (30). In the a-LCR element, all three types of motifs are present and are occupied in vivo. Two of 7 the sites bound by proteins in vivo do not strictly conform to 1 2 3 4 5 the accepted consensus sequences for GATA-1 and NF-E2 FIG. 6. Binding of purified NF-E2 to both AP-1/NF-E2 sites in binding. An atypical GATA-motif (TGATTA) is occupied in the cx-LCR element. Purified NF-E2 was incubated with a probe vivo (Fig. 1B, 2, and 4) and also bound by GATA-1 in vitro. carrying the ,3-LCR NF-E2 sites in the absence (lanes 1 and 7) and presence (remaining lanes) of excess unlabeled oligonucleotide In other studies, we have found that sequences of this form competitors. The sequences of oligonucleotides are shown in Mate- are high-affinity GATA-1 targets (19a). Whereas the down- rials and Methods. Disappearance of the gel shift complex reflects stream AP-1/NF-E2 site of the ot-LCR element matches the NF-E2 binding by the oligonucleotide. The less intense band seen proposed NF-E2 consensus site (20, 26, 27), the upstream just below the major complex most likely results from binding of differs slightly (ACTGAGTCA versus GCTGAGTCA). partially proteolyzed NF-E2. Nonetheless, highly purified NF-E2 binds both AP-1/NF-E2 motifs of this element (Fig. 6). Given the parallels between the oa-LCR and P-LCR HS 2, it is most likely that NF-E2 is the protein occupying the AP-1/NF-E2 sites of the ot-LCR NF-E2 binds the AP-1/NF-E2 motifs in the ot-LCR element. element in expressing cells. This result is consistent with the emerging view that NF-E2 Globin LCRs are distinguished by their capacity to confer may play an important role in the function of such regulatory relative position-independent expression on linked genes. A elements. goal of current efforts is the delineation of mechanisms responsible for this phenomenon. Several possibilities might DISCUSSION account for the unique properties of these elements. First, a chromatin domain might be opened upon DNA binding of a Of the methods available to analyze mechanisms of gene specific protein, distinct from proteins which interact with control, only in vivo footprinting assesses DNA-protein GATA, AP-1/NF-E2, and CACC/GGTGG motifs. Alterna- interactions in situ. In vivo footprinting may reveal tissue- tively, the factors binding these characterized motifs might specific binding of ubiquitous proteins, discriminate poten- act in concert to open a closed chromatin domain during tial binding sites that are available or inaccessible, monitor hematopoietic differentiation and/or prevent closure during changing profiles of protein occupancy at overlapping DNA development. Altered chromatin structures thus generated elements, and detect active chromatin structures (4, 9, 25, might be revealed by in vivo footprinting as protections or 29). In this study, we have extended the analytical potential enhancements in regions not recognized by nuclear proteins of LMPCR in vivo footprinting by examining DMS methyl- in vitro. The altered methylation pattern detected between ation of adenine as well as guanine residues. Hence, infor- the two downstream GATA elements of the ao-LCR element mation can be obtained regarding all purine-protein contacts. may reflect such changes in chromatin structure, probably The additional resolution and sensitivity afforded by this attributable to interactions of bound GATA-1-, NF-E2-, and modification have been indispensable for detailed analysis of CACC/GGTGG-binding proteins with each other and with a major regulatory element of human globin genes. additional nuclear proteins that do not directly contact Jarman et al. recently reported in vitro studies of protein DNA. These findings may help resolve the paradox that the binding to the 350-bp core of an ao-LCR element by using same repertoire of protein-DNA interactions is evident in DNase I footprinting and gel shift analyses (15). Their study cells poised to undergo terminal differentiation and in cells demonstrated protein binding to various sites on naked which have been induced to differentiate further. Subtle DNA. With GA-LMPCR in vivo footprinting, we have alterations in protein-protein interactions or posttransla- investigated occupancy of these sites on human chromo- tional modifications of preexisting nuclear factors may trans- some 16 in the context of an active erythroid environment. duce signals for differentiation and, in this particular in- We find specific differences in the occupancy of sites in vivo stance, high-level ox-globin gene expression. These changes compared with protein binding in vitro and detect a protec- might activate chromatin without modifying protein-DNA tion, hypersensitivity, and inducible hypersensitivity in a contacts themselves. Remarkably, the DNA-binding sites
VOL. 12, 1992 IN VIVO FOOTPRINTING OF THE ot-GLOBIN LCR ELEMENT 2141 utilized in vivo in LCR elements appear to be the same as ulos. 1990. Beta globin locus activation regions: conservation of those defined previously in globin gene promoters and more organization, structure and function. Proc. Natl. Acad. Sci. local 3' enhancers. USA 87:8207-8211. 17. Lumelsky, N. L., and B. G. Forget. 1991. Negative regulation of globin gene expression during megakaryocytic differentiation of ACKNOWLEDGMENTS a human erythroleukemic cell line. Mol. Cell. Biol. 11:3528- We are indebted to Paul R. Mueller for generously communicating 3536. in vivo footprinting protocols and advice. We are grateful to W. G. 18. Maxam, A., and W. Gilbert. 1980. Sequencing end-labeled DNA Wood and J. Sharpe for establishing the interspecific somatic cell with base-specific chemical cleavages. Methods Enzymol. 65: hybrid line J3-8B. 499-560. This work was supported in part by a grant from the National 19. Maxam, A. M., and W. Gilbert. 1977. A new method for Institutes of Health to S.H.O., a grant from Johnson and Johnson sequencing DNA. Proc. Natl. Acad. Sci. USA 74:560-564. Research Awards to E.C.S. and S.H.O. through the Harvard-MIT 19a.Merika, M., and S. H. Orkin. Unpublished data. Division of Health Sciences and Technology Program, and a 20. Mignotte, V., F. Eleouet, N. Raich, and P.-H. Romeo. 1989. Cis- Merck-AFCR Foundation M.D.-Ph.D. postdoctoral fellowship to and trans-acting elements involved in the regulation of the N.C.A. S.H.O. is an Investigator of the Howard Hughes Medical erythroid promoter of the human porphobilinogen deaminase Institute. gene. Proc. Natl. Acad. Sci. USA 86:6548-6551. Downloaded from http://mcb.asm.org/ on March 1, 2021 by guest 21. Mignotte, V., L. Wall, E. deBoer, F. Grosveld, and P.-H. Romeo. REFERENCES 1989. Two tissue-specific factors bind the erythroid promoter of 1. Andrews, N. C., and D. V. Faller. 1991. A rapid microprepara- the human porphobilinogen deaminase gene. Nucleic Acids tion technique for extraction of DNA-binding proteins from Res. 17:37-54. limiting numbers of mammalian cells. Nucleic Acids Res. 19: 22. Moi, P., and Y. W. Kan. 1990. Synergistic enhancement of 2499. globin gene expression by activator protein-i-like proteins. la.Andrews, N. C., and S. H. Orkin. Unpublished data. Proc. Natl. Acad. Sci. USA 87:9000-9004. 2. Antoniou, M., and F. Grosveld. 1990. P-Globin dominant control 23. Moon, A. M., and T. J. Ley. 1990. Conservation of the primary region interacts differently with distal and proximal promoter structure, organization, and function of the human and mouse elements. Genes Dev. 4:1007-1013. 3-globin locus activating regions. Proc. Natl. Acad. Sci. USA 3. Becker, P. B., and G. Schultz. 1988. Genomic footprinting. 87:7693-7697. Genet. Eng. 10:1-19. 24. Moon, A. M., and T. J. Ley. 1991. Functional properties of the 4. Becker, R. B., S. Ruppert, and G. Schultz. 1987. Genomic P-locus control region in K562 erythroleukemia cells. Blood footprinting reveals cell type-specific DNA binding of ubiqui- 77:2272-2284. tous factors. Cell 51:435-443. 25. Mueller, P. R., and B. Wold. 1989. In vivo footprinting of a 5. Behringer, R. R., T. M. Ryan, R. D. Palmiter, R. L. Brinster, muscle specific enhancer by ligation mediated PCR. Science and T. M. Townes. 1990. Human y- to 3-globin gene switching in 246:780-786. transgenic mice. Genes Dev. 4:380-389. 26. Ney, P. A., B. P. Sorrentino, C. H. Lowrey, and A. W. Nienhuis. 6. Deisseroth, A., and D. Hendrick 1979. Activation of phenotypic 1990. Inducibility of the HS II enhancer depends on binding of expression of human globin genes from nonerythroid cells by an erythroid specific nuclear protein. Nucleic Acids Res. 18: chromosome-dependent transfer to tetraploid mouse erythro- 6011-6017. leukemia cells. Proc. Natl. Acad. Sci. USA 76:2185-2189. 27. Ney, P. A., B. P. Sorrentino, K. T. McDonagh, and A. W. 7. Driscoll, M. C., C. S. Dobkin, and B. P. Alter. 1989. -yb3- Nienhuis. 1990. Tandem AP-1-binding sites within the human Thalassemia due to a de novo mutation deleting the 5' 3-globin ,B-globin dominant control region function as an inducible en- gene activation-region hypersensitive sites. Proc. Natl. Acad. hancer in erythroid cells. Genes Dev. 4:993-1006. Sci. USA 86:7470-7474. 28. Orkin, S. H. 1990. Globin gene regulation and switching: circa 8. Enver, T., N. Raich, A. J. Ebens, T. Papayannopoulou, F. 1990. Cell 63:665-672. Costantini, and G. Stamatoyannopoulos. 1990. Developmental 29. Pfeifer, G. P., and A. D. Riggs. 1991. Chromatin differences regulation of human fetal-to-adult globin gene switching in between active and inactive X chromosomes revealed by ge- transgenic mice. Nature (London) 344:309-313. nomic footprinting of permeabilized cells using DNase I and 9. Ephrussi, A., G. M. Church, S. Tonegawa, and W. Gilbert. 1985. ligation-mediated PCR. Genes Dev. 5:1102-1113. B-lineage specific interactions of an immunoglobulin enhancer 30. Philipsen, S., D. Talbot, P. Fraser, and F. Grosveld. 1990. The with cellular factors in vivo. Science 227:134-140. 3-globin dominant control region: hypersensitive site 2. EMBO 10. Evans, T., and G. Felsenfeld. 1989. The erythroid-specific tran- J. 9:2159-2167. scription factor eryfl: a new finger protein. Cell 58:877-885. 31. Pruzina, S., 0. Hanscombe, D. Whyatt, F. Grosveld, and S. 11. Evans, T., M. Reitman, and G. Felsenfeld. 1988. An erythrocyte- Philipsen. 1991. Hypersensitive site 4 of the human J3 globin specific DNA-binding factor recognizes a regulatory sequence locus control region. Nucleic Acids Res. 19:1413-1419. common to all chicken globin genes. Proc. Natl. Acad. Sci. 32. Rutherford, T. R., J. B. Clegg, and D. J. Weatherall. 1979. K562 USA 85:5976-5980. human leukemic cells synthesise embryonic hemoglobin in 12. Grosveld, F., G. B. van Assendelft, D. R. Greaves, and B. response to hemin. Nature (London) 280:164-165. Kollias. 1987. Position-independent, high-level expression of the 32a.Strauss, E. C., and S. H. Orkin. Unpublished data. human beta-globin gene in transgenic mice. Cell 51:975-985. 33. Talbot, D., and F. Grosveld. 1991. The 5' HS 2 of the globin 13. Hatton, C. S. R., A. 0. M. Wilkie, H. C. Drysdale, W. G. Wood, locus control region enhances transcription through the interac- M. A. Vickers, J. Sharpe, H. Ayyub, I. M. Pretorius, V. J. tion of a multimeric complex binding at two functionally distinct Buckle, and D. R. Higgs. 1990. oa-Thalassemia caused by a large NF-E2 binding sites. EMBO J. 10:1391-1398. (62 kb) deletion upstream of the human a-globin gene cluster. 34. Talbot, D., S. Philipsen, P. Fraser, and F. Grosveld. 1990. Blood 76:221-227. Detailed analysis of the site 3 region of the human f-globin 14. Higgs, D. R., W. G. Wood, A. P. Jarman, J. Sharpe, J. Lida, dominant control region. EMBO J. 9:2169-2178. I.-M. Pretorius, and H. Ayyub. 1990. A major positive regula- 35. Tsai, S. F., D. I. Martin, L. I. Zon, A. D. DAndrea, G. G. Wong, tory region is located far upstream of the human a-globin gene and S. H. Orkin. 1989. Cloning of cDNA for the major DNA- locus. Genes Dev. 4:1588-1601. binding protein of the erythroid lineage through expression in 15. Jarman, A. P., W. G. Wood, J. A. Sharpe, G. Gourdon, H. mammalian cells. Nature (London) 339:446-51. Ayyub, and D. R. Higgs. 1991. Characterization of the major 36. Tuan, D., W. Solomon, Q. Li, and I. M. London. 1985. The regulatory element upstream of the human a-globin gene clus- "beta-like globin" gene domain in human erythroid cells. Proc. ter. Mol. Cell. Biol. 11:4679-4689. Natl. Acad. Sci. USA 82:6384-6388. 16. Li, Q., B. Zhou, P. Powers, T. Enver, and G. Stamatoyannopo- 37. Yamamoto, M., L. J. Ko, M. W. Leonard, H. Beug, S. H. Orkin,
2142 STRAUSS ET AL. MOL. CELL. BIOL. and J. D. Engel. 1990. Activity and tissue-specific expression of Mol. Biol. Med. 1:489-500. the transcription factor NF-El multigene family. Genes Dev. 39. Zon, L. I., S.-F. Tsai, S. Burgess, P. Matsudaira, G. Bruns, and 4:1650-1662. S. H. Orkin. 1989. The major human erythroid DNA-binding 38. Zeitlin, H. C., and D. J. Weatherall. 1983. Selective expression protein (GF-1; NF-E1; Eryf-1): primary sequence and localiza- within the human a-globin gene complex following chromosome tion of the gene to the X chromosome. Proc. Nat. Acad. Sci. dependent transfer into diploid mouse erythroleukaemia cells. USA 87:668-672. Downloaded from http://mcb.asm.org/ on March 1, 2021 by guest
You can also read