©1995 by The American Society for Biochemistry and Molecular Biology, Inc.
Structure of the Hepatic Control Region of the Human Apolipoprotein E/C-I Gene Locus (*)

(Received for publication, July 5, 1995)

Qi Dang (1) (2) David Walker (1) Stacy Taylor (1) Charles Allan (1) (2) Peter Chin (1) Jianglin Fan (1) (2) John Taylor (1) (2) (3)(§)

From the  (1)Gladstone Institute of Cardiovascular Disease, (2)Cardiovascular Research Institute, and (3)Department of Physiology, University of California, San Francisco, California 94141-9100

ABSTRACT
INTRODUCTION
MATERIALS AND METHODS
RESULTS
DISCUSSION
FOOTNOTES
ACKNOWLEDGEMENTS
REFERENCES

ABSTRACT

The specificity of expression in the liver of the human apolipoprotein (apo) E/C-I gene locus is determined by a hepatic control region (HCR) that is located 15 kilobases downstream of the apoE gene. DNase I footprint studies of this sequence using nuclear extracts identified a region of the HCR that is enriched in nuclear protein-binding sites. Nuclease analysis of chromatin revealed liver-specific DNase I-hypersensitive sites that were associated with this region, and additional liver-specific nuclease-sensitive sites associated with the apoE gene were identified. The HCR domain has a limited binding affinity for the nuclear scaffold. The specific domain required for liver expression was tested by ligating subfragments of the HCR to the apoE gene and examining their activity in transgenic mice. A segment of 319 nucleotides that contained several potential regulatory sequences was required for full activity of liver-specific transcription with shorter segments yielding much lower levels of expression in the liver. All constructs that contained a fully active HCR were expressed in approximately a copy-dependent manner, suggesting that transgene expression was independent of integration position. Taken together, the properties of the HCR are consistent with its function as a locus control region for the liver-specific expression of the apoE gene.


INTRODUCTION

Human apolipoprotein (apo)^1 E, a 299-amino-acid glycoprotein of M(r) = 35,000(1) , is a major component of various plasma lipoprotein classes, including chylomicron remnants, very low density lipoproteins, and high density lipoproteins(2, 3) . It facilitates the redistribution of cholesterol from peripheral tissues to the liver and is required for the receptor-mediated uptake of chylomicron remnants(2, 4) . Although apoE is produced by many different tissues, more than 90% of the circulating apoE in human plasma comes from liver hepatocytes(2) . Receptor binding defective variants of apoE having the E2 phenotype are associated with type III hyperlipoproteinemia and premature atherosclerosis(2) . A commonly occurring apoE variant, the E4 allele, has been linked to an enhanced occurrence of the neurofibulary tangles and plaque deposits characteristic of Alzheimer's disease(5, 6, 7) .

The human apoE gene is 3.6 kb in length (8, 9) and is located at the 5` end of a 45-kb cluster of apolipoprotein genes on chromosome 19, all of which have the same transcriptional orientation(10) . The apoC-I gene (4.7 kb) is located 5.3 kb downstream of the apoE gene and a 4.4-kb apoC-I` pseudogene is located 7.5 kb still further downstream. The apoC-II gene (3.3 kb) is found about 16 kb downstream of the apoC-I` pseudogene. Each of these genes contains four exons, with the introns located in similar intragenic positions, suggesting that this gene family evolved from a common ancestor(1, 9) .

Expression of the apoE gene in the liver was shown initially by Simonet et al.(11, 12) to require the presence of a distal downstream tissue-specific enhancer. Subsequent studies by Simonet et al.(12) demonstrated that this hepatic control region (HCR) was located 19 kb downstream of the transcription start site of apoE gene and 9 kb downstream of the beginning of apoC-I gene. The HCR contained the sequences necessary to direct expression of both the apoE and apoC-I genes to hepatocytes with no liver specificity contributed by the promoters of either gene(13) . Any construct that lacked the HCR was not expressed, even at low levels, in the transgenic mouse liver. Full liver-specific activity of the HCR was provided by a 0.77-kb fragment (construct LE6 in (13) ). However, the presence of a previously characterized enhancer element, that lacks tissue specificity, in the promoter of the apoE gene (14, 15) was required for transcriptional activation. These results suggested that interactions of a unique hepatocyte-specific combination of distal elements in the HCR with an activator sequence in the promoter directed the expression of the apoE/C-I/C-II locus in the liver.

The current study extends our earlier findings (13) and characterizes more precisely (16) the features of the HCR that define its activity. We find that a genomic fragment of about 300 nucleotides is required for full levels of HCR activity, that at least three nuclear protein binding sequences are involved, and that the HCR is characterized by prominent liver-specific DNase I hypersensitivity. These results further define the unique qualities of the HCR that determine liver-specific activity.


MATERIALS AND METHODS

Preparation of Nuclear Extracts

Nuclei and nuclear extracts from mouse liver and other tissues were isolated by the method of Gorski et al.(17) with minor modifications as described(15) . HeLa cell nuclear extract was prepared by the method of Dignam et al.(18) .

In Vitro Translation of Transcription Factor

Transcription factors C/EBPalpha, HNF-3alpha, HNF-1, and HNF4 were synthesized with a riboprobe transcription/translation system (Promega, Milwaukee, WI) according to the manufacturer's suggested protocol. C/EBPalpha cDNA (a gift of S. McKnight, Tularik Co., South San Francisco, CA), HNF-3alpha cDNA (a gift of R. Costa, University of Illinois Medical Center, Chicago, IL), HNF1 cDNA (a gift of G. Crabtree, Stanford University, Palo Alto, CA), and HNF4 cDNA (a gift of F. Sladek, University of California, Riverside, CA) were cloned into vectors pBSSK and pGEM-1, respectively, then used as templates.

Preparation of DNA Fragments

DNA Fragments for Footprinting

To generate single end-labeled DNA probes, the 774-bp LE6 HCR fragment (13) was cloned in pBSSK (Stratagene, San Diego, CA). Subfragments of the HCR were excised from the vector by restriction endonucleases that generated 5` protruding ends, and the subfragments were purified by polyacrylamide gel electrophoresis (PAGE). The fragments were end-labeled with [alpha-P]dNTP by the Klenow fragment of DNA polymerase, then purified with NENsorb columns (DuPont NEN), and stored in TE buffer (10 mM Tris-HCl, pH 7.4, and 1 mM EDTA).

Oligonucleotides

Complementary oligonucleotide pairs were synthesized via a nucleic acid synthesis system (Millipore, MA), then annealed in a buffer containing 67 mM Tris-HCl, pH 7.6, 13 mM MgCl(2), 6.7 mM dithiothreitol, 1.3 mM spermidine, and 1.3 mM EDTA. The annealed products were purified by PAGE, then labeled by T4 DNA kinase in the presence of [P]ATP(19) . Oligonucleotide sequences were: 1) TCCCTAAAATGGGCAAACATTGCAAGCAGCAAACAG; 2) TCCCTGCCTGCTGACCTTGGAGCTGGGGCA; 3) TGGGGCAGAGGTCAGAGACCTCTCTGGGCCCATGCCACCTCCAACAT; 4) ACCCCGAAGCCTGTTTCCCCATCTGTACAATG; 5) AATGATAAAGACGCCCATCTGATAGGGTTTTTG; 6) CCAGGCTGGAGTCAGTGACACAATCTCATCTCACCACA; HNF3-alpha, GTTGACTAAGTCAATAATCAGAATCAG(20) ; TF-LE2, AGTCTGTCTTTGACCTTGAGCCC(21) ; HNF-4, CCCTAGGCAAGGTTCATATGGCC(22) ; C/EBPalpha, GACCTTTTGCAATCCTGG(23) ; GATA, TGGCCTTATAACTCCGG(24) ; HNF1, AACAAACTGTCAAATATTAACTAAAGGGAG(25) .

Preparation of DNA Probes

For mapping DNase I-hypersensitive sites, DNA fragments were purified by agarose gel electrophoresis, then labeled with [alpha-P]dCTP by random primers and the Klenow fragment of DNA polymerase as described (19) For mapping nuclear scaffold-associated regions, plasmids containing HCR DNA fragments were digested with restriction enzymes, end-labeled with dCTP and Klenow fragment, and purified by column chromatography (Schleicher & Schuell).

HCR Subfragments for Cloning

Different fragments of the 774-bp HCR (13) were amplified by polymerase chain reaction (PCR), with HindIII sites generated on the 5` end and SpeI sites generated on the 3` ends. The amplification products were digested with HindIII and SpeI, purified by agarose gel electrophoresis, and cloned into pBSSK. The 10-kb HEG1 human genomic fragment(11) , which contains the complete apoE gene sequence together with 5 kb of 5`-flanking and 1.7 kb of 3`-flanking sequences, was inserted adjacent to the 5` ends of the HCR segments. The nucleotide sequence of the HCR segments, as well as the complete HCR, were confirmed by direct DNA sequence analysis using oligonucleotides(19) .

Gel Shift Assays

The assays were carried out essentially as described(15) . Nuclear extract was incubated with 4 µg of poly(dI:dC) at 20 °C for 5 min in 20 µl of nuclear dialysis buffer. Where appropriate, 50 ng of specific oligonucleotide competitors were added to each sample and incubated at 20 °C for 15 min. Then, 0.5 ng of end-labeled oligonucleotide were added to the extract and incubated at 30 °C for 30 min. Samples were resolved by electrophoresis in 5% polyacrylamide gels in 1/2 times TBE buffer (90 mM Tris borate, 2 mM EDTA), then the gel was dried and examined by autoradiography.

Mapping of DNase I-hypersensitive Sites

Nuclei were isolated, as described(17) , from different tissues of transgenic mice bearing the LE1 transgene construct(13) . This construct contained the intact human apoE gene together with 5 kb of 5`-flanking sequence, 1.7 kb of 3`-flanking sequence, and the LE1 HCR-containing fragment (a 3.8-kb SphI/BamHI fragment located 14 kb downstream of the apoE gene that contained the complete HCR activity). The nuclear pellet was resuspended in Buffer A (15 mM Tris-HCl, pH 7.4, 0.15 mM spermine, 0.5 mM spermidine, 80 mM KCl, 2 mM EDTA, 1 mM dithiothreitol, and 0.5 mM phenylmethylsulfonyl fluoride) containing 5% glycerol.

The nuclei were diluted to 10 A/ml with Buffer A (without EDTA) containing 5% glycerol and 5 mM MgCl(2). Then, 0.5-ml aliquots were incubated on ice for 10 min with different amounts of DNase I. The reactions were quenched by addition of 25 µl of 0.5 M EDTA, 25 µl of 5 M NaCl, 5 µl of 10% SDS, 12.5 µl of 1 M Tris, and 10 µl of 10 mg/ml proteinase K and incubation at 50 °C for 1 h. Aliquots were extracted with phenol, and DNA was precipitated with ethanol and then resuspended in 0.1 times TE buffer. DNA was digested with appropriate restriction enzymes, resolved by electrophoresis in a 0.8% agarose gel, and transferred to nylon membranes. The membranes were hybridized to radioactively labeled DNA probes at 42 °C overnight, washed to a final stringency of 0.1 times SSC (0.15 mM NaCl, 15 mM sodium citrate) and 0.1% SDS at 55 °C for 30 min, and then examined by autoradiography.

Rebinding Assay of DNA to Nuclear Scaffolds

Rebinding assays were carried out as described (26) with minor modifications. In brief, nuclei were isolated as described in the preceding section, then washed in 7 volumes of Buffer B (7.5 mM Tris-HCl, pH 7.4, 0.1 mM spermine, 0.25 mM spermidine, 40 mM KCl, 1 mM EDTA, 100 units/ml Trasylol (Calbiochem, La Jolla, CA), and 1% thiodiglycol) and resuspended in the same buffer to a concentration of 29 A/ml. Aliquots (0.5 ml) of the nuclear suspension were incubated at 42 °C for 20 min. Then, 7 ml of Buffer C (5 mM PIPES, pH 7.4, 0.25 mM spermidine, 2 mM EDTA, 2 mM KCl, 0.1% digitonin, 25 mM lithium 3,5-diiodosalicylate) was added, and the mixture was incubated at 20 °C for 5 min. The reaction mixtures were centrifuged at 18,000 times g for 20 min at 4 °C, and the pellets were washed five times in 8 ml of Buffer D (20 mM Tris-HCl, pH 7.4, 20 mM KCl, 70 mM NaCl, 10 mM MgCl(2), 0.05 mM spermine, 0.125 mM spermidine, 0.1% digitonin, and 100 units/ml Trasylol). The nuclear scaffolds were digested in 1 ml of Buffer D by 1000 units/ml of three restriction endonucleases at 37 °C in a water bath with gentle shaking for 2 h. Aliquots containing 2 A were adjusted to 0.2 ml with Buffer D, then 4 ng of radioactively labeled DNA probes together with 2 µg of MspI-digested pBR332 plasmid DNA were added. The reaction mixtures were incubated for 1 h at 37 °C with gentle shaking, then mixed with an equal volume of Buffer D and centrifuged at 7,000 times g for 10 min. The pellets and supernatants were separated and saved. The pellets were washed once with Buffer D, and the wash fluid was discarded. Washed pellets and initial supernatants were digested with proteinase K and extracted with phenol, and the DNA was precipitated by ethanol. The DNA was resolved by electrophoresis in 0.8% agarose gels and then the gels were dried and examined by autoradiography, and the fragments were quantitated by a photodensity scanner.

Generation and Analysis of Transgenic Mice

The apoE gene/HCR constructs were used to generate transgenic mice as described(27) . Newborn pups were weaned at 3 weeks of age, DNA was extracted from tail segments, and transgenic founders were identified by Southern blot analysis(28) . The copy number of the incorporated transgene was determined by quantification of the Southern blots via a Fuji Bas1000 Bio-imaging analyzer (Fuji Photo Film Co., Ltd., Japan) using human genomic DNA as a standard, following hybridization using a human apoE hybridization probe(11) .

Total cellular RNA was isolated from various tissues of transgenic mice and examined by RNase protection analysis as described (29) followed by quantitation via the Fuji Bas1000 Bio-imaging analyzer. The quantity of the total cellular RNA in each reaction was monitored by comparison to the amount of mouse beta-actin measured by the same assay using a mouse beta-actin antisense RNA probe (Ambion, TX). For each construct, at least two, and up to five, independent transgenic lines were analyzed.


RESULTS

Nucleotide Sequence of the HCR

Since it contained all of the regulatory information required for full levels of expression of the apoE gene in the liver, the sequence of the 774-nucleotide length of the LE6 HCR fragment was determined (Fig. 1)(13) . The 3`-terminal portion of the LE6 fragment has an 83% identity in the 3` to 5` orientation in comparison to the corresponding regions of an Alu family consensus sequence(30, 31) . The typical Alu family member is composed of two closely related segments separated by a short A-rich sequence of slightly variable length with a second variable length A-rich sequence at the 3` terminus. In the LE6 fragment of the HCR, nucleotides 462-666 consist of the 3`-terminal two-thirds of an Alu sequence family, and nucleotides 667-774 consists of the middle one-third of a separate Alu sequence family. The 5` portion of the HCR fragment, consisting of 461 nucleotides, does not contain any portion of an Alu sequence family, nor any other repeat family, as evaluated by hybridization to genomic DNA (data not shown).


Figure 1: Sequence analysis of LCR region. Shaded boxes represent footprint regions. Arrows with letters indicate the TGTTTGC motif. Triangles indicate DNase I-hypersensitive sites revealed by in vitro footprinting study. The line above nucleotides 462 to 774 indicates Alu family sequence homology.



Binding of Nuclear Proteins to the HCR

DNase I Footprinting

To examine the binding of nuclear proteins to the HCR region, DNase I footprinting was performed along the entire length of the 774-bp HCR fragment using both mouse liver and cultured HeLa cell nuclear extracts. Six distinct footprints were revealed (Fig. 2), with three footprints clustered near the 5` end of the HCR. Footprints 1, 4, 5, and 6 were protected by nuclear factors that appeared to be somewhat enriched in the liver. Footprints 2 and 3 were protected by factors that were abundant in both HeLa cell and mouse liver nuclear extracts. The protected region for footprint 1 was longer on the 3` strand than on the 5` strand.


Figure 2: DNase I footprinting of HCR. Single end-labeled DNA fragments were incubated with various amounts of nuclear extracts from mouse liver or HeLa cells and digested with different amounts of DNase I. The digested DNA was extracted and resolved on sequencing gels. Four end-labeled fragments were used. Panel A, PstI-AvaII fragment, labeled on 3` end, top strand; panel B, full-length HCR region, labeled on 5` end, bottom strand; panel C, AvaII-XbaI fragment, labeled on 5` end, bottom strand; panel D, full-length HCR region, labeled on 3` end, top strand. Liver = mouse liver nuclear extract; HeLa = HeLa cell nuclear extract; G + A and G = Maxam-Gilbert reaction. Solid bars indicate footprints. Dots represent DNase I-hypersensitive sites, and their sequence locations are indicated.



DNase I-hypersensitive sites were detected at footprint regions as a consequence of nuclear protein binding ( Fig. 1and Fig. 2). These sites were abundant in the region around footprint 1. It is noteworthy that this region has four tandem copies of the TGTTTGC motif and two upstream closely related sequences, indicated by the arrow bars in Fig. 1. In all but one of these sequences, a DNase I-hypersensitive site was present in the TGTTTGT motif between the two G nucleotides at a T adjacent to a G. The most prominent hypersensitive site detected in the 3` strand (nucleotide 114) was located within one of these motifs although an unambiguous footprint was not detected here. The hypersensitive site at nucleotide 156 in footprint 1 was prominent in both DNA strands. Several DNase I hypersensitive sites were detected elsewhere in the HCR fragment where clear footprints were not observed, possibly reflecting weak or unstable protein binding. Some DNase I-hypersensitive sites were detected in the Alu family sequence region. Both footprint 5, adjacent to the Alu sequence, and footprint 6, within the Alu sequence, had nearby hypersensitive sites. Since subsequent studies (described below) showed that this region was not required for HCR activity, the significance of DNase I sensitive sites in the Alu sequence domain remains unclear.

Gel-shift Assays

Protein binding to the footprint sequences was confirmed by gel-shift assays using mouse liver nuclear extracts and double-stranded oligonucleotides corresponding to these regions (Fig. 3). Since some of these sequences contained homologies to known transcription factor binding motifs, the gel-shift assay was used to investigate potential relationships. In addition, known transcription factors associated with enhanced liver expression were synthesized and used in this assay to investigate their potential role in HCR action.


Figure 3: Gel shift assays. Mouse liver nuclear extract or in vitro translated nuclear factors were incubated with end-labeled oligonucleotides that were synthesized according to footprint sequences and DNA binding consensus sequence of nuclear factors TF-LF2 (21) and HNF-4(22) , respectively. Unlabeled oligonucleotides were used as specific competitors. The samples were analyzed by PAGE, and autoradiograms of dried gels are shown. ML = mouse liver.



The radioactively labeled footprint 1 oligonucleotide had prominent gel-shift bands, and low levels of these bands were still detectable in the presence of a 100-fold excess of nonlabeled competitor. The footprint 1 sequence did not bind C/EBP, HNF1, nor HNF4 (data not shown), but it did bind HNF3alpha, yielding a gel-shift band that corresponded to the major band produced by liver extracts (Fig. 3, panel 1).

Footprint 2 contains an identity of eight nucleotides with a sequence in the TF-LF2 element located at nucleotide -480 in the promoter of the rat transferrin gene(21) . In gel-shift assays, an oligonucleotide corresponding to the transferrin regulatory sequence competed effectively for nuclear protein binding by the HCR footprint 2 probe (Fig. 3). This result suggests that the TF-LF2 factor (or factor family) recognizes the HCR footprint 2 sequence. The footprint 2 sequence did not bind C/EBPalpha, HNF1, HNF3alpha, or HNF4 (data not shown).

The 5` portion of footprint 3 has a limited homology to the consensus binding sequence for HNF4(22) ; however, the HNF4 produced by an in vitro transcription/translation system bound very weakly to the footprint 3 oligonucleotide (data not shown). Nevertheless, the oligonucleotide of the footprint 3 sequence was an effective competitor for HNF4 binding to its own consensus high affinity binding sequence (22) . These results suggest that a member of the HNF4 factor family may contribute to generating footprint 3. Although footprint 3 extended over a length of 41 nucleotides, only one major gel-shift band was observed. The footprint 3 sequence did not bind C/EBPalpha, HNF1, or HNF3alpha (data not shown).

Gel-shift assays confirmed the binding of a nuclear protein to the footprint 4 sequence (Fig. 3, panel 4). However, the factors HNF1, HNF3alpha, HNF4, and C/EBP did not bind to this element (data not shown).

Footprint 5 of the HCR contains a GATA motif. This motif is a core sequence in regulatory elements that bind a family of erythroid transcription factors most closely identified with globin gene promoters and the globin locus control region(32, 33, 34, 35) . However, GATA-binding factors also have been identified in nonerythroid tissue (24) . An oligonucleotide corresponding to a high affinity GATA-1-binding site in the beta-globin promoter (24) was used as a competitor in HCR gel-shift assays. The major gel-shift band produced by the footprint 5 oligonucleotide with liver extract was competed by the erythroid GATA-1 oligonucleotide (Fig. 3), indicating a potential role for a GATA binding factor in HCR function.

The sequence of footprint 6 has a limited homology to the consensus binding sequence of C/EBP(23) . Protein produced by a C/EBPalpha expression vector bound to the footprint 6 oligonucleotide, yielding a band in the gel-shift assay that corresponded to one of the liver extract-generated bands (Fig. 3). Thus, it is likely that a member of this transcription family interacts with this sequence.

Mapping of DNase I-hypersensitive Sites in Chromatin

Regulatory sequences from actively transcribed genes are commonly hypersensitive to nuclease cleavage within the chromatin of the nucleus(36) . Therefore, we investigated the presence of DNase I-hypersensitive sites (DHS) in the HCR as well as in the associated apoE gene construct using nuclei from transgenic mice that had been generated with the HEG.LE1 construct(13) . This construct consisted of the complete human apoE gene together with 5 kb of 5`-flanking region and 1.7 kb of 3`-flanking region ligated to a 3.8-kb downstream fragment (LE1) that contained the HCR domain construct illustrated in Fig. 4and Fig. 5. The complete HEG.LE1 construct is expressed at high levels in only the liver, and this expression is directed by the HCR activity of the 0.77-kb LE6 segment (13) .


Figure 4: DNase I sensitivity of human apoE gene with upstream flanking sequence and the LE1 fragment(11) . Nuclei isolated from liver, kidney, or brain of transgenic mice bearing the HEG.LE1 constructs (13) were digested with increasing concentrations of DNase I for 10 min on ice: lanes 1-6 correspond to 0, 48, 100, 200, 800, and 1,600 units/ml DNase I. Genomic DNA from the treated nuclei then was purified and digested with restriction enzymes SacI (panels A, C, and D) and EcoRI (panel B). Samples were separated on 0.8% agarose gel then blotted onto nylon membranes. DNase I hypersensitive sites (DHS) were revealed by hybridization with a radioactively labeled probe (indicated by bars) from the ends of each of the fragments that had been derived from restriction digestions with different restriction enzymes. p indicates the parent bands. p1 and p2 indicate the parent bands due to tail-to-tail and head-to-tail integration of the transgene (present in about 70 copies). Arrows indicate DNase I-hypersensitive sites. Asterisks indicate liver specific DHS. S = SalI, Sa = SacI, E = EcoRI, H = HindIII, Sp = SphI, Ps = PstI, X = XbaI, and B = BamHI.




Figure 5: Summary of hypersensitive sites in HEG.LE1. Arrows indicate DNase I-hypersensitive sites. Asterisks indicate liver specific DHS. S = SalI, Sa = SacI, E = EcoRI, H = HindIII, Sp = SphI, Ps = PstI, X = XbaI, and B = BamHI.



To determine if tissue-specific nuclease cleavage sites were associated with the HEG.LE1 transgene, isolated intact nuclei from the liver, brain, and kidney of transgenic mice were incubated briefly with DNase I. Then, the DNA was extracted and hybridized to probes from different regions of the construct. As shown in Fig. 4, panels A-D, and summarized in Fig. 5, 20 nuclease-sensitive sites were identified in chromatin that were distributed throughout the transgene. Most sites were detected in all tissues examined. However, five sites were liver-specific: one in the proximal promoter, two in the second intron, one in the HCR domain, and one just 3` of the LE6 HCR segment. The most prominent liver-specific DNase I-sensitive sites were DHS 11 in the second intron and DHS 17 in the 5` region of the HCR. DHS 17 is located in the region of footprint 1, a sequence shown to be enriched in nuclease-sensitive sites by the footprinting analysis ( Fig. 1and Fig. 2).

HCR Sequences Required for Liver-specific Expression of Transgenic Mice

To determine the minimum portion of the HCR required to direct expression of the apoE gene to the liver, constructs were prepared in which subfragments of the HCR (Fig. 6) were ligated to the 3` terminus of the HEG1 genomic fragment. They were used to generate transgenic mice, and the expression of two to five independent lines of each construct was examined in various tissues. Previous studies showed that the HEG1 construct ligated to the parent 774-nucleotide LE6 fragment (13) is expressed at high levels in both the liver and the kidney. Expression in the kidney is regulated by positive elements located near the apoE gene promoter and by silencer elements in the second intron and at distal locations near, but distinct from, HCR activity in the LE6 fragment(13) . Since kidney expression is not affected by the LE6 HCR fragment(13) , apoE expression in the kidney served as an internal control in the analysis of transgene activity. The expression of each transgene construct was determined by RNase protection analysis (Fig. 7). The ratio of transgenic human apoE mRNA levels in the liver compared to the number of constructs integrated into the mouse genome (measured by quantitation of Southern blots of transgenic mouse DNA) was determined. This comparison indicated the dependence of transgene expression on construct copy number.


Figure 6: Subfragments of the HCR. Filled ovals with numbers indicate DNA footprint locations, arrows indicate DNase I-hypersensitive sites detected by the DNA footprinting experiments. The subfragments are identified by the LES nomenclature, their locations in the LE6 HCR domain are given, and their liver-specific enhancers activities are indicated by shaded boxes.




Figure 7: Expression pattern of HCR subfragments. RNase protection assays with RNAs from various tissues of transgenic mice are shown. The quality and content of the RNAs were checked by RNase protection assays with a mouse beta-actin probe, with a typical result shown in the right bottom panel. The bands shown here correspond to the expected protected bands for human apoE mRNA and mouse actin mRNA fragments.



As shown in Table 1, subfragments LES2, LES3, and LES7 directed similar levels of expression in the liver. This activity is equivalent to that of the reference LE1 transgenic line that includes the complete HCR domain. These results suggest that the region containing footprints 4, 5, and 6 may not be required for the full transcriptional activity of the HCR and that the Alu sequence domain may not contribute to HCR activity. Further reduction of HCR length to delete the sequences containing footprint 3 (LES12) and footprint 2 (LES11) resulted in a 4-fold to more than 10-fold reduction in HCR activity. A construct having only the 5`-terminal 118 nucleotides (LES14) showed no expression in the liver (data not shown). The LES13 fragment that deleted 72 nucleotides of the 5` portion of the HCR but retained footprints 1, 2, and 3 sequences together with nearly all of the 5` DNase I-hypersensitive sites had low activity in the liver, equivalent to that of the LES12 fragment. The LES6 construct had no detectable liver expression, and a comparison of its activity to that of the LES3 construct, in which deletion of the 5`-terminal 122 nucleotides is the only difference, suggests that this segment is essential to the activity of the HCR. Taken together, these results indicated that the minimum region required for HCR activity includes the footprint 1 sequence and the adjacent DNase I-hypersensitive regions, as contained in the LES11 fragment.



Association of the HCR with the Nuclear Scaffold

The possibility that the HCR may associate with the liver nuclear scaffold was investigated by a rebinding assay(26) . In this assay, the LE6 fragment or a subfragment of LE6 (Fig. 8), mixed together with a construct consisting of the HEG1 apoE gene and plasmid fragments(13) , were radioactively labeled, then incubated with an isolated nuclear scaffold preparation. The DNA fragments that did not bind to the nuclear scaffold were separated subsequently by centrifugation, and the distribution of the bound radioactivity in the pellet (scaffold) and the nonbound radioactivity free in the supernatant fractions were resolved by agarose gel electrophoresis. As shown in Fig. 8, radioactivity near the top of the gel corresponded to the HEG1 apoE gene construct and plasmid DNA, and radioactivity near the bottom of the gel corresponded to the HCR DNA. The 774-nucleotide LE6 HCR fragment and the LES3 subfragment showed the greatest association with the nuclear scaffold in the pellet. In comparison, essentially no LES2 or LES8 (nucleotides 462-774 consisting of the complete Alu family segment) fragments were found in the pellet. Similarly, HEG1 apoE gene fragments showed relatively little binding to the nuclear scaffold. These preliminary findings suggest that the unique sequence portion of the HCR has a low, but specific, affinity to the nuclear scaffold whereas the apoE gene and its proximal flanking regions do not associate significantly with the nuclear scaffold.


Figure 8: Affinity of the HCR for the nuclear scaffold. Plasmids carrying HEG.LE6 and subfragments of LE6 were digested with HindIII and XbaI, then end-labeled with [P]dCTP. The nuclear scaffolds were isolated from ICR mouse liver nuclei, then digested with HindIII, BamHI, and XhoI. Probes and digested scaffolds were incubated at 37 °C for 1 h, then pellets and supernatants were separated by centrifugation. DNA from each fraction was extracted, then resolved by electrophoresis in 1.0% agarose gels. The gels were dried and autoradiographed. Samples were labeled as T = total probe, S = 1/2 supernatant fraction, and P = pellet fraction.




DISCUSSION

Previous studies in our laboratory mapped the liver-specific HCR of the human apoE/C-I locus to a 774-bp fragment (LE6) located about 19 kb downstream of apoE promoter and about 9 kb downstream of apoC-I promoter(13) . The expression of apoE and apoC-I genes in the liver of the transgenic mice were totally dependent on this distal 3` regulatory region. In the absence of the HCR, there was no expression of the apoE gene in the liver. No sequences in the promoter, even extending 30 kb upstream, were found that conferred expression of the apoE gene (or the apoC-I gene) in the liver. The finding that all of the liver expression capability of the apoE gene is contained entirely within the downstream HCR makes this tissue specificity unique among genes that are expressed in the liver. The typical mechanism for the hepatic expression of other genes positive promoter elements that provide low levels of liver expression, with higher levels of transcription in the liver dependent upon distal enhancer sequences. The results presented here have described several distinct properties of the apoE gene HCR that contribute to its unique activity.

Nuclear protein DNase I footprint analysis and gel-shift assays of the LE6 HCR revealed several different protein families that bind to this fragment. Most of these families consist of multiple members, often present together in the same cell type, binding to DNA sequences that are characteristic of a particular family. Thus, members of the HNF3 and HNF4 transcription factor families may influence HCR activity, whereas the potential roles of the GATA-binding and the C/EBP families are unclear. Further study beyond the scope of this preliminary investigation would be required to identify the particular member in each factor family that binds to the HCR domain. The HNF3, HNF4, and C/EBP transcription factor families contribute to the liver-specific expression of many genes (i.e. most of the genes listed in Table 2). In data not shown here, we determined that the frequently used liver transcription factor HNF-1 (generated by in vitro translation of a cDNA expression vector) does not bind to any footprint sequence in the HCR. Furthermore, none of the transcription factors investigated in this study bind to footprint sequences that were previously identified (1, 15) in the apoE gene promoter (data not shown), consistent with its inability to direct gene expression to the liver(13, 14, 15) .



The unique sequence domain at the 5` end of the LE6 fragment (nucleotides 1-461) appears to constitute the functional HCR. The minimum length having full activity is a 319-nucleotide region located at the 5` terminus. This segment contains three major footprints, two of which are liver-specific, and prominent DNase I-hypersensitive character, which is also liver-specific. Analysis of subfragments of the HCR in apoE transgene constructs demonstrates that shorter fragments have a 4-fold or greater reduction in transcriptional activity. Thus, while the footprint 1 element together with adjacent nuclease-sensitive regions can direct liver-specific expression, full activity requires a sequence of about 300 nucleotides.

Three copies of the TGTTTGC motif are found in the footprint 1 sequence, and three more copies are located nearby. Five of these sequences are associated with DNase I-hypersensitive sites, and together they probably account for the prominent liver-specific nuclease hypersensitivity that was detected in whole nuclear chromatin at this location (Fig. 4, panel D, site 17). The highly conserved TGTTTGC motif is found in the promoters and/or enhancers of many genes that are expressed in the liver at moderate to high levels (Table 2). In the human transferrin gene promoter, this motif binds a transcription factor that is specific for single-stranded DNA(37) . It is of interest that the TTTG motif is conserved in the core DNA binding sequence of high mobility group (HMG) proteins(38) . The HMG proteins act by introducing pronounced bending in DNA at the binding site(39) , an action expected to result in enhanced DNase I hypersensitivity. Perhaps, the binding of nuclear proteins to the TGTTTGC motif in the footprint 1 sequence of the HCR may induce a substantial conformational change in the DNA duplex.

Nuclease-sensitive sites are a characteristic feature of transcriptionally active chromatin in eukaryotes (reviewed in (40) ), and they are associated with a wide variety of regulatory elements. We mapped 20 DNase I-hypersensitive sites in whole nuclei that were distributed throughout the HEG1.LE1 transgene, with five of them being specific to the liver. Since HEG1 alone, without an HCR domain, is not expressed in the liver(11) , it was notable to find one liver-specific site in the proximal promoter and two liver-specific DNase I-hypersensitive sites in the second intron. These regions may participate in the hepatic expression of the apoE gene, even though transgene expression data shows that the far downstream HCR is sufficient for directing liver tissue-specific transcription.

The shortest HCR construct analyzed that resulted in full liver-specific activity was the 319-nucleotide LES2 subfragment. A comparison to the 3.8-kb LE1 parent HCR fragment (13) indicates that transgenic mice generated with either construct yield similar apoE mRNA levels per gene copy number (Table 1). The range of 2-70 transgene copies in the six independent lines examined for these two constructs suggests that HCR activity is relatively independent of the position of construct integration into the genome. This property is similar to the action of the locus control region of the human beta-globin gene(41, 42) .

Position-independent expression of exogenous genes in stably transfected cultured cells or transgenic mice has been shown to be a consequence of the presence of nuclear scaffold binding regions(41, 42) . The finding of limited nuclear scaffold binding character in the unique sequence portion of the HCR, but not in other regions of the HEG1 apoE gene fragments, is consistent with the observed HCR action. Perhaps sequences adjacent to this portion of the HCR are required for conferring more stringent position independence.

Our results differ from the recently reported findings of others(16) . They employed nuclear extracts from cultured hepatoma cells to identify two DNase I footprints in a fragment that corresponded to our nucleotides 191-223, but they did not detect footprint 1. They also concluded that a 154-bp fragment (corresponding to our nucleotides 78-231) constituted the liver-specific enhancer. Their use of cultured tumor cells, potential differences in nuclear extract protocols, as well as a transgene test vector with a comparatively short 5`-flanking sequence, may have led to their different conclusions regarding the nature of the HCR.

The results of our current study, taken together with previously published data(11, 12, 13) , argue that a domain of nucleotides 6-325 is required for full HCR activity. The footprint 1 sequence region is essential for HCR function, and it is associated with prominent liver-specific DNase I hypersensitivity. It also seems likely that DNase I nuclease-sensitive sites adjacent to the HCR, within the second intron, and within the proximal promoter region may facilitate HCR activity. In addition, a potential nuclear scaffold attachment capability may contribute to HCR action in directing position independent gene expression in the liver. While these additional elements may contribute to the unique character of the HCR, the required information that determines the liver-specific expression of the apoE gene is contained within the unique combination of elements that constitute the far downstream HCR.


FOOTNOTES

*
This work was supported in part by the National Institutes of Health Grant HL37063 (to J. M. T.) and Training Grant HL 07731 (to Q. D.). The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore by hereby marked ``advertisement'' in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

The nucleotide sequence(s) reported in this paper has been submitted to the GenBank(TM)/EMBL Data Bank with accession number(s) U32510[GenBank].

§
To whom correspondence should be addressed: Gladstone Institute of Cardiovascular Disease, P. O. Box 419100, San Francisco, CA 94141-9100. Tel.: 415-826-7500; Fax: 415-285-5632.

(^1)
The abbreviations used are: apo, apolipoprotein; HCR, hepatic control region; HMG, high mobility group; PAGE, polyacrylamide gel electrophoresis; PCR, polymerase chain reaction; PIPES, piperazine-N,N`-bis-2-ethanesulfonic acid; kb, kilobase(s); bp, base pair(s); DHS, DNase I-hypersensitive sites.


ACKNOWLEDGEMENTS

We thank Susannah White for help with manuscript preparation, Gary Howard for editorial review, and John Carroll and Amy Corder for graphics.


REFERENCES

  1. Paik, Y.-K., Chang, D. J., Reardon, C. A., Davies, G. E., Mahley, R. W., and Taylor, J. M. (1985) Proc. Natl. Acad. Sci. U. S. A. 82,3445-3449 [Abstract]
  2. Mahley, R. W., Innerarity, T. L., Rall, S. C., Jr., and Weisgraber, K. H. (1984) J. Lipid Res. 25,1277-1294 [Abstract]
  3. Mahley, R. W. (1988) Science 240,622-630 [Medline] [Order article via Infotrieve]
  4. Brown, M. S., and Goldstein, J. L. (1986) Science 232,34-47 [Medline] [Order article via Infotrieve]
  5. Strittmatter, W. J., Saunders, A. M., Schmechel, D., Pericak-Vance, M., Enghild, J., Salvesen, G. S., and Roses, A. D. (1993) Proc. Natl. Acad. Sci. U. S. A. 90,1977-1981 [Abstract]
  6. Saunders, A. M., Strittmatter, W. J., Schmechel, D., St. George-Hyslop, P. H., Pericak-Vance, M. A., Joo, S. H., Rosi, B. L., Gusella, J. F., Crapper-MacLachlan, D. R., Alberts, M. J., Hulette, C., Crain, B., Goldgaber, D., and Roses, A. D. (1993) Neurology 43,1467-1472 [Abstract]
  7. Saunders, A. M., Schmader, K., Breitner, J. C. S., Benson, M. D., Brown, W. T., Goldfarb, L., Goldgaber, D., Manwaring, M. G., Szymanski, M. H., McCown, N., Dole, K. C., Schmechel, D. E., Strittmatter, W. J., Pericak-Vance, M. A., and Roses, A. D. (1993) Lancet 342,710-711 [Medline] [Order article via Infotrieve]
  8. Olaisen, B., Teisberg, P., and Gedde-Dahl, T., Jr. (1982) Hum. Genet. 62,233-236 [Medline] [Order article via Infotrieve]
  9. Das, H. K., McPherson, J., Bruns, G. A. P., Karathanasis, S. K., and Breslow, J. L. (1985) J. Biol. Chem. 260,6240-6247 [Abstract/Free Full Text]
  10. Lauer, S. J., Walker, D., Elshourbagy, N. A., Reardon, C. A., Levy-Wilson, B., and Taylor, J. M. (1988) J. Biol. Chem. 263,7277-7286 [Abstract/Free Full Text]
  11. Simonet, W. S., Bucay, N., Lauer, S. J., Wirak, D. O., Stevens, M. E., Weisgraber, K. H., Pitas, R. E., and Taylor, J. M. (1990) J. Biol. Chem. 265,10809-10812 [Abstract/Free Full Text]
  12. Simonet, W. S., Bucay, N., Pitas, R. E., Lauer, S. J., and Taylor, J. M. (1991) J. Biol. Chem. 266,8651-8654 [Abstract/Free Full Text]
  13. Simonet, W. S., Bucay, N., Lauer, S. J., and Taylor, J. M. (1993) J. Biol. Chem. 268,8221-8229 [Abstract/Free Full Text]
  14. Paik, Y.-K., Chang, D. J., Reardon, C. A., Walker, D. W., Taxman, E., and Taylor, J. M. (1988) J. Biol. Chem. 263,13340-13349 [Abstract/Free Full Text]
  15. Chang, D. J., Paik, Y.-K., Leren, T. P., Walker, D. W., Howlett, G. J., and Taylor, J. M. (1990) J. Biol. Chem. 265,9496-9504 [Abstract/Free Full Text]
  16. Shachter, N. S., Zhu, Y., Walsh, A., Breslow, J. L., and Smith, J. D. (1993) J. Lipid Res. 34,1699-1707 [Abstract]
  17. Gorski, K., Carneiro, M., and Schibler, U. (1986) Cell 47,767-776 [Medline] [Order article via Infotrieve]
  18. Dignam, J. D., Lebovitz, R. M., and Roeder, R. G. (1983) Nucleic Acids Res. 11,1475-1489 [Abstract]
  19. Maniatis, T., Fritsch, E. F., and Sambrook, J. (1982) Molecular Cloning , 2nd Ed., Cold Spring Harbor Laboratory, Cold Spring Harbor, NY
  20. Costa, R. H., Grayson, D. R., and Darnell, J. E., Jr. (1989) Mol. Cell. Biol. 9,1415-1425 [Medline] [Order article via Infotrieve]
  21. Ochoa, A., Brunel, F., Mendelzon, D., Cohen, G. N., and Zakin, M. M. (1989) Nucleic Acids Res. 17,119-133 [Abstract]
  22. Sladek, F. M., Zhong, W., Lai, E., and Darnell, J. E., Jr. (1990) Genes & Dev. 4,2353-2365
  23. Metzger, S., Leff, T., and Breslow, J. L. (1990) J. Biol. Chem. 265,9978-9983 [Abstract/Free Full Text]
  24. Wilson, D. B., Dorfman, D. M., and Orkin, S. H. (1990) Mol. Cell. Biol. 10,4854-4862 [Medline] [Order article via Infotrieve]
  25. Chow, B. K.-C., Ting, V., Tufaro, F., and MacGillivray, R. T. A. (1991) J. Biol. Chem. 266,18927-18933 [Abstract/Free Full Text]
  26. Amati, B. B., and Gasser, S. M. (1988) Cell 54,967-978 [Medline] [Order article via Infotrieve]
  27. Brinster, R. L., Chen, H. Y., Trumbauer, M. E., Yagle, M. K., and Palmiter, R. D. (1985) Proc. Natl. Acad. Sci. U. S. A. 82,4438-4442 [Abstract]
  28. Wahl, G. M., Meinkoth, J. L., and Kimmel, A. R. (1987) Methods Enzymol. 152,572-581 [Medline] [Order article via Infotrieve]
  29. MacDonald, R. J., Swift, G. H., Przybyla, A. E., and Chirgwin, J. M. (1987) Methods Enzymol. 152,219-227 [Medline] [Order article via Infotrieve]
  30. Schmid, C. W., and Jelinek, W. R. (1982) Science 216,1065-1070 [Medline] [Order article via Infotrieve]
  31. Jurka, J., and Smith, T. (1988) Proc. Natl. Acad. Sci. U. S. A. 85,4775-4778 [Abstract]
  32. Martin, D. I. K., Tsai, S.-F., and Orkin, S. H. (1989) Nature 338,435-438 [CrossRef][Medline] [Order article via Infotrieve]
  33. Nicolis, S., Ronchi, A., Malgaretti, N., Mantovani, R., Giglioni, B., and Ottolenghi, S. (1989) Nucleic Acids Res. 17,5509-5516 [Abstract]
  34. Watt, P., Lamb, P., Squire, L., and Proudfoot, N. (1990) Nucleic Acids Res. 18,1339-1350 [Abstract]
  35. Philipsen, S., Pruzina, S., and Grosveld, F. (1993) EMBO J. 12,1077-1085 [Abstract]
  36. Elgin, S. C. R. (1988) J. Biol. Chem. 263,19259-19262 [Free Full Text]
  37. Brunel, F., Alzari, P. M., Ferrara, P., and Zakin, M. M. (1991) Nucleic Acids Res. 19,5237-5245 [Abstract]
  38. Churchill, M. E. A., Jones, D. N. M., Glaser, T., Hefner, H., Searles, M. A., and Travers, A. A. (1995) EMBO J. 14,1264-1275 [Abstract]
  39. Giese, K., Cox, J., and Grosschedl, R. (1992) Cell 69,185-195 [Medline] [Order article via Infotrieve]
  40. Gasser, S. M., and Laemmli, U. K. (1987) Trends Genet. 3,16-22 [CrossRef]
  41. McKnight, R. A., Shamay, A., Sankaran, L., Wall, R. J., and Hennighausen, L. (1992) Proc. Natl. Acad. Sci. U. S. A. 89,6943-6947 [Abstract]
  42. Phi-Van, L., von Kries, J. P., Ostertag, W., and Strätling, W. H. (1990) Mol. Cell. Biol. 10,2302-2307 [Medline] [Order article via Infotrieve]
  43. Papazafiri, P., Ogami, K., Ramji, D. P., Nicosia, A., Monaci, P., Cladaras, C., and Zannis, V. I. (1991) J. Biol. Chem. 266,5790-5797 [Abstract/Free Full Text]
  44. Paulweber, B., Sandhofer, F., and Levy-Wilson, B. (1993) Mol. Cell. Biol. 13,1534-1546 [Abstract]
  45. Boissier, F., Augé-Gouillou, C., Schaeffer, E., and Zakin, M. M. (1991) J. Biol. Chem. 266,9822-9828 [Abstract/Free Full Text]
  46. Hayashi, Y., Chan, J., Nakabayashi, H., Hashimoto, T., and Tamaoki, T. (1992) J. Biol. Chem. 267,14580-14585 [Abstract/Free Full Text]
  47. Zhang, D.-E., Hoyt, P. R., and Papaconstantinou, J. (1990) J. Biol. Chem. 265,3382-3391 [Abstract/Free Full Text]
  48. Godbout, R., Ingram, R. S., and Tilghman, S. M. (1988) Mol. Cell. Biol. 8,1169-1178 [Medline] [Order article via Infotrieve]
  49. Zaret, K. S., Liu, J.-K., and DiPersio, C. M. (1990) Proc. Natl. Acad. Sci. U. S. A. 87,5469-5473 [Abstract]
  50. Trujillo, M. A., Letovsky, J., Maguire, H. F., Lopez-Cabrera, M., and Siddiqui, A. (1991) Proc. Natl. Acad. Sci. U. S. A. 88,3797-3801 [Abstract]
  51. Grange, T., Roux, J., Rigaud, G., and Pictet, R. (1991) Nucleic Acids Res. 19,131-139 [Abstract]

©1995 by The American Society for Biochemistry and Molecular Biology, Inc.