©1996 by The American Society for Biochemistry and Molecular Biology, Inc.
Identification, Characterization, Cloning, and Expression of Apolipoprotein C-IV, a Novel Sialoglycoprotein of Rabbit Plasma Lipoproteins (*)

(Received for publication, August 10, 1995; and in revised form, November 6, 1995)

Lin-Hua Zhang Leila Kotite Richard J. Havel (§)

From the Cardiovascular Research Institute and Department of Medicine, University of California, San Francisco, California 94143-0130

ABSTRACT
INTRODUCTION
MATERIALS AND METHODS
RESULTS
DISCUSSION
FOOTNOTES
ACKNOWLEDGEMENTS
REFERENCES

ABSTRACT

We have identified and characterized a novel proline- and arginine-rich protein component of lipoproteins, present in up to five sialylated isoforms, in rabbit blood plasma. The pI of the desialylated protein is 5.7. Based upon its N-terminal sequence, a complete cDNA sequence of 555 nucleotides was cloned from rabbit liver.The synthesized protein is predicted to contain 124 amino acids, including a typical signal peptide of 27 residues. The mature protein of 97 amino acids, designated apolipoprotein C-IV, is associated with the lipoproteins of blood plasma, primarily very low density and high density lipoproteins. It contains two potential amphipathic helices characteristic of plasma apolipoproteins and forms discoidal micelles with phosphatidylcholine. Northern analysis shows a single 0.6-kilobase apolipoprotein C-IV mRNA, detected only in the liver, and Southern analysis suggests a single copy gene. Sialylated apolipoprotein C-IV is secreted from transfected mammalian cells. Nucleotide sequence comparisons demonstrate a strong homology to portions of the upstream regions of the mouse and human apolipoprotein C2 genes, within each of which a distinct gene has recently been identified. The nucleotide sequences and the predicted amino acid sequences, as well as corresponding cDNA sequences in the rat and monkey, indicate that the apolipoprotein C4 gene has been highly conserved during mammalian evolution.


INTRODUCTION

Lipoproteins in blood plasma of mammals contain a variety of protein components, most of which are water-soluble proteins, encoded by a gene family characterized by repeating sequences of amphipathic helices containing 11 or 22 amino acids(1) . Those of low molecular mass (approximately 6-12 kDa) are generally called ``C'' apoproteins, of which three are well recognized(2) . These apoproteins are found in triglyceride-rich lipoproteins (chylomicrons and very low density lipoproteins (VLDL)) (^1)and high density lipoproteins (HDL), among which they readily transfer according to the relative concentrations and, presumably, surface areas of the lipoprotein particles(3) . ApoC-I and apoC-II lack carbohydrate, whereas apoC-III has an O-linked carbohydrate moiety containing up to three sialic acid residues(2) . The functions of the C apoproteins are only partly understood. During postprandial lipemia, C apoproteins move from HDL to nascent chylomicrons and are returned to HDL as chylomicrons are metabolized by lipoprotein lipase(4) . ApoC-II is a specific cofactor for the action of lipoprotein lipase upon the ester bond of triacylglycerols and glycerophosphatides of triglyceride-rich lipoproteins(5, 6) . The other C apoproteins can inhibit this action of apoC-II(5, 6) , and all of the C apoproteins can inhibit the clearance of remnants of triglyceride-rich lipoproteins by the liver(7) .

The C apoproteins are most commonly visualized in triglyceride-rich lipoproteins by isoelectric focusing electrophoresis in polyacrylamide gels. During studies of lipoprotein metabolism in the rabbit, we observed a complex pattern of apoproteins in triglyceride-rich lipoproteins(8) , unlike that found in other mammals. This pattern, which was also seen in the C apoproteins separated from other soluble apoproteins by gel permeation chromatography, was found to reflect the presence of a second sialoglycoprotein, in addition to apoC-III. Based upon the N-terminal amino acid sequence, which was unusually rich in proline, we have now cloned the cDNA and expressed this protein which was found to be homologous to that of a recently identified gene in the apoE-apoC1-apoC2 gene cluster on mouse chromosome 7(9) .


MATERIALS AND METHODS

Preparation of Lipoproteins and Apolipoproteins

Blood was obtained from ear veins of unanesthetized rabbits or by aortic puncture of rabbits anesthetized with ketamine/zylazine that had been fed normal chow or chow enriched with 1% cholesterol. Two rabbits fed normal chow were made diabetic 3-5 days before blood was obtained by intravenous injection of alloxan (200 mg/kg)(10) . To assess the acute-phase response of C apoproteins, rabbits were bled 48 h after subcutaneous injection of 0.5 ml of turpentine per kg of body weight (11) or an equal volume of 0.15 M NaCl. Blood was collected into tubes containing disodium EDTA, benzamidine, and phenylmethylsulfonyl fluoride to give final concentrations of 0.5, 0.3, and 0.1 mg/ml, respectively. Plasma was obtained after centrifugation at 2000 rpm for 20 min and subjected to ultracentrifugation at densities of 1.006, 1.063, and 1.21 g/ml to obtain lipoprotein fractions: VLDL, intermediate density lipoproteins and low density lipoproteins (IDL + LDL), and HDL(12) . For purification, lipoprotein fractions were recentrifuged at the upper density limit. To remove sialic residues from apolipoproteins, lipoproteins were incubated with 0.1 unit neuraminidase from Clostridium perfringens (Boehringer Mannheim) per mg of protein in 0.02 M sodium acetate, pH 5.4, for 2 h at 37 °C.

Apolipoproteins were obtained after delipidation of lipoprotein fractions with ethanol:ether (3:1)(13) . C apoproteins were separated from apoVLDL dissolved in 0.015 M Tris, 8 M urea, pH 8.2, by chromatography on Sephadex G-200 (Pharmacia Biotech Inc.)(14) . Individual apolipoproteins were separated by preparative isoelectric focusing electrophoresis as described (15) or by anion exchange chromatography on DEAE-cellulose (DE52, Whatman). For the latter, elution was with a linear gradient of 0.01-0.15 M Tris (pH 8.2) containing 6 M urea. Apolipoproteins and their components were evaluated by one-dimension isoelectric focusing (pH 3.5-7.0)(16) , SDS-polyacrylamide gel electrophoresis(17) , and two-dimensional electrophoresis(18) . The amino acid composition of individual C apoproteins was determined in an amino acid analyzer(19) . Content of cysteine was evaluated after treatment of proteins with performic acid(20) . For amino acid sequencing, samples containing 800 pmol of purified protein were subjected to Edman degradation in an Applied Biosystems 470A gas-phase Sequencer. The phenylthiohydantoin-derivatives were identified and quantified by reverse-phase HPLC with an on-line Applied Biosystems 120A PTH analyzer.

Quantitative Assays

ApoE was quantified by a radioimmunoassay(21) . A similar radioimmunoassay was developed to quantify apoC-IV with desialylated apoC-IV as standard. Antibodies against apoC-IV were raised in a guinea pig. For this purpose, 150 µg of electrophoretically pure desialylated apoC-IV, mixed with an equal volume of complete Freund's adjuvant, were injected intradermally into several sites on the backs of each of two adult guinea pigs, with booster injections of 100 µg after 3 and 6 weeks. Antiserum was obtained 7 weeks after the initial injection. To remove trace immunoreactivity against apoE, the antiserum was recycled through a rabbit apoE affinity column(22) . The purified antiserum showed no detectable immunoreactivity against rabbit apoE, apoC-II, or apoC-III. Content of total cholesterol in plasma and lipoprotein fractions was determined by enzymatic procedures(23) .

Other Analytical Procedures

Western blotting was carried out after transfer of proteins from polyacrylamide gels to nitrocellulose paper (24) with diluted antisera against apoC-IV followed by I-labeled goat anti-guinea pig IgG (Sigma) and autoradiography. Unilamellar vesicles of dimyristoylphosphatidylcholine, before and after incubation with apoproteins, were visualized by electron microscopy after negative staining(25) . The cofactor property of apoproteins for the activity of lipoprotein lipase from bovine milk was assayed as described(6) .

cDNA Cloning

A reverse transcription-polymerase chain reaction (RT-PCR)-based cDNA cloning strategy (26) was used to isolate a cDNA encoding for apoC-IV. Total RNA was isolated from rabbit liver by the method of Chirgwin et al.(27) and Sambrook et al.(28) . Single-stranded cDNA was synthesized with 250 units of SuperScript II RNase H (Life Technologies, Inc.) from 3 µg of total RNA after priming with 30 pmol of a cDNA synthesis primer (CDS primer, 5`CCCCGAATTCTTTTTTTTTTTTTTTTTT(A/G/C)(A/G/C/T)3`)(29) . Each RT product stock was then diluted 500-fold and used in subsequent PCR reactions.

Based upon the N-terminal amino acid sequence of apoC-IV: EPEGtP(n/t)PLPA(p/a)EESRWS(l/t)V (parentheses represent ambiguities, and lower case letters indicate weak signals), degenerate primers were synthesized (Operon Co.) complementary to the first 9 amino acid residues: P1S3, 5`GA(A/G)CCNGA(A/G)GGNACNCCNAA(T/C)CCN(C/T)T3` (N = A, G, C, or T); and the nested primer: P2S2, 5`CCI(C/T)TICCIGCI(C/G)CIGA(A/G)GA(A/G)3`, (I = deoxyinosine) to amino acid residues 8 to 14 as indicated in Fig. 5. The 3`-portion of the apoC-IV cDNA was first screened by three successive rounds of PCR. The first round was asymmetric, with CDS-primed single-stranded cDNA as the template. Amplification was carried out with P1S3 (4 pmol) as primer in 67 mM Tris (pH 8.3), 12 mM (NH(4))(2)SO(4), 2 mM MgCl(2), 17 µg/ml bovine serum albumin, 0.15 mM concentration each of dATP, dGTP, dCTP, and dTTP, and 1 unit of Taq polymerase (Life Technologies, Inc.) in a thermal cycler (Perkin-Elmer) for 30 cycles at 94 °C, 45 s; 50 °C, 60 s; 72 °C, 2 min, with a last extension at 72 °C for 7 min. One µl of the first round PCR product was used in the second round PCR amplification, probed by 8 pmol each of CDS and P1S3 under the same conditions for 30 cycles. Two bands (major, 380 bp; minor, 330 bp) were resolved by electrophoresis in 1.5% agarose gel and purified, and each was used in the third round PCR amplification with 14 pmol each of CDS and P2S2 under the same conditions for 30 cycles. The amplification product from the 380-bp band was 360 bp (about 21 bp shorter as expected for the nested P2S2 primed product). The 330-bp band yielded only unrelated products. The PCR product containing the 360-bp cDNA was treated with 1.5 units of T4-DNA polymerase and 5 units of polynucleotide kinase (Life Technologies, Inc.) (30) and purified by ethanol precipitation before cloning into the SmaI site of Bluescript II KS vector (Stratagene). The resultant cDNA inserts were sequenced on both directions by the dideoxynucleotide chain termination method (31, 32) using sequenase Conversion 2.0 DNA sequencing kits (U. S. Biochemical Corp.).


Figure 5: Nucleotide and deduced protein sequences of the cDNA for apoC-IV. The cDNA was isolated from rabbit liver by PCR screening. The open reading frame of 124 amino acids, in one-letter code for the predicted amino acids, is indicated below the nucleotide sequence. A translation initiation codon, ATG, is indicated by bold letters. Positions of oligonucleotides used for PCR amplifications and their orientations with respect to the sense strand are indicated by arrows. * represents the stop codon. The polyadenylation signal is underlined. +1 indicates the start of the mature protein determined by protein sequencing.



We used the 5` rapid amplification of cDNA ends (5`-RACE) method (26) to clone the 5`-portion of apoC-IV cDNA. The cDNA was synthesized from 2.5 µg of total RNA with 12.5 units of avian myeloblastosis virus reverse transcriptase (Boehringer Mannheim) and 4 pmol of a specific primer P5A1 (5`GCCACCTCTCTCTGGTCCTAGTCA3`) at 42 °C for 1 h and 52 °C for 30 min. The RT products were purified on a Centricon 100 column (Amicon Co.) and tailed with oligo(dA) and 18 units of terminal DNA transferase (Pharmacia Biotech) at 37 °C for 5 min. The reaction mixtures were diluted 200-fold and used in 5`-RACE reaction. The amplification was performed as above except annealing was at 50 °C for 45 s. The nested PCR was carried out with 100-fold dilution of the first PCR product and 12 pmol each of CDS and the nested primer P5A2 (5`AGTCAGCAGCGGACCCACCAGCTCCTT3`) as above except annealing was at 62 °C for 30 s. A single band of PCR product (280 bp) was cloned and sequenced as above.

A full-length apoC-IV cDNA was obtained by sequencing the PCR product primed with the sense primer p8S (5`CCGAATTCGCAGAGAGGGACAGA3`) and the antisense primer p8A (5`CCGAATTCTGGGTGAGCAGGTGA3`) amplified at 94 °C, 45 s; 62 °C, 30 s; 72 °C, 90 s for 40 cycles, with a last extension at 72 °C for 7 min. This set of primers, with an inserted EcoRI site in each, was derived from the 5`- and 3`-ends of the separately cloned nucleotide sequences. Several deletions of the cloned cDNA were made and sequenced in both directions to resolve ambiguities. Each full-length RT-PCR and cloning were repeated three times.

Northern Analysis

RNA samples were subjected to electrophoresis in a denaturing 1.5% (w/v) agarose, 7% formaldehyde gel, transferred to Hybond N nylon membrane (Amersham), and probed with a P-labeled apoC-IV cDNA lacking the poly(A) tail (>5 times 10^8 cpm/µg of DNA). The prehybridization and hybridization were performed in 0.25 M Na(2)HP0(4), 0.1 mM EDTA, 7% SDS, pH 7.0, 100 µg/ml denatured salmon sperm DNA(28, 33) , with the probe present at 10^6 cpm/ml during the hybridization at 65 °C for 1 h or overnight. The blots were washed twice at room temperature for 20 min using 2 times SSC (1 times SSC is 0.15 M NaCl, 0.015 M sodium citrate, pH 7.0) and 0.1% SDS, and then in 0.1 times SSC, 0.1% SDS for 10 min. The final wash was carried out in 0.1 times SSC, 0.1% SDS at 50 °C for 15 min and was followed by autoradiography.

Southern Analysis

Rabbit genomic DNA, prepared from rabbit liver by standard methods, was digested with restriction endonucleases, and electrophoresis (0.8% w/v agarose) and Southern transfer were performed according to described procedures(28) . Hybridization and autoradiography were carried out as above except that the final washing steps were done twice at 65 °C for 15 min.

Expression in E. coli

A BamHI-BamHI fragment of apoC-IV cDNA coding the 124-amino acid polypeptide was produced by PCR and inserted into the BamHI fusion site of the expression vector pET3c containing a 14-amino acid linker sequence. Another construct was made by inserting the NdeI-BamHI fragment of apoC-IV cDNA encoding only the 97-amino acid mature protein into vector pET3b. Escherichia coli BL21(DE3) cells, transformed with these plasmids, were then induced with 0.4 mM isopropyl-1-thio-beta-D-galactopyranoside to an optical density of 0.5 at 600 nm for 2-4 h(34) . The whole cell extracts were resolved on SDS-15% polyacrylamide gel, transferred to a nitrocellulose membrane, and subjected to Western blotting as above.

Expression and Secretion from Mammalian Cells

Chinese hamster ovary (CHO)-K-1 cells were grown in monolayer in 10-cm dishes containing 10 ml of Dulbecco's modified Eagle's medium-Ham's 21 (DME-H21), 10% fetal calf serum, containing 1.1 mg of sodium pyruvate, and nonessential amino acids in a humidified atmosphere of 5% CO(2), 95% air. COS-7 African green monkey kidney cells were grown in DME-H21 medium containing 4.5 g of glucose/liter and 10% FBS as described(35, 36) . A full-length apoC-IV cDNA lacking the poly(A) tail was subcloned into the EcoRI sites of the expression vectors pMT2 (37) and pBJ1(38) . Two µg of plasmid prepared by density ultracentrifugation in cesium chloride (28) or a simplified large-scale alkaline lysis (39) followed by treatment with RNase A and proteinase K were transfected into the cells by the adenovirus/DEAE-dextran method(36) . Transfection efficiency was monitored by cotransfection of a plasmid containing Rous sarcoma virus beta-galactosidase and staining with 3 mM potassium ferricyanide, 3 mM potassium ferrocyanide, 1 mM MgCl(2), and 0.5 mg/ml 5-bromo-4-chloro-3-indolyl beta-D-galactoside (36) or by beta-galactosidase assay(33, 35) . After a 2-h transfection, the transfection solution was removed and cells were rinsed with 5 ml of 10% dimethyl sulfoxide in phosphate-buffered saline. Ten ml of fresh medium were added, and incubation was continued for 48 h before collection of the medium.

For immunoprecipitation, 2 µl of anti-apoC-IV antiserum was added to 5 ml of the medium collected from each dish of transfected cells and the mixture, supplemented with 2 µg/ml each of apoprotinin, leupeptin, and pepstatin A, and 0.02% sodium azide, was incubated with rotation at 4 °C overnight. Then, 80 µl (0.1 g/ml) of protein A-linked Sepharose CL-4B beads (Pharmacia) were added and incubation was continued for 4 h. Beads were sedimented at low speed for 5 s and washed twice with cold phosphate-buffered saline. For desialylation, immunoisolated protein on the beads was incubated with 7.5 milliunits of neuraminidase as above for 24 h. The protein was eluted with SDS sample buffer at 95 °C for 5 min and subjected to SDS-PAGE and Western blot analysis.


RESULTS

Isolation and Characterization of ApoC-IV

ApoC-IV was found during a systematic analysis of small molecular weight apoproteins (C apoproteins) of very low density lipoproteins (VLDL) from rabbit blood plasma. Upon isoelectric focusing, electrophoresis of apoVLDL from normal rabbits, several components with isoelectric points more acidic than apoE were evident (Fig. 1, lane 5). These proteins were also seen upon isoelectric focusing electrophoresis of the C apoprotein fraction separated from apoVLDL by gel filtration chromatography (Fig. 2, lane 1). The amino acid composition of the three most acidic of these proteins, isolated by preparative isoelectric focusing (Fig. 2), resembled that of apoC-III from other species, except that the component with a pI of 4.6 (designated ``apoC-III(0)'') contained somewhat more proline and arginine. The composition of the other components differed from that of known C apoproteins in other species. The component with a pI of 5.0 (third lane) had cofactor properties for lipoprotein lipase from bovine milk (data not shown), but its amino acid composition differed substantially from that expected for apoC-II. The amino acid composition of the components with pI 5.2 and 4.8 (second and fourth lanes) was similar, and they were particularly rich in proline and arginine (designated ``apoC-IV'' in Fig. 1and Fig. 2). These components had no detectable cofactor activity for lipoprotein lipase. After desialylation of the apoVLDL with neuraminidase, the isoelectric focusing pattern was simplified, with three major components evident (Fig. 2, last three lanes). The most acidic of these (pI 4.6) was identified as apoC-III(0) by its amino acid composition, and the next most acidic (pI 5.0) as apoC-II by its composition and cofactor activity for lipoprotein lipase. The amino acid composition of the most basic component (pI 5.7) was essentially identical with that of the proline and arginine-rich components from non-desialylated C apoproteins.


Figure 1: Isoelectric focusing polyacrylamide slab gel electrophoretograms (pH 3.5-7.0) of rabbit apoVLDL. Lanes 1 and 2, 70 µg of apoVLDL from a rabbit injected with turpentine 48 h before blood was sampled; lane 3, 6 µg of purified desialylated apoC-IV; lanes 4 and 5, 70 µg of apoVLDL from a normal rabbit injected with 0.15 M NaCl as above. The first of each pair of apoVLDL samples was treated with neuraminidase. SAA = serum amyloid A. The minor band in lane 3 may represent carbamylated apoC-IV.




Figure 2: Isoelectric focusing polyacrylamide tube gel electrophoretograms (pH 3.5-7.0) of C apolipoproteins of VLDL from cholesterol-fed rabbits. Lane at left shows unfractionated C apoproteins. Individual C apoproteins were separated by preparative isoelectric focusing electrophoresis and identified on the basis of amino acid composition (see text). The group of three gels at the right is from a separate preparation of apoVLDL that was treated with neuraminidase.



The individual proteins were also isolated from the C apoprotein fraction by ion exchange chromatography (Table 1). In addition to the three proteins separated by preparative isoelectric focusing, a major component was found with an amino acid composition that resembled human apoC-I together with a component tentatively identified as the 12-kDa thrombolytic fragment of apoE(42) . The novel proline- and arginine-rich protein (PARP) thus appeared to be a sialoglycoprotein, normally present as isoforms with up to 4 or 5 sialic acid residues. Its apparent molecular mass on SDS-gel electrophoresis was about 14 kDa. This was confirmed by two-dimensional electrophoresis with Coomassie Blue staining and immunoblotting (Fig. 3), which also showed that the apparent molecular mass of the sialylated forms of PARP was greater than that of the desialylated protein. PARP contained approximately 2 cysteine residues (Table 1). Upon immunoblotting apoVLDL separated by SDS gel electrophoresis without addition of reducing agents, bands of higher molecular weight, consistent with homodimers and homotrimers, were observed (data not shown). These components were absent in the presence of reducing agents, consistent with disulfide linkage of the oligomers.




Figure 3: Two-dimensional gel electrophoretic analysis of apoVLDL (A and C) and desialylated apoVLDL (B and D). Gels from one preparation of apoVLDL were stained with Coomassie Blue R-250 (A and B) and those from another preparation of apoVLDL were subjected to Western blotting with apoC-IV antiserum (C and D). Cathode was at left and anode at right of each gel.



PARP was also evident upon isoelectric focusing of apoHDL (data not shown). By radioimmunoassay, PARP was distributed mainly between VLDL and HDL of a rabbit fed regular chow, with undetectable amounts in IDL, LDL, and lipoprotein-free serum ( > 1.21 g/ml), whereas PARP was detected only in VLDL from a cholesterol-fed rabbit (Table 2). PARP was also found in apoVLDL isolated from perfusates of isolated rabbit livers (data not shown). Thus, PARP appeared to be a bona fide apolipoprotein that was distributed in a manner resembling that of other C apoproteins. Based upon these properties, PARP was provisionally designated as apoC-IV.



Properties of ApoC-IV

Since apoC-IV was first isolated from plasma of alloxan diabetic rabbits, in which increased amounts of serum amyloid A were also expressed (see also Fig. 2, first lane), we considered the possibility that apoC-IV is an acute phase protein. Within 48 h after administration of turpentine subcutaneously to two normal rabbits, the apparent mass of serum amyloid A was markedly increased, whereas that of apoC-IV was not (Fig. 1); furthermore, the concentration of apoC-IV, estimated by radioimmunoassay, was in the same range observed in control rabbits injected with physiological saline (data not shown).

The affinity of apoC-IV for lipids was confirmed by its ability to convert small unilamellar vesicles of dimyristoylphosphatidylcholine to discoidal particles. These discs were, however, considerably larger than those produced with rabbit apoE (Fig. 4).


Figure 4: Negatively stained preparations of discoidal complexes of apoC-IV with dimyristoyl phosphatidylcholine with apoC-IV (top) and rabbit apoE (bottom). For each preparation, unilamellar liposomes of dimyristoylphosphatidylcholine prepared in a French pressure cell (120-170 µg) were mixed with one-fifth the mass of apoC-IV or apoE in 0.25 ml of 0.15 M NaCl and incubated at 37 °C for 1 h before negative staining. times 180,000.



Molecular Cloning of ApoC-IV

Despite its prominence in normal rabbit VLDL, no component resembling apoC-IV was evident upon Coomassie Blue staining of isoelectric focusing electrophoretograms of apoVLDL from rats, mice, or humans and in these species no immunoreactive protein was found in lipoprotein fractions by radioimmunoassay with apoC-IV antiserum (data not shown). To learn more about the origin of apoC-IV, we used an RT-PCR-based cloning strategy to isolate a cDNA encoding apoC-IV. This strategy would encompass cDNAs in rabbit liver of less than 500 bp. To increase the likelihood of obtaining cDNAs from rare transcripts, we used asymmetric PCR in the first round amplification, which gave only a faint smear on ethidium bromide-stained agarose gels. In the subsequent amplifications with a nested primer, we obtained a band of 360 bp. Sequencing this band revealed a typical 3`-region of a cDNA containing a poly(A) tail, a polyadenylation signal and a stop codon, together with an open reading frame encoding 90 amino acids beginning with residue 8 of the N-terminal sequence (Fig. 5). The amino acid composition of the predicted polypeptide was in excellent agreement with that determined by amino acid analysis of apoC-IV (Table 3). The 5`-cDNA sequence (277 bp) was subsequently obtained by 5`-RACE cloning, as described under ``Materials and Methods,'' which displayed a typical 5` portion of a full-length cDNA containing predicted residues 1-7 of the N-terminal amino acid sequence of apoC-IV (Fig. 5). With this information, the full-length cDNA of apoC-IV was then cloned and sequenced. The sequence was identical except for T rather than C at positions 234 and 398 observed in the 3`-cloning. The earlier result probably reflects an error of incorporation during PCR amplification.



Sequence Analysis of Full-length ApoC-IV cDNA

ApoC-IV cDNA comprises 555 nucleotides, and the decoded cDNA sequence revealed an open reading frame of 372 nucleotides encoding a protein of 124 residues with a molecular mass of 14,097 Da (Fig. 5). The sequence encodes a typical signal peptide of 27 residues and a mature protein of 97 residues (as indicated by the N-terminal amino acid sequence) with a molecular mass of 11,020 Da. The sequence context about the first AUG, GAAAUG is among the most commonly found for functional initiation codons in vertebrates(43) . The apoC-IV cDNA sequence contains 97 nucleotides of the 5`-untranslated region composed of nine GGGACAG(A/G) repeats, and 65 nucleotides of the 3`-untranslated region with the polyadenylation signal, AATAAA, 21 nucleotides upstream from beginning of the poly(A) tail.

The amino acids at positions -3 and -1 upstream of the N terminus of the mature protein (Val and Cys, respectively) fit well into the ``(-3,-1) rule'' for the signal cleavage site of eukaryotes(44) . The calculated pI of the mature protein is 5.05. Amphipathic alpha-helix analysis (45, 46, 47) of the predicted amino acid sequence indicates two possible regions (residues 48-71 and 92-113) of the protein that may be responsible for lipid binding (Fig. 6). The sequence lacks the N-glycosylation consensus site NX(S/T), but there are several candidate Ser and Thr residues for O-glycosylation. Two cysteines are present near the C terminus.


Figure 6: Comparison of helical wheel diagrams (45) of predicted amino acids of homologous regions of rabbit apoC-IV cDNA (A and B), mouse ACL gene (C and D)(9) , and human apoC-IV cDNA (E and F)(54) . Negatively and positively charged amino acids are indicated by - and +, respectively. * indicates proline residue 57 in rabbit and mouse sequences.



DataBank (Swiss-prot, EMBL, and GenBank) searching revealed significant homology between apoC-IV cDNA (146-307 bp) and the 5`-flanking regions of cynomolgus monkey apoC-II (86-247 bp, 77%(48) ) and mouse apoC-II (114-244 bp, 82%(49) ). This 5`-flanking region of the mouse apoC2 gene has been sequenced recently (9) and contains a novel gene (the ``ACL'' gene) within the mouse apoE-apoC1-apoC2 gene cluster on chromosome 7. The complete nucleotide sequences of rabbit apoC-IV and mouse ACL cDNA shared 64% homology and the predicted amino acid sequences shared 62% amino acid identity.

Northern Analysis

Northern hybridizations were performed with total RNA isolated from several rabbit tissues. Only a single band of about 0.6 kb of the transcript from liver was detected (Fig. 7), suggesting that our cloned cDNA is complete, and that there is only one major form of mRNA in liver. Our rabbit cDNA failed to recognize any transcript in RNA from human adrenal or HepG2 cells (not shown).


Figure 7: Northern blot analysis of tissue expression of the apoC4 gene. A, 5 µg of poly(A) RNA (lane 1) or 20 µg of total RNA (lanes 2-7) from various tissues were subjected to electrophoresis in a denaturing agarose/formaldehyde gel, transferred to a nylon membrane, and probed with P-labeled full-length apoC-IV cDNA. The membrane was exposed to x-ray film for 9 days at -70 °C. Lanes 1 and 2, liver; 3, kidney; 4, brain; 5, heart; 6, small intestine; 7, skeletal muscle. Positions of standard RNA size markers (G319a, Promega) are indicated on the left. B, ethidium bromide-stained RNA blotted on a nylon membrane shows the equivalence of RNA mass after transfer.



Southern Analysis of Rabbit Genomic DNA

Fig. 8shows the pattern of hybridization obtained when 537 bp of apoC-IV cDNA was used to probe rabbit genomic DNA digested with four restriction enzymes. In all cases, a single band was found, suggesting a simple gene structure, consistent with the presence of a single copy of the apoC-IV gene per haploid rabbit genome.


Figure 8: Southern blot analysis of genomic DNA. Genomic DNA (20 µg) from rabbit liver was digested with restriction endonucleases: B, BamHI; H, HindIII; P, PstI; and E, EcoRI, subjected to electrophoresis in 0.8% agarose gel, transferred to a nylon membrane, and probed with P-labeled full-length apoC-IV cDNA. The membrane was exposed to x-ray film for 6 days at -70 °C. Positions of standard DNA markers (1-kb ladder, Life Technologies, Inc.) are indicated on the left.



Expression of ApoC-IV

E. coli cells, transformed with the expression vector containing the apoC-IV cDNA, but not the vector alone, synthesized proteins of apparent molecular mass of 16 kDa and 14 kDa for the 124-amino acid and 97-amino acid constructs, respectively, which were recognized by guinea pig anti-C-IV (Fig. 9A, lanes 4 and 6). Evidently, the mobility of both forms of apoC-IV is anomalous, like that of apoC-IV isolated from VLDL (Fig. 9A, lanes 2 and 7). To explore this further, we expressed full-length apoC-IV cDNA in mammalian cells.


Figure 9: Western blot analysis of rabbit apoC-IV expressed in E. coli (A) and CHO cells (B). A, lanes 1, molecular mass markers; 2, 0.25 µg of desialylated rabbit apoVLDL; 3, cell extract transformed with pET3c vector alone; 4, cell extract transformed with pET3c vector containing the cDNA encoding the 124 amino acids of apoC-IV and an additional 14-amino acid linker sequence; 5, cell extract transformed with pET3b vector alone; 6, cell extract transformed with pET3b vector containing the cDNA encoding only the 97 amino acids of mature apoC-IV; 7, 0.25 µg of sialylated rabbit apoVLDL. B, lanes 1, molecular mass markers; 2, medium from transfected CHO cells with pMT2 vector alone; 3 and 4, media from transfected CHO cells with pMT2 apoC-IV cDNA; 5 and 6, 0.1 µg of rabbit apoVLDL. Expressed protein in lanes 3 and 6 was desialylated with neuraminidase.



In cell extracts from transfected COS-1, COS-7, Chinese hamster ovary (CHO), and HepG2 cells, we observed several nonspecific bands around 10-18 kDa in Western blot analysis with our polyclonal antiserum (data not shown). Since apoC-IV was isolated from blood plasma and should be detected in medium of transfected cells as a secreted protein, we used the medium from cells to assess the expression of apoC-IV cDNA. Fig. 9B shows that the immunoisolated proteins from medium of CHO cells transfected with pMT2 vector harboring apoC-IV cDNA yielded a specific band of about 16 kDa (lane 4). After desialylation, this band shifted to about 14 kDa (lane 3). Thus, the mobility of asialoglycosylated apoC-IV was similar to that of the unglycosylated protein. Furthermore, the apparent molecular mass of the expressed apoC-IV cDNA product before and after desialylation was similar to the sialylated and desialylated forms of apoC-IV from rabbit apoVLDL, respectively, consistent with the presence of a sialylated carbohydrate moiety. Similar results were obtained with transfected COS-7 cells (data not shown).


DISCUSSION

We have identified a novel protein associated with lipoproteins in blood plasma of rabbits. Other proteins, such as serum amyloid A, which is secreted in large amounts as part of the acute phase response(50) , associate with lipoproteins and many others, such as complement 4B-binding protein(51, 52) , have been found to associate with lipoproteins to a limited extent. The novel protein, which we have designated as apoC-IV, appears to be a bona fide apolipoprotein by several criteria: 1) it is present primarily or exclusively in association with lipoproteins; 2) it binds to phospholipids, converting unilamellar liposomes of dimyristoylphosphatidylcholine to discoidal particles; 3) it is present in measurable quantities at all times and does not appear to be a typical acute phase reactant; 4) it contains two regions predicted to form amphipathic helices of the type found in most soluble apolipoproteins. Based on its molecular weight and distribution mainly in VLDL and HDL, apoC-IV appears to fall into the group of C apolipoproteins. Like apoC-III, it is a sialoglycoprotein, present in several isoforms.

The content of apoC-IV among rabbit C apolipoproteins is appreciable, as judged from Coomassie Blue staining and ion exchange chromatography, the mass of apoC-IV appears to be substantially greater than that of apoC-II and similar to that of apoC-I and apoC-III.

The N-terminal sequence of apoC-IV, with 5 prolines among the first 12 amino acids, gave no clue to its provenance. However, its structure otherwise is typical with respect to its signal peptide and the presence of amphipathic helical sequences downstream. As with other C apolipoproteins, apoC-IV mRNA is expressed mainly (possibly exclusively) in the liver. As predicted, it is evidently secreted from transfected mammalian cells as a sialoglycoprotein. The apparent molecular mass of sialylated apoC-IV is about 15,500 Da; that of asialoC-IV (approximately 14,000 Da) in SDS-PAGE is about 3,000 Da higher than that of the aminoacyl chain (11,020 Da). This anomalous behavior is evidently not related to the presence of one or more O-linked carbohydrate moieties (Fig. 9). The proline-rich N-terminal region may contribute to the reduced mobility.

The protein sequence of apoC-IV showed little homology with that of other apolipoproteins, but, surprisingly, the cDNA sequence showed a striking homology with the 5`-flanking region of monkey and mouse apoC-II(48, 49) . The basis for this homology has become clear from the recent work of van Eck, Hoffer and associates(9) , who initially found two additional exons 5` to the start site of the mouse apoC2 gene(49) . Evidence for a similar gene (the ``ECL'' gene) was found earlier in the rat by Shen and Howlett(53) , in the region of the apoE gene, but with an orientation apparently opposite to that found in the mouse. Van Eck and associates (9) have recently shown that these exons are contained within a novel gene in the apoE, apoC1, apoC2 gene cluster on mouse chromosome 7, the orientation of which is identical with that of the other three genes in this cluster (Fig. 10). This gene, which they designated the apoC2-linked (ACL) gene, is composed of three exons and is expressed in liver as a transcript of 473 bp. It encodes a putative protein of the precise length of apoC-IV (124 amino acids). The mRNA for the putative mouse protein, like apoC-IV, is expressed solely in the liver. From the sequence of the C-terminal portion of the putative signal peptide(9) , we predict that the site of signal peptide cleavage would be the same as for apoC-IV (following residue 27). The overall homology of amino acids 1-27 is 68%. Except for the first 7 amino acids (Fig. 5), the predicted sequence of the mature mouse protein also shows striking homology to apoC-IV, 71% for amino acids 35-124 and, when conservative substitutions are included, 80% (Fig. 10). Essentially the same amphipathic helical regions are predicted for the putative mouse protein encoded by the ACL gene and apoC-IV (Fig. 6).


Figure 10: Comparison of the deduced amino acid sequences of the rabbit apoC-IV cDNA and the mouse ACL gene(9) , partial sequence of the rat ECL gene(53) , the cynomolgus monkey 5`-flanking region of apoC-II cDNA(48) , and the human apoC4 gene (54) . Identical amino acids are indicated by - -. Gaps were introduced to maximize alignments. The stop codon in the rat sequence is indicated by an asterisk. The numbers assigned to the right are based on the methionine at the start of the open reading frame deduced from the nucleotide sequences.



After this work was completed, we became aware of the sequence of the human gene, also present in the same gene cluster just upstream of the apoC2 gene on chromosome 19(54) . The sequence (Fig. 10) includes a 9-nucleotide insert similar to that of the monkey gene, yielding a predicted protein of 127 amino acids. Based upon the hydropathy profile of the predicted protein and the assumption that the N terminus is initiated at the start of exon two, Allan and associates (54) predicted the signal peptide to have 25 amino acids. The amphipathic helices predicted for the human protein also include two regions. A 21-mer (residues 95-116) resembles that found for the rabbit and mouse sequences but differs with respect to the polar face, for which only the predicted human sequence contains a positively charged amino acid (Lys-110) (Fig. 6). A 28-mer amphipathic helix (residues 47-74) differs from the corresponding rabbit and mouse sequences by the presence of a codon for threonine (ACA) at residue 60 (Fig. 10). In all other species (monkey at predicted residue 60 and mouse, rat and rabbit at predicted residue 57), the equivalent codon specifies proline (CC(G/A)) which should produce a kink in the helix(55) . If this were not the case, the sequences for the species other than the human would predict a similar amphipathic helix in this region (Fig. 6).

The structure of the mouse and human apoC4 genes as well as that of the rabbit (^2)is similar, with three exons and two introns (two short exons upstream, the first encoding most of the signal peptide, and a longer terminal exon encoding the C-terminal region). The second and third exons each contain an amphipathic helical region. This genomic organization and the location of the apoC4 gene within the apoE, apoC1, apoC2 gene cluster provide strong support for inclusion of apoC-IV in the apolipoprotein gene family, which includes not only the four C apoproteins and apoE, but also apoA-I, -A-II, and -A-IV(1) . Conservation of the genomic and protein structure of apoC-IV across mammalian evolution from mouse to humans strongly suggests that apoC-IV has an important function. It is thus puzzling that the protein apparently is not expressed and secreted at a more substantial level in species other than the rabbit. A low level of secretion in humans has been predicted by Allan et al.(54) , based upon a level of expression of apoC-IV mRNA in human liver only about 1% of that of apoC-II. Clearly, further work is needed to clarify the reasons for such variable expression of apoC-IV (and in particular the basis for an important function at some stage of life).


FOOTNOTES

*
This work was supported by National Institutes of Health Grant HL-14237. The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore by hereby marked ``advertisement'' in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

The nucleotide sequence(s) reported in this paper has been submitted to the GenBank(TM)/EMBL Data Bank with accession number(s) U39356[GenBank].

§
To whom correspondence should be addressed. Tel.: 415-476-2226; Fax: 415-476-2283.

(^1)
The abbreviations used are: VLDL, very low density lipoprotein; LDL, low density lipoprotein; HDL, high density lipoprotein; apo, apolipoprotein; PAGE, polyacrylamide gel electrophoresis; bp, base pair(s); kb, kilobase(s); RT-PCR, reverse transcription-polymerase chain reaction; RACE, rapid amplification cDNA ends; PARP, proline- and arginine-rich protein.

(^2)
L.-H. Zhang, L. Kotite, and R. J. Havel, unpublished data.


ACKNOWLEDGEMENTS

We thank Philippe Duchateau for assistance in verifying the N-terminal sequence of apoC-IV, Clive Pullinger for synthesizing several oligonucleotides, Barney Welsh for providing pBJ1 expression vector, and Robert Hamilton for visualization of lipid-protein complexes. We also thank Clive Pullinger and Qian Jin Hu for critically reading the manuscript and John Kane for advice.


REFERENCES

  1. Li, W. H., Tanimura, M., Luo, C. C., Datta, S., and Chan L. (1988) J. Lipid Res. 29, 245-271 [Medline] [Order article via Infotrieve]
  2. Chapman, M. J. (1980) J. Lipid Res. 21, 789-853 [Medline] [Order article via Infotrieve]
  3. Imaizumi, K., Fainaru, M., and Havel, R. J. (1978) J. Lipid Res. 19, 712-722 [Abstract]
  4. Havel, R. J., Kane, J. P., and Kashyap, M. D. (1973) J. Clin. Invest. 52, 32-38 [Medline] [Order article via Infotrieve]
  5. Brown, W. V., and Baginsky, M. L. (1972) Biochem. Biophys. Res. Commun. 46, 375-382 [Medline] [Order article via Infotrieve]
  6. Havel, R. J., Fielding, C. J., Olivecrona, T., Shore, V. G., Fielding, P. E., and Egelrud, T. (1973) Biochemistry 12, 1828-1833 [Medline] [Order article via Infotrieve]
  7. Windler, E., and Havel, R. J. (1985) J. Lipid Res. 26, 556-565 [Abstract]
  8. Havel, R. J., Kita, T., Kotite, L., Kane, J. P., Hamilton, R. L., Goldstein, J. L., and Brown, M. S. (1982) Arteriosclerosis 2, 467-474 [Abstract]
  9. Van Eck, M. M., Hoffer, M. J. V., Havekes, L. M., Frants, R. R., and Hofker, M. H. (1994) Genomics 21, 110-115 [CrossRef][Medline] [Order article via Infotrieve]
  10. Duff, G. L., and McMillan, G. C. (1949) J. Exp. Med. 89, 611-630
  11. Saxena, U., Nagpurkar, A., and Mookerjea, S. (1985) Can. J. Biochem. Cell Biol. 63, 1014-1021 [Medline] [Order article via Infotrieve]
  12. Havel, R. J., Eder, H., and Bragdon, J. (1955) J. Clin. Invest. 34, 1345-1353 [Medline] [Order article via Infotrieve]
  13. Brown, W. V., Levy, R. I., and Fredrickson, D. S. (1969) J. Biol. Chem. 244, 5687-5694 [Abstract/Free Full Text]
  14. Fainaru, M., Havel, R. J., and Imaizumi, K. (1977) Biochim. Biophys. Acta 490, 144-155 [Medline] [Order article via Infotrieve]
  15. Havel, R. J., Kotite, L., and Kane, J. P. (1979) Biochem. Med. 21, 121-128 [Medline] [Order article via Infotrieve]
  16. Pagnan, A., Havel, R. J., Kane, J. P., and Kotite, L. (1977) J. Lipid Res. 18, 613-622 [Abstract/Free Full Text]
  17. Laemmli, U. K. (1970) Nature 227, 680-685 [Medline] [Order article via Infotrieve]
  18. O'Farrell, P. H. (1975) J. Biol. Chem. 250, 4007-4021 [Abstract]
  19. Spackman, D. H. (1967) Methods Enzymol. 11, 3-15 [CrossRef]
  20. Hirs, C. H. W. (1967) Methods Enzymol. 11, 59-62
  21. Havel, R. J., Kotite, L., Vigne, J.-L., Kane, J. P., Tun, P., Phillips, N., and Chen, G. C. (1980) J. Clin. Invest. 66, 1351-1362 [Medline] [Order article via Infotrieve]
  22. Yamada, N., Shames, D. M., Stoudemire, J. B., and Havel, R. J. (1986) Proc. Natl. Acad. Sci. U. S. A. 83, 3479-3483 [Abstract]
  23. Allain, C. C., Poon, L. S., Chan, C. S. G., Richmond, W., and Fu, P. C. (1974) Clin. Chem. 20, 470-475 [Abstract/Free Full Text]
  24. Towbin, H., Staehelin, T., and Gordon, J. (1979) Proc. Natl. Acad. Sci. U. S. A. 76, 4350-4354 [Abstract]
  25. Hamilton, R. L., Jr., Goerke, J., Guo, L. S. S., Williams, M. C., and Havel, R. J. (1980) J. Lipid Res. 21, 981-992 [Abstract]
  26. Frohman, M. A., Dush, M. K., and Martin, G. R. (1988) Proc. Natl. Acad. Sci. U. S. A. 85, 8998-9002 [Abstract]
  27. Chirgwin, J. M., Przybyla, A. E., MacDonald, R. J., and Rutter, W. J. (1979) Biochemistry 18, 5294-5299 [Medline] [Order article via Infotrieve]
  28. Sambrook, J., Fritsch, E. F., and Maniatis, T. (1989) Molecular Cloning: A Laboratory Manual , 2nd Ed., Cold Spring Harbor Laboratory, Cold Spring Harbor, NY
  29. Borson, N. D., Salo, W. L., and Drewes, L. R. (1992) PCR Methods & Application 2, 144-148
  30. Wang, K., Koop, B. F., and Hood, L. (1994) BioTechniques 17, 236-238 [Medline] [Order article via Infotrieve]
  31. Sanger, F., Coulson, A. R., Barrell, B. G., Smith, A. J. H., and Roe, B. A. (1980) J. Mol. Biol. 143, 161-178 [Medline] [Order article via Infotrieve]
  32. Chen, E. Y., and Seeburg, P. H. (1985) DNA (NY) 4, 165-170 [Medline] [Order article via Infotrieve]
  33. Lin, D., Harikrishna, J. A., Moore, C. C. D., Jones, K. L., and Miller, W. L. (1991) J. Biol. Chem. 266, 15992-15998 [Abstract/Free Full Text]
  34. Studier, F. W., Rosenberg, A. H., Dunn, J. J., and Dubendorff, J. W. (1990) Methods Enzymol. 185, 60-89 [Medline] [Order article via Infotrieve]
  35. Zhang, L., Rodriquez, H., Ohno, S., and Miller, W. L. (1995) Proc. Natl. Acad. Sci. U. S. A. 92, 10619-10623 [Abstract]
  36. Forsayeth, J. R., and Garcia, P. D. (1994) BioTechniques 17, 354-358 [Medline] [Order article via Infotrieve]
  37. Kaufman, R. J., Davies, M. V., Pathak, V. K., and Hershey, J. W. B. (1989) Mol. Cell. Biol. 9, 946-958 [Medline] [Order article via Infotrieve]
  38. Takebe, Y., Seiki, M., Fujisawa, J., Hoy, P., Yokota, K., Arai, K., Yoshida, M., and Arai, N. (1988) Mol. Cell. Biol. 8, 466-472 [Medline] [Order article via Infotrieve]
  39. Wang, L., Voysey, R., and Meng, Y. U. (1994) BioTechniques 17, 26-28 [Medline] [Order article via Infotrieve]
  40. Herbert, P., Levy, R. I., and Fredrickson, D. S. (1971) J. Biol. Chem. 246, 7068-7071 [Abstract/Free Full Text]
  41. Herbert, P. N., Windmueller, H. G., Bersot, T. P., and Shulman, R. S. (1974) J. Biol. Chem. 249, 5718-5724 [Abstract/Free Full Text]
  42. Innerarity, T. L., Friedlander, E., Rall, S. C., Jr., Weisgraber, K. H., and Mahley, R. W. (1983) J. Biol. Chem. 258, 12341-12347 [Abstract/Free Full Text]
  43. Kozak, M. (1987) Nucleic Acids Res. 15, 8125-8148 [Abstract]
  44. von Heijne (1986) Nucleic Acids Res. 14, 4683-4690 [Abstract]
  45. Schiffer, J., and Edmundson, A. B. (1967) Biophys. J. 7, 121-135 [Medline] [Order article via Infotrieve]
  46. Eisenberg, D., Weiss, R. M., and Terwilliger, T. C. (1984) Proc. Natl. Acad. Sci. U. S. A. 81, 140-144 [Abstract]
  47. Segrest, J. P., Jones, M. K., Loof, H. D., Brouillette, C. G., Venkatachalapathi, Y. V., and Anantharamaiah, G. M. (1992) J. Lipid Res. 33, 142-166
  48. Whitted, B. E., Castle, C. K., Polites, H. G., Melchior, G. W., and Marotti, K. R. (1989) Mol. Cell. Biochem. 90, 69-79 [Medline] [Order article via Infotrieve]
  49. Hoffer, M. J. V., vanEck, M. M., Havekes, L. M., Hofker, M. H., and Frantz, R. R. (1993) Genomics 17, 45-51 [CrossRef][Medline] [Order article via Infotrieve]
  50. Rosenthal, C. J., and Franklin, E. C. (1975) J. Clin. Invest. 55, 746-753 [Medline] [Order article via Infotrieve]
  51. Sata, T., Havel, R. J., Kotite, L., and Kane, J. P. (1976) Proc. Natl. Acad. Sci. U. S. A. 73, 1063-1067 [Abstract]
  52. Matsuguchi, T., Okamura, S., Aso, T., Sata, T., and Niho, Y. (1989) Biochem. Biophys. Res. Commun. 165, 138-144 [Medline] [Order article via Infotrieve]
  53. Shen, P., and Howlett, G. J. (1992) Arch. Biochem. Biophys. 297, 345-353 [Medline] [Order article via Infotrieve]
  54. Allan, C. M., Walker, D., Segreist, J. P., and Taylor, J. M. (1995) Genomics 28, 293-300
  55. van Hoof, G., Goosens, F., DeMeester, I., Hendriks, D., and Scharpe, S. (1995) FASEB J. 9, 736-744 [Abstract/Free Full Text]

©1996 by The American Society for Biochemistry and Molecular Biology, Inc.