(Received for publication, September 9, 1996)
From the Glycobiology Institute, Department of
Biochemistry, University of Oxford, South Parks, Oxford OX1 3QU, United
Kingdom and the
Department of Molecular Microbiology and
Immunology, St. Louis University School of Medicine,
St. Louis, Missouri 63104
We have characterized the glycans at individual
sites on the hemagglutinin of three influenza A variants to obtain
information on the role of cell-specific glycosylation in determining
the receptor binding properties of this virus. The variants differ in
whether they have a glycosylation site at residue 129 on the tip of the
hemagglutinin and whether amino acid 184 (near to the receptor binding
site) is His or Asn. We found that all sites on each variant are
glycosylated in Madin-Darby bovine kidney cells, that the glycosylation
is site-specific, and that the glycans at the same site in each variant
are highly similar. One site that is buried in the hemagglutinin trimer
contains only oligomannose glycans. The remaining sites carry complex
glycans of increasing size as the distance of the site from the viral
membrane decreases. Most of these complex glycans are terminated with
-galactose residues, a consequence in bovine cells of the removal of
terminal sialic acids by the viral neuraminidase. Although the glycans at residue 129 are among the smallest on the molecule, they are large
enough to reach the receptor binding pocket on their own and adjacent
monomers. The results suggest that the reduction in receptor binding
observed with Madin-Darby bovine kidney cell-grown virus is due to the
combined effect of large complex glycans at the tip of the
hemagglutinin and a His to Asn substitution close to the receptor
binding pocket.
The influenza virus hemagglutinin (HA)1 is the virion surface glycoprotein that attaches the virus to its receptors on host cells and fuses the viral envelope with the membranes of endocytic vesicles to initiate the infectious process. It is also the virion component that stimulates the formation of protective antibodies. It is not surprising, therefore, that the nature and extent of glycosylation of the HA have been implicated in altering its receptor binding properties and in the emergence of viral variants with enhanced cytopathogenicity (1-3) and virulence (4) and in masking its antigenic sites (5).
The amino acid sequence of the HA and hence the location of its N-glycosylation sites is determined by the viral genome. This RNA genome is replicated by a polymerase that lacks editing functions, so that mutations in all of the viral genes occur at a high frequency. One of the consequences of this high mutation frequency is that virus populations contain mutants that differ from the majority in the number and position of the N-linked glycans on the HA. The structures of these oligosaccharides appear to be determined by their position on the HA (6, 7) and by the array of biosynthetic and trimming enzymes provided by the host cell in which the virus is grown. Thus, the plasticity of the viral genome and the host-specified glycosylation machinery can, together, create virus populations that are more heterogeneous in structure and function than could be developed by either process alone. This diversity is considered to be responsible for survival of these viruses in a variety of biological niches and for their ability to overcome the inhibitory effects of neutralizing antibodies and antiviral agents.
The HA appears to have regions that must be glycosylated, others that must be free of oligosaccharides, and still others in which glycosylation may be either advantageous or detrimental to the survival of the virus. Glycosylation sites at certain positions on the HA of influenza A viruses isolated from various animals and humans are highly conserved and therefore appear to be essential for the formation and/or maintenance of functional HA (8). Conversely, the generation of glycosylation sites in certain regions of the HA reduces its transport to the cell surface, and its stability and/or function (9). It is in the regions in which glycosylation is neither prohibited nor required for the formation of functional HA that oligosaccharide diversity may have a major selective effect, depending on the specific environment in which the virus is expected to grow.
We have been using laboratory-isolated variants of an H1 strain to study this type of selection and to evaluate the combined effects of genetic mutation and cell-specific glycan processing on the properties of the HA. The properties of these variants are as follows. The three, designated Fo, Cf, and Co, grow equally well in chick embryo fibroblasts (CEFs) (3, 10). The Fo variant produces low virus yields on Madin-Darby bovine kidney (MDBK) cells. It has a glycosylation site at residue 129 and an Asn at residue 184. This variant is lost from MDBK cell-grown virus populations, due to its low affinity for MDBK-cell receptors (1) and is replaced by Cf, which grows to high titer in MDBK cells. Cf has lost the glycosylation site at 129 by an Asn to Asp substitution but is identical to Fo at residue 184 (2). The remaining variant, Co, also grows well in MDBK cells despite having a glycosylation site at residue 129. This variant has His at residue 184 instead of Asn (11, 12), as do most of the HAs sequenced to date. Thus, the low receptor binding of the Fo variant appeared to be due to either the combined effects of MDBK cell glycosylation and an amino acid substitution at 184 or to the glycans on Fo being different from those on Co.
We therefore undertook the characterization of the glycans at individual sites on the HA1 subunits of these three variants grown in MDBK cells. The glycosylation of the HA2 subunit was not examined. Our aim was to gain information about what, in addition to the spectrum of glycosylating enzymes present in the cell, determines the structure of the glycans on the HA and to explain how these glycans affect the binding activities of these viruses.
In this report we relate the glycan structures that we have found to their location in the three-dimensional structure of this glycoprotein (13). Last, our work prompts new insights into the role of the viral neuraminidase in determining the structure of glycans put onto the HA by cells of different species.
MDBK cells were grown as described previously (1). The three plaque-type variants of A/WSN/33 (H1N1) used in this study have been previously described (1, 3, 10), and the predicted amino acid sequences of their HA1 subunits have been published (2, 12). Virus stocks were maintained in CEFs in which the sequence of the HA at the critical sites was known to remain unchanged.
Virus Growth and PurificationThe virus used in all experiments was grown in MDBK cell monolayers overlaid with Earle's basal medium supplemented with 2% calf serum. The cells were infected, and the virus was harvested and purified as described previously (1, 14).
HA PurificationVirus was diluted to a concentration of
5 × 104 HA units/ml in STE (saline/Tris/EDTA, pH 7.2)
and incubated for 20 min at 37 °C with 0.02 mg/ml
L-1-tosylamido-2-phenylethyl chloromethyl ketone-treated trypsin (Sigma) to remove the viral neuraminidase.
Trypsinized virus was pelleted through a 30% sucrose cushion and
resuspended in STE to give 1-ml aliquots containing 1 × 104 HA units. The virus was repelleted and washed twice in
STE containing 2 mM dithiothreitol (Sigma)
and 50 µg/ml each of the protease inhibitors,
N-p-tosyl-L-lysine
chloromethyl ketone hydrochloride (Sigma) and phenylmethylsulfonyl fluoride (Sigma). Washed virus
pellets were re-suspended in 50 mM Tris-HCl, pH 8.3 containing 2 mM dithiothreitol, and 50 µg/ml each of
N
-p-tosyl-L-lysine
chloromethyl ketone hydrochloride and phenylmethylsulfonyl fluoride.
The suspensions were made 50 mM in
n-octylglucoside (Sigma) and incubated for
30 min at room temperature. Disrupted virus suspensions were
centrifuged for 25 min (80,000 rpm in a TLA-100.2 Beckman rotor) to
remove most of the nucleoprotein and matrix protein. Supernatants,
containing essentially all of the HA, were added to immobilized
monoclonal antibody columns, prepared as described below (~2 × 104 HA units of starting material/column), and incubated
overnight with mixing at 4 °C. Unbound material was removed by
washing with 10 column volumes of 50 mM Tris-HCl, pH 8.3, containing dithiothreitol, detergent, and protease inhibitors at the
concentrations indicated above. Bound material was eluted with 10 column volumes of 50 mM diethylamine, pH 11.5, containing
50 mM n-octylglucoside. Fractions (0.8 ml) were
collected into 80 µl of 0.5 M
NaH2PO4 to neutralize the pH. The eluate was
assessed for HA content and purity by 15% SDS-polyacrylamide gel
electrophoresis. Those fractions containing pure HA were pooled, and
the protein concentration was determined using a BCA protein assay kit
(Pierce & Warriner, Chester, UK), with bovine serum albumin (BSA) for
calibration. Purified, quantitated HA was dialyzed against four changes
of 100 mM ammonium acetate, pH 7.0, to remove detergent and
then lyophilized and stored, desiccated, at
20 °C.
HA affinity columns were prepared by chemically cross-linking a monoclonal antibody specific for influenza A/WSN/33 virus HA (79AD.2; Ref. 3) to protein G-Sepharose beads (Pharmacia, Milton Keynes, UK), as described by Harlowe and Lane (15).
Isolation and Identification of GlycopeptidesHA, reduced and alkylated as described previously (16), was diluted to 1-2 mg/ml in 100 mM Tris-HCl, pH 8.5, containing 1 M guanidine HCl and subjected to exhaustive trypsin treatment, using an enzyme:substrate ratio of 1:33 (w/w) and an incubation time of 28 h at 37 °C. Aggregates in the reaction mixture were removed by low speed centrifugation, and the digested HA was loaded directly onto a Bio-Gel P-30 gel filtration column (0.5 × 100 cm) and eluted with 50 mM ammonium acetate, pH 7.0, at a flow rate of 2 ml/h. Fractions (1 ml) corresponding to peaks in absorbance at 214 nm were pooled. A 5% (volume) aliquot from each pool was removed for analysis of glucosamine content using a modified version of the Waters PicoTag method (17). Glycopeptides in the glucosamine-containing pools were separated by reversed-phase high performance liquid chromatography (HPLC) and detected as described previously (17).
The glycosylation sites to which isolated glycopeptide fractions corresponded were determined by amino acid analysis (18) and comparison with the predicted amino acid composition from a tryptic map of the HA1 subunits. The identities of the glycopeptides determined by this method were confirmed by N-terminal sequencing, performed by Tony Willis (Medical Research Council Immunochemistry Unit, Department of Biochemistry, University of Oxford).
Isolation and Radiolabeling of CarbohydratesCarbohydrates were released from glycopeptides by hydrazinolysis and radiolabeled with NaB3H4 (Amersham International plc, Little Chalfont, UK) as described previously (19).
Mass Spectrometry of Glycopeptides and Released OligosaccharidesMass profiles of glycopeptides and released
oligosaccharides were obtained on a Micromass AutoSpec QFPD magnetic
sector instrument (Micromass Ltd., Manchester, UK) fitted with a
matrix-assisted laser desorption/ionization (MALDI) source and a focal
plane detector (20). The matrix used for glycopeptides was a saturated
solution of -cyano-4-hydroxycinnamic acid (21) in CH3CN;
slides were prepared using 0.2-0.4 nmol of sample in a 4-µl matrix.
Released oligosaccharides were prepared for mass spectrometric analysis in the same way, except that the matrix used was 2,5-dihydroxybenzoic acid. Maximum entropy data processing was carried out using Micromass software.
Neutral and charged carbohydrates were separated by high voltage electrophoresis, and the isolated neutral carbohydrates were separated by Bio-Gel P-4 high resolution gel filtration chromatography as described previously (19).
Immobilized Lectin Affinity ChromatographyAgarose-bound
Aleuria aurantia lectin and Datura stramonium
lectin, both from Vector Laboratories (Peterborough, UK) were packed
into 5 × 50-mm glass columns (Pharmacia). Radiolabeled glycans
were loaded in 100 mM sodium acetate, pH 6.0, at a flow rate of 50 µl/min. Unbound and weakly bound material was eluted by
washing the column with 2.5 ml of loading buffer. Bound material was
eluted from A. aurantia lectin with 0.5 mM
L-fucose (Sigma) and from D. stramonium lectin with 10 mg/ml chitin hydrolysate at a flow rate
of 250 µl/min for both lectins. Agarose-bound Griffonia simplicifolia lectin I (Vector Laboratories) was packed into
disposable 2-ml Econocolumns (1.5 ml/column), and samples were loaded
in phosphate-buffered saline containing 0.1 mM
CaCl2. Unbound and weakly bound glycans were eluted with
200-µl aliquots of sample loading buffer; strongly bound material was
eluted with 50 mM -methyl galactoside
(Sigma), also in 200-µl increments. Elution of
glycans was monitored by liquid scintillation counting.
Glycosidase treatment was performed
either sequentially with individual enzymes or with a defined mixture
of enzymes; digest products were analyzed by mass spectrometry as
described above or by Bio-Gel P-4 gel filtration chromatography. The
exoglycosidases used were Aspergillus saitoi
-1,2-mannosidase (10 milliunits/ml in 50 mM sodium
acetate, pH 5.0, with 1 mg/ml BSA), jack bean
-mannosidase (10 units/ml in 10 mM citrate buffer, pH 4.5), coffee bean
-galactosidase (5 units/ml in citrate/phosphate buffer, pH 6.0, with
1 mg/ml BSA), jack bean
-galactosidase (9.3 units/ml in 50 mM citrate buffer, pH 3.5, with 1 mg/ml BSA), jack bean
-N-acetylhexosaminidase (3 or 10 units/ml in 50 mM citrate buffer, pH 5.0, with 1 mg/ml BSA),
Charonia lampas
-fucosidase (6 units/ml in 50 mM sodium acetate, pH 4.5, with 0.15 M NaCl),
and almond
-1,3/4-fucosidase (11 milliunits/ml in 50 mM
sodium acetate, pH 5.0, with 1 mg/ml BSA), all of which were prepared
and tested for purity and specificity by Dr. Terry Butters
(Glycobiology Institute, Department of Biochemistry, University of
Oxford). Endo-
-galactosidase was prepared from Escherichia
freundii by Dr. Peter Scudder (22). All enzyme digests were
performed at 37 °C for 18-22 h under toluene, and the reactions
were stopped by lowering the temperature to
20 °C. Digested
samples were desalted and filtered before analysis.
Isolation and Identification of Individual Glycosylation Sites
Individual glycopeptides corresponding to each glycosylation site
of the HA1 subunit were isolated from tryptic digests of Co-, Cf-, and
Fo-derived HA by gel filtration chromatography and subsequent
reversed-phase HPLC of glucosamine-containing fractions of the eluate
(Fig. 1). Glycopeptides were designated GP20/21, GP65,
GP129, and GP271 according to the glycosylation site that each
contained (Table I).
|
Previous observations had suggested that Asn184 in the sequence Asn-Pro-Ser served as a glycosylation site in two of the variants (Cf and Fo (2)). However, a glycopeptide that corresponded to such a glycosylation site was not present in the digests. We subsequently identified a tryptic peptide containing Asn184 (data not shown), indicating that this sequence is not glycosylated.
It was apparent from the amino acid sequence of GP20/21 that either Asn 20 or 21 could serve as an N-linked carbohydrate attachment site (see Table I). For all three variants, N-terminal sequencing of this glycopeptide gave a positive identification of Asn at position 20 but a blank in the cycle of Edman degradation for residue 21, indicating that residue 21 carried the carbohydrate moiety. A similar observation was made by Deshpande et al. (4) for a related sequence in the HA from an avian H5 strain of influenza.
Structural Characterization of N-Glycans
Isolation of Neutral GlycansThe charged and neutral radiolabeled N-glycans from each glycopeptide were separated by high voltage electrophoresis. In all cases, only a very small proportion of the glycans were charged. Site 21 carried the largest proportion of charged glycans (<5%), and this charge was due mainly to sulfation (determined by neuraminidase treatment and chemical desulfation; data not shown).
Bio-Gel P-4 gel filtration chromatography of the neutral glycans showed
that the elution profiles of the individual glycosylation sites
differed markedly, indicating site-specific glycosylation (Fig.
2). The simplest elution profiles were from
glycosylation site 65 (Fig. 2b), which showed little
heterogeneity. The elution volume of the major species (K in
Fig. 2b) was characteristic of the oligomannose structure
Man9GlcNAc2 (23).
Table II presents a summary of the glycans found on each of the three variants. The three HAs exhibit only small variation in the distribution of the structures present at each of the sites, indicating that the glycans put onto sites 21, 65, and 271 were not significantly affected by the presence or absence of a glycan at site 129.
|
The structural predictions made from the Bio-Gel P-4 elution
profiles of the glycans from site 65 were confirmed by sequential exoglycosidase treatment. Species K and L were reduced to single 9-gu
species on treatment with A. saitoi -1,2-mannosidase,
indicating the removal of four and three terminal
-1,2-linked Man
residues, respectively. This component was further reduced to a 5.5-gu
structure on treatment with jack bean
-mannosidase, indicative of
the removal of the four remaining
-linked Man residues, leaving only
the Man residue
-linked to the chitobiose core. This behavior was exactly that expected of authentic Man9GlcNAc2
and Man8GlcNAc2.
Initial attempts to sequence the
neutral glycans at site 129 of the Co and Fo variants by sequential
exoglycosidase treatment (devised on the basis of the hydrodynamic
volumes obtained from Bio-Gel P-4 gel filtration chromatography) were
unsuccessful. When the glycopeptides were reexamined by mass
spectrometry, it became apparent that these glycosylation sites
contained uncommon complex-type N-glycans. The mass profiles
for the two variants (Fig. 3) were highly similar,
suggesting that they carried highly similar structures at this
glycosylation site. Two major (4167 Da (B) and 4327 Da
(C)) and two minor (4005 Da (A) and 4489 Da (D)) mass species were found that, assuming complete
trypsinization of the HA and no mass variation due to peptide,
represent an oligosaccharide series differing by 162 Da (1 hexose
unit). These results suggested that the structures at this site
terminated in Gal-Gal sequences. Although these data gave no indication
of the anomericity of the linkage between these Gal residues, they were
likely to be -linked, since previous studies have identified
-1,3-galactosyltransferase activity in bovine cells (24, 25).
Serial Lectin Affinity Chromatography
The sequential use of
immobilized lectins with defined carbohydrate-binding specificities
affords a great deal of structural information with minimal loss of
material and is thus a useful alternative to sequential exoglycosidase
treatment for the structural characterization of complex mixtures of
oligosaccharides (reviewed in Ref. 26). Neutral glycans from sites 21, 129, and 271 were first fractionated on immobilized A. aurantia lectin, which binds structures with a fucose residue
-1,6-linked to the chitobiose core ("core-fucosylated")
(27). The major fractions from A. aurantia lectin affinity
chromatography were subsequently subfractionated on immobilized
D. stramonium lectin. D. stramonium lectin binds strongly to tri- and tetraantennary oligosaccharides containing the
pentasaccharide unit Gal
4GlcNAc
6(Gal
4GlcNAc
2)Man and/or lactosamine units (Gal
4GlcNac) extending one or more of the outer arms and less strongly to oligosaccharides containing
Gal
4GlcNAc
4(Gal
4GlcNAc
2)Man (28). Strongly bound
material is eluted with GlcNAc oligomers, and weakly bound material is
eluted as a retarded fraction without the addition of GlcNAc oligomers.
D. stramonium lectin binding properties are abolished by
outer arm fucosylation. Finally, the major glycan fractions from
D. stramonium lectin affinity chromatography were
fractionated on immobilized G. simplicifolia lectin I. This lectin binds strongly to glycans containing two or more terminal
-linked Gal or GalNAc residues and weakly to those containing only a
single terminal
-Gal/GalNAc.
The serial lectin affinity profiles of glycans from glycosylation sites
21, 129, and 271 (Fig. 4 and Table III)
were similar for the three HAs at a given glycosylation site, and the
majority of the glycans at all three sites were core-fucosylated as
judged by their binding to A. aurantia lectin. Most of the
core-fucosylated glycans at site 129 from Co and Fo failed to bind to
D. stramonium lectin, suggesting limited branching
(biantennary structures). This, together with the mass spectrometry
data and the hydrodynamic volumes, suggested the presence of bisecting
GlcNAc residues. The behavior of these A+/D glycans on G. simplicifolia lectin I indicated that they terminated in 0-2
-Gal residues.
|
As with the glycans from site 129, the majority of the core-fucosylated
oligosaccharides at site 271 were unbound by D. stramonium lectin, while most of the remaining glycans bound weakly to this lectin. The majority of the A+/D structures were bound strongly by
G. simplicifolia lectin I, indicating that they had two or more terminal
-Gal residues. The hydrodynamic volumes of the major
species on Bio-Gel P-4 gel filtration (Fig. 2d) indicated multiantennary structures and to abolish the D. stramonium
lectin binding properties associated with these types of structures, the glycans would have to carry outer arm fucosylation. Alternatively, these species could be bisected biantennary structures with 0-2 lactosamine units (Gal
4GlcNAc) extending one or both outer arms.
In contrast to sites 129 and 271, most of the core-fucosylated glycans
from site 21 bound strongly to D. stramonium lectin, and
most or all of this A+/D+ material bound strongly to G. simplicifolia lectin I. These results, in combination with the
hydrodynamic volumes of the glycans on Bio-Gel P-4 gel filtration
chromatography (Fig. 2a), suggested that the major
oligosaccharides at site 21 were highly branched, core-fucosylated
structures, with terminal -Gal residues on at least two outer arms
and 0-2 lactosamine units extending one or more outer arms.
In the final characterization of the
complex glycans, exo- and endoglycosidase treatments were used to
confirm and extend the structural predictions made from the experiments
described above. A combination of sequential exoglycosidase treatment
and MALDI-TOF mass spectrometry was used to sequence the major glycans at site 129 (Fig. 5 and Table IV). Three
components were observed in the untreated total glycan pool (Fig.
5a), and after treatment with coffee bean -galactosidase
(Fig. 5b) the two larger glycans (A and
B) shifted to the same mass as the smallest (C),
indicating the loss of one (B) or two (A) hexose
units. Subsequent treatment with jack bean
-galactosidase (Fig.
5c) gave a mass shift of 324 Da, consistent with the loss of
two hexose units. Treatment with jack bean
-hexosaminidase at an
enzyme concentration sufficient to ensure complete removal of bisecting
GlcNAc residues (10 units/ml) resulted in a mass shift of 663 Da,
consistent with the removal of three hexosamine units (Fig.
5d). Finally, C. lampas
-fucosidase treatment
(Fig. 5e) resulted in a mass shift of 146 Da, consistent with the removal of a single fucose residue. The mass of this last
species (F) was 934 Da, consistent with the core
pentasaccharide Man3GlcNAc2.
|
Further evidence for these structures was provided by the masses of the in-source decay fragment ions (29, 30) present in the spectra shown in Fig. 5, b and c. The presence of the ion at m/z 1321.5 in the spectrum shown in Fig. 5c, caused by the loss of GlcNAcFuc in a B-type cleavage (31), and the absence of an ion corresponding to the loss of unsubstituted GlcNAc confirmed the presence of a core fucose at the reducing terminus. The in-source fragmentation pattern of the spectrum in Fig. 5c closely matched that of a reference sample of a bisected biantennary glycan containing a core fucose and lacking terminal Gal residues.
In order to determine whether the additional lactosamine units on the
site 21 oligosaccharides extended from a single outer arm or were
distributed over several arms, the A+/D+ fraction was treated with
E. freundii endo--galactosidase. This enzyme specifically
hydrolyzes the
-1,4-linkage in Gal
4GlcNAc only when it is present
within an unbranched polylactosamine sequence, with no outer arm
fucosylation (32, 33). Bio-Gel P-4 gel filtration chromatography of the
glycan pools before and after endo-
-galactosidase treatment (Fig.
6, a and b, respectively) provided
evidence for the distribution of single lactosamine units over several
outer arms. The digestion products comprised several poorly resolved species (region Y in Fig. 6b), ranging in size
from ~23 to ~19 gu, as would be expected from
endo-
-galactosidase treatment of the structures shown in Fig.
6c. The structures at this site did not contain any outer
arm fucosylation, since the digestion products were identical when the
A+/D+ fraction was pretreated with almond
-fucosidase (data not
shown).
The A+/D fraction from site 271 was treated with a mixture of coffee
bean
-galactosidase, jack bean
-galactosidase, and jack bean
-hexosaminidase (3 units/ml). The Bio-Gel P-4 profiles of the A+/D
glycan pools after this treatment (Fig. 7b)
showed a major component at ~13 gu and a minor component at ~8.5 gu
(species P and Q, respectively, in Fig.
7b). This result would be consistent with the structures
depicted in Fig. 7c if the
-hexosaminidase had
successfully removed the outer GlcNAc residues, but had failed to
remove all of the innermost GlcNAc residues and the bisecting GlcNAc.
The activity of the
-hexosaminidase toward an authentic bisected
biantennary oligosaccharide substrate corresponding to structure P (13 gu) was only ~30%, showing that this was indeed the case. Digestion
products identical to those shown in Fig. 7b were obtained
when the A+/D
fraction was treated first with almond
-fucosidase
and then with the enzyme mixture described above (data not shown),
indicating that these structures were not substituted on the outer arms
with fucose residues.
The carbohydrate analysis described here shows conserved
glycosylation at individual sites on the HA1 subunits from the three virus variants studied. The major glycans observed at sites 21, 65, and
271 from Co, Cf, and Fo as well as those at site 129 of Co and Fo are
shown in Fig. 8. The most striking feature of this figure is the site specificity that it illustrates. The highly processed polylactosamine complex-type structures observed at site 21 indicate that MDBK cells possess the full range of enzyme activities
necessary for extensive processing of N-linked glycans, yet
none of the oligosaccharides at the other sites are processed to this
extent.
The most extreme example of restricted processing is site 65, which contains almost exclusively Man9GlcNAc2, a structure which has undergone very little processing from the glucosylated precursor common to all N-linked carbohydrates. This illustrates that, while the extent to which N-linked carbohydrates on influenza virus glycoproteins can be processed depends upon the array of glycosidases and glycosyltransferases available in the host, the nature of the protein backbone will determine the extent to which this potential is realized. In the case of site 65, the limited oligosaccharide processing is most likely due to the inaccessibility of the relevant enzymes to this site; since this site is almost buried within the monomer, between the head and stem regions of the HA1 subunit (see Fig. 8), it is probably no longer accessible after folding and trimerization of the HA polypeptides. Similar conclusions have been drawn by Keil et al. (6), who found oligomannose structures on a glycosylation site at the interface between HA monomers.
Although some secretory proteins have been found to contain oligosaccharides with one or two N-acetyllactosamine repeats, these structures are almost exclusive to membrane-bound glycoproteins in mammalian systems (34-36), and it has been suggested that the proximity of a glycosylation site to the membrane is important for the addition of polylactosaminoglycans (37). This hypothesis is supported by the observation that the Trypanosoma brucei type II variant surface glycoproteins have polylactosaminoglycans only at the site closest to the membrane (17). The HA is a membrane-bound glycoprotein, and of all the glycosylation sites on the HA1 subunits, site 21 is the most membrane-proximal (see Fig. 8). In addition, site 21 contained most of the charged oligosaccharides found on the HA. This charge is due to sulfation, which is most commonly found on GlcNAc residues in N-acetyllactosamine repeats. Thus, the carbohydrate structures at site 21 are not surprising when one considers the position of this glycosylation site in the three-dimensional structure of the HA.
The oligosaccharides at site 271 were not processed to the same extent as those at site 21. These two sites are apparently equally exposed on the HA surface and should be equally accessible to the glycosyl transferases involved in N-linked oligosaccharide processing. Site 129, which is the most membrane-distal glycosylation site on the HA1 subunit, contains glycans that are processed to a relatively high degree but less so than those at site 271. Thus, there appears to be an inverse correlation between the extent of oligosaccharide processing and the distance of the site from the membrane.
The level of -galactosylation observed in this study is unusually
high, although the presence of these moieties could be anticipated,
since the host in which the viruses were grown is a bovine cell line
and this species is known to possess
-1,3-galactosyltransferase activity (24, 25). Substrate specificity studies on
-1,3-galactosyltransferase purified from calf thymus demonstrated
that the oligosaccharide acceptor for this enzyme is Gal
4GlcNAc-R;
the activity of this enzyme is mutually exclusive with
-2,6-sialyltransferase and is blocked by the presence of
-1,3-fucosylation on the GlcNAc residue (38). The high level of
terminal
-1,3-galactosyl residues can be explained by the activity
of the viral neuraminidase, which ensures that terminal sialic acid
residues are removed from HA, leaving an abundance of oligosaccharide
acceptors for the action of
-1,3-galactosyltransferase. In addition,
the lack of outer arm fucosylation would further promote
-galactosylation. The presence of terminal
-Gal residues on the
majority of the glycans in this study explains the previous observation
that HA N-glycans obtained from virus grown in MDBK cells
are poor acceptors for sialyltransferase (39).
As indicated above, one of the aims of this study was to determine the mechanism by which the combination of an amino acid substitution at residue 184 (His to Asn) and a glycan at residue 129 on the HA reduces the receptor binding activity of the Fo virus. Since site 129 of Co- and Fo-HA contain highly similar arrays of N-glycans, it is apparent that the presence of Asn rather than His at position 184 does not influence the nature of the oligosaccharides at site 129. Furthermore, it is clear that the presence of an N-glycan at site 129 does not alter the glycosylation at the other sites. This points to the conclusion that the replacement of a highly conserved His at position 184 with an Asn residue affects the HA RBP such that the presence of a large, MDBK cell-synthesized glycan at site 129 can sterically hinder the interaction of this RBP with its sialic acid-containing receptor. The carbohydrates attached to the HA1 subunit of these variants by CEFs are known to be significantly smaller than those synthesized in MDBK cells (40), and the CEF-synthesized glycans on fowl plague virus are also smaller than the structures reported here (7). CEF-synthesized oligosaccharides may be too small to interfere with receptor binding in the way that the larger MDBK cell-synthesized oligosaccharides do. This would explain why all three virus variants are able to grow to equally high titer in CEFs and why Fo virus grown in CEFs is capable of binding MDBK more strongly than Fo virus grown in MDBK cells (1).
The hypothesis that large oligosaccharides near the RBP of influenza A virus HA can interfere with receptor binding activity, whereas smaller oligosaccharides may have no influence, is supported by the work of Kemble et al. (41). These authors proposed that a large oligosaccharide 7 Å from the RBP interfered with receptor binding activity by "filling or masking the receptor pocket on a neighboring monomer," (41) whereas a smaller oligosaccharide did not. Analysis of minimum energy models of the Fo- and Co-HA, based on the available crystal structure of an H3 HA complexed with sialyllactose (42), showed that glycosylation site 129 is about 11 Å from the RBP (43). Furthermore, we were able to show that an MDBK cell-synthesized oligosaccharide at this site was able to reach both the RBP of the monomer to which it was attached and that of the adjacent monomer.
The high degree of conservation of His184 in nature and its proximity to the RBP residue His183 suggest that a His to Asn substitution at this site may be detrimental to the receptor binding activity of the HA. We have looked at the RBPs of the minimum energy models of the Co- and Fo-HA and have found that the side chain of residue 184 points downwards and away from the RBP. This observation argues against this amino acid substitution resulting in conformational alterations to the RBP that could account for reduced receptor binding activity. It may, however, be that electrostatic effects arise from interactions between His183 and His184 in the HA. This type of interaction has been demonstrated for RNase A, where proton exchange between two adjacent His residues in the active site (His12 and His119) ensures that one of these residues is in the correct protonatable state to interact with the substrate and is therefore crucial to the substrate binding activity of this molecule (reviewed in Ref. 44). Although His183 does not interact directly with sialic acid, it forms part of a hydrogen bond network that defines the RBP surface topology (45). If there is proton exchange between His183 and His184 in the HA analogous to that in the RNase A active site, it is possible that replacing this His with Asn might disrupt the RBP architecture. This could perturb the RBP such that a glycan at site 129 now becomes detrimental to receptor binding activity.
This is the first report that provides a detailed description of site-specific glycosylation of the HA by mammalian cells. Our results clearly reveal the dual role of the amino acid sequence and the glycosylating enzymes in determining the final structure of the biologically active trimer. For example, one can predict that the amino acid sequence of the HA is likely to be a major determinant of the glycan structures found at site 65 and other sites buried in or between monomers in the HA trimer. However, given the smaller size of the HA when synthesized by other cells (CEFs and Madin-Darby canine kidney cells), the structures of the glycans at the other sites will undoubtedly depend, to varying degrees, on the processing enzymes of the host cell. Since the Fo variant used in this study is less cytopathic than either C variant (1), we postulated that it might contain somewhat larger complex-type oligosaccharides or glycans with different structures from those on the C viruses. However, we have shown here that this is not the case. Differences in the degree to which the C and F variants reduce host protein synthesis (46) also had no detectable effect on the ability of the glycosylating enzymes to process the HA N-glycans.
In contrast, a rather large effect on the final HA structure appears to result from the action of the viral neuraminidase. The effects of this viral enzyme activity go beyond the removal of sialic acid residues from the HA, since its activity provides substrates for outer arm substitutions by host glycosyltransferases. The data presented here might suggest that the action of the neuraminidase may enhance the difference between the glycans put onto the HA by cells of different species. Whether this has biological consequences in terms of epitope masking, trimer stability, or fusion activity remains to be investigated.
Last, many of the conclusions to be drawn from this study are similar to those made earlier by Keil et al. (6, 7), who examined the glycans at individual sites on an avian H7 influenza virus HA grown in CEFs. However, more work along the lines reported here is needed before information can be generalized to other influenza A strains. For example, H1 strains newly isolated from the human population contain eight or nine N-linked glycosylation sites, many of which are conserved during antigenic drift, whereas those isolated from avian species have only four sites (47). It is not known whether all of the N-glycan attachment sites on these human isolates are occupied, nor is it known whether the conserved sites common to human and avian strains would carry the same glycans if grown in the same host. Answers to questions such as these should contribute to our understanding of the epidemiology of influenza, to improved methods for the characterization of new isolates, and to the improvement of vaccines.
We thank Yan Liang and Geoffrey Guile for expert technical assistance, Terry Butters for providing the exoglycosidases and excellent advice, Tony Willis for the N-terminal sequencing, and Mark Wormald for assistance with the molecular modeling.