(Received for publication, July 24, 1995; and in revised form, September 25, 1995)
From the
Crystallographic analysis and site-directed mutagenesis have
been used to identify the catalytic and oligosaccharide recognition
residues of
peptide-N-(N-acetyl-
-D-glucosaminyl)asparagine
amidase F (PNGase F), an amidohydrolase that removes intact
asparagine-linked oligosaccharide chains from glycoproteins and
glycopeptides. Mutagenesis has shown that three acidic residues,
Asp-60, Glu-206, and Glu-118, that are located in a cleft at the
interface between the two domains of the protein are essential for
activity. The D60N mutant has no detectable activity, while E206Q and
E118Q have less than 0.01 and 0.1% of the wild-type activity,
respectively. Crystallographic analysis, at 2.0-Å resolution, of
the complex of the wild-type enzyme with the product, N,N`-diacetylchitobiose, shows that Asp-60 is in
direct contact with the substrate at the cleavage site, while Glu-206
makes contact through a bridging water molecule. This indicates that
Asp-60 is the primary catalytic residue, while Glu-206 probably is
important for stabilization of reaction intermediates. Glu-118 forms a
hydrogen bond with O
of the second N-acetylglucosamine residue of the substrate and the low
activity of the E118Q mutant results from its reduced ability to bind
the oligosaccharide. This analysis also suggests that the mechanism of
action of PNGase F differs from those of L-asparaginase and
glycosylasparaginase, which involve a threonine residue as the
nucleophile.
Peptide-N-(N-acetyl-
-D-glucosaminyl)asparagine
amidase F (PNGase F, (
)peptide N-glycanase) is a
34.8-kDa amidohydrolase secreted by Flavobacterium
meningosepticum(1) . The enzyme cleaves the
-aspartylglucosamine bond of asparagine-linked oligosaccharides,
converting the asparagine residue to an aspartic acid(2) . The
1-amino-oligosaccharide product spontaneously hydrolyzes to ammonia and
the intact oligosaccharide chain(3) . The enzyme has a broad
substrate specificity, but both the amino and carboxyl groups of the
asparagine residue have to be in peptide linkage, while the
oligosaccharide must consist at least of the N,N`-diacetylchitobiose core,
GlcNAc
1
4GlcNAc. The enzyme is highly sensitive to
modifications of this core; an
-1
3-fucose substituent on the
asparagine-proximal, or reducing end, GlcNAc completely blocks PNGase F
activity(4) , but an
-1
6-fucose substituent has no
effect. The enzyme is extensively used as a biochemical tool for the
study and analysis of glycoproteins(5) , including the analysis
of oligosaccharides as well as the deglycosylation of glycoproteins for
structural analysis.
Two crystal structures of PNGase F, in
different crystal forms, were recently
published(6, 7) . The molecular structure, shown in Fig. 1, consists of two eight-stranded antiparallel
-sandwiches that lie side-by-side so that the interface runs the
full length of the
-sheets. Very little information regarding the
mechanism of action of the enzyme was available when the structures
were determined, and the assignment of a possible active site was based
primarily on information obtained from the studies of other
asparaginases: L-asparaginase (8) and
glycosylasparaginases(9, 10) . The active site of L-asparaginase includes a triad of hydrogen-bonded Asp-Lys-Thr
residues, where the threonine is thought to be the nucleophile in the
reaction mechanism. Glycosylasparaginases also have an essential
threonine residue. This information, combined with observation of
surface features of the molecule, led to the identification of three
possible substrate binding sites(6) : a bowl on one face of the
molecule that contains a group of residues that somewhat resemble the
active site of L-asparaginase, a shallow S-shaped cleft on the
opposite face of the molecule that contains a number of acidic residues
and threonine residues, and a deep cleft at the interface between the
two domains at one end of the molecule. This cleft, formed by the loops
connecting
-strands between the
-sheets contains several
acidic residues and serines, as well as many aromatic residues. Most
remarkable is the presence of five tryptophan residues.
Figure 1:
Stereo ribbon diagram showing the
-carbon tracing of the three-dimensional structure of PNGase F
with N,N`-diacetylchitobiose in the substrate binding
cleft. The molecule consists of two nearly identical eight-stranded
antiparallel
-barrel domains with jelly roll motifs. Side chains
of representative residues in the three areas initially identified as
possible active sites are shown: Thr-101, lower back; Thr-72, lower
front; Glu-118, top front; and, Asp-60 and Glu-206, top rear. The
figure was prepared with program
Molscript(21) .
In this paper we describe the results of site-directed mutagenesis studies, resulting in the identification of the active site area, and of the crystallographic analysis of the complex of the enzyme with the product disaccharide N,N`-diacetylchitobiose. These studies lead to the unambiguous identification of the catalytic residues and the detailed analysis of the oligosaccharide recognition site.
The D60N, E206Q, and
E118Q mutants were overexpressed in E. coli, purified in large
quantities, and crystallized. All three crystallized in the same space
group under nearly identical conditions as the wild-type
enzyme(16) . The crystallographic analysis showed that their
structures are essentially identical to that of the wild-type enzyme,
confirming that the reduced activities are caused by diminished
catalytic functionality or impaired substrate binding, not by altered
protein conformation. ()
The
crystals of the complex were grown under identical conditions and
belong to the same space group as those of the uncomplexed protein. The
structure of the complex of the wild-type enzyme has been fully refined
to an R value of 0.197 at 2.0-Å resolution. Fig. 2shows the electron density of the disaccharide in the
refined structure. The protein conformation is essentially unaffected
by the binding of the ligand. The least-squares fit of the main chain
atoms of all but the four N-terminal residues has a root mean square
deviation of 0.33 Å and a maximum deviation of 1.30 Å. All
of the largest deviations are in loops that are far removed from the
binding cleft. The main difference in the cleft is a slight change in
the orientation of the side chain of Trp-191, with a shift in the
position of N by 0.95 Å. The average thermal
parameters for the protein molecule are significantly lower in the
complex: 18.4 Å
and 19.5 Å
for the
main chain and side chain atoms in the complex, compared with 23.2 and
25.4 Å
in the uncomplexed structure. Interestingly,
the change in the thermal parameters is uniformly distributed over the
molecule and does not result from large changes in the binding cleft
area. The N,N`-diacetylchitobiose molecule has an
average thermal parameter of 16.6 Å
with a range from
11.0 to 26.0 Å
. The N-acetyl group of the
first GlcNAc is most deeply buried into the protein molecule and
correspondingly has the lowest thermal parameters.
Figure 2:
Stereodiagram showing the electron density
for the N,N`-diacetylchitobiose in the final
2F - F
map, contoured at 1.5
of the
map.
In solution, N,N`-diacetylchitobiose exists in an equilibrium
state containing a mixture of - and
-configurations of the
O
-hydroxyl group at the reducing end. In the wild-type
PNGase F complex structure, only the
-conformation is observed,
despite the fact that the oligosaccharide moiety attached to the
N
of the asparagine is in the
-configuration. The
glycosidic link between the GlcNAc residues is in an extended
conformation with torsion angles,
(O
-C
-O
-C
)
-85° and
(C
-O
-C
-C
)
-123°. These angles are well within the range of those
observed for other
-1
4-linked disaccharides in complexes
with lectins(17, 18) but are most similar to those
seen in complexes of lysozyme with GlcNAc
multimers(19, 20) .
Fig. 3shows the
location and orientation of the N,N`-diacetylchitobiose molecule in the cleft of the
enzyme. The molecule is inserted edgewise into the cleft and spans the
distance between the two groups of acidic residues in the cleft, with
O of the reducing end GlcNAc residue forming hydrogen bonds
with Asp-60 and the water molecule, Wat
, that connects
Asp-60 and Glu-206. O
of the second GlcNAc forms a hydrogen
bond with Glu-118. Fig. 4shows a schematic diagram of all
important intermolecular contacts. The first GlcNAc residue makes
extensive contacts with the protein, although several are mediated by
tightly bound water molecules. One interesting feature, shown in Fig. 5, is the location of the N-acetyl group, which
extends into a deep hole in the interface between the two domains of
the protein. The contacts of the second GlcNAc are much weaker and only
O
and O
are involved in direct hydrogen bonds
with the protein. On the reducing end, Asp-60 forms hydrogen bonds with
O
of the N,N`-diacetylchitobiose on one
side and with Wat
on the other side. Glu-206 does not
make any direct contacts with the disaccharide. The position and
orientation of the disaccharide in the cleft easily explains the
inability of the enzyme to process substrates with an
-1
3-fucose on the first GlcNAc residue (5) and the
lack of interference of O
substitutions; O
is
buried and inaccessible while O
is fully exposed to the
solvent.
Figure 3:
Detailed image of the interactions of N,N`-diacetylchitobiose with PNGase F showing the
hydrogen bonding interactions between the disaccharide, water
molecules, and the protein. Aromatic residues also make important
contacts with the substrate. Trp-191 is positioned nearly perpendicular
to the disaccharide and forms a hydrogen bonding contact rather than a
hydrophobic interaction. Trp-120 forms a hydrogen bond with O but also appears to be positioned correctly to be able to make a
hydrophobic contact with the next residue, the first mannose, in the
intact substrate. The figure was prepared with the program
SECTOR(22) .
Figure 4:
Schematic diagram showing the
intermolecular hydrogen bonding contacts between PNGase F, N,N`-diacetylchitobiose and water molecules. Protein
residues are indicated with single-letter amino acid code and sequence
number in rectangular boxes, water molecules are indicated by a number,
corresponding to their number in the file deposited with the Protein
Data Bank. The reducing end GlcNAc residue is on the left. Hydrogen
bonding distances, in Å, are shown in italics. Note that
Wat (349) is present twice, once in contact with
O
and once with Arg-61.
Figure 5: GRASP-image (23) showing the molecular envelope of PNGase F, cut-off through the oligosaccharide binding cleft to show the penetration of the N-acetyl group of the first GlcNAc residue into the cavity in the core of the molecule. The carbonyl oxygen makes hydrogen bonds with two water molecules in the cavity, while the methyl group is in close proximity to a hydrophobic area, primarily formed by the Ile-82 side chain. The protein surface is colored according to its electrostatic potential. Potentials greater than 9 T are blue, those less than -22 T are red, and neutral ones are white.
The solvent-accessible surface of the protein is reduced by
less than 200 Å in the complex: 12,395 Å
for the protein in the complex versus 12,587
Å
in the uncomplexed structure. This is explained in
part by the location of the molecule toward one side of the cleft,
making close contacts with many residues of the N-terminal domain but
allowing for a layer of water molecules to form bridging contacts with
the residues of C-terminal domain. Many of these water molecules,
including Wat
, Wat
, and Wat
,
are in nearly identical positions in the uncomplexed and complex
structures. Of all the water molecules involved in contacts with the
protein and the disaccharide, only Wat
and Wat
have no water molecule within 0.5 Å in the uncomplexed
structure. However, all water molecules that only make contacts with
the N,N`-diacetylchitobiose molecule are disordered
in the uncomplexed structure.
Site-directed mutagenesis experiments were used to determine that the active site of PNGase F is located in the cleft at the interface between the two domains of the molecule and showed that three acidic residues, Asp-60, Glu-118, and Glu-206, are essential for activity.
The combination of this information with the crystal
structure analysis of the complex with N,N`-diacetylchitobiose clearly reveals the distinct
roles of these residues in the mechanism of action of the enzyme.
Mutagenesis of Asp-60 to the corresponding asparagine results in total
loss of activity. This residue is in direct hydrogen bonding contact
with O of the reducing-end GlcNAc residue. This atom
replaces the N
of the asparagine in the substrate.
Therefore, this residue must be of primary importance in the catalytic
mechanism. The fact that the enzyme selects for the
-configuration
of O
of the N,N`-diacetylchitobiose while
the N
of the substrate is in the
-configuration
may prove to be important for further studies of the mechanism of
action. Glu-206, whose amide mutant has very low residual activity, is
not in direct contact with the product and is more likely to play a
secondary role, such as stabilization of a reaction intermediate or
interaction with the O
of the substrate asparagine.
The geometry of the complex structure in the area of Glu-118 clearly
illustrates why this residue is important for substrate binding and why
the E118Q mutation impairs substrate binding. O of the
second GlcNAc residue accepts a hydrogen atom in its hydrogen bond with
N
of Trp-120. Therefore, it must donate a hydrogen
atom in its hydrogen bond to O
of Glu-118.
O
of the Glu-118 side chain is also involved in a
hydrogen bond with another hydrogen donor, Ser-155. The glutamine side
chain of the E118Q mutant cannot accept two hydrogen atoms, thus
destabilizing this interaction and decreasing the ability to bind the
substrate. Co-crystallization experiments of the mutants with the N,N`-diacetylchitobiose confirm this interpretation.
While the D60N and E206Q mutants also crystallize as complexes with the N,N`-diacetylchitobiose bound in the active site
cleft, no electron density corresponding to the N,N`-diacetylchitobiose molecule could be located in
the electron density map of crystals of the E118Q mutant grown under
identical conditions.
These studies have unambiguously identified the active site residues and the oligosaccharide binding mechanism of PNGase F. The exact nature of the mechanism of action of this enzyme is at this time still unclear. However, it is obvious from the lack of serine or threonine residues in the active site area that, contrary to what was initially expected, the mechanism is not related to those of L-asparaginase or glycosylasparaginase. These studies will form the basis for further analysis, which will be focused on the analysis of the interaction of the enzyme with the peptide component of the substrate.
The atomic coordinates (code 1PNF) have been deposited in the Protein Data Bank, Brookhaven National Laboratory, Upton, NY.