Functional Analysis of Glycosyltransferases Encoded by the Capsular Polysaccharide Biosynthesis Locus of Streptococcus pneumoniae Serotype 14*

(Received for publication, April 21, 1997)

Marc A. B. Kolkman , Bernard A. M. van der Zeijst Dagger and Piet J. M. Nuijten §

From the Department of Bacteriology, Institute of Infectious Diseases and Immunology, School of Veterinary Medicine, University of Utrecht, 3508 TD Utrecht, The Netherlands.

ABSTRACT
INTRODUCTION
MATERIALS AND METHODS
RESULTS
DISCUSSION
FOOTNOTES
ACKNOWLEDGEMENTS
REFERENCES


ABSTRACT

Bacteria belonging to the species Streptococcus pneumoniae vary in their capsule. Presently, 90 capsular serotypes are known, all possessing their own specific polysaccharide structure. Little is known about the biosynthesis of these capsular polysaccharides. The cps locus of S. pneumoniae serotype 14 was cloned. So far, 7 open reading frames have been sequenced, cps14B to cps14H. The gene products are similar to proteins involved in bacterial polysaccharide biosynthesis, both of Gram-negative and -positive micro-organisms. Gene-specific mutants were created for cps14D to cps14H by insertional mutagenesis. All mutants no longer agglutinated with a monoclonal antibody against type 14 capsule polysaccharides. The biosynthetic function of cps14E and cps14G was determined by analysis of the intermediates in the synthesis of the oligosaccharide subunit, formed in membrane preparations of the wild-type and mutant strains and in membrane preparations of Escherichia coli expressing the pneumococcal glycosyltransferases. The enzyme encoded by cps14E is a glucosyl-1-phosphate transferase that links glucose to a lipid carrier, the first step in the biosynthesis of the type 14 repeating unit. The gene product of cps14G encodes a beta -1,4-galactosyltransferase, the enzyme responsible for the second step in the subunit synthesis, the transfer of galactose to lipid-linked glucose.


INTRODUCTION

The expression of a polysaccharide capsule is a common feature of many bacterial species, particularly those causing serious invasive infections of man (1). The capsule confers the bacterium resistance to complement-mediated opsonophagocytosis (2). In addition, some bacteria express capsular polysaccharides (CPs)1 that mimic host molecules, thereby avoiding the specific immune system of the host (3). Bacterial CPs are generally composed of repeating oligosaccharides, consisting of two to ten monosaccharides, sometimes complemented with other components. Because of the variable number and different types of monosaccharides within a subunit and the different ways these monosaccharides can be linked together, an enormous variety of polysaccharide structures can be formed. The species Streptococcus pneumoniae (the pneumococcus), a human pathogen causing invasive diseases such as pneumonia, meningitis, and otitis media, comprises 90 serotypes (4), each one having its own specific CP. The capsule of the pneumococcus is an important virulence factor. This became clear from the observation that unencapsulated mutants lose their virulence (5). Also, antibodies directed against CPs are protective (6), and this protection is serotype-specific. CPs from 23 serotypes are included in a vaccine (7). Although this vaccine protects healthy adults against infections caused by the serotypes included in the vaccine, its efficacy is poor in high-risk groups such as young children (8, 9).

The chemical structures of most pneumococcal CP types have been determined (10), but little is known about the genes involved in CP synthesis (cps), and the biosynthesis itself. Biosynthesis should require a complex enzymatic pathway, starting with the uptake or synthesis of the monosaccharides and their activation by conversion to nucleotide derivatives. A membrane-bound transferase complex would catalyze the successive linkage of the monosaccharides to a membrane-bound lipid carrier, followed by polymerization of the subunits and subsequent export and attachment of the complete CP to the cell surface.

Classic genetic experiments performed by Avery et al. (11) indicated that the cps genes are clustered on the chromosome of the pneumococcus. Recent reports by others (12, 13) and ourselves (14) confirm this observation. Also in other bacterial species (e.g. Escherichia coli, Haemophilus influenzae), cps genes are clustered. Between these loci, there appears to be a considerable degree of sequence homology. Moreover, they have a common genetic organization involving three functional regions (15, 16). The central region encodes serotype-specific biosynthetic functions and is flanked on both sides by two conserved regions presumed to encode proteins for common functions like CP transport. Little is known about the genetic organization of cps loci in Gram-positive bacteria, including S. pneumoniae.

We study the capsule biosynthesis of S. pneumoniae serotype 14, whose CP has a linear backbone of right-arrow6)-beta -D-GlcpNAc-(1right-arrow3)-beta -D-Galp-(1right-arrow4)-beta -D-Glcp-(1right-arrow repeating units, with monosaccharide side chains of a beta -D-Galp-(1right-arrow linked to C4 of each N-acetylglucosamine residue. We recently identified a chromosomal region of serotype 14 involved in CP synthesis and cloned at least a large part of the cps locus into a low copy number cosmid vector (14). Two genes, cps14E and cps14F, involved in CP synthesis were identified. Cps14E encodes a glycosyltransferase. In this communication, we report on the structure of five other genes of the serotype 14 cps locus. We also determined the functions of gene products by inactivating the genes and by functional expression of these genes in E. coli.


MATERIALS AND METHODS

Bacterial Strains and Growth Conditions

S. pneumoniae serotype 14 was strain NCTC11902. A serotype 4 strain was obtained from A. J. W. van Alphen (Academic Medical Center Amsterdam, The Netherlands). The pneumococcal strain CP1200 has been described previously (17). Pneumococci were grown at 37 °C in Todd-Hewitt broth supplemented with 0.5% yeast extract (THY-medium) or on blood agar plates. Tetracycline was used at a concentration of 10 µg/ml for selection of the S. pneumoniae mutants. E. coli CG120 (18), containing pAM120, was a gift from C. G. Rubens (University of Washington, Seattle, WA). E. coli DH5alpha (19) was used as a host for pBluescript II KS (Stratagene), and all clones were grown in or on LB or TB media (19) containing the appropriate antibiotic concentration.

Plasmids and Antibodies

Cosmid cMK02 and the plasmids pGV100, pMK100, pMK102, and pMK103 have been described before (14). pBluescript II KS was used for subcloning of fragments from cosmid clone cMK02. Monoclonal antibody Hasp14.1 (specific for serotype 14 polysaccharide) was obtained from U. B. Skov Sørensen (Statens Seruminstitut, Copenhagen, Denmark).

DNA Techniques

Cloning, transformation, Southern blot analysis, and most other DNA techniques were performed as described before (19). Chromosomal DNA was isolated as described by Ausubel et al. (20). Restriction endonucleases and T4 DNA ligase were purchased from Pharmacia-LKB. High-fidelity PCR was performed with Pfu DNA polymerase (Stratagene).

DNA Sequencing and Analysis

Plasmid DNA was sequenced using an A.L.F. DNA sequencer (Pharmacia-LKB), and sequencing data were assembled using pc/gene (IntelliGenetics). The programs TFASTA and BLAST were used to compare sequences to data bases available at the National Center for Biotechnology Information, Bethesda. Hydrophobic stretches within proteins were predicted by the method of Klein et al. (21). The nucleotide sequences of cps14B to cps14H have been deposited with GenBankTM/EBI DataBank.

Construction of Gene-specific Knock-out Mutants

Construction of cps14E and cps14F mutants has been described (14). An insertional mutation in cps14D, cps14G, and cps14H was constructed in vitro by blunt ligation of a 4.9-kilobase HincII fragment of pAM120 containing the tetracycline resistance gene of Tn916 (tetM) in the HpaI site of cps14D in pMK104, in the SacII site of cps14G in pGV100, and in the SpeI site of cps14H in pGV100, respectively. After transformation to E. coli DH5alpha , clones were selected in which the direction of transcription of the tetracycline resistance gene was similar to that of the adjacent cps genes. This yielded the plasmids pKOM13, pKOM01, and pKOM08, respectively. The cps14D, cps14G, and cps14H insertion mutation was introduced into the chromosome of the wild-type by natural transformation as described by Yother et al. (22), using the three constructs. Pneumoccoccal strain CP1200 was used as a donor strain for competence factor. Transformants were selected for tetracycline resistance, and mutations were confirmed by Southern hybridization analysis.

Construction of Plasmid pMK113

An in-frame deletion in cps14F (from nucleotide 3564 to 3893) was introduced in plasmid pMK100REV by performing a high fidelity PCR with the primers 14FBGLP1 (5'-ATAAAGATCTATGAGTTAAATGACCTCC-3') and 14FBGLP2 (5'-CAGGAGATCTGTACAATGGGAAGAAATG-3'), which both contain an internal BglII site, and pMK100REV as target DNA. The PCR product was digested with BglII, religated, and transformed to E. coli DH5alpha . The resulting plasmid of one of the clones, designated pMK113, was sequenced to confirm the in-frame deletion in cps14F and to check whether any mutations were introduced into cps14E and cps14G.

Membrane Preparations

Pneumococcal membranes were prepared as described (14). E. coli membranes were isolated by the same procedure with some modifications. Cultures were grown overnight in TB containing 50 µg/ml ampicillin at 37 °C and then diluted 10-fold in the same medium. Cells were grown to mid-log phase (A600 = 0.4-0.6), and then expression of the cps genes was induced by adding isopropyl-beta -D-thiogalactopyranoside (IPTG) to a final concentration of 0.5 mg/ml. The incubation was extended for 2 h at 37 °C and then cells were harvested by centrifugation. The cells were resuspended in 10 ml of 0.7 M sucrose, 50 mM Tris-Cl, pH 8.0, 1 mM EDTA, 20 mg of lysozyme was added, and the suspension was incubated for 3 h at 4 °C. The rest of the procedure was as described for pneumococcal membranes (14). The pneumococcal and E. coli membranes were stored at -70 °C and proved to be stable for several months.

Glycosyltransferase Activity Assays

Glycosyltransferase activity was essentially determined as described previously (14). For each reaction, 40 µl of membrane preparation (approximately 25 µg of protein) was incubated at 10 °C for 1 h with 0.025 µCi UDP-[14C]glucose (Amersham Life Science, Inc., 296 mCi/mmol), and/or 0.025 µCi UDP-[14C]galactose (Amersham, 305 mCi/mmol), or 0.025 µCi UDP-[14C]N-acetylglusosamine (Amersham, 251 mCi/mmol) as indicated in each experiment, and 10 mM MgCl2 in a final volume of 50 µl. When indicated, unlabeled UDP-glucose was added to a final concentration of 500 µM. Reactions were stopped by the addition of 1 ml chloroform:methanol (2:1). This solution was extracted three times with 0.2 ml of PSUP (1.5 ml of chloroform, 25 ml of methanol, 23.5 ml of water, and 0.183 g of KCl). The incorporation of 14C into the glycolipid fraction in the organic phase was measured in a scintillation counter (Beckman).

Analysis of Lipid-linked Intermediates by Thin Layer Chromatography

Lipid-linked intermediates were hydrolyzed from the lipid carriers by mild acid hydrolysis: one-fifth of the glycolipid fraction was dried in a Speed-Vac and resuspended in 100 µl of 1-butanol. 100 µl of 0.05 M trifluoroacetic acid was added, and the solution was heated for 20 min at 90 °C. The solution was dried again in a Speed-Vac, and the pellet was resuspended in 40% 2-propanol containing 5 mg/ml unlabeled carrier glucose, galactose, and lactose, and subjected to thin layer chromatography (TLC). TLC was carried out on HPTLC silica gel (Merck), developed in 1-butanol/ethanol/water (5:3:2), sprayed with En3hance spray (DuPont) and autoradiographed for 1-2 days. To visualize unlabeled sugar standards, the TLC plate was sprayed with 5% H2SO4 in ethanol and heated to 100 °C for 10 min.

Reduction Analysis of the [14C]-labeled Lactose Intermediate

[14C]-labeled lactose, released by mild acid hydrolysis of the lipid carrier of wild-type pneumococcal membranes incubated with UDP-[14C]glucose and UDP-[14C]galactose, was scraped off a TLC plate and eluted in water. Half of the sample was reduced with sodium borohydride as described previously (23) and desalted using a column of mixed-bed resin (IWT TMD-8, Aldrich). Both samples (reduced and unreduced lactose) were subjected to strong acid hydrolysis. The dried saccharides were dissolved in 80 µl H2O, 20 µl of 100% trifluoroacetic acid was added, and the solution was heated for 4 h at 100 °C. The solution was dried again, and the pellet was resuspended in 40% 2-propanol containing 5 mg/ml unlabeled glucose, galactose, sorbitol, and lactose, serving as carriers and internal standards. The sample was analyzed by TLC as described above, but to achieve a better separation of glucose, galactose, and sorbitol, the TLC plate was developed twice in in ethyl acetate/methanol/water/acetic acid (7:2:1:0.1).


RESULTS

The Biosynthesis of the Serotype 14 Polysaccharide Subunit Starts with Glucose

In this study, we focussed on the first two steps in the type 14 CP biosynthesis. Lipid-linked oligosaccharide intermediates were prepared by glycosyltransferase assays, performed on membrane preparations. Membranes were incubated with 14C-labeled UDP-activated monosaccharides that are part of the serotype 14 polysaccharide subunit: glucose, galactose, or N-acetylglucosamine. Serotype 4, which contains none of these monosaccharides in its polysaccharide structure, was used as a negative control. The incorporation of 14C-label into the glycolipid fraction was measured (Fig. 1A). Serotype 14 membranes efficiently incorporated [14C]glucose and [14C]galactose but hardly any [14C]N-acetylglucosamine. The incorporation of all labeled sugars into the glycolipid fractions of serotype 4 membranes was much lower than that of serotype 14. Our data agree well with those of Distler and Roseman (24), confirming that the 14C-labeled monosaccharides are incorporated into a serotype-specific glycolipid fraction and that either glucose or galactose is the first sugar linked to a lipid carrier. To address this further, the glycolipid fractions were subjected to mild acid hydrolysis and then analyzed by TLC and autoradiography (Fig. 1A). Incubation with only UDP-[14C]glucose resulted in incorporation of mainly glucose (lane 1), whereas UDP-[14C]galactose gave mainly lactose (beta -Gal(1right-arrow4)beta -Glc) and some glucose (lane 2). UDP-glucose will be formed from UDP-galactose by a UDP-glucose-4-epimerase activity, which is apparently present in this system (24). Incubation with UDP-[14C]N-acetylglucosamine gave no significant spot (lane 3), whereas unlabeled UDP-glucose, together with UDP-[14C]galactose, results in a prominent lactose spot (lane 4). Strong acid hydrolysis of 14C-labeled lactose, released by mild acid hydrolysis of membranes incubated with both UDP-[14C]glucose and UDP-[14C]galactose, gives glucose and galactose, whereas reduction of the 14C-labeled lactose intermediate, followed by strong acid hydrolysis, results in [14C]sorbitol and [14C]galactose (Fig. 1B). These data show that glucose is the reducing terminal sugar attached to a lipid carrier. In the second biosynthesis step, galactose is linked to this glucose residue, resulting in lipid-linked Galbeta (1-4)Glc (lactose).


Fig. 1. A, thin-layer chromatogram of [14C]-labeled intermediates of capsule polysaccharide synthesis in S. pneumoniae serotype 14 membranes. Glc, glucose; Gal, galactose; GNAc, N-acetylglucosamine; and Gal-Glc, lactose. The amount of incorporation of 14C-labeled monosaccharides (cpm/µg of protein) into the glycolipid fraction of membranes of S. pneumoniae types 14 and 4 is indicated at the bottom. B, thin-layer chromatogram of the 14C-labeled lactose intermediate after strong acid hydrolysis (left lane) and after reduction with NaBH4 followed by strong acid hydrolysis (right lane). Glc, glucose; Gal, galactose; and Sor, sorbitol.
[View Larger Version of this Image (15K GIF file)]

Sequence Analysis of Seven Open Reading Frames of the cps Locus of Serotype 14

We previously cloned the cps locus of serotype 14 into the low copy number vector pPR691, yielding cosmid clone cMK02 (14). Restriction fragments of this clone were subcloned in pBluescript, mapped (Fig. 2), and sequenced. Seven complete ORFs were found in this region. No transcription terminator signals were detected within the sequenced region, suggesting that the ORFs are part of one operon. Each ORF is preceded by a ribosome binding site sequence, and they were all closely coupled, being separated by a maximum of 18 nucleotides. The first four ORFs were almost identical to cps19fB-E of S. pneumoniae serotype 19F (13) and therefore designated cps14B-E. This homology abruptly ends from the start of cps14F. cps14F-H do not show any homology with cps genes of serotype 19F. Gene-specific knock-out mutants of cps14D-H no longer agglutinated with a monoclonal antibody against type 14 CP (see below), confirming that the genes are essential for type 14 CP synthesis. An overview of the properties of the ORFs and their translation products is given in Table I.


Fig. 2. Physical map of part of the cps locus on the chromosome of S. pneumoniae serotype 14. The location of seven ORFs and the direction of transcription are indicated by the arrows. pGV100 is a 2.4-kilobase EcoRI chromosomal DNA fragment cloned in pBluescript. pMK100, pMK102 to pMK106, pMK108, and pMK110 are subclones of cosmid clone cMK02. E, EcoRI; H, HindIII; R, RsaI; S, SpeI; and X, XbaI.
[View Larger Version of this Image (10K GIF file)]

Table I. Properties of the ORFs in the cps locus of S. pneumoniae serotype 14 and homologies with gene products of other bacteria


ORF Nucleotide position in sequence G + C% Number of amino acids Predicted mol weight Function gene product Homologous gene products (% identity) Reference

kDa
cps14B 1 -729 37.3 243 28.6 Unknown S. pneumoniae 19F Cps19fB (96.3%) 13
S. agalactiae type III CpsA (62.1%) 25
S. thermophilus sf16 EpsB (57.1%) 26
S. aureus type 1 CapC (34.1%) 27
cps14C 741 -1430 38.1 230 25.5 Chain length S. pneumoniae type 3 Orf1 (94.3%) 28
determinationa S. pneumoniae 19F Cps19fC (92.6%) 13
S. agalactiae type III CpsB (47.2%) 25
S. thermophilus Sfi6 EpsC (49.8%) 26
S. aureus type 1 CapA (24.4%) 27
R. meliloti ExoPc (23.4%) 29
cps14D 1443 -2123 34.9 227 24.8 Chain length S. pneumoniae 19F Cps19fD (95.2%) 13
determination/Export S. pneumoniae type 3 Orf2 (91.7%) 28
CPa S. agalactiae type III CpsC (58.0%) 25
S. thermophilus Sfi6 EpsD (57.5%) 26
S. aureus type 1 CapB (37.2%) 27
R. meliloti ExoPd (33.0%) 29
P. solanacearum EpsBd (29.5%) 31
E. amylovora AmsAd (31.4%) 32
K. pneumoniae Orf6d (28.8%) 33
cps14E 2142 -3506 32.7 455 52.7 Glucosyl-1-phosphate transferaseb see Kolkman et al. (1996) 14
cps14F 3513 -3959 32.4 149 17.3 Unknown Sphingomonas S88 SpsKc (33.8%) 35
cps14G 3962 -4462 27.7 167 19.6 Galactosyl transferaseb Sphingomonas S88 SpsKd (27.7%) 35
cps14H 4455 -5627 27.3 390 45.9 CP polymerasea Shigella flexneri Rfc (25.6%) 37
Salmonella typhimurium Rfc (21.0%) 38

a Predicted by sequence similarity.
b Experimentally determined in this study.
c Homology refers to the N-terminal part of the gene product of exoP or spsK, respectively.
d Homology refers to the C-terminal part of the gene product of exoP, epsB, amsA, orf6, or spsK, respectively.

The cps14B gene product is homologous to proteins encoded by genes of several other Gram-positive bacteria involved in CP or exopolysaccharide synthesis (Table I), but their function is unknown.

The proteins encoded by cps14C and cps14D are homologous to a family of ExoP-like proteins (Table I). They represent two domains that are mostly divided over two proteins, but they are also found combined in one protein (ExoP). The amino acid sequence AISPSSPNIKRNTLIGFLAG (residues 168-187) of Cps14C matches a consensus sequence motif identified for proteins involved in the chain length determination of polysaccharides (30, 26). Furthermore, the hydrophobicity plot of Cps14C resembles those of these proteins, with two hydrophobic segments at its N and C termini, and a hydrophilic domain in the cental part. These data suggest that Cps14C is involved in the chain length determination of type 14 CP. Cps14D contains nucleotide-binding motives as reported for most of the proteins homologous to Cps14D. The sequences GEGKTT (amino acids 46-51) and YIIVD (amino acids 128-132) match the consensus proposed for the Walker nucleotide-binding motifs A and B (34). The amino acid sequence FARAGYKTLLIDGDTR downstream of the Walker A motif matches for 90% another conserved sequence motif found in ATPases (LAXXGXKXLLIDXDXR) reported by Huang and Schell (31). Rhizobium meliloti strains carrying truncated ExoP proteins devoid of the C-terminal domain, including the nucleotide-binding motif, produced smaller amounts of exopolysaccharide with a lower molecular weight, indicating a less efficient transport and an altered chain length regulation (30). These data indicate that Cps14D may have a role in chain length determination and export of type 14 CP.

Cps14E, a glycosyltransferase, and its homology with other bacterial transferases has been described previously (14). Cps14E contains four potential membrane spanning stretches.

Cps14F shows significant homology with the N-terminal part of SpsK of Sphingomonas S88 (35). It contains one hydrophobic segment by which it may be anchored into the bacterial membrane. Cps14G is homologous to the C-terminal part of SpsK of Sphingomonas S88. These homologies suggest that the gene products of cps14F and cps14G act together. They may have been one gene originally, or spsK may have evolved from a gene fusion. Based on marginal similarities with putative UDP binding sites of other glycosyltransferases, it was suggested that SpsK is a glycosyltransferase (35). Also in Cps14G, a small region shows similarity with a proposed UDP binding site domain of UDP-glycosyltransferases (36). Fig. 3 shows sequence similarities of the putative UDP binding site domains of Cps14G (amino acids 58-101), Crtx of Erwinia herbicola (36), SpsK of Sphingomonas S88 (35), and a UDP-binding consensus sequence proposed by Hundle et al. (36). Four out of nine amino acids of Cps14G match this consensus sequence.


Fig. 3. Alignment of a region of Cps14G with a homologous domain of Sphingomonas S88 SpsK and the conserved domain of Erwinia herbicola zeaxanthin UDP-glucose transferase (Crtx) associated with UDP binding. UDP seq is the UDP-binding consensus sequence proposed by Hundle et al. (32). x represents variable residues.
[View Larger Version of this Image (16K GIF file)]

The deduced amino acid sequence of cps14H shows low homology with the O-antigen polymerases encoded by the rfc genes of Shigella flexneri (37) and Salmonella typhymurium (38). The hydrophobicity plot of Cps14H shows 9 to 11 potential membrane spanning regions (data not shown), characteristic for the O-antigen polymerases identified so far. Cps14H may be the type 14 polysaccharide polymerase.

Subunit Synthesis in Gene-specific Knock-out Mutants

The gene-specific knock-out mutants Spn#E and Spn#F, in which cps14E and cps14F are inactivated, have been described previously (14). Three more mutants were created as described under "Experimental Procedures," Spn#D, Spn#G, and Spn#H, in which cps14D, cps14G, and cps14H were disrupted. The correct location of the tetM insertions on the chromosome was confirmed by Southern hybridization analysis (data not shown). All five mutants were not agglutinated by Hasp14.1, a monoclonal antibody specific for serotype 14 CP, indicating that all five genes are involved in CP synthesis. Membranes of wild-type bacteria and the five mutants were prepared and incubated with radioactively labeled UDP-glucose and UDP-galactose. Membranes of wild-type bacteria and the mutants Spn#D, Spn#F, Spn#G, and Spn#H showed comparable levels of incorporation but Spn#E demonstrated a reduced transferase activity (Fig. 4), as described earlier (14). The lipid-linked intermediates of these mutant membranes were removed from the lipid carrier by mild acid hydrolysis and characterized by thin-layer chromatography (Fig. 4). Wild-type membranes and membranes of Spn#D both showed glucose and lactose as lipid-linked intermediates. These products were absent in the Spn14#E lane. Thus the cps14E gene product is responsible for the first step of CP biosynthesis, the transfer of glucose-1-phosphate onto the lipid carrier. We assume that the biosynthesis of other pneumococcal polysaccharides (e.g. C-polysaccharide) also proceeds via lipid-linked intermediates but at a much lower level than CP production. This may explain the low background of incorporated glucose into the glycolipid fraction of mutant Spn#E. Membranes of both Spn#F and Spn#G incorporated only glucose. This result shows that at least Cps14G is involved in the second step in the biosynthesis of serotype 14 CP, the addition of galactose to lipid-linked glucose. Since we cannot exclude the possibility that insertion of the tetracycline resistance cassette in cps14F has a polar effect on cps14G, we cannot draw any conclusions from this experiment whether cps14F is involved in galactosyltransferase activity or not. Therefore, a non-polar in-frame deletion was made in cps14F in an E. coli clone that is able to express cps14E to cps14G (see below). Membranes of Spn14#H demonstrated both glucose and lactose as lipid-linked intermediates, which shows that Cps14H (the putative CP polymerase) is not involved in the first two biosynthesis steps.


Fig. 4. Incorporation of [14C]glucose and [14C]galactose (cpm/µg of protein) into the glycolipid fraction of wild type and mutant membranes of S. pneumoniae serotype 14 (at bottom) and TLC of the 14C-labeled intermediates in the CP synthesis, hydrolyzed from the glycolipid fractions. 14, wild-type intermediates; #D, #E, #F, #G, and #H are intermediates produced by the mutants Spn#D, Spn#E, Spn#F, Spn#G, and Spn#H, respectively; Glc, glucose; Gal, galactose; and Gal-Glc, lactose.
[View Larger Version of this Image (29K GIF file)]

Expression of Pneumococcal Glycosyltransferase Genes in E. coli

It appeared that pneumococcal glycosyltransferases can be expressed in E. coli. Six different clones containing cps genes of serotype 14 were used (Fig. 5A). Membranes of the E. coli cells were isolated 2 h after induction with IPTG, and transferase assays were performed using labeled UDP-glucose and UDP-galactose and labeled UDP-galactose with an excess of unlabeled UDP-glucose. The incorporation of [14C]-label into the glycolipid fraction was measured (Table II), and formed oligosaccharide intermediates were hydrolyzed and characterized by TLC (Fig. 5B). As a marker, the lipid-linked intermediates hydrolyzed from pneumococcal membranes incubated with 14C-labeled UDP-glucose and UDP-galactose were run on the same TLC.


Fig. 5. A, overview of the plasmids, containing pneumococcal cps genes, used in the glycosyl transferase assays in E. coli (panel B). Fragments of the cps locus of serotype 14 cloned in pBluescript and the direction of transcription are indicated by the arrows. LP is the lac promoter of pBluescript. X, XbaI; E, EcoRI; and S, SacII. B, TLC of 14C-labeled intermediates in the CP synthesis of serotype 14, isolated from membranes of the wild type (wt) (first lane) and E. coli(DH5alpha ) clones expressing type 14 cps genes (panel A): pMK112 (second lane), pMK100REV (third lane), pMK100 (fourth lane), pMK113 (fifth lane), pGV100 (sixth lane), and pMK104 (seventh lane). Intermediates were hydrolyzed from membranes to which both UDP-[14C]glucose and UDP-[14C]galactose were added (see Table II). Glc, glucose; Gal, galactose; and Gal-Glc, lactose.
[View Larger Version of this Image (20K GIF file)]

Table II. Incorporation of 14C-labeled monosaccharides (cpm/µg of protein) into the glycolipid fraction of membranes of E. coli clones expressing pneumococcal cps genes


E. coli DH5alpha  + clone UDP-monosaccharides added
14C-Glc + 14C-Gal Glc + 14C-Gal

pMK112 121 3
pMK100REV 174 116
pMK100 53 51
pMK113 85 64
pGV100 8 4
pMK104 4 2

Clone pMK104 contains a major part of cps14B, the complete cps14C and cps14D genes and a small part of cps14E (287 bp) in the wrong orientation with respect to the lac promoter of pBluescript. This clone was used as a negative control since it contains no putative glycosyltransferase genes, and indeed, no glucose or galactose was incorporated.

Clone pMK112 contains the larger part of cps14E, cps14F, and half of cps14G (the first 276 bp) in the correct orientation with respect to the lac promoter. In this clone, the first 283 nucleotides of cps14E are missing, but a start codon (ATG) with a ribosome binding site (AGAAG) are present, resulting, when expressed, in a polypeptide missing the first 98 amino acids of Cps14E. When cultures were induced with IPTG, membranes of this clone showed considerable glycosyltransferase activity, but only [14C]glucose and not [14C]galactose was recovered from the glycolipid fraction. When 14C-labeled UDP-galactose and an excess of unlabeled UDP-glucose was added to these membranes, no radioactive monosaccharides could be detected in the glycolipid fraction.

Clones pMK100REV and pMK100 only differ in orientation of the insert. They contain the same major part of cps14E as clone pMK112, plus the entire cps14F and cps14G genes, and a part of cps14H. Membranes of clone pMK100REV showed a significant glycosyltransferase activity. Even with an excess of unlabeled glucose, [14C]galactose was incorporated. The TLC shows that lactose could be released from the lipid fraction. Clone pMK100 shows a weaker lactose spot on the TLC, indicating that there is some expression of the cps14 genes, independent of the lac promoter. Also in clone pMK113, a derivative of clone pMK100REV containing an in-frame deletion in cps14F, lactose could be hydrolyzed from the lipid fraction although the incorporation of 14C label is lower than in clone pMK100REV. These data confirm that that cps14E encodes a glucosyl-1-phosphate transferase and that cps14G encodes a beta -1,4-galactosyltransferase. Moreover, these transferase assays show that Cps14F is not required for the transfer of galactose to lipid-linked glucose.

Clone pGV100 contains a small part of the carboxyl-terminal end of cps14E (577 bp), the entire cps14F and cps14G genes, and a large part of cps14H. This clone did not show any glycosyltransferase activity. Since no functional Cps14E is expressed in this clone, lipid-linked glucose is not present as an acceptor for Cps14G activity, and therefore lipid-linked lactose is not formed.


DISCUSSION

In this study, we characterized a chromosomal region involved in capsular polysaccharide biosynthesis of S. pneumoniae serotype 14. Seven complete open reading frames (cps14B-H), which seem to belong to a single transcriptional unit, were found. Five gene-specific knock-out mutants were created by insertion of a tetracycline resistance cassette (tetM) in cps14D-H. All five mutants did not react with a type 14 specific monoclonal antibody, indicating that cps14D-H are part of the cps locus of serotype 14.

The first four ORFs, cps14B-E, are almost identical to cps19fB-E (97% identity at the DNA level) of serotype 19F. This homology ends abruptly from the start of cps14F, suggesting that a recombination event in this area led to exchange of cps genes. Data that suggest that horizontal gene transfer of CP biosynthesis genes in pneumococci occur have been reported before (39). In Gram-negative bacteria (e.g. E. coli and H. influenzae), serotype-specific genes that are involved in CP synthesis are located between two conserved regions that are presumed to code for common functions (15, 16). Recombination between the flanking conserved CP genes would be expected to result in another capsular type and evidence for the in vivo horizontal transfer of serotype-specicific CP genes of H. influenzae has been reported (40). Cross hybridization experiments between pneumococcal cps genes and chromosomal DNA of several serotypes (13)2 showed that the cps14/19fA-D genes are conserved in many serotypes. Whether this forms one of the two conserved regions as observed in the organization of cps loci in Gram-negative bacteria remains to be clarified.

In earlier work, we showed that cps14E encodes a glycosyltransferase (14), and we suggested that both Cps14E and Cps19fE are glucosyltransferases since glucose is the only monosaccharide that serotypes 14 and 19F have in common. In this study, we have shown that the subunit synthesis of serotype 14 starts with the addition of glucose to a lipid carrier and that Cps14E is responsible for this first step in the oligosaccharide synthesis. Also in serotype 19F, the subunit synthesis starts with the addition of glucose to a lipid carrier (data not shown), and therefore, it seems likely that Cps19fE catalyzes the first step of CP synthesis in serotype 19F. The CP structure of serotype 14 is very similar to that of Streptococcus agalactiae type III, the only difference is an extra sialic acid residue linked to the galactose side group of S. agalactiae type III CP. Cps14E has a considerable similarity with CpsD of S. agalactiae type III (14). Rubens et al. (25) found reduced galactosyltransferase activity in a CpsD mutant, indicating that cpsD encodes a galactosyltransferase. This suggests that CP biosynthesis in both streptococcal species is rather different. However, the possibility that also in their study UDP-galactose is converted into UDP-glucose by an epimerase, and that glucose is transferred to a lipid carrier cannot be excluded. E. coli clone DH5alpha (pMK112) contains a major part of cps14E downstream of the lac promoter of pBluescript, and, when induced, membranes of this clone showed considerable glucosyltransferase activity. Most likely, a polypeptide, missing the first 98 amino acids of Cps14E, is expressed that still has glucosyltransferase activity. This is in agreement with the observation that expression of only the 3'-half of wbaP (rfbP) of Salmonella enterica, a gene homologous to cps14E and involved in the first step of O-antigen synthesis, results in the synthesis of a polypeptide with glycosyltransferase activity (41).

Both the cps14F and the cps14G mutant still showed glucosyltransferase activity, but both mutants were blocked in the second step of the subunit synthesis, the transfer of galactose to lipid-linked glucose. Insertion of tetM in cps14D has no polar effect on cps14E, which is situated immediately downstream of cps14D, since mutant Spn#D is still able to incorporate glucose and galactose into the glycolipid fraction. However, the possibility that the lack of galactosyltransferase activity of Spn#F is caused by a polar effect on cps14G, rather then the cps14F mutation itself, could not be excluded. Therefore, a non-polar in-frame deletion was made in cps14F in E. coli clone DH5alpha (pMK100REV), resulting in clone DH5alpha (pMK113). Membranes of E. coli DH5alpha (pMK100REV), which contains cps14E to cps14G, showed glucosyl and galactosyl transferase activity. Both activities were still present in clone DH5alpha (pMK113), which shows that Cps14F is not required for galactosyltransferase activity. Thus cps14G solely encodes the beta -1,4-galactosyltransferase activity required for the second step in the biosynthesis of the oligosaccharide subunit in S. pneumoniae serotype 14, the transfer of galactose to lipid-linked glucose. The homology of Cps14F and Cps14G to the N-terminal half and C-terminal half of the Sphingomonas SpsK protein, respectively, suggests a combined function for both proteins. The function of CpsF is not clear, but the reduced glycosyltransferase activity in clone DH5alpha (pMK113) compared with the activity of DH5alpha (pMK100REV) indicates that Cps14F has an enhancing role in glycosyltransferase activity. It is unlikely that cps14F encodes another glycosyltransferase since preliminary sequencing data revealed the presence of two ORFs downstream of cps14H, which both show homology to several other glycosyltransferases. These putative genes would encode the beta -1,3-N-acetylglucosaminyltransferase and the beta -1,4-galactosyltransferase, required for the last two sugar additions in the oligosaccharide subunit synthesis of S. pneumoniae serotype 14 CP.

Cross-hybridizations between cps14F and cps14G probes and chromosomal DNA of several other pneumococcal serotypes, showed that only serotypes 13, 15A, 15B, 15C, and 15F contain cps14F- and cps14G-like sequences. Both probes reacted with EcoRI fragments of the same size, indicating that both genes also exist together in these serotypes. In addition, these five types all contain Galbeta (1-4)Glc as part of their subunit structure (10), and transferase assays performed with membranes of these serotypes showed that lipid-linked Galbeta (1-4)Glc is formed (data not shown). These data suggest that genes homologous to cps14G encode similar beta -1,4-galactosyltransferases, required for the CP synthesis in these pneumococcal serotypes.


FOOTNOTES

*   This work was supported by the Dutch Organization for Scientific Research (NWO) Grant 900-514-150 (to M. K.) and by the Utrecht Institute for Infection and Immunity (to P.N.).The costs of publication of this article were defrayed in part by the payment of page charges. The article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

The nucleotide(s) sequences reported in this paper has been submitted to the GenBankTM/EBI DataBank with accession number(s) X85787.


Dagger    To whom correspondence should be addressed: Dept. of Bacteriology, Institute of Infectious Diseases and Immunology, P. O. Box 80.165, 3508 TD Utrecht, The Netherlands. Tel.: 31-30-253-4888; Fax. 31-30-254-0784; E-mail: zeijst{at}vetmic.dgk.ruu.nl.
§   Present address: Intervet International BV, 5830 AA Boxmeer, The Netherlands.
1   The abbreviations used are: CP, capsular polysaccharide; IPTG, isopropyl-beta -D-thiogalactopyranoside; TLC, thin layer chromatography; ORF(s), open reading frame(s); bp, base pair(s); LB, Luria-Bertani; TB, terrific broth.
2   M. A. B. Kolkman, B. A. M. van der Zeijst, and P. J. M. Nuijten, unpublished data.

ACKNOWLEDGEMENTS

We thank Colin Pritchard and Olga Tap for technical assistance and Hans Kamerling and Hans Vliegenthart for helpful discussions and advice.


REFERENCES

  1. Boulnois, G. J., and Roberts, I. S. (1990) Curr. Top. Microbiol. Immunol. 150, 1-18 [Medline] [Order article via Infotrieve]
  2. Moxon, E. R., and Kroll, J. S. (1990) Curr. Top. Microbiol. Immunol. 150, 65-85 [Medline] [Order article via Infotrieve]
  3. Cross, A. S. (1990) Curr. Top. Microbiol. Immunol. 150, 87-95 [Medline] [Order article via Infotrieve]
  4. Henrichsen, J. (1995) J. Clin. Microbiol. 33, 2759-2762 [Abstract]
  5. Watson, D. A., and Musher, D. M. (1990) Infect. Immun. 58, 3135-3138 [Medline] [Order article via Infotrieve]
  6. Bruyn, G. A. W., and Furth, R. (1991) Eur. J. Clin. Microbiol. Infect. Dis. 10, 897-910 [Medline] [Order article via Infotrieve]
  7. Jennings, H. J. (1990) Curr. Top. Microbiol. Immunol. 150, 97-128 [Medline] [Order article via Infotrieve]
  8. Siber, G. R. (1994) Science 265, 1385-1387 [Medline] [Order article via Infotrieve]
  9. Douglas, R. M., Paton, J. C., Duncan, S. J., and Hansman, D. J. (1983) J. Infect. Dis. 148, 131-137 [Medline] [Order article via Infotrieve]
  10. van Dam, J. E. G., Fleer, A., and Snippe, H. (1990) Antonie van Leeuwenhoek 58, 1-47 [Medline] [Order article via Infotrieve]
  11. Avery, O. T., MacLeod, C. M., and McCarthy, M. (1944) J. Exp. Med. 79, 137-158
  12. Dillard, J. P., Vandersea, M. W., and Yother, J. (1995) J. Exp. Med. 181, 973-983 [Abstract]
  13. Guidolin, G., Morona, J. K., Morona, R., Hansman, D., and Paton, J. C. (1994) Infect. Immun. 62, 5384-5396 [Abstract]
  14. Kolkman, M. A. B., Morrison, D. A., van der Zeijst, B. A. M., and Nuijten, P. J. M. (1996) J. Bacteriol. 178, 3736-3741 [Abstract]
  15. Boulnois, G. J., and Jann, K. (1989) Mol. Microbiol. 3, 1819-1823 [Medline] [Order article via Infotrieve]
  16. Kroll, J. S., Zamze, S., Loynds, B., and Moxon, E. R. (1989) J. Bacteriol. 171, 3343-3347 [Medline] [Order article via Infotrieve]
  17. Morrison, D. A., Lacks, S. A., Guild, W. R., and Hageman, J. M. (1983) J. Bacteriol. 156, 281-290 [Medline] [Order article via Infotrieve]
  18. Gawron-Burke, C., and Clewell, D. B. (1984) J. Bacteriol. 159, 214-221 [Medline] [Order article via Infotrieve]
  19. Sambrook, J., Fritsch, E. F., and Maniatis, T. (1989) Molecular Cloning: A Laboratory Manual, 2nd Ed., Cold Spring Harbor Laboratory, Cold Spring Harbor, NY
  20. Ausubel, F. M., Brent, R., Kingston, R. E., Moore, D. D., Smith, J. A., Seidman, J. G., and Struhl, K. (1987) Current Protocols in Molecular Biology, John Wiley & Sons, NY
  21. Klein, P., Kanehisa, M., and DeLisi, C. (1985) Biochim. Biophys. Acta 815, 468-476 [Medline] [Order article via Infotrieve]
  22. Yother, J., McDaniel, L. S., and Briles, D. E. (1986) J. Bacteriol. 168, 1463-1565 [Medline] [Order article via Infotrieve]
  23. Kennedy, E. P., Rumley, M. K., Schulman, H., and van Golde, L. M. G. (1976) J. Biol. Chem. 251, 4208-4213 [Abstract]
  24. Distler, J., and Roseman, S. (1964) Proc. Natl. Acad. Sci. U. S. A. 51, 897-905 [Medline] [Order article via Infotrieve]
  25. Rubens, C. E., Heggen, L. M., Haft, R. F., and Wessels, M. R. (1993) Mol. Microbiol. 8, 843-855 [Medline] [Order article via Infotrieve]
  26. Stingele, F., Neeser, J-R., and Mollet, B. (1996) J. Bacteriol. 178, 1680-1690 [Abstract]
  27. Lin, W. S., Cunneen, T., and Lee, C. Y. (1994) J. Bacteriol. 176, 7005-7016 [Abstract]
  28. Arrecubieta, C., García, E., and López, R. (1995) Gene (Amst.) 167, 1-7 [CrossRef][Medline] [Order article via Infotrieve]
  29. Glucksmann, M. A., Reuber, T. L., and Walker, G. C. (1993) J. Bacteriol. 175, 7045-7055 [Abstract]
  30. Becker, A., Niehaus, K., and Pühler, A. (1995) Mol. Microbiol. 16, 191-203 [Medline] [Order article via Infotrieve]
  31. Huang, J., and Schell, M. (1995) Mol. Microbiol. 16, 977-989 [Medline] [Order article via Infotrieve]
  32. Bugert, P., and Gelder, K. (1995) Mol. Microbiol. 15, 917-933 [Medline] [Order article via Infotrieve]
  33. Arakawa, Y., Wacharotatayankun, R., Nagatsuka, T., Ito, H., Kato, N., and Ohta, M. (1995) J. Bacteriol. 177, 1788-1796 [Abstract]
  34. Higgins, C. F. (1992) Annu. Rev. Cell. Biol. 8, 67-113 [CrossRef]
  35. Yamazaki, M., Thorne, L., Mikolajczak, M., Armentrout, R. W., and Pollock, T. J. (1996) J. Bacteriol. 178, 2676-2687 [Abstract]
  36. Hundle, B. S., O'Brien, D. A., Alberti, M., Beyer, P., and Hearst, J. E. (1992) Proc. Natl. Acad. Sci. U. S. A. 89, 9321-9325 [Abstract]
  37. Morona, R., Mavris, M., Fallarino, A., and Manning, P. A. (1994) J. Bacteriol. 176, 733-747 [Abstract]
  38. Collins, L. V., and Hackett, J. (1991) J. Bacteriol. 173, 2521-2529 [Medline] [Order article via Infotrieve]
  39. Coffey, T. J., Dowson, C. G., Daniels, M., Zhou, J., Martin, C., Spratt, B. G., and Musser, J. M. (1991) Mol. Microbiol. 5, 2255-2260 [Medline] [Order article via Infotrieve]
  40. Kroll, J. S., and Moxom, E. R. (1990) J. Bacteriol. 172, 1374-1379 [Medline] [Order article via Infotrieve]
  41. Wang, L., Liu, D., and Reeves, P. R. (1996) J. Bacteriol. 178, 2598-2604 [Abstract]

©1997 by The American Society for Biochemistry and Molecular Biology, Inc.