Key words: Agrobacterium sp. ATCC31749/(1->3)-[beta]-glucan/repetitive [beta]-glycosyl transferases/[beta]-glucan synthases/biosynthesis/curdlan/hydrophobic cluster analysis/protein sequence analysis
Extracellular homo- and hetero-polysaccharides are produced by many bacterial species (Leigh and Coplin, 1992; Sutherland, 1993). The best known is the (1->4)-[beta]-glucan, cellulose, from Acetobacter xylinum, also produced by Agrobacterium tumefaciens, Rhizobium spp., Sarcina ventriculi, and other species (Ross et al., 1991). Members of the Rhizobiaceae, including Agrobacterium, Bradyrhizobium, and Rhizobium, synthesize linear and cyclic (1->2)-[beta]-glucans and cyclic (1->3, 1->6)-[beta]-glucans (Breedveld and Miller, 1994).
A linear (1->3)-[beta]-glucan, known as curdlan, is produced by an Agrobacterium sp. and some other bacteria (Harada and Harada, 1996). This polysaccharide is obtained in good yields in an insoluble, microfibrillar form when the Agrobacterium is grown on glucose-rich medium, and its synthesis is induced on depletion of the nitrogen source (Phillips and Lawford, 1983). The (1->3)-[beta]-glucan molecules in native curdlan probably exist as an association of single helical chains (Kasai and Harada, 1980). When suspensions of native curdlan are heated at 55-70°C, a low set gel is produced. In this state, x-ray diffraction (Okuyama et al., 1991) and 13C NMR (Saito et al., 1977) studies have demonstrated that most of the molecules are single chain helices but some triple-stranded helices may also occur. On heating to higher temperatures, the strength of the gel increases and the (1->3)-[beta]-glucan molecules are present as triple-stranded helices (Kasai and Harada, 1980). The unique gel forming properties of curdlan have led to its use as a food texture modifier (Harada et al., 1993; Harada and Harada, 1996) and as an enhancer of flow properties of concrete (Sakamoto et al., 1991).
Structurally comparable (1->3)-[beta]-glucans are found in the walls of yeasts and certain groups of filamentous fungi (Stone and Clarke, 1992; Cid et al., 1995) and as storage polysaccharides in brown algae (laminarin), euglenoids (paramylon), chrysophytes (leucosin) and some fungi (cellulin, mycolaminarin, pachyman; Stone and Clarke, 1992). Higher plants also produce a linear (1->3)-[beta]-glucan, callose, in the cell plate at the earliest stage of cell wall deposition, and in special walls in cells of reproductive tissues (e.g., pollen mother cells, pollen tubes) as well as forming deposits on the plasma membrane in abiotic and biotic stress (Stone and Clarke, 1992).
Curdlan is specifically stained by the triphenylmethane dye aniline blue (Nakanishi et al., 1974) and by the aniline blue fluorochrome (Evans et al., 1984) as are other (1->3)-[beta]-glucans such as yeast (Saccharomyces cereviseae) glucan, pachyman (Poria cocus; Nakanishi et al., 1974), and callose (Vitis vinifera; Aspinall and Kessler, 1957). Aniline blue has thus been used both to detect curdlan production by bacteria grown on agar medium (Nakanishi et al., 1974) and to identify mutants unable to produce curdlan (Nakanishi et al., 1976).
A considerable body of biochemical and molecular genetic information is available concerning bacterial cellulose synthesis, including the intriguing indication that the mechanisms may differ between A.xylinum (Saxena et al., 1991; Saxena and Brown, 1995) and A.tumefaciens (Matthysse et al., 1995a) even though these bacteria encode highly homologous cellulose synthases. Nothing is known about the molecular genetics of curdlan biosynthesis in bacteria or callose synthesis in higher plants, but there is growing information about the genes required for (1->3)-[beta]-glucans produced by yeasts and filamentous fungi (Douglas et al., 1994; Inoue et al., 1995; Mazur et al., 1995; Kelly et al., 1996; Mio et al., 1997). Thus, knowledge of the synthesis and secretion of curdlan is also of considerable interest from a comparative viewpoint. In addition, this information might prove useful in the isolation of plant (1->3)-[beta]-glucan synthase gene homologues as has been achieved for plant cellulose synthase genes (Pear et al., 1996). As a first stage in such a study of curdlan, we have cloned several essential curdlan genes from the industrially used strain, Agrobacterium sp. ATCC31749 (formerly Alcaligenes faecalis var. myxogenes; Harada et al., 1966). The genes occur in two nonoverlapping genomic clones that complement the mutations in curdlan-deficient insertion-mutants. The nucleotide sequence of one gene, crdS, was determined and, based on its homology with known [beta]-glycan synthases, is proposed to encode the (1->3)-[beta]-glucan (curdlan) synthase catalytic protein. Isolation of curdlan-deficient mutants of Agrobacterium
Curdlan-producing (Crd+) strains of bacteria form colonies that stain dark blue on agar medium containing aniline blue whereas curdlan nonproducers form nonstaining colonies (Nakanishi et al., 1976). This phenotypic difference permitted the isolation of curdlan-deficient mutants of LTU50 (a Cmr mutant of Agrobacterium sp. ATCC31749) after mutagenesis with transposon TnphoA(Manoil and Beckwith, 1985). This was achieved by conjugation experiments in which a suicide plasmid containing TnphoA(Kmr) (i.e., pRT733; Taylor et al., 1986) was mobilized from E.coli SM10 to LTU50. Kanamycin-resistant transconjugant colonies were isolated at frequencies of about 5 × 10-3 per donor cell and some (about 0.15%) stained poorly or not at all with aniline blue. When curdlan production by such mutants (LTU61-LTU113) was assessed from the amount of alkali-soluble polysaccharide that they produced (Nakanishi et al., 1976), it was significantly less than that from LTU50 (6 g l-1). Moreover, the amount correlated with the staining intensity of the colonies (i.e., <0.2-0.5 g l-1 from white-colony mutants (Crd-) and 1-4 g l-1 from mutants that formed weakly staining colonies (Crd+/-)). Southern analysis of the mutants showed that each had TnphoA inserted in one of nine distinguishable StuI or PvuI fragments (from 1-9 kb in size). Thus, it was likely that several different genes required for curdlan synthesis had been affected. Cloning of curdlan synthesis genes from Agrobacterium
The broad host range plasmid RP1 (=RK2) (Pansegrau et al., 1994) was found to transfer efficiently from an HB101 derivative to Agrobacterium LTU50, and to be stably maintained in this strain (data not shown). The cosmid vector pLAFR1, which is derived from RK2 (Friedman et al., 1982), was therefore chosen for the construction of an LTU50 gene library in E.coli. Wild type curdlan genes in the library were detected using probes containing disrupted crd gene sequences from Crd- mutants carrying distinguishable insertion mutations. The probes were obtained by cloning the Kmr-containing EcoRI-junction fragments from five mutants (LTU61-LTU64 and LTU87) into pUC19 (forming pVS1500-pVS1504, respectively). Each junction fragment contained a different length of Agrobacterium DNA (from 0.2-4.0 kb) but the same portion of TnphoA (including the HpaI (0.18 kb) site; Figure
Figure 1. (A) Simplified map of transposon TnphoA (Manoil and Beckwith, 1985) showing the location of IS50L containing the phoA gene, the kan gene, IS50R, and various restriction sites. The IS50 elements are shaded. (B) Map of curdlan locus I of Agrobacterium sp. LTU50 in an 8.8 kb EcoRI fragment cloned in pUC19 (pVS1512) and pLAFR1 (pVS1513). The location of Crd- TnphoA-insertion mutations (in LTU61-LTU66) is shown relative to the putative curdlan synthase gene, crdS, and various restriction sites. The orientation of insertion of each insertion is indicated by the flag which points towards the Hpa1 site The direction of transcription of crdS is from left to right. The bars below the map represent EcoRI-junction fragments that were cloned from LTU61-LTU66 into pUC19 (pVS1501-pVS1504). HpaI(TnphoA)-EcoRI(LTU) fragments recovered from pVS1501-pVS1504 were used as probes (P1501-P1504). Restriction site abbreviations: B, BamHI; Bg, BglII; E, EcoRI; H, HpaI; K, KpnI; S, SalI; Sm, SmaI; Ss, SstII.
When 4200 colonies containing the gene library were screened with the probes, 25 colonies (Group I) hybridized with P1501-P1504, whereas 12 colonies (Group II) hybridized only with P1500. This suggested that the recombinant cosmids in the two groups of bacteria contained nonoverlapping genomic clones and hence that at least one different gene required for curdlan synthesis must be present in each clone. The first of these conclusions was confirmed by EcoRI-digestion of a genomic clone from each of Group I (pVS1506) and Group II (pVS1511). In each case, both the number and the sizes of all of the fragments differed (five fragments from pVS1506 (of 1.2-9.5 kb) and more than nine from pVS1511 (from about 0.2-5.5 kb)). The second conclusion, that each clone carried different crd genes, was confirmed by complementation analysis of the set of Crd- mutants (LTU61-LTU88) that formed white colonies on ABA medium. Plasmids pVS1506 and pVS1511 were each transferred individually into these mutants by mobilization with the helper plasmid (an RK2-derivative) located in the chromosome of E.coli S17-1 (Simon et al., 1983). Curdlan production was restored in 26 mutants (LTU61-LTU86) by pVS1506 and in the remaining two mutants (LTU87 and LTU88) by pVS1511. Identification of two regions (locus I and locus II) in the Agrobacterium genome encoding crd genes
The region of pVS1506 responsible for complementation of the mutations in LTU61-LTU86 was found to be located in an 8.8 kb EcoRI fragment. This fragment was identified initially because it was the only one from pVS1506 that hybridized with probes P1501-P1504 (derived from pVS1501-pVS1504, the clones recovered from mutants LTU61-LTU64). When the 8.8 kb fragment was cloned into pLAFR1, the resulting plasmid (pVS1513; Figure
The Crd- mutations in LTU87 and LTU88 that were complemented only by the second cosmid recombinant (pVS1511) were found by Southern analysis to be located in the same 2.5 kb PvuI- and 8 kb StuI-fragments but different EcoRI-fragments (an 0.4 kb fragment in LTU87 and another, unidentified, fragment in LTU88; data not shown). These mutations are therefore closely linked and define the position of locus II. Sequencing and analysis of the locus I gene, crdS
The plasmids pVS1501-pVS1504 carry junction fragments cloned from four different locus I insertion mutants. This permitted a TnphoA-specific primer to be used to determine the sequence of the Agrobacterium DNA adjacent to the transposon in each plasmid. Of the sequences obtained, only those from LTU61 (in pVS1501) and LTU62 (in pVS1502) shared homology with known genes (see below). The single crd gene that had been inactivated in these two mutants was then sequenced and named crdS. This 1623 nt gene commences with an ATG start codon and terminates with a TGA stop codon. A putative ribosome binding site (GCGAGGT) (Shine and Dalgarno, 1974) occurs 8 nt upstream of the initiation codon, but no consensus -10 and -35 E.coli-like promoter sequences (Staden, 1984) were detected in the upstream (120 nt) sequences. The G+C content of crdS is 57% and is thus within the range found for Agrobacterium species (57-63%; Kersters and DeLey, 1984).
The crdS gene encodes a predicted 540 amino acid protein (59 kDa) that contains a high proportion of hydrophobic amino acids (50.4%) suggestive of a membrane-associated protein. This possibility was supported by hydropathy analysis (Kyte and Doolittle, 1982) which revealed stretches of hydrophobic residues, four of which are predicted by the ALOM program (Klein et al., 1985) to be transmembrane domains (residues 9-33, 35-58, 379-395, and 482-498; Figure
Figure 2. Hydropathy profile of CrdS. The plot (Kyte and Doolittle, 1982) is of the derived amino acid sequence of CrdS, using a window of 12 residues. The number of amino acids is plotted on the x-axis, and hydropathy is plotted on the y-axis. Of the six hydrophobic stretches, four (indicated by bars I-IV) are predicted by the ALOM program (Klein et al., 1985) to be transmembrane domains. Domain I is also a putative cleavable signal sequence (von Heijne, 1986).
The CrdS protein was found by a BLASTX search (Atshul et al., 1990; Gish and States, 1993) to share homology with members of the HasA family of [beta]-glycosyl transferases with repetitive action patterns (Keenleyside and Whitfield, 1996). Further analysis using LFASTA (Pearson and Lipman, 1988) revealed that homology over regions exceeding 300 amino acid residues was highest (ca. 46% similarity, 26% identity) with bacterial cellulose synthases, namely, with Acetobacter xylinum AcsA (Saxena et al., 1994), AcsAII (Saxena and Brown, 1995) and BcsA (Wong et al., 1990), Agrobacterium tumefaciens CelA (Matthysse et al., 1995b) and with cellulose synthase homologs from E.coli (YhjO; Sofia et al., 1994) and a chlorella virus (A473L; Kutish et al., 1996). Lower homology was found with other repetitive glycosyl transferases from bacteria (e.g., with Rhizobium meliloti NodC (30% similarity, 22% identity); Jacobs et al., 1985) and plants (i.e., with a partial gene sequence from Arabidopsis thaliana T88271 (ca. 36% similarity, 27% identity); Saxena and Brown, 1997) and with cotton CelA1 and CelA2 cellulose synthases (ca. 40% similarity, 23% identity; Pear et al., 1996) when the cotton-specific regions of these proteins were omitted from the comparison]. Interestingly, no significant homology was detected with closely related proteins believed to be components of fungal and yeast (1->3)-[beta]-glucan synthases (Cid et al., 1995; Inoue et al., 1995; Mazur et al., 1995; Kelly et al., 1996, Mio et al., 1997) nor with bacterial proteins involved in cyclic (1->3,1->6)-[beta]-glucan or (1->2)-[beta]-glucan production (Zorrigueta et al., 1988; Ielpi et al., 1990; Bhagwat et al., 1996).
A CLUSTALW (Thompson et al., 1994) alignment of CrdS and the bacterial cellulose synthase proteins, which are all about 200 residues longer, showed that homology was confined mainly to the central regions corresponding to CrdS residues 118 to 353 (Figure
Figure 3. Multiple alignment of the entire CrdS amino acid sequence (residues 1-540) and portions of the polypeptides of A.xylinum (residues 41-589), BcsA (residues 41-591), and AcsAII (residues 37-587), E.coli YhjO (residues 1-531), and A.tumefaciens CelA (residues 17-583), using the CLUSTALW program (v. 1.60). Each protein name is preceded by an acronym for the name of the bacterium from which the sequence was derived. Domains A and B previously identified in repetitive [beta]-glycosyl transferases are indicated by the horizontal lines above the sequence (Saxena et al., 1995; Keenleyside and Whitfield, 1996). Boxed regions enclose other conserved motifs described in Table I including the components of the D,D,D,QXXRW motif that are distributed across domains A and B (Saxena and Brown, 1997). Identical amino acid residues in all six sequences shown here are highlighted in boldface and marked with asterisks, whereas conservative amino acid replacements are indicated with dots.
Figure 4. Phylogenetic tree constructed by the maximum parsimony algorithm (TREEALIGN; Hein, 1990) using the amino acid sequence of CrdS (Figure 3) and other sequences retrieved from GenBank.
Computer analyses have revealed that [beta]-glycosyl transferases share conserved sequences and structural features (based on hydrophobic cluster analysis (HCA); Saxena et al., 1995; Saxena and Brown, 1997). The repetitive transferases in this group contain a D,D,D35QXXRW motif distributed over two domains, A and B (Saxena et al., 1995; Keenleyside and Whitfield, 1996). Non-repetitive transferases have domain A but not B, and so have only the first two Asp residues of the motif (Saxena and Brown, 1995; Keenleyside and Whitfield, 1996). Inspection of the CrdS sequence (Figure
Domain A also contains an incompletely conserved sequence proposed to be a potential UDP-glucose/UDP-N-acetylglucosamine binding motif (Delmer and Amor, 1995). A modified motif derived from the Delmer sequences (Delmer and Amor, 1995; Kawagoe and Delmer, 1997) was detected in all cellulose synthases and several other [beta]-glycosyl transferases as well as in domain A of CrdS (residues 146-157; Table I), and was located in the expected position (Figure
Figure 5. HCA plots (Lemesle-Varloot et al., 1990) of CrdS, CelA, and AcsA for the amino acid sequence indicated under each plot. The analysis was performed using the HCA-Plot program (Doriane Informatique, Le Chesnay, France). The sequences are written on a classical [alpha]-helix (3.6 amino acids per turn) smoothed on a cylinder. To make the three-dimensional representation easier to handle, the cylinder is cut parallel to its axis and then spread out. Since some adjacent amino acids are now separated by the unfolding of the cylinder, their representation is duplicated to restore the full connectivity of the amino acids. Clusters of adjacent hydrophobic residues (Ala, Val, Leu, Ile, Met, Phe, Gln) are within contours. Amino acids are denoted with the one-letter code except for proline (stars), glycine (diamonds), serine (squares with dot), and threonine (squares). Vertical lines are drawn to indicate structurally conserved features including transmembrane domains (indicated by bars but numbered only for CrdS), domains A and B identified in AcsA (Saxena et al., 1995), and the various conserved motifs (enclosed in boldface outline).
The experiments described here are the first report of the cloning of genes required for the biosynthesis of the bacterial (1->3)-[beta]-glucan, curdlan. These genes occur in two unlinked loci (I and II) and were identified in nonoverlapping genomic clones of Agrobacterium sp. ATCC31749 by their ability to complement transposon (TnphoA)-insertion mutations that blocked curdlan synthesis. The crdS gene, which was fully sequenced, occurs in locus I together with at least two other curdlan genes. This was suggested by the finding that mutations within crdS, and on either side of it, prevented curdlan production (Figure
Evidence that locus I contains a gene required for curdlan synthesis rests on the characteristics of crdS, the curdlan gene that was sequenced. Mutations in this gene (in LTU61 and LTU62; Figure
Although computer-based comparisons have provided a means for identifying (Saxena et al., 1995) and classifying glycosyl transferases (Keenleyside and Whitfield, 1996; Campbell et al., 1997; Saxena and Brown, 1997), experimental evidence supporting functional roles for the conserved sequences is still limited. Amino acids in the D,D,D35QXXRW motif of Saccharomyces cerevisiae chitin synthase (Chs2p; Nagahashi et al., 1995) and A.xylinum cellulose synthase (AcsAB; Saxena and Brown, 1997) have been implicated in the catalytic mechanism by site-directed mutagenesis studies. In AcsAB, replacement of the first or second Asp residue impaired in vitro enzyme activity, as did replacement in Chs2p of the first or third Asp, or of Gln, Arg, or Trp in the QXXRW sequence. Even conservative replacements of the third Asp, or of Gln, Arg, or Trp in Chs2p were not tolerated, suggesting that these residues are involved in the catalytic reaction. The two residues flanking the third Asp in Chs2p were also essential for enzyme activity. This is of interest as one of these, Glu, is conserved in CrdS and the bacterial cellulose synthases (Figure
The binding motif for UDPGlc, the presumptive curdlan synthase substrate, was initially recognized on the basis of photoaffinity labeling which showed that a peptide from spinach (Spinacea oleracea) sucrose phosphate synthase (EC 2.4.1.13) could bind this monosaccharide donor (Salvucci and Klein, 1993). Part of this peptide sequence occurs in the UDPGlc-binding motif recognized by Delmer and Amor (1995) in plant and other glycosyl transferases (Kawagoe and Delmer, 1997), and includes the first Asp of the D,D,D35QXXRW motif (Figure
The detection of two potential UDPGlc-binding motifs in CrdS was based on a search using a modified motif derived from sequences identified by Delmer and Amor (1995) and Kawagoe and Delmer (1997). Both CrdS motifs differ from the consensus sequence at three positions, but the first aligns with the corresponding single motif in the cellulose synthases (Figure
Perhaps the most intriguing finding of this study is that CrdS, which catalyses (1->3)-[beta]-linkage formation, shares considerable homology in both sequence and structure with cellulose synthases which specifically form (1->4)-[beta]-linkages. This homology also extends to two bifunctional enzymes that catalyze the formation of both (1->3) and (1->4) linkages between two different monomeric units. These are the hyaluronan synthase (HasA) from Streptococcus pyogenes (Dougherty and van de Rijn, 1994) and Type 3 capsular polysaccharide synthase (Cps3S) from S.pneumoniae (Arrecubieta et al., 1995; Dillard et al., 1995). HasA and Cps3S also display the sequence and structural characteristics of repetitive glycosyl transferases (Keenleyside and Whitfield, 1996) and have been grouped into the same family as the bacterial cellulose synthases (Campbell et al., 1997). On the basis of the results presented here, this family would also include CrdS. The important issue of which features determine linkage (and donor) specificity in these enzymes remains to be resolved. One proposal is that linkage specificity is determined by subtle features of the active site rather than by the global folding reflected by HCAs (Campbell et al., 1997). There are two examples in which limited replacement of amino acids over a restricted sequence determines specificity. One example is in the family of human [alpha]-fucosyltransferases which share 85% sequence homology and discriminate between acceptor substrates to form either (1->3)- or (1->4)-[alpha]-fucosyl linkages (Legault et al., 1995). The other example is the (1->3)- and (1->3, 1->4)-[beta]-glucan hydrolases (EC 3.2.1.39 and EC 3.2.1.73, respectively) which hydrolyze (1->3)- or (1->4)-[beta]-glucosidic linkages in their respective substrates. Because their polypeptide backbones are structurally very similar, the differences in their substrate specificity is acquired primarily by substitutions in amino acids lining a deep substrate-binding groove on the enzyme surface (Varghese et al., 1994; Høj and Fincher, 1995). Alternatively, linkage specificity may be provided by ancillary proteins. An analogy would be the [beta]-galactosyl transferases (EC 2.4.1.22 and EC 2.4.1.90) which, in the presence of [alpha]-lactalbumin, use glucose as an acceptor to form Gal[beta](1->4)Glc (lactose) but in its absence the acceptor is GlcNAc on N-linked oligosaccharides forming a Gal[beta](1->4)GlcNAc structure (Hill and Brew, 1975).
Table II. Introduction
Results
Discussion
Table I.
Strain or plasmid
Relevant characteristics
Source or reference
E.coli
DH5[alpha]
recA1 hsdR17
Gibco-BRL
SM10
recA chr-RP4-2-Tc::Mu Kmr Tra+
Simon et al. (1983)
S17-1
recA hsdR chr-RP4-2-Tc::Mu-Km::Tn7 Tpr Smr Tra+
Simon et al. (1983)
HB101
recA13 hsdR hsdM rpsL20
Sambrook et al. (1989)
LE392
hsdR
Stratagene Cloning Systems
Agrobacterium
ATCC31749
Type strain; Crd+
ATCC
LTU50a
Spontaneous Cmr derivative of ATCC31749
This work
LTU61 & LTU62
LTU50crdS::TnphoA
This work
LTU63-LTU86
LTU50::TnphoA locus I Crd- mutants
This work
LTU87 & LTU88
LTU50::TnphoA locus II Crd- mutants
This work
LTU89-LTU113
LTU50::TnphoA Crd+/-
This work
Plasmids
RP1
Apr Tcr Kmr Tra+ IncP
Pansegrau et al. (1994)
pLAFR1
Broad host range cosmid vector; oriVRK2 oriTRK2;Tcr
Friedman et al. (1982)
pUC19
Apr, cloning vector
Yanisch-Perron et al. (1985)
pRT733
Suicide plasmid; oriVR6K oriTRP4 TnphoA; AprKmr
Taylor et al. (1989)
pVS1500
pUC19 with 6.8 kb EcoRI-junction fragment of LTU87; Kmr Apr
This work
pVS1506
LTU50 locus I clone in pLAFR1; Tcr
This work
pVS1511
LTU50 locus II clone in pLAFR1; Tcr
This work
Bacterial strains, plasmids, and media
The bacteria and plasmids used in this study are listed in Table I. Additional plasmids and their modes of construction are shown in Figure
Detection and assay of curdlan
Curdlan production by bacteria was detected on ABA medium which contains the (1->3)-[beta]-glucan-specific dye, aniline blue (Nakanishi et al., 1976). After incubation at 28°C for 5 d, curdlan-producing colonies are stained blue by the dye. Curdlan production in broth was assayed quantitatively by the method of Nakanishi et al. (1976) and involved stirring a 50 ml ADB culture of Agrobacterium (cell density ~5 × 108/ml) together with an equal volume of 1 M NaOH. After 15 min the mixture was centrifuged and the supernatant neutralized (pH 7.0) by adding 4 M HCl. The precipitate that formed was collected by centrifugation at 20,000 × g for 10 min, washed three times with distilled water, then frozen at -20°C before being freeze-dried and weighed.
Conjugation procedures for transposon mutagenesis and complementation studies
Bacterial cultures in 10 ml NB were grown to about 5 × 108 cells/ml either at 37°C for E. coli donors or 28°C for Agrobacterium recipients. Conjugations were performed at 28°C for 3-4 h on nitrocellulose filters as described by Palombo et al. (1989) and then the transconjugants recovered on selective medium incubated at 28°C for 3-5 d. Transposon mutagenesis of Agrobacterium was performed using the suicide plasmid pRT733 which carries TnphoA and is mobilizable from E.coli SM10 (Taylor et al., 1989). To ensure the isolation of non-sibling mutants, separate filter matings between E.coli SM10 carrying pRT733 and LTU50 were performed and the transconjugants selected on ABA medium containing Cm and Km (the phoA characteristic was not utilized). Colonies that failed to stain with aniline blue were purified and the inheritance of TnphoA and loss of pRT733 confirmed by the Kmr Aps phenotype of the bacteria. Complementation studies involved conjugation between E.coli S17-1 carrying a pLAFR1-recombinant and recipient cultures (TnphoA-induced curdlan mutants). The transconjugants were selected on ABA medium containing Tc and Cm and after growth of the bacteria at 28°C for 3-5 d, complementation was detected by the blue staining of the transconjugant colonies.
DNA techniques and preparation of an Agrobacterium genomic library
Methods for DNA isolation, manipulation, cloning, and transformation of electrocompetent DH5[alpha] cells were primarily adapted from the manual by Sambrook et al. (1989). DNA fragments were purified from low melting agarose gels using the Bresa-Clean DNA isolation kit (Bresatec) and then used for cloning or as probes in Southern or colony hybridizations. DNA-DNA hybridizations were performed by the method of Sambrook et al. (1989) for the detection of low abundance sequences. Probes were prepared from the 2.8 kb BglII fragment of TnphoA (Figure
DNA sequencing and sequence analysis
The DNA sequence of crdS was determined using a TnphoA-based primer (5[prime]-CATGAACGTTACCATGTTAG-3[prime]) to sequence the Agrobacterium DNA adjoining TnphoA in pVS1501 and pVS1502 (derived from LTU61 and LTU62; Figure
Nucleotide sequence accession numbers
The nucleotide sequence of crdS was submitted to GenBank under the accession number AF057142. GenBank accession numbers of other sequences used in this study (see Table I) were AcsA (X54676), BcsA (M96060), AcsAII (U15957), CelA (L38609), CelA1 (U58283), CelA2 (U58284), A472L (U42580), HasA (L21187), NodC (M11268), and Chs1 (M14045). The YhjO accession number was from SwissProt P37653 and accession number T88271 was from the EMBL EST database.
We thank Dr. B. T. O. Lee and Dr. S. R. Long for providing SM10 and pLAFR1, respectively; Dr. J. Beckwith for the nucleotide sequence of TnphoA; and Dr. B. Henrissat for the HCA program.
ATCC, American Type Culture Collection; HCA, hydrophobic cluster analysis; UDPGlc, uridine diphosphate-[alpha]-d-glucose.
2To whom correspondence should be addressed
3Present address: Department of Biochemistry, Mt. Sinai School of Medicine, New York, NY 10029