©1996 by The American Society for Biochemistry and Molecular Biology, Inc.
Genomic Organization and Chromosomal Mapping of the Gal1,3GalNAc/Gal1,4GlcNAc 2,3-Sialyltransferase (*)

(Received for publication, July 31, 1995; and in revised form, October 17, 1995)

Hiroshi Kitagawa (§) Marie-Geneviève Mattei (1) James C. Paulson (¶)

From the From Cytel Corporation, San Diego, California 92121 and Groupe Hospitalier de la Timone, U242 INSERM, 13385 Marseille, Cedex 5, France

ABSTRACT
INTRODUCTION
EXPERIMENTAL PROCEDURES
RESULTS
DISCUSSION
FOOTNOTES
REFERENCES

ABSTRACT

In this report we describe the chromosome mapping and genomic organization of the human Galbeta1,3GalNAc/Galbeta1,4GlcNAc alpha2,3-sialyltransferase gene. The gene is localized to human chromosome 11(q23-q24) by in situ hybridization of metaphase chromosomes. It spans more than 25 kilobases of human genomic DNA and is distributed over 14 exons that range in size from 61 to 679 base pairs. Previous characterization of cDNAs encoding the Galbeta1,3GalNAc/Galbeta1,4GlcNAc alpha2,3-sialyltransferase revealed that the gene produces at least three transcripts in human placenta, which code for identical protein sequences except at the 5` ends (Kitagawa, H., and Paulson, J. C. (1994a) J. Biol. Chem. 269, 1394-1401). Repeated screening for clones that contain the 5` end of the cDNA has identified two additional distinct mRNAs that are expressed in human placenta. Comparison of the genomic DNA sequence with that of the five different mRNAs indicates that these transcripts are produced by a combination of alternative splicing and alternative promoter utilization. Northern analysis indicated that one of them is specifically expressed in placenta, testis, and ovary, indicating that its expression is independently regulated from the others.


INTRODUCTION

Sialic acid-containing oligosaccharide structures found on glycoproteins and glycolipids are known to vary with species, tissue type, and stage of development. The structural diversity of these carbohydrates is believed to be used by the cell to mediate specific cellular recognition processes including protein targeting, cell adhesion, and cellular differentiation and development (Kornfeld, 1987; Rademacher et al., 1988; Paulson, 1989; Brandley et al., 1990; Varki, 1992; Powell and Varki, 1995). The high degree of structural diversity observed in the terminal glycosylation sequences of glycoprotein carbohydrates is generally believed to be specified by the glycosyltransferases produced by the cell. Accumulating evidence suggests that the regulated expression of these enzymes may account for the synthesis of cell type-specific carbohydrate structures (Paulson et al., 1989; Kleene and Berger, 1993; Kitagawa and Paulson, 1994b; Natsuka and Lowe, 1994). Despite the growing number of glycosyltransferase cDNAs which have been cloned, limited information is available concerning the organization and regulation of the expression of glycosyltransferase genes (Joziasse, 1992; Kleene and Berger, 1993).

The sialyltransferase family consists of 12-15 or more glycosyltransferases grouped by their common function of transferring sialic acid from CMP-NeuAc to terminal positions on the sugar chains of glycoproteins and glycolipids. To date, 11 cDNAs of these enzymes have been cloned (Weinstein et al., 1987; Gillespie et al., 1992; Wen et al., 1992b; Livingston and Paulson, 1993; Sasaki et al., 1993; Kitagawa and Paulson, 1994a; Kurosawa et al., 1994a, 1994b; Lee et al., 1994; Nara et al., 1994; Sasaki et al., 1994; Haraguchi et al., 1994; Kojima et al., 1995; Yoshida et al., 1995; Eckhardt et al., 1995). Of these, only the beta-galactoside alpha2,6-sialyltransferase gene has been extensively characterized (Svensson et al., 1990; Wang et al., 1990; O'Hanlon and Lau, 1992; Wen et al., 1992a; Svensson et al., 1992; Aasheim et al., 1993; Wang et al., 1993). This gene was found to be relatively large, spanning over 80 kb (^1)in length, producing at least six different messages, via alternative promoter usage and mRNA splicing in a tissue-specific fashion.

In this report we have examined the gene of human Galbeta1, 3GalNAc/Galbeta1,4GlcNAc alpha2,3-sialyltransferase for which the cDNA has recently been cloned from human melanoma cell line WM266-4 and human placenta and partial characterization of the human gene has been recently reported (Sasaki et al., 1993; Kitagawa and Paulson, 1994a; Chang et al., 1995). The gene was found to span more than 25 kb and produce at least five distinct transcripts in human placenta. Northern analysis indicated that one of them is specifically expressed in placenta, testis, and ovary. The results suggested that the human alpha2,3-sialyltransferase gene is expressed tissue specifically by a combination of alternative splicing and alternative promoter utilization. Finally, we document that this gene and the human Galbeta1,3(4)GlcNAc alpha2,3-sialyltransferase gene, which has the highest homology to this gene, reside in different human chromosomes, 11q23-q24 and 1p34-p33, respectively.


EXPERIMENTAL PROCEDURES

Materials

Bluescript plasmid vector, FIXII vector, Escherichia coli strains XL1-Blue, XL1-Blue MRA, and XL1-Blue MRA(P2), Gigapack II XL packaging extracts, and Pfu polymerase were purchased from Stratagene. Restriction enzymes were purchased from New England Biolabs, Pharmacia LKB Biotech Inc., and Life Technologies, Inc. [alpha-P]dCTP was obtained from Amersham.

Isolation of Human Genomic Clones

Human genomic DNA (Clontech) was partially digested with Sau3AI and then ligated into XhoI-digested FIXII (Stratagene). The resultant library was packaged using a Stratagene Gigapack II XL packaging extract and plated on E. coli XL1-Blue MRA(P2) (Stratagene). Approximately 1 million plaques were screened with radiolabeled sialyltransferase cDNA as described (Kitagawa and Paulson, 1994a). Multiple clones (C1-C21) were isolated, and three of them, C1, C11, and C21, were characterized in detail. Insert DNA fragments were initially characterized by restriction digestion and Southern blot analysis (Sambrook et al., 1989). Human genomic DNA fragments that hybridized to the sialyltransferase cDNA probes were subcloned into Bluescript plasmid vectors. Nucleotide sequencing was carried out on double-stranded templates using a Sequenase sequencing kit (United States Biochemical Corp.).

PCR Amplification of 5`-cDNA End (RACE)

Human placenta poly(A) RNA was isolated from total RNA (Clontech) using oligo(dT)-cellulose type 2 (Collaborative Research). Amplification of the 5` end of the Galbeta1,3GalNAc/Galbeta1,4GlcNAc alpha2,3-sialyltransferase cDNA was performed according to the manufacturer's instruction (5`-AmpliFINDERRACE kit, Clontech), using 1 µg of poly(A) RNA as a template. The primer, 5`-AGCTTAGAGGCCTTGCTCTCTG-3`, was used for the initial reverse transcription. After synthesis of the first strand of DNA, an anchor primer provided by the company and each isoform-specific internal primer, 5`-GAGGGCTTGACCCTGATTGA-3`, 5`-AGCTTCCAGCCCACTGTCCT-3`, and 5`-TTGCTGACCATGTTTCTCAG-3`, from the type A1 (former Long A), type A2 (former Short), and type B (former Long B) forms of the alpha2,3-sialyltransferase cDNAs (Kitagawa and Paulson, 1994a), respectively, were used for PCR. Each cDNA was amplified using Pfu polymerase (Stratagene) for 30 cycles of a step program (95 °C, 45 s; 55 °C, 45 s; 73 °C, 2 min). The PCR products were subcloned into Bluescript (Stratagene) and then sequenced using a T7 primer (Stratagene).

Northern Analysis

Multiple tissue Northern blots of poly(A) RNAs were purchased from Clontech Laboratories for the analysis. The blots were probed with a gel-purified, radiolabeled (>1 times 10^9 cpm/µg), 1.3-kb EcoRI fragment isolated from STZ-2 (Kitagawa and Paulson, 1994a). The type A1 (former Long A) form-specific fragment (191 bp) was generated by PCR using 5` primer, 5`-CTTCATCTTGAAGGACAGTGG-3`, and 3` primer, 5`-CTTCCAGCCTGCAGGACACAT-3`. PCR reaction was carried out with Pfu polymerase by 30 cycles of 95 °C for 45 s, 55 °C for 45 s, and 73 °C for 90 s. The PCR fragment was purified, radiolabeled (>1 times 10^9 cpm/µg), and then used for Northern blots as a probe. The hybridized blots were washed at room temperature in 2 times SSC, 0.1% SDS for 10 min, then twice at 65 °C in 0.1 times SSC, 0.1% SDS for 20 min, and exposed to x-ray film for several days.

In Situ Chromosome Hybridization

In situ hybridization was carried out on chromosome preparations obtained from phytohemagglutinin-stimulated human lymphocytes cultured for 72 h. 5-Bromodeoxyuridine was added for the final 7 h of culture (60 µg/ml medium), to ensure a posthybridization chromosomal banding of good quality. The clones STZ-1 (Kitagawa and Paulson, 1994a) and ST3NHP-1 (Kitagawa and Paulson, 1993) were used as probes, respectively. The conditions for labeling of probes and hybridization and washing were as described previously (Nguyen et al., 1986). After coating with nuclear track emulsion (KODAK NTB(2)), the slides were exposed for 18 days at 4 °C and were developed. To avoid any slipping of silver grains during the banding procedure, chromosome spreads were first stained with a buffered Giemsa solution, and metaphases were photographed. R-banding was then performed by the fluorochrome-photolysis-Giemsa method, and metaphases were rephotographed before analysis.


RESULTS

Identification of the Novel Isoforms of the Human Galbeta1, 3GalNAc/Galbeta1,4GlcNAc alpha2,3-Sialyltransferase

The cDNAs of Galbeta1,3GalNAc/Galbeta1,4GlcNAc alpha2,3-sialyltransferase were reported previously to consist of at least three isoforms in human placenta (Kitagawa and Paulson, 1994a). To identify the 5` ends of the Galbeta1,3GalNAc/Galbeta1,4GlcNAc alpha2,3-sialyltransferase transcripts and possibly identify other mRNA isoforms that differ in the 5` end, we employed a rapid amplification of cDNA ends-polymerase chain reaction (RACE-PCR) cloning strategy combined with reverse transcription polymerase chain reaction. Human placenta poly(A) RNA was reverse-transcribed using a common primer among the three isoforms cloned previously (type A1, type A2, and type B; see ``Experimental Procedures''). PCR amplification was then performed using the anchor primer, and each isoform-specific internal primer revealing five distinct PCR products. Two bands were identified with the type A1 or type A2 form-specific primers, respectively, and three bands were detected with the type B form-specific primer. After subcloning the PCR product and sequencing individual clones, we found that there were two other novel isoforms related to the previously described type B form (Kitagawa and Paulson, 1994a) which differed in their 5` noncoding sequences and are referred to as type B2 and type B3. The nucleotide sequences of the nonhomologous region at the 5` end of the three type B forms are presented in Fig. 1. These two additional type B transcripts indicate that the human alpha2,3-sialyltransferase gene produces at least five transcripts in human placenta.


Figure 1: Comparison of the sequences in the 5`-untranslated region of the type B1, type B2, and type B3 forms of alpha2,3-sialyltransferase cDNA isolated from human placenta. Homologous sequence (only the first 16 bp) is represented by boldface letters.



Isolation and Characterization of the Human alpha2,3-Sialyltransferase Gene

In order to gain information about the organization and the regulation of Galbeta1,3GalNAc/Galbeta1,4GlcNAc alpha2,3-sialyltransferase gene in human tissues, it was necessary to isolate the genomic sequences containing this gene. A cDNA (type A1) was used as a probe to screen a human placenta genomic DNA library. Several independent clones were isolated and subjected to a Southern blot analysis. Three were found to overlap with each other and contained all of the exons corresponding to the type A1 form of alpha2,3-sialyltransferase cDNA. As summarized in Fig. 2, coding sequences for the type A1 and type A2 forms protein of human alpha2,3-sialyltransferase gene are divided into 9 exons, and that for the type B form protein is divided into 10 exons, ranging in size from 61 bp to 679 bp. Exons E3 and E6-E14 contain the coding sequence indicated in Fig. 2. Exon E14 also contains the 3`-untranslated region which included the poly(A) attachment site, ATTAAA. The locations of the alpha2,3-sialyltransferase exons were determined by sequencing of the genomic clones. Moreover, sequencing was done to determine the exact size of the exons as shown in Table 1as well as the sequence of the intron/exon junctions as shown in Table 2. All the intron/exon junctions were found to follow the GT/AG rule and were flanked by conserved sequences (Breathnach and Chambon, 1981). The sequence encoded by these exons is identical with the sequence of the human alpha2,3-sialyltransferase cDNA reported by us (Kitagawa and Paulson, 1994a). Sasaki et al.(1993) has also cloned the type B1 form cDNA from human melanoma cell line WM266-4 using the expression cloning method with the cytotoxic lectin. However, the cDNA lacks the 12 bp, from nucleotide position 87 to 108, in the stem region of the type B1 form reported by us (Kitagawa and Paulson, 1994a). The sequence of the cDNA at the boundary of the 12-bp deletion matches consensus splice donor site sequence. From both this observation and our analysis of the intron/exon structure of the gene, it became apparent that alternative splicing of the mRNA takes place within exon E6. Although we have screened human placenta cDNA extensively, this type of isoform has not been detected. Although no genomic clones were identified that contained the 5`-most exon (exon E1) of the type B1 form, restriction mapping by Southern blot analysis indicates that the exon is found to be more than 15 kb upstream of exon E2 (data not shown).


Figure 2: Genomic map of the alpha2,3-sialyltransferase gene. Exons are labeled E1-E14. EcoRI (E) and HindIII (H) restriction sites are shown as hash marks. Exon regions are denoted by boxes. Black boxes represent coding sequence and open boxes denote 5`- and 3`-untranslated sequences. Shown below are the splicing patterns for each type of message described here. Abbreviations for each message are used in the text. In the previous publication (Kitagawa and Paulson (1994a)), types A1, A2, and B1 were referred to as Long A, Short, and Long B, respectively.







In summary, the analysis suggests that the entire alpha2,3-sialyltransferase gene spans over 25 kb of human genomic DNA. It should be noted that, in contrast to the genomic organization of alpha2,6-sialyltransferase (Svensson et al., 1990), the highly conserved sialylmotif, used to clone this alpha2,3-sialyltransferase cDNA (Kitagawa and Paulson, 1994a), is divided into two exons, exon E10 and E11 (Fig. 2). In addition, the unique sequence found on the 5` end of the type B2 and type B3 forms (see Fig. 1) were mapped between exon E1 and exon E3, indicating that these were produced by alternative promoter utilization (Fig. 2). The exon E2 for the 5` end of type B2 form was located 397 bp upstream of exon E3. The 5`-most transcriptional start site of the type B3 form was located only 44 bp upstream of exon E3, and the mRNA was formed to the 3` end of exon E3 without splicing. These results suggest that the five mRNA isoforms are produced by a combination of alternative splicing and alternative promoter utilization, and, consequently, that the mRNA is formed from a combination of 14 exons of the alpha2,3-sialyltransferase gene.

Analysis of the Transcriptional Start Sites and the Sequence of the alpha2,3-Sialyltransferase Promoter Region

The sequence of the 5`-flanking region of the type A1, type A2, and type B3 forms, and the type B2 form of the alpha2,3-sialyltransferase gene is shown in Fig. 3and Fig. 4, respectively. The transcriptional start sites were determined by sequencing the RACE-PCR products as described above and were marked by arrows in the figures. Sequencing of the PCR product of the type A1, type A2, the type B2, and the type B3 forms also revealed that transcription of the former two forms, type A1 and type A2, initiates at two positions and that of the latter two forms, type B2 and type B3, initiates at several positions, indicated by arrows in Fig. 3and Fig. 4. As expected from the observation that there are multiple sites of transcription initiation, both of the 5`-flanking regions lack canonical TATA or CCAAT boxes, but do contain several other well characterized promoter elements as shown in Table 3and Table 4. As shown in Fig. 3and Table 3, this region contains six sequence motifs similar to AP2 recognition element (Mitchell et al., 1987), of which four are just upstream of the transcriptional start sites of the type B3 form and two are just upstream of those of the type A1 and type A2 forms. In addition, three potential mammary cell activating factor (MAF) consensus sequences (Mink et al., 1992) were identified; one potential Sp1 consensus sequence (Kadonaga et al., 1986) was found; and four ETF consensus sequences which stimulate transcription of promoters lacking TATA boxes (Kageyama et al., 1989) were identified. Moreover, two LF-A1 consensus sequences (Hardon et al., 1988), one HLH consensus sequence (Blackwell and Weintraub, 1990), and one NF-1-like protein binding site (Paonessa et al., 1988) were found.


Figure 3: Sequence of the type A1, type A2, and type B3 forms of human alpha2,3-sialyltransferase promoter region. The start sites of transcription for each isoform are shown by arrows. The consensus binding sites for the transcription factors AP2, Sp1, LF-A1, HLH (helix-loop-helix proteins), NF-1, and MAF are underlined and those for the transcription factor ETF are boxed.




Figure 4: Sequence of the type B2 form of human alpha2,3-sialyltransferase promoter region. The start sites of transcription are shown by arrows. The consensus binding sites for the transcription factors AP1, AP2, MAF, CArG, HLH, PEA3, NF-1, and OCT are underlined, and those for the transcription factor ETF are boxed.







As shown in Fig. 4and Table 4, the 5`-flanking region of the type B2 form also contains three sequence motifs similar to the MAF recognition element, one sequence similar to the AP1 binding site (Lee et al., 1987), four sequences similar to the AP2 binding site, seven ETF consensus sequences, one HLH consensus sequence, and one NF-1-like protein binding site. Moreover, three additional sequence motifs were detected. Three CArG consensus binding sites are present, a sequence motif required for expression of smooth muscle-specific genes (Reddy et al., 1990), one OCT (octamer binding transcription factor) consensus binding site (Cox et al., 1988) is identified, a sequence motif recognized by an octamer-related proteins which have been implicated in the control of the histone 2b gene and the melanocyte-specific tyrosinase-related protein TRP1 (Lowings et al., 1992), and one PEA3 consensus sequence is also present (Faisst and Meyer, 1992).

Tissue-specific Expression of the Type A1 Form

As described in the previous paper (Kitagawa and Paulson, 1994a), the alpha2,3-sialyltransferase exhibits a unique tissue-specific pattern of expression. In order to determine whether the alpha2,3-sialyltransferase transcripts are tissue-specifically expressed by a combination of alternative splicing and alternative promoter utilization, Northern blots with mRNAs from human adult and fetal tissues were probed with the type A1 form-specific fragment (Fig. 5b). For comparison, the same Northern blot was probed by a full-length cDNA probe of the alpha2,3-sialyltransferase shown in Fig. 5a, which should detect all five transcripts. This result indicates that the type A1 form mRNA is specifically expressed in placenta, ovary, and testis. For the three type B forms, however, the length of the cDNA specific for each type is relatively short so that it was not possible to get a visible signal.


Figure 5: Tissue-specific expression of the type A1 form transcript in various human tissues. Northern blots with mRNA from various adult and fetal human tissues were hybridized with a probe corresponding to the coding sequence of the alpha2,3-sialyltransferase (a) or with a probe corresponding to the specific sequence of the type A1 form cDNA (b), as described under ``Experimental Procedures.''



Chromosomal Mapping of Two Human alpha2,3-Sialyltransferase Genes

Using a cDNA probe of the Galbeta1,3GalNAc/Galbeta1,4GlcNAc alpha2,3-sialyltransferase, in situ hybridization to normal metaphase chromosomes was performed to determine the chromosomal localization of the human gene. Of 100 metaphase cells examined from this hybridization, 197 silver grains were associated with chromosomes, and 42 of these (21.3%) were located on chromosome 11. The distribution of grains on this chromosome was not random, and 83.3% of them were mapped to the q23-q24 region of chromosome 11 long arm (Fig. 6a). These results clearly indicate that the gene is located at human chromosome 11q23-q24.


Figure 6: Distribution of labeled sites on chromosome 11 for Galbeta1,3GalNAc/Galbeta1,4GlcNAc alpha2,3-sialyltransferase gene (a) and chromosome 1 for Galbeta1,3(4)GlcNAc alpha2,3-sialyltransferase gene (b). The peak of hybridization occurs on band q23-q24 of chromosome 11 (a) and on band p34-p33 on chromosome 1 (b).



Comparison of sequences of nine cloned sialyltransferases revealed that the highest homology of Galbeta1,3GalNAc/Galbeta1,4GlcNAc alpha2,3-sialyltransferase to the other 10 sequences was with that of Galbeta1,3(4)GlcNAc alpha2,3-sialyltransferase. Accordingly, an experiment similar to that described above was carried out to determine the chromosomal location of the Galbeta1,3(4)GlcNAc alpha2,3-sialyltransferase gene. Of 100 metaphase cells examined for hybridization, 188 silver grains were associated with chromosomes, and 51 of these (27.1%) were located on chromosome 1. As shown in Fig. 6b, 78.4% of them were mapped to the p34-p33 region of chromosome 1 short arm. These results indicate that the Galbeta1,3(4)GlcNAc alpha2,3-sialyltransferase gene is localized at chromosome 1p34-33.


DISCUSSION

To date, the genomic organization has been reported for several glycosyltransferases (Joziasse, 1992; Kleene and Berger, 1993; Chang et al., 1995). The rat alpha2,6-sialyltransferase gene is divided into at least 12 exons which span over 80 kb in length (Wen et al., 1992a). Similarly, the alpha1,3-galactosyltransferase gene is distributed over 9 exons that span over 35 kb (Joziasse et al., 1992), and the beta1,4-galactosyltransferase gene is also distributed over 6 exons that span over 40 kb of genomic sequence (Hollis et al., 1989). This alpha2,3-sialyltransferase gene falls into the same pattern. In contrast, several exceptions to this pattern are beta1,2-GlcNAc-transferase I, beta1,4-GlcNAc-transferase III, several alpha1,3-fucosyltransferases, and beta1,6-GlcNAc-transferase genes. The entire coding sequence of these genes appears to be contained within a single exon (Hull et al., 1991; Lowe et al., 1991; Weston et al., 1992a, 1992b; Bierhuizen et al., 1993; Ihara et al., 1993). It is unclear whether the occurrence of two patterns of glycosyltransferase genomic organization has an evolutionary significance.

The gene for human Galbeta1,3GalNAc/Galbeta1,4GlcNAc alpha2,3-sialyltransferase is distributed over 14 exons that span at least 25 kb of genomic sequence. Transcription of this gene results in the production of five distinct mRNAs (type A1, type A2, type B1, type B2, and type B3 forms) in human placenta, each approximately 2.0 kb in size, that are generated by a combination of alternative splicing and alternative promoter utilization. Translation of these individual mRNAs predicts the biosynthesis of three related protein isoforms of the alpha2,3-sialyltransferase which were previously referred to as the Long A, Long B, and Short forms, of 332, 333, and 322 amino acids, respectively, which has been confirmed by in vitro translation. (^2)Structurally, these three protein isoforms differ from each other only at its N-terminal that is the cytoplasmic tail and the part of transmembrane domain (Kitagawa and Paulson, 1994a). The biological significance of the three different protein isoforms is presently unclear.

The observation of multiple transcripts for this alpha2,3-sialyltransferase gene has also been observed with other glycosyltransferase genes including those of the alpha2,6-sialyltransferase and the beta1,4-galactosyltransferase (Paulson et al., 1989; Russo et al., 1990; Wen et al., 1992a; Aasheim et al., 1993; Wang et al., 1993). In case of the alpha2,6-sialyltransferase, at least six different transcripts were produced via alternative splicing and alternative promoter usage. The most well-characterized one is a 4.3-kb mRNA found almost exclusively in the liver (Wen et al., 1992a), which is generated from six exons of the gene (Svensson et al., 1990). Two distinct forms of a 4.7-kb mRNA, one is highly expressed in B-cells and another is expressed at low levels in most tissues (Aasheim et al., 1993; Wang et al., 1993), have been identified. The two transcripts are also produced from the same six exons as the 4.3-kb one with the addition of one or two 5`-untranslated exons (Aasheim et al., 1993; Wang et al., 1993). Thus, these three transcripts have identical coding sequences but having different 5`-untranslated sequences. Since the coding sequences are identical, the different mRNA species in this case are a consequence of the cell type-specific regulation of the expression of this complex gene. In addition, three forms of a 3.6-kb mRNA have been isolated from rat kidney. Although these transcripts are generated from the alpha2,6-sialyltransferase gene, they retain less than 50% of the coding region and do not have sialyltransferase activity (Wen et al., 1992a; Harduin-Lepers et al., 1993). Moreover, they have only been detected in the kidney in rat and not in human (Kitagawa and Paulson, 1994b).

In the case of the alpha2,3-sialyltransferase, as shown in this report, at least five transcripts are produced from a single gene locus by a combination of alternative splicing and alternative promoter usage, which each codes for identical protein, Galbeta1,3GalNAc/Galbeta1,4GlcNAc alpha2,3-sialyltransferase, except at the 5` ends. Why are alternative promoters and alternative splicing used for the production of five mRNAs coding for the same enzyme? Northern analysis of the tissue distribution of them demonstrated that one of the alpha2,3-sialyltransferase mRNA isoforms, type A1 form, is specifically expressed in placenta, testis, and ovary although either of them is constitutively expressed in all the tissues examined (Fig. 5). This pattern of expression is likely a consequence of differential regulation at the level of transcription as demonstrated for the alpha2,6-sialyltransferase and the beta1,4-galactosyltransferase (Svensson et al., 1992; Harduin-Lepers et al., 1993).

The sequence analysis of the 5`-flanking region of the alpha2,3-sialyltransferase isoforms revealed the heterogeneous transcriptional start sites and the absence of typical TATA and CCAAT boxes coupled with the presence of GC boxes ( Fig. 3and Fig. 4). These structural features are believed to be typical of the so-called housekeeping genes, which are expressed at low levels in essentially all tissues (Kadonaga et al., 1986), suggesting that its regulation would be governed, at least in part, by the Sp1 binding sites like that of the beta1,4-galactosyltransferase (Harduin-Lepers et al., 1993). Further work is required to confirm this mechanism.

Chromosomal assignments have been reported for several glycosyltransferases including two sialyltransferases, Galbeta1,4GlcNAc alpha2,6-sialyltransferase and NeuAcalpha2,3Galbeta1, 4Glcbeta1-1`Cer alpha2,8-sialyltransferase, which reside at human chromosome 3q27-q28 and on human chromosome 12, respectively (Kleene and Berger, 1993; Wang et al., 1993; Sasaki et al., 1994). The present study demonstrates that the two additional sialyltransferases, the Galbeta1,3GalNAc/Galbeta1,4GlcNAc alpha2,3-sialyltransferase and the Galbeta1,3(4)GlcNAc alpha2,3-sialyltransferase, are also localized on entirely different human chromosomes, 11q23-q24 and 1p34-33, respectively, despite the fact that their four genes share the highly conserved region, sialylmotif. These results strongly suggest that the four sialyltransferases diverged from an ancestor gene early in evolution. It remains to be determined whether the rest of the sialyltransferase genes are likewise dispersed in the human genome.


FOOTNOTES

*
This work was supported in part by United States Public Health Service Grant GM27904. The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore by hereby marked ``advertisement'' in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

The nucleotide sequence(s) reported in this paper has been submitted to the GenBank(TM)/EMBL Data Bank with accession number(s) L29553[GenBank].

§
Present address: Dept. of Biochemistry, Kobe Pharmaceutical University, Higashinada-ku, Kobe 658, Japan.

To whom correspondence should be addressed: 3525 John Hopkins Ct., San Diego, CA 92121. Tel.: 619-552-3002; Fax: 619-552-8801.

(^1)
The abbreviations used are: kb, kilobase(s); bp, base pair(s); PCR, polymerase chain reaction; RACE, rapid amplification of cDNA ends; MAF, mammary cell activating factor.

(^2)
H. Kitagawa and J. C. Paulson, unpublished data.


REFERENCES

  1. Aasheim, H.-C., Aas-Eng, D. A., Deggerdal, A., Blomhoff, H. K., Funderud, S., and Smeland, E. B. (1993) Eur. J. Biochem. 273, 467-475
  2. Bierhuizen, M. F. A., Mattei, M.-G., and Fukuda, M. (1993) Genes & Dev. 7, 468-478
  3. Blackwell, T. K., and Weintraub, H. (1990) Science 250, 1104-1110 [Medline] [Order article via Infotrieve]
  4. Brandley, B. K., Swiedler, S. J., and Robbins, P. W. (1990) Cell 63, 861-863 [Medline] [Order article via Infotrieve]
  5. Breathnach, R., and Chambon, P. (1981) Annu. Rev. Biochem. 50, 349-383 [CrossRef][Medline] [Order article via Infotrieve]
  6. Chang, M. L., Eddy, R. L., Shows, T. B., and Lau, J. L. (1995) Glycobiology 5, 319-325 [Abstract]
  7. Cox, P. M., Temperley, S. M., Kumar, H., and Goding, C. R. (1988) Nucleic Acids Res. 16, 11047-11056 [Abstract]
  8. Eckhardt, M., Muhlenhoff, M., Bethe, A., Koopman, J., Frosch, M., and Gerardy-Schahn, R. (1995) Nature 373, 715-718 [CrossRef][Medline] [Order article via Infotrieve]
  9. Faisst, S., and Meyer, S. (1992) Nucleic Acids Res. 20, 3-26 [Medline] [Order article via Infotrieve]
  10. Gillespie, W., Kelm, S., and Paulson, J. C. (1992) J. Biol. Chem. 267, 21004-21010 [Abstract/Free Full Text]
  11. Haraguchi, M., Yamashiro, S., Yamamoto, A., Furukawa, K., Takamiya, K., Lloyd, K. O., Shiku, H., and Furukawa, K. (1994) Proc. Natl. Acad. Sci. U. S. A. 91, 10455-10459 [Abstract/Free Full Text]
  12. Hardon, E. M., Frain, M., Paonessa, G., and Cortese, R. (1988) EMBO J. 7, 1711-1719 [Abstract]
  13. Harduin-Lepers, A., Shaper, J. H., and Shaper, N. L. (1993) J. Biol. Chem. 268, 14348-14359 [Abstract/Free Full Text]
  14. Hollis, G. F., Douglas, J. G., Shaper, N. L., Shaper, J. H., Stafford-Hollis, J. M., Evans, R. J., and Kirsch, I. R. (1989) Biochem. Biophys. Res. Commun. 162, 1069-1075 [Medline] [Order article via Infotrieve]
  15. Hull, E., Sarkar, M., Spruiji, M. P. N., Hoppener, J. W. M., Dunn, R., and Schachter, H. (1991) Biochem. Biophys. Res. Commun. 176, 608-615 [Medline] [Order article via Infotrieve]
  16. Ihara, Y., Nishikawa, A., Tohma, T., Soejima, H., Nikawa, N., and Taniguchi, N. (1993) J. Biochem. (Tokyo) 113, 692-698 [Abstract]
  17. Joziasse, D. H. (1992) Glycobiology 2, 271-277 [Abstract]
  18. Joziasse, D. H., Shaper, N. L., Kim, D., Van den Eijnden, D. H., and Shaper, J. H. (1992) J. Biol. Chem. 267, 5534-5541 [Abstract/Free Full Text]
  19. Kadonaga, J. T., Jones, K. A., and Tjian, R. (1986) Trends Biochem. Sci. 11, 20-23 [CrossRef]
  20. Kageyama, R., Merlino, G. T., and Pastan, I. (1989) J. Biol. Chem. 264, 15508-15514 [Abstract/Free Full Text]
  21. Kitagawa, H., and Paulson, J. C. (1993) Biochem. Biophys. Res. Commun. 194, 375-382 [CrossRef][Medline] [Order article via Infotrieve]
  22. Kitagawa, H., and Paulson, J. C. (1994a) J. Biol. Chem. 269, 1394-1401 [Abstract/Free Full Text]
  23. Kitagawa, H., and Paulson, J. C. (1994b) J. Biol. Chem. 269, 17872-17878 [Abstract/Free Full Text]
  24. Kleene, R., and Berger, E. G. (1993) Biochim. Biophys. Acta 1154, 283-325 [Medline] [Order article via Infotrieve]
  25. Kojima, N., Yoshida, Y., Kurosawa, N., Lee, Y.-C., and Tsuji, S. (1995) FEBS Lett. 360, 1-4 [CrossRef][Medline] [Order article via Infotrieve]
  26. Kornfeld, S. (1987) FASEB J. 1, 462-468 [Abstract/Free Full Text]
  27. Kurosawa, N., Hamamoto, T., Lee, Y.-C., Nakaoka, T., Kojima, N., and Tsuji, S. (1994a) J. Biol. Chem. 269, 1402-1409 [Abstract/Free Full Text]
  28. Kurosawa, N., Kojima, N., Inoue, M., Hamamoto, T., and Tsuji, S. (1994b) J. Biol. Chem. 269, 19048-19053 [Abstract/Free Full Text]
  29. Lee, W., Mitchell, P., and Tjian, R. (1987) Cell 49, 741-752 [Medline] [Order article via Infotrieve]
  30. Lee, Y.-C., Kojima, N., Wada, E., Kurosawa, N., Nakaoka, T., Hamamoto, T., and Tsuji, S. (1994) J. Biol. Chem. 269, 10028-10033 [Abstract/Free Full Text]
  31. Livingston, B. D., and Paulson, J. C. (1993) J. Biol. Chem. 268, 11504-11507 [Abstract/Free Full Text]
  32. Lowe, J. B., Kukowska-Latallo, J. F., Nair, R. P., Larsen, R. D., Marks, R. M., Macher, B. A., Kelly, R. J., and Ernst, L. K. (1991) J. Biol. Chem. 266, 17467-17477 [Abstract/Free Full Text]
  33. Lowings, P., Yavuzer, U., and Goding, C. R. (1992) Mol. Cell. Biol. 12, 3653-3662 [Abstract]
  34. Mink, S., Hartig, E., Jennewein, P., Doppler, W., and Cato, A. C. B. (1992) Mol. Cell. Biol. 12, 4906-4918 [Abstract]
  35. Mitchell, P. J., Wang, C., and Tjian, R. (1987) Cell 50, 847-861 [Medline] [Order article via Infotrieve]
  36. Nara, K., Watanabe, Y., Maruyama, K., Kasahara, K., Nagai, Y., and Sanai, Y. (1994) Proc. Natl. Acad. Sci. U. S. A. 91, 7952-7956 [Abstract]
  37. Natsuka, S., and Lowe, J. B. (1994) Curr. Opin. Struct. Biol. 4, 683-691
  38. Nguyen, C., Mattei, M.-G., Mattei, J.-F., Santoni, M.-J., Goridis, C., and Jordan, B. R. (1986) J. Cell Biol. 102, 711-715 [Abstract]
  39. O'Hanlon, T. P., and Lau, J. T. Y. (1992) Glycobiology 2, 257-266 [Abstract]
  40. Paonessa, G., Goanari, F., Frank, R., and Cortese, R. (1988) EMBO J. 7, 3115-3123 [Abstract]
  41. Paulson, J. C. (1989) Trends Biochem. Sci. 14, 272-276 [CrossRef][Medline] [Order article via Infotrieve]
  42. Paulson, J. C., Weinstein, J., and Schauer, A. (1989) J. Biol. Chem. 264, 10931-10934 [Abstract/Free Full Text]
  43. Powell, L. D., and Varki, A. (1995) J. Biol. Chem. 270, 14243-14246 [Free Full Text]
  44. Rademacher, T. W., Parekh, R. B., and Dwek, R. A. (1988) Annu. Rev. Biochem. 57, 785-838 [CrossRef][Medline] [Order article via Infotrieve]
  45. Reddy, S., Ozgur, K., Lu, M., Chang, W., Mohan, S. R., Kumar, C. C., and Ruley, H. E. (1990) J. Biol. Chem. 265, 1683-1687 [Abstract/Free Full Text]
  46. Russo, R. N., Shaper, N. L., and Shaper, J. H. (1990) J. Biol. Chem. 265, 3324-3331 [Abstract/Free Full Text]
  47. Sambrook, J., Fritsch, E. F., and Maniatis, T. (1989) Molecular Cloning: A Laboratory Manual , 2nd Ed., Cold Spring Harbor Laboratory, Cold Spring Harbor, NY
  48. Sasaki, K., Watanabe, E., Kawashima, K., Sekine, S., Dohi, T., Oshima, M., Hanai, N., Nishi, T., and Hasegawa, M. (1993) J. Biol. Chem. 268, 22782-22787 [Abstract/Free Full Text]
  49. Sasaki, K., Kurata, K., Kojima, N., Kurosawa, N., Ohta, S., Hanai, N., Tsuji, S., and Nishi, T. (1994) J. Biol. Chem. 269, 15950-15959 [Abstract/Free Full Text]
  50. Svensson, E. C., Soreghan, B., and Paulson, J. C. (1990) J. Biol. Chem. 265, 20863-20868 [Abstract/Free Full Text]
  51. Svensson, E. C., Conley, P. B., and Paulson, J. C. (1992) J. Biol. Chem. 267, 3466-3472 [Abstract/Free Full Text]
  52. Varki, A. (1992) Curr. Opin. Cell Biol. 4, 257-266 [Medline] [Order article via Infotrieve]
  53. Wang, X. C., O'Hanlon, T. P., Young, R. F., and Lau, J. T. Y. (1990) Glycobiology 1, 25-31 [Abstract]
  54. Wang, X. C., Vertino, A., Eddy, R. L., Byers, M. G., Jani-Sait, S. N., Shows, T. B., and Lau, J. T. Y. (1993) J. Biol. Chem. 268, 4355-4361 [Abstract/Free Full Text]
  55. Weinstein, J., Lee, E., McEntee, K., Lai, P., and Paulson, J. C. (1987) J. Biol. Chem. 262, 17735-17743 [Abstract/Free Full Text]
  56. Wen, D. X., Svensson, E. C., and Paulson, J. C. (1992a) J. Biol. Chem. 267, 2512-2518 [Abstract/Free Full Text]
  57. Wen, D. X., Livingston, B. D., Medzihradzky, K. F., Kelm, S., Burlingame, A. L., and Paulson, J. C. (1992b) J. Biol. Chem. 267, 21011-21019 [Abstract/Free Full Text]
  58. Weston, B. W., Nair, R. P., Larsen, R. D., and Lowe, J. B. (1992a) J. Biol. Chem. 267, 4152-4160 [Abstract/Free Full Text]
  59. Weston, B. W., Smith, P. L., Kelly, R. J., and Lowe, J. B. (1992b) J. Biol. Chem. 267, 24575-24584 [Abstract/Free Full Text]
  60. Yoshida, Y., Kojima, N., Kurosawa, N., Hamamoto, T., and Tsuji, S. (1995) J. Biol. Chem. 270, 14628-14633 [Abstract/Free Full Text]

©1996 by The American Society for Biochemistry and Molecular Biology, Inc.