©1995 by The American Society for Biochemistry and Molecular Biology, Inc.
Mouse 1- and 2-Syntrophin Gene Structure, Chromosome Localization, and Homology with a Discs Large Domain (*)

(Received for publication, April 29, 1995; and in revised form, July 6, 1995)

Marvin E. Adams (1) Timothy M. Dwyer (1) Lisa L. Dowler (2) Robert A. White (2) Stanley C. Froehner (1)(§)

From the  (1)Department of Physiology, University of North Carolina, Chapel Hill, North Carolina 27599-7545 and the (2)Section of Genetics, Children's Mercy Hospital/University of Missouri at Kansas City School of Medicine, Kansas City, Missouri 64108

ABSTRACT
INTRODUCTION
EXPERIMENTAL PROCEDURES
RESULTS
DISCUSSION
FOOTNOTES
ACKNOWLEDGEMENTS
REFERENCES

ABSTRACT

The syntrophin family of dystrophin-associated proteins consists of three isoforms, alpha1, beta1, and beta2, each encoded by a distinct gene. We have cloned and characterized the mouse alpha1- and beta2-syntrophin genes. The mouse alpha1-syntrophin gene (>24 kilobases) is comprised of eight exons. The mouse beta2-syntrophin gene (>33 kilobases) contains seven exons, all of which have homologues at the corresponding position in the alpha1-syntrophin gene. Primer extension analysis reveals two transcription initiation sites in the alpha1-syntrophin gene and a single site in the beta2-syntrophin gene. The sequence immediately 5` of the transcription start sites of both genes lacks a TATA box but is GC-rich and has multiple putative SP1 binding sites. The alpha1-syntrophin gene is located on human chromosome 20 and mouse chromosome 2, while the beta2-syntrophin gene is on human chromosome 16 and mouse chromosome 8. Analysis of the amino acid sequence of the syntrophins reveals the presence of four conserved domains. The carboxyl-terminal 56 amino acids are highly conserved and constitute a syntrophin unique domain. Two pleckstrin homology domains are located at the amino-terminal end of the protein. The first pleckstrin homology domain is interrupted by a domain homologous to repeated sequences originally found in the Drosophila discs-large protein.


INTRODUCTION

Syntrophin is a peripheral membrane protein of M(r) 58,000 that was first identified in the postsynaptic membrane of Torpedo electric organ and subsequently shown to be present in many mammalian tissues(1) . Interest in syntrophin came first from its location at the neuromuscular junction and more recently from the demonstration that it is directly associated with dystrophin, the product of the Duchenne/Becker muscular dystrophy gene. Although the precise function of syntrophin is unknown, a potential role for the dystrophin-associated proteins in agrin-stimulated nicotinic acetylcholine receptor clustering has implicated syntrophin in the process of synaptogenesis(2) .

Three different but highly conserved syntrophin isoforms encoded by distinct genes have been identified and cloned. Each syntrophin has approximately 50% amino acid identity with the other two(3) . The three syntrophins can be separated into two classes based on isoelectric point(4) . The acidic isoform, alpha1-syntrophin, (pI 6.7) has been cloned from Torpedo, mouse, rabbit, and human (5, 6, 7) . There are two basic forms, beta1- and beta2- syntrophin (pI 9.0). A full-length human cDNA encoding beta1-syntrophin has been cloned, and the gene has been localized to human chromosome 8q23-24 (8) . Partial clones encoding mouse and human beta2-syntrophin have been reported previously(5, 8) .

The function of syntrophin is likely to be related to its association with dystrophin and other members of the dystrophin protein family (9, 10, 11, 12, 13) . Proteins of the dystrophin family are derived from a combination of three genes, the use of alternative promoters within these genes, and alternative splicing. Dystrophin, the major product in skeletal muscle, is a 427-kDa protein with an actin-binding amino terminus, 24 spectrin-like coiled coil repeats, a cysteine rich (CR) (^1)region and a unique carboxyl terminus (CT)(14, 15) . Shorter forms of dystrophin that contain CRCT and either no (Dp 71) or only a few spectrin repeats (e.g. Dp 116) are produced from the same gene via internal promoters (for review, see (54) ). Utrophin is highly related to dystrophin but is encoded by a separate gene(16, 17) . A third gene encodes a protein of M(r) 87,000 from Torpedo that has modest but significant homology with the CRCT of dystrophin(18) . The association of syntrophin with each of these members of the dystrophin family suggests that it has a general role in their functions at the membrane(9) .

Most members of the dystrophin protein family exhibit highly restricted tissue distributions. The exception is utrophin, which, as its name implies, is nearly ubiquitous. Dystrophin is expressed primarily in skeletal muscle but can also be detected in cardiac muscle and brain. Dp 71 is found primarily in brain glial cells, liver, and stomach (19, 20, 21) . Dp 116 is found only in glial cells, particularly the Schwann cells(22) . Other recently described proteins derived from alternate promoters within the dystrophin gene also appear to have restricted tissue distributions(23, 24) .

The three syntrophins also show unique expression patterns. Northern blot analysis has shown that alpha1-syntrophin, like dystrophin, is expressed primarily in skeletal muscle but is also present in cardiac muscle, kidney, and brain(5) . In skeletal muscle, the subcellular distribution of alpha1-syntrophin is virtually identical to that of dystrophin. Both proteins are associated with the sarcolemma and concentrated at the neuromuscular junction(25) . beta2-Syntrophin message is expressed at highest levels in testis, brain, cardiac muscle, kidney, and lung(5) , but only at low levels in skeletal muscle where the protein is restricted to the neuromuscular junction(25) . beta1-Syntrophin message is found primarily in liver with moderate levels in kidney, skeletal muscle, and lung but at very low levels in brain and cardiac muscle(8) . These unique patterns of syntrophin are most likely a result of tissue-specific transcription. The predominant interactions between syntrophins and dystrophin family members may reflect these differential tissue expressions and thus subserve the specific functional requirements of that tissue.

As a prelude to understanding the transcriptional control of syntrophin, we report the cloning and characterization of the genes encoding two of the three syntrophins. We have mapped the loci for alpha1-syntrophin (gene symbol Snta1) and beta2-syntrophin (gene symbol Sntb2) to the mouse and human chromosomes in order to identify potential links between mutations in these genes and known genetic disorders. Finally, analysis of the full-length amino acid sequence of the syntrophins has revealed the presence of two pleckstrin homology (PH) domains(26) , a PDZ domain (so named to indicate its presence in postsynaptic density protein-95 (PSD-95), the Drosophila discs large tumor suppresser protein, and the zonula occludens-1 protein (ZO-1), (^2)and a highly conserved carboxyl-terminal syntrophin unique (SU) domain.


EXPERIMENTAL PROCEDURES

Isolation of Genomic Clones

Genomic clones encoding alpha1- and beta2-syntrophin were isolated from a 129SV mouse genomic DNA library in Fix II (Stratagene, La Jolla, CA) by screening 10^6 phage with each of the respective cDNAs as described previously(5) . Twelve alpha1-syntrophin clones and 17 beta2-syntrophin clones were isolated. Based on restriction mapping and hybridization to cDNA fragments, four alpha1- and five beta2-syntrophin clones were selected for further analysis. Restriction fragments of the genomic clones that hybridized to cDNA sequence were subcloned into pUC 19 for sequencing. None of the beta2-syntrophin clones contained exon 5, and repeated screening proved unsuccessful in isolating this part of the gene. Therefore, this exon and flanking sequence was obtained by amplification of mouse genomic DNA using the GeneAmp XL PCR kit (Perkin Elmer) and primer sets that spanned intron 4 (5`-AGCAGTGGAGGCCTGTTCTCATGG; 5`-GACTTCCTTGATCAGCTCAGCAGC) and intron 5 (5`-GGTTCATTCAGGTTCTGGATGTCGG; 5`-TCCATCATCAGCAGACATCTTCAGC). Intron lengths were determined by a combination of sequencing, restriction mapping of the genomic clones, and PCR analysis of mouse genomic DNA.

DNA Sequencing and Analysis

DNA sequencing was performed manually (27) and by the University of North Carolina, Chapel Hill, Automated DNA Sequencing Facility on a model 373A DNA sequencer (Applied Biosystems). Sequence assembly and analysis was performed using DNA Star Lasergene computer software. The BLAST network service through the National Center for Biotechnology Information was used for databank searches.

Primer Extension

Total cellular RNA was isolated from adult mouse (C57BL/6J) tissue by the method of Chomczynski and Sacchi(28) . Oligonucleotide primers specific for each isoform (alpha1-syntrophin 5`-CCTCGAAGTCCCAGTAC; beta2-syntrophin 5`-CTTAGAAGCAGCTCCACCAGCCCCGCTTTGG) were labeled with [-P]ATP using T4 polynucleotide kinase (New England Biolabs, Beverly, MA). Labeled primer (0.15 pmol) was combined with 30 µg of total cellular RNA, heated to 80 °C for 3 min, and allowed to anneal for 2 h at 47 °C (alpha1-syntrophin) or 65 °C (beta2-syntrophin) in 20 µl of 10 mM Tris, pH 7.5, 250 mM KCl, 1 mM EDTA, 10 mM dithiothreitol, and 0.5 units/µl human placental RNasin (Promega, Madison, WI). Reverse transcription was performed for 1 h at 42 °C in 75 mM Tris, pH 7.5, 83 mM KCl, 0.3 mM EDTA, 15 mM dithiothreitol, 12 mM MgCl(2), 0.5 units/µl RNasin, 0.5 mM dNTP, 3.3 units/µl Moloney murine leukemia virus reverse transcriptase (Life Technologies, Inc.) and 33 µg/ml actinomycin D (Sigma). After extension, samples were treated with 1 µg of RNase A (Life Technologies, Inc.) and 10 mM EDTA for 30 min at 37 °C. The resulting cDNA was precipitated with ethanol, resuspended in formamide, run on a 6% polyacrylamide sequencing gel, and subjected to autoradiography. A second primer extension method was also used to confirm the alpha1-syntrophin transcription start sites(29) .

Chromosome Localization

Genomic DNAs from C57BL/6J, Mus spretus, and a (M. Spretus times C57BL/6J) F1 times M. spretus back-cross DNA panel consisting of 94 DNAs were obtained from The Jackson Laboratory (Bar Harbor, ME)(30) . Southern blots and hybridizations were carried out as described previously(31) . Approximately 5 µg of genomic DNAs from the C57BL/6J and M. spretus progenitors were digested with 28 different restriction enzymes to find a suitable restriction fragment length polymorphism (RFLP) for mapping. Southern blots were probed with either mouse alpha1-syntrophin or beta2-syntrophin cDNA(5) . Approximately 2 µg of DNA from the back-cross panel was digested for each sample with HincII overnight. Segregation of Snta1 and Sntb2 alleles was compared with other loci from a data base at The Jackson Laboratory Backcross DNA map Panel Service(30) . The gene symbols Snta1 and Sntb2 have been approved by the International Mouse Nomenclature Committee.

To identify the human chromosomes encoding syntrophin, a hamster-human somatic cell hybrid panel (BIOS, New Haven, CT) was screened with the alpha1- and beta2-syntrophin cDNAs according to a Southern blot protocol described previously(5) .


RESULTS

Exon/Intron Structure of Mouse alpha1- and beta2-Syntrophin Genes

In order to analyze the syntrophin gene structure and identify putative transcriptional regulatory elements, genomic clones encoding alpha1- and beta2-syntrophin were isolated from a 129SV mouse phage library. The sequence of each exon and adjacent sequence was determined using sequencing primers derived from the cDNA. The relative positions and sizes of the exons and introns for each gene are shown in Fig. 1.


Figure 1: Exon/intron structure of the alpha1- and beta2-syntrophin genes. The lengths of the exons (open boxes) are given in number of base pairs, and the intron lengths are given in kilobase pairs. Introns are not drawn to scale. The lengths of all exons and alpha1-syntrophin introns IV and VII were determined by sequencing. Two sizes for alpha1-syntrophin exon 1 are possible, depending on the transcription start site. Alternative splicing potentially produces two sizes of alpha1-syntrophin exon 6 (see text). The lengths of the remaining alpha1-syntrophin introns and of beta2-syntrophin intron VI were estimated by restriction mapping of genomic clones, and the lengths of beta2-syntrophin introns I-V were determined by PCR of mouse genomic DNA. The positions of the start (ATG) and stop (TGA) codons are shown.



The alpha1-syntrophin gene is over 24 kb in length and contains eight exons. The smallest is exon 5 (131 bp) and the longest is exon 8 (580 bp). The length of exon 1 is either 367 or 408 bp depending on the site used for transcription initiation (see below). The length of exon 6 is dependent on the 3` splice site used by intron 5. One of the cDNA clones previously isolated, BC10, encoded an additional 4 amino acids (SSAH) not present in three other independently isolated clones(5) . The 12 nucleotides encoding this sequence could be included in exon 6 by using an alternative 3` splice site (Fig. 2). This splice occurs at a 3` TG instead of the highly conserved AG dinucleotide and is therefore likely to be a rare splicing event. The additional 12 nucleotides were observed in only one of four mouse cDNA clones and are not present in the Torpedo(5) , rabbit(6) , or human (7) alpha1-syntrophin cDNAs, further suggesting that this is a rare splicing event. Exon 8 contains the TAG stop codon followed by 490 bp of 3`-untranslated sequence, which includes the polyadenylation signal sequence.


Figure 2: Intron border sequences of the alpha1- and beta2-syntrophin genes. The nucleotide and corresponding amino acid exon sequences immediately before and after each intron are shown in upper case. Intronic sequence is shown in lower case. The conserved GT and AG dinucleotides that begin and end each intron are in boldface. Intron 5 of the alpha1-syntrophin gene contains a second 3` splice site 12 nucleotides prior to the first splice site that can result in the incorporation of 4 additional amino acids (see text).



The beta2-syntrophin gene is over 33 kb long and contains seven exons. The smallest is exon 4 with 143 bp, and the largest, exon 7, is over 1690 bp. The exact size of exon 7 is unknown because no polyadenylation signal sequence was found in either the genomic clone or the 3` end of the cDNA(5) . The positions of the introns relative to the amino acid sequence and the exon/intron border sequence of both syntrophin genes are shown in Fig. 2. All introns have the conserved GT and AG dinucleotides present at the donor and acceptor sites, respectively.

The exon sequences of the alpha1-syntrophin and beta2-syntrophin genes were identical to the sequences of the previously characterized cDNAs(5) . The cDNA for mouse beta2-syntrophin previously reported (5) was missing a substantial portion of the 5` sequence. Genomic clones containing beta2-syntrophin exon 1 extended the cDNA sequence by 327 bp, including 264 bp of coding region (88 amino acids). We have also isolated a beta2-syntrophin cDNA clone with sequence corresponding to that of exon 1. The cDNA sequence contains the exon 1 coding sequence, but its 5` untranslated region is 15 bp shorter than that of the genomic sequence (data not shown). The full-length amino acid sequence of beta2-syntrophin derived from the exon 1 sequence coupled with the previously reported cDNA sequence (5) is shown in Fig. 3. The additional amino acids derived from the genomic sequence are underlined. The first 5` methionine codon in frame with the previously reported beta2-syntrophin sequence is in a context favorable for translation initiation(32) . The beta2-syntrophin encoded by this full-length sequence is 520 amino acids long with a calculated molecular mass of 56,388.5 Da. The full-length mouse beta2-syntrophin amino acid sequence shares 46% identity with mouse alpha1-syntrophin and 55% identity with human beta1-syntrophin. beta2-Syntrophin was classified as a basic (beta) form because the partial sequence had indicated a high isoelectric point. The calculated pI of 8.7 for the full-length protein indicates that it is correctly placed among the beta-syntrophins.


Figure 3: Complete amino acid sequence of mouse beta2-syntrophin. The complete sequence of mouse beta2-syntrophin was derived from the novel amino-terminal 88 amino acids translated from the genomic sequence (underlined) linked to the previously published partial cDNA sequence(5) .



Transcription Initiation Sites

The transcription initiation sites of both genes were determined by primer extension and confirmed by ribonuclease protection assays. RNA isolated from skeletal muscle and testis (the richest sources of alpha1- and beta2-syntrophin message, respectively) were used for these studies. Ribonuclease protection assays showed two bands using an alpha1-syntrophin probe and a single band using a beta2-syntrophin probe (data not shown). Using primer extension assays, two initiation sites were identified in the alpha1-syntrophin gene at nucleotide positions -76 and -117 relative to the translation initiator ATG codon (Fig. 4, left panel). The primer extension band at position -117 is more intense than that at -76 and is therefore likely to be the major site of initiation. A single site was identified in the beta2-syntrophin gene at position -65 (Fig. 4, right panel). A similar experiment using RNA isolated from heart produced a band at position -86, although brain and kidney RNA showed only the -65 band (data not shown).


Figure 4: Primer extension analysis of the transcription initiation sites of the alpha1- and beta2-syntrophin gene. Primer extension of mouse skeletal muscle RNA using a primer to alpha1-syntrophin gives two bands (arrows) separated by 41 nucleotides (lanes 1 and 2). Lanes 1 and 2 represent primer extension products obtained with two different protocols (see ``Experimental Procedures''). A potential third band seen in lane 1 was not confirmed in either the second primer extension (lane 2) or in ribonuclease protection assays (data not shown). Extension of mouse testis RNA with a primer to beta2-syntrophin results in a single major band (lane 3). The marker sequence (M) was derived from M13mp18 and is loaded in the order GATC.



Sequence of the Promoter Region

The sequence 5` of the alpha1- and beta2-syntrophin coding region is shown in Fig. 5. Neither gene contains a standard TATA box, but both have multiple GGGCGG elements (putative SP1 binding sites). The alpha1-syntrophin promoter contains two CANNTG E-box motifs at positions -240 and -659. E-box elements in many other promoters bind members of the basic helix-loop-helix family of regulatory factors, often regulating the expression of muscle-specific genes(33) . Other putative transcription regulatory elements in the alpha1-syntrophin gene include a GGAA core purine-rich Ets-1 site at position -262(34) , a CCAAT box at position -377, and an 8-nucleotide inverted repeat starting at nucleotides -625 and -595. The beta2-syntrophin gene contains an Ets-1 site at position -183, two CCAAT boxes at nucleotides -407 and -539, a CArG box at -482(35) , a CANNTG at -629, and a stretch of 25 thymidines beginning at position -440.


Figure 5: Promoter region sequence of the alpha1- and beta2-syntrophin genes. The nucleotide sequence 5` of the translation initiation codon is shown for the alpha1-syntrophin gene (A) and the beta2-syntrophin gene (B). The location of the transcription start sites (bent arrows) and selected putative regulatory elements (underlined) are shown. The alpha1-syntrophin promoter contains an 8-nucleotide inverted repeat (underlined by arrows). The nucleotides are numbered relative to the translation initiation codon.



Chromosomal Mapping of the Snta1 and Sntb2 Genes

The chromosomal locations of the genes encoding mouse alpha1- and beta2-syntrophin were determined by analyzing the segregation of RFLPs in DNAs derived from the offspring of a (M. spretus times C57BL/6J) F(1) times M. spretus back-cross. A HincII RFLP for Snta1 (encoding alpha1-syntrophin) was identified by the presence of 13.5- and 5.4-kb genomic DNA fragments in C57BL/6J or the absence of these fragments in M. spretus (Fig. 6A). This allele was characterized in 88 DNAs from the (C57BL/6J times M. spretus) F(1) times M. spretus back-cross panel. Haplotype analysis of these mapping data (Fig. 6B) indicated that the Snta1 locus is closely linked to D2Mit22 (DNA fragment, MIT 22) on mouse chromosome 2. Allelic segregation patterns for D2Mit22 and Snta1 were identical (no recombinants), indicating a distance of less than 1 cM between these two genes. The calculated map distances between Snta1 and adjacent loci D2Bir21 (DNA fragment BIR 21) and D2Mit52 (DNA fragment MIT 52), including 95% confidence limits, were determined: D2Bir21-6.8 ± 2.7 cM-Snta1-8.0 ± 2.9 cM-D2Mit52.


Figure 6: Mapping of Snta1 to mouse chromosome 2. A, HincII restriction enzyme pattern of M. spretus (S) and C57BL/6J (B) genomic DNAs probed alpha1-syntrophin cDNA. The molecular sizes of the fragments in kb are indicated. B, haplotype analysis of chromosome 2 genetic markers in (C57BL/6J times M. spretus)F(1) times M. spretus back-cross mice showing linkage and relative position of Snta1. Closed boxes indicate inheritance of the C57BL/6J (B) allele, and open boxes indicate the inheritance of the M. spretus (S) allele from the (C57BL/6J times M. spretus)F(1) parent. Gene names and reference to these loci can be found in GBASE(48) . The first two columns indicate the number of back-cross progeny with no recombinations. The following columns indicate recombinational events between adjacent loci (signified by a change from open box to closed box). The number of recombinants are listed below each column and the recombination frequency (REC %) between adjacent loci is indicated.



A HincII RFLP was also identified for Sntb2 (encoding beta2-syntrophin) by the presence of a 2.2-kb genomic DNA fragment in C57BL/6J or its absence in M. spretus (Fig. 7A). This allele was characterized in the 88 DNAs from the (C57BL/6J times M. spretus) F(1) times M. spretus back-cross panel. Haplotype analysis of these mapping data was performed and is indicated in Fig. 7B. The Sntb2 locus is closely linked to D8Bir25 (DNA fragment BIR 25) on mouse chromosome 8. The calculated map distances between Sntb2 and adjacent loci D8Bir25 and D8Bir26 (DNA fragment BIR 26), including 95% confidence limits were determined: D8Bir25-1.1 ± 1.1 cM-Sntb2-5.7 ± 2.5 cM-D8Bir26.


Figure 7: Mapping of Sntb2 to mouse chromosome 8. This figure is similar to Fig. 6. A, HincII restriction enzyme pattern for M. spretus (S) and C57BL/6J (B) genomic DNAs probed with beta2-syntrophin cDNA. The molecular sizes of the fragments in kb are indicated. B, haplotype analysis of chromosome 8 genetic markers in (C57BL/6J times M. spretus)F(1) times M. spretus back-cross mice showing linkage and relative position of Sntb2.



The human chromosomes containing the alpha1- and beta2-syntrophin genes were identified using hamster-human and mouse/human hybrid cell lines. Southern blot and PCR analyses indicated that SNTA1 is located on human chromosome 20, and SNTB2 is on human chromosome 16 (data not shown).

Protein Domain Homology

Acquiring the full-length amino acid sequence of beta2-syntrophin enabled us to compare this sequence to that of other syntrophins and with the sequences in GenBank. These comparisons revealed that all three syntrophins are comprised of four domains (Fig. 8). The presence of two pleckstrin homology domains in the syntrophins and comparison of their sequences with those of other proteins with similar domains has been reported previously(26) . In addition, we and Lue et al.(36) have found that a 90-amino acid segment of syntrophin is homologous to the PDZ domain. The alignment of this syntrophin domain with the PDZ domains of PSD-95, human and Drosophila discs large protein, ZO-1, and selected other proteins shows that the syntrophin PDZ domain contains many of the defining elements of this motif, including the conserved GLG(F/I) sequence (Fig. 8A). Finally, the carboxyl-terminal 56 amino acids of syntrophin are highly conserved across species and isoforms (5, 6, 8) and thus represent a potential fourth domain. This region has no homology with proteins other than the syntrophins and, therefore, constitutes a SU domain.


Figure 8: Alignment of the syntrophin PDZ domain with the PDZ domain of other proteins. A, identical and structurally conserved amino acids are shaded. Proteins included are mouse alpha1- and beta2-syntrophin(5) ; human beta1-syntrophin (8) ; Torpedo alpha1-syntrophin(5) ; Drosophila discs-large protein(49) ; human discs-large protein(36) ; rat postsynaptic density protein (psd95)(46) ; rat nitric-oxide synthase(50) ; human lymphocyte chemoattractant factor (GenBank number M90391); human tyrosine phosphatase(51) ; mouse zonula occludens-1 protein (ZO-1)(52) ; and human erythrocyte p55(53) . B, structural diagram of syntrophin showing the relative locations of the PH domains, the PDZ domain, and the SU domain.




DISCUSSION

The interactions between the syntrophins and dystrophin (and other members of the dystrophin family) is direct(7, 10, 11, 12, 13) . One site of interaction occurs at a region of the cysteine-rich domain encoded by exon 74 of the dystrophin gene(11, 12, 13) , an exon that is subject to alternative splicing(37) . Furthermore, the syntrophins can interact with more than one member of the dystrophin family. For example, alpha1, beta1, and beta2 syntrophins each bind homologous sequences of dystrophin, utrophin, and 87-kDa protein(7) . Thus, the specificity of the interaction between the syntrophin isoforms and the dystrophin family members that is apparent from the differential localization of alpha1 and beta2 syntrophins in skeletal muscle (25) must depend on other factors. These may include posttranslational modifications, associations with other proteins, and transcriptional regulation of expression. The tissue distribution patterns of the syntrophin isoforms and members of the dystrophin protein family implicate tissue-specific transcription as a potential regulator of the dystrophin/syntrophin isoform association. As a first step in investigating syntrophin transcriptional regulation, we have isolated and characterized genomic clones encoding alpha1- and beta2-syntrophin.

The alpha1-syntrophin gene is over 24 kb in length and contains seven introns. The beta2-syntrophin gene is over 33 kb long and contains at least six introns. Comparison of the positions where the introns interrupt the coding sequence shows that all introns in the beta2-syntrophin gene occur at the corresponding position in the alpha1- syntrophin gene ( Fig. 1and Fig. 2). The alpha1-syntrophin gene contains an additional intron dividing the sequence that in beta2-syntrophin is a continuous first exon (Fig. 1). This positioning of introns at similar locations is frequently observed in genes derived from a common ancestor.

The alpha1-syntrophin gene has two major transcription initiation sites, 41 nucleotides apart. cDNAs with 5` ends near each start site were obtained during cDNA cloning(5) . The site at position -76 is 35 nucleotides 5` of the start of cDNA clone BC5, and the site at -117 extends cDNA clone BC10 by 7 nucleotides. Since the two transcripts are only 41 nucleotides different, Northern blots did not resolve the 2 bands, rather a single broad band was observed at 2.4 kb(5) . A single transcription start site was identified in the beta2-syntrophin gene. This suggests that the three sizes of message observed on Northern blots (2, 5, and 10 kb) (5) must result from different sized 3`-untranslated region and/or incomplete removal of introns.

The promoter region of both syntrophin genes is very GC rich, contains no identifiable TATA box, and has multiple putative SP1 binding sites. This type of promoter is often present in housekeeping genes, although the Dp 71 gene has a similar promoter(38) . Expression of syntrophin mRNAs varies greatly among different tissues, suggesting that tissue-specific regulatory elements are present within the gene. A candidate for such an element is the E-box motif present in both genes. Surprisingly, the beta2-syntrophin gene, which is expressed in many tissues but only at low levels in muscle, contains a putative CArG box element that is often present in genes encoding muscle-specific proteins. Functional analysis of each promoter will enable identification of the elements responsible for alpha1- and beta2-syntrophin's unique pattern of expression.

To identify potential associations between known genetic disorders and mutations in syntrophin genes, we have mapped the location of the alpha1- and beta2-syntrophin genes (Snta1 and Sntb2) to both human and mouse chromosomes. Human chromosome mapping using somatic cell hybrid indicated that the SNTA1 and SNTB2 loci are located on human chromosomes 20 and 16, respectively. Our data are in agreement with results from the Kunkel laboratory(7, 8) , which show that the human genes for alpha1, beta1, and beta2 syntrophins are located at 20q11, 8q23-34, and 16q23, respectively.

The mouse syntrophin genes were mapped to loci that are part of long conserved linkage groups between human chromosome 20 and mouse chromosome 2 (for alpha1-syntrophin) or human chromosome 16 and mouse chromosome 8 (for beta2-syntrophin). This information enabled us to compare these locations with those of known mouse neuromuscular genetic disorders. No likely candidates were found within a reasonable distance of the alpha1-syntrophin gene. A potential candidate for a beta2-syntrophin mutation may be the myodystrophic (Myd) mouse(39) . Myd harbors the mutation responsible for the myodystrophic phenotype and has been mapped to chromosome 8 at a site 3 cM from the beta2-syntrophin gene. Given the error limits of genetic mapping, the Myd locus and the beta2-syntrophin gene cannot be resolved as separate genes. Experiments are currently underway to determine if beta2-syntrophin is the gene product altered by the Myd mutation.

Analysis of the amino acid sequence of all three syntrophins shows that syntrophin is comprised of four protein domains. Each syntrophin contains two PH domains, a PDZ domain, and a highly conserved, carboxyl-terminal SU domain. The PH domain is a sequence of approximately 100 residues found in many proteins involved in intracellular signaling(26) . Nuclear magnetic resonance and crystal structure analyses have shown that this domain in beta-spectrin and dynamin consists of seven strands of beta-sheet and one alpha-helix(40, 41) . In all three syntrophins, the amino-terminal PH domain is interrupted by the insertion of the PDZ domain in the loop connecting the c and d beta-sheet strands. Thus, the structure of this PH domain is unlikely to be disrupted by the presence of intervening sequence at this site. Only a few other proteins with PH domains contain insertions at this site(26) . The largest of these is the 340-amino acid insertion in the PH domain of phospholipase C, which contains two SH2 domains and an SH3 domain. The syntrophin PDZ is very conserved among the three isoforms and across species from Torpedo to human (Fig. 8). The SU region is also highly conserved among the isoforms and across species(5) .

The roles these domains play in the function of syntrophin is not understood but will likely be aided by functional studies of these motifs in other proteins. The PH domain has been implicated in beta-adrenergic receptor kinase association with G proteins (G)(42) , phospholipase C binding to phosphatidylinositol 4,5-bisphosphate(43) , beta(G) spectrin binding to membranes(44) , and protein kinase C association with Bruton's tyrosine kinase(45) . Thus, the presence of two PH domains in syntrophin raises the possibility that this protein, and therefore the dystrophin complex, may mediate intracellular signaling pathways.

The PDZ domain may be involved in targeting syntrophin to the membrane in a manner similar to that proposed for PSD-95(46) . Recently, two proteins containing PDZ domains have been shown to bind erythroid protein 4.1(36, 47) . Thus, syntrophin could potentially associate with members of the protein 4.1 family. Since syntrophin is the only protein containing a PH domain or a PDZ domain that is known to bind dystrophin, it is unlikely that either of these domains alone is responsible for the syntrophin-dystrophin association. Rather, it appears that the PH and PDZ domains must work in harmony to bind dystrophin or that the dystrophin binding site is located in the SU domain. The immunoprecipitation of dystrophin family proteins by the carboxyl-terminal two thirds of syntrophin (7) further supports the SU region as the dystrophin binding domain. Further studies of the syntrophin domains will allow us to identify the binding sites for dystrophin and potentially for other proteins of the dystrophin complex.


FOOTNOTES

*
This work was supported by grants from the Paul Patton Memorial Trust (to R. A. W.) and the William T. Kemper Foundation (to R. A. W.), a postdoctoral fellowship from the Muscular Dystrophy Association (to M. E. A.), a Muscular Dystrophy Association Research Grant (to S. C. F.), and National Institutes of Health Grant NS 14871 (to S. C. F.). The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore by hereby marked ``advertisement'' in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

The nucleotide sequence(s) reported in this paper has been submitted to the GenBank(TM)/EMBL Data Bank with accession number(s) U00678[GenBank], U30900[GenBank], and U30901[GenBank].

§
To whom correspondence should be addressed. Tel.: 919-966-1239; Fax: 919-966-6413; froehner@med.unc.edu.

(^1)
The abbreviations used are: CR, dystrophin cysteine-rich region; CT, dystrophin carboxyl-terminal region; Dp 71, dystrophin protein of 71 kDa; Dp 116, dystrophin protein of 116 kDa; PH, pleckstrin homology domain; PDZ, repeated domain found in postsynaptic density protein-95/Drosophila disks large tumor suppresser protein/zonula occludens-1 protein; SU, syntrophin unique domain; PCR, polymerase chain reaction; RFLP, restriction fragment length polymorphism; kb, kilobase(s); bp, base pair(s); cM, centimorgan.

(^2)
M. B. Kennedy, personal communication.


ACKNOWLEDGEMENTS

We thank The Jackson Laboratory Backcross DNA Map Panel Service especially Lucy Rowe, Mary Barter, and Ed Birkenmeier for supplying the DNA panel and for performing analyses of linkage data. We also thank A. Ahn and L. Kunkel for sharing unpublished data on human syntrophins and B. Popko, R. Sealock, and our laboratory colleagues for helpful discussions and comments on the manuscript.


REFERENCES

  1. Froehner, S. C. (1984) J. Cell Biol. 99, 88-96 [Abstract/Free Full Text]
  2. Sealock, R., and Froehner, S. C. (1994) Cell 77, 617-619 [Medline] [Order article via Infotrieve]
  3. Tinsley, J. M., Blake, D. J., Zuellig, R. A., and Davies, K. E. (1994) Proc. Natl. Acad. Sci. U. S. A. 91, 8307-8313 [Abstract]
  4. Yamamoto, H., Hagiwara, Y., Mizuno, Y., Yoshida, M., and Ozawa, E. (1993) J. Biochem. (Tokyo) 114, 132-139 [Abstract]
  5. Adams, M. E., Butler, M. H., Dwyer, T. M., Peters, M. F., Murnane, A. A., and Froehner, S. C. (1993) Neuron 11, 531-540 [Medline] [Order article via Infotrieve]
  6. Yang, B., Ibraghimov-Beskrovnaya, O., Moomaw, C. R., Slaughter, C. A., and Campbell, K. P. (1994) J. Biol. Chem. 269, 6040-6044 [Abstract/Free Full Text]
  7. Ahn, A. H., Feener, C. A., Gussoni, E., Yoshida, M., Ozawa, E., and Kunkel, L. M. (1995) J. Biol. Chem, in press
  8. Ahn, A. H., Yoshida, M., Anderson, M. S., Feener, C. A., Selig, S., Hagiwara, Y., Ozawa, E., and Kunkel, L. M. (1994) Proc. Natl. Acad. Sci. U. S. A. 91, 4446-4450 [Abstract]
  9. Kramarcy, N. R., Vidal, A., Froehner, S. C., and Sealock, R. (1994) J. Biol. Chem. 269, 2870-2876 [Abstract/Free Full Text]
  10. Cartaud, A., Stetzkowski-Marden, F., and Cartaud, J. (1993) J. Biol. Chem. 268, 13019-13022 [Abstract/Free Full Text]
  11. Ahn, A. H., and Kunkel, L. M. (1995) J. Cell Biol. 128, 363-371 [Abstract]
  12. Suzuki, A., Yoshida, M., and Ozawa, E. (1995) J. Cell Biol. 128, 373-381 [Abstract]
  13. Yang, B., Jung, D., Rafael, J. A., Chamberlain, J. S., and Campbell, K. P. (1995) J. Biol. Chem. 270, 4975-4978 [Abstract/Free Full Text]
  14. Hoffman, E. P., Brown, R. H., and Kunkel, L. M. (1987) Cell 51, 919-928 [Medline] [Order article via Infotrieve]
  15. Koenig, M., Monaco, A. P., and Kunkel, L. M. (1988) Cell 53, 219-228 [Medline] [Order article via Infotrieve]
  16. Love, D. R., Hill, D. F., Dickson, G., Spurr, N. K., Byth, B. C., Marsden, R. F., Walsh, F. S., Edwards, Y. H., and Davies, K. E. (1989) Nature 339, 55-58 [CrossRef][Medline] [Order article via Infotrieve]
  17. Tinsley, J. M., Blake, D. J., Roche, A., Fairbrother, U., Riss, J., Byth, B. C., Knight, A. E., Kendrick-Jones, J., Suthers, G. K., Love, D. R., Edwards, Y. H., and Davies, K. E. (1992) Nature 360, 591-593 [CrossRef][Medline] [Order article via Infotrieve]
  18. Wagner, K. R., Cohen, J. B., and Huganir, R. L. (1993) Neuron 10, 511-522 [Medline] [Order article via Infotrieve]
  19. Blake, D. J., Love, D. R., Tinsley, J., Morris, G. E., Turley, H., Gatter, K., Dickson, G., Edwards, Y. H., and Davies, K. E. (1992) Hum. Mol. Genet. 1, 103-109 [Abstract]
  20. Lederfein, D., Levy, Z., Augier, N., Mornet, D., Morris, G., Fuchs, O., Yaffe, D., and Nudel, U. (1992) Proc. Natl. Acad. Sci. U. S. A. 89, 5346-5350 [Abstract]
  21. Hugnot, J. P., Gilgenkrantz, H., Vincent, N., Chafey, P., Morris, G. E., Monaco, A. P., Berwald-Netter, Y., Koulakoff, A., Kaplan, J. C., Kahn, A., and Chelly, J. (1992) Proc. Natl. Acad. Sci. U. S. A. 89, 7506-7510 [Abstract]
  22. Tinsley, J. M., Blake, D. J., and Davies, K. E. (1993) Hum. Mol. Genet. 2, 521-524 [Abstract]
  23. Gorecki, D. C., Monaco, A. P., Derry, J. M. J., Walker, A. P., Barnard, E. A., and Barnard, P. J. (1992) Hum. Mol. Genet. 1, 505-510 [Abstract]
  24. D'Souza, V. N., Man, N., Morris, G. E., Karges, W., Pillers, D. M., and Ray, P. N. (1995) Hum. Mol. Genet. 4, 837-842 [Abstract]
  25. Peters, M. F., Kramarcy, N. R., Sealock, R., and Froehner, S. C. (1994) NeuroReport 5, 1577-1580 [Medline] [Order article via Infotrieve]
  26. Gibson, T. J., Hyvonen, M., Birney, E., Musacchio, A., and Saraste, M. (1994) Trends Biochem. Sci. 19, 349-353 [CrossRef][Medline] [Order article via Infotrieve]
  27. Sanger, F., Nicklen, S., and Coulson, A. R. (1977) Proc. Natl. Acad. Sci. U. S. A. 74, 5463-5467 [Abstract]
  28. Chomczynski, P., and Saachi, N. (1987) Anal. Biochem. 162, 156-159 [CrossRef][Medline] [Order article via Infotrieve]
  29. Sambrook, J., Fritsch, E. F., and Maniatis, T. (1989) Molecular Cloning: A Laboratory Manual , Cold Spring Harbor Laboratory, Cold Spring Harbor, NY
  30. Rowe, L. B., Nadeau, J. H., Turner, R., Frankel, W. N., Letts, B. A., Eppig, J. T., Ko, M. S. H., Thurston, S. J., and Birkenmeier, E. H. (1994) Mammalian Genome 5, 253-274 [Medline] [Order article via Infotrieve]
  31. White, R. A., Peters, L. L., Adkison, L. R., Korsgren, C., Cohen, C. M., and Lux, S. E. (199 2) Nature Genet. 2, 80-83 [Medline] [Order article via Infotrieve]
  32. Kozak, M. (1986) Cell 44, 283-292 [Medline] [Order article via Infotrieve]
  33. Olson, E. N. (1990) Genes & Dev. 4, 1454-1461
  34. Karim, F. D., Urness, L. D., Thummel, C. S., Klemsz, M. J., McKercher, S. R., Celada, A., Van Bevern, C., Maki, R. A., Gunther, C. V., Nye, J. A., and Graves, B. J. (1990) Genes & Dev. 4, 1451-1453
  35. Minty, A., and Kedes, L. (1986) Mol. Cell. Biol. 6, 2125-2136 [Medline] [Order article via Infotrieve]
  36. Lue, R. A., Marfatia, S. M., Branton, D., and Chishti, A. H. (1994) Proc. Natl. Acad. Sci. U. S. A. 91, 9818-9822 [Abstract/Free Full Text]
  37. Feener, C. A., Koenig, M., and Kunkel, L. M. (1989) Nature 338, 509-511 [CrossRef][Medline] [Order article via Infotrieve]
  38. Lederfein, D., Yaffe, D., and Nudel, U. (1993) Hum. Mol. Genet. 2, 1883-1888 [Abstract]
  39. Lane, P. W., Beamer, T. C., and Myers, D. D. (1976) J. Heredity 67, 135-138 [Medline] [Order article via Infotrieve]
  40. Macias, M. J., Musacchio, A., Ponstingl, H., Nilges, M., Saraste, M., and Oschkinat, H. (1994) Nature 369, 675-677 [CrossRef][Medline] [Order article via Infotrieve]
  41. Ferguson, K. M., Lemmon, M. A., Schlessinger, J., and Sigler, P. B. (1994) Cell 79, 199-209 [Medline] [Order article via Infotrieve]
  42. Touhara, K., Inglese, J., Pitcher, J. A., Shaw, G., and Lefkowitz, R. J. (1994) J. Biol. Chem. 269, 10217-10220 [Abstract/Free Full Text]
  43. Cifuentes, M. E., Honkanen, L., and Rebecchi, M. J. (1993) J. Biol. Chem. 268, 11586-11593 [Abstract/Free Full Text]
  44. Davis, L. H., and Bennett, V. (1994) J. Biol. Chem. 269, 4409-4416 [Abstract/Free Full Text]
  45. Yao, L., Kawakami, Y., and Kawakami, T. (1994) Proc. Natl. Acad. Sci. U. S. A. 91, 9175-9179 [Abstract]
  46. Cho, K.-O., Hunt, C. A., and Kennedy, M. B. (1992) Neuron 9, 929-942 [Medline] [Order article via Infotrieve]
  47. Marfatia, S. M., Lue, R. A., Branton, D., and Chishti, A. H. (1994) J. Biol. Chem. 269, 8631-8634 [Abstract/Free Full Text]
  48. Maltais, L. J., Doolittle, D. P., Hilyard, A. L., Guidi, J. N., Davisson, M. T., and Roderick, T. H. (1993) GBASE , Jackson Laboratory, Bar Harbor, ME
  49. Woods, D. F., and Bryant, P. J. (1991) Dev. Biol. 134, 222-235
  50. Bredt, D. S., Hwang, P. M., Glatt, C. E., Lowenstein, C., Reed, R. R., and Snyder, S. H. (1991) Nature 351, 714-718 [CrossRef][Medline] [Order article via Infotrieve]
  51. Maekawa, K., Imagawa, N., Nagamatsu, M., and Harada, S. (1994) FEBS Lett. 337, 200-206 [CrossRef][Medline] [Order article via Infotrieve]
  52. Itoh, M., Nagafuchi, A., Yonemura, S., Kitani-Yasuda, T., Tsukita, S., and Tsukita, S. (1993) J. Cell Biol. 121, 491-502 [Abstract]
  53. Ruff, P., Speicher, D. W., and Husain-Chishti, A. (1991) Proc. Natl. Acad. Sci. U. S. A. 88, 6595-6599 [Abstract]
  54. Ahn, A. H., and Kunkel, L. M. (1993) Nature Genet. 3, 283-291 [Medline] [Order article via Infotrieve]

©1995 by The American Society for Biochemistry and Molecular Biology, Inc.