©1996 by The American Society for Biochemistry and Molecular Biology, Inc.
A Large Family of Putative Transmembrane Receptors Homologous to the Product of the Drosophila Tissue Polarity Gene frizzled(*)

(Received for publication, November 1, 1995)

Yanshu Wang (1) (5) Jennifer P. Macke (1) (5) Benjamin S. Abella (1) Katrin Andreasson (2) (3) Paul Worley (2) (3) Debra J. Gilbert (6) Neal G. Copeland (6) Nancy A. Jenkins (6) Jeremy Nathans (1) (5) (2) (4)

From the  (1)Departments of Molecular Biology and Genetics, (2)Neuroscience, (3)Neurology, and (4)Ophthalmology, (5)Howard Hughes Medical Institute, The Johns Hopkins University School of Medicine, Baltimore, Maryland 21205 and the (6)Mammalian Genetics Laboratory, ABL Basic Research Program, NCI-Frederick Cancer Research and Development Center, Frederick, Maryland 21702

ABSTRACT
INTRODUCTION
EXPERIMENTAL PROCEDURES
RESULTS
DISCUSSION
FOOTNOTES
ACKNOWLEDGEMENTS
REFERENCES

ABSTRACT

In Drosophila melanogaster, the frizzled gene plays an essential role in the development of tissue polarity as assessed by the orientation of cuticular structures. Through a combination of random cDNA sequencing, degenerate polymerase chain reaction amplification, and low stringency hybridization we have identified six novel frizzled homologues from mammals, at least 11 from zebrafish, several from chicken and sea urchin, and one from Caenorhabditis elegans. The complete deduced amino acid sequences of the mammalian and nematode homologues share with the Drosophila frizzled protein a conserved amino-terminal cysteine-rich domain and seven putative transmembrane segments. Each of the mammalian homologues is expressed in a distinctive set of tissues in the adult, and at least three are expressed during embryogenesis. As hypothesized for the Drosophila frizzled protein, the frizzled homologues are likely to act as transmembrane receptors for as yet unidentified ligands. These observations predict the existence of a family of signal transduction pathways that are homologous to the pathway that determines tissue polarity in Drosophila.


INTRODUCTION

Much of animal development is orchestrated through the transmission of intercellular signals. In Drosophila melanogaster, genetic approaches have identified both receptors and ligands that mediate a number of these intercellular signaling processes. Examples include the receptors Sevenless(1) , Notch(2) , torso(3) , and Toll(4) , and the ligands Hedgehog(5) , and wingless(6, 7) . For several of these Drosophila proteins, vertebrate homologues have been identified, and in each case the homologue(s) appear to play important roles in development(8, 9, 10, 11) . These studies demonstrate a remarkable conservation of biochemical mechanisms between vertebrate and invertebrate cell-cell communication pathways despite differences in the morphologies of the corresponding organisms.

In Drosophila, as in other insects, the cuticle contains a variety of specialized structures, such as bristle sense organs and hairs. At each surface location these extended structures exhibit a predetermined orientation with respect to the body axes. For example, on the wings and legs they point away from the body, and on the thorax and abdomen they point posteriorly. Mutations that alter the polarity of these structures define a set of genes that are referred to as tissue polarity genes(12) . Mutations in frizzled, the most extensively studied tissue polarity gene, disrupt to various extents the polarity of hairs on the wing, leg, thorax, and abdomen(13) . Most alleles also disrupt the orderly arrangement of eye bristles, giving a rough eye phenotype. Genetic mosaic experiments in which homozygous frizzled/frizzled patches are generated in a heterozygous background show that for some alleles the loss of tissue polarity is confined to the mutant patch, whereas for other alleles the loss of tissue polarity also spreads to adjacent cells distal to the mutant patch(14) . These experiments have led Adler and colleagues (14) to propose that the frizzled gene product may be involved in both sending and receiving polarity information.

Strong support for a model in which frizzled acts directly in tissue polarity signaling comes from the sequence of the frizzled gene(15, 16) . This sequence reveals a deduced protein product of 587 amino acids that begins with a putative signal peptide and contains seven putative transmembrane segments. Immunostaining of cells expressing frizzled cDNA shows that the protein accumulates in the plasma membrane and that the amino and carboxyl termini face the extracellular and intracellular spaces, respectively(17) . Although the frizzled protein shares no primary sequence homology with any known receptor, its role in development and its plasma membrane location strongly suggest that it acts as a receptor, a ligand, or both in the transmission of tissue polarity information.

The goal of the present work is to determine the extent to which frizzled-like proteins exist outside of Drosophila. Previous work by Chan et al.(18) identified two closely related rat genes, the protein products of which share approximately 40% amino acid identity with the Drosophila frizzled protein. These genes were identified serendipitously, and their biological functions are currently unknown. In the present work, we show that at least one frizzled-like gene exists in the nematode, Caenorhabditis elegans, and that a large number of frizzled-like genes exist in vertebrates. These observations predict the existence of a family of signal transduction pathways homologous to the pathway that determines tissue polarity in Drosophila.


EXPERIMENTAL PROCEDURES

Random Sequencing of Retina cDNA Clones

From an adult human retina cDNA library in gt10(19) , DNA inserts were isolated en mass by EcoRI cleavage and preparative agarose gel electrophoresis. To sample different regions of each cDNA and to generate smaller DNA segments for reproducible and high efficiency template preparation by PCR, (^1)the cDNA inserts were processed in either of two ways. In the first method, the cDNA inserts were used as a template for DNA synthesis with the Klenow fragment of Escherichia coli DNA polymerase I using 5`-GACGAGATATTAGAATTCTACTCGNNNNNN as a primer (where N indicates a mixture of all four bases). Following heat denaturation and a second round of priming with the same primer, those DNA segments carrying the primer sequence at both ends were PCR amplified with the primer 5`-CCCCCCCCCGACGAGATATTAGAATTCTACTC, cleaved with EcoRI, size fractionated to obtain segments of 400-600 base pairs, and recloned into gt10. In the second method, the cDNA inserts were cut with Tsp509I, size fractionated to obtain segments of 400-600 base pairs, and recloned into gt10. The resulting libraries of size fractionated cDNA fragments were plated at low density, picked into microtiter wells, and screened by hybridization to eliminate clones that contain repetitive DNA or that are derived from the mitochondrial genome. Templates were prepared for sequencing by PCR amplification using primers from gt10 that flank the unique EcoRI site, and DNA sequences were determined using an ABI 373 sequencer. Each sequence was translated in all six reading frames and the resulting protein sequences used to search the GenBank data base with the BLASTX algorithm(20) .

Identification of Frizzled Homologues by Degenerate PCR

Fully degenerate primers with flanking EcoRI or HindIII restriction sites were designed for the amino acid sequences shown in Table 1and used for PCR with Thermus aquaticus polymerase under the following conditions: 1 times (94 °C, 4 min), 35 times (50 °C, 2.5 min; 72 °C, 1.5 min; 94 °C 1 min), 1 times (50 °C, 5 min; 72 °C, 10 min). PCR products were cleaved with EcoRI and HindIII, fractionated by preparative agarose gel electrophoresis, subcloned into pBluescript (Stratagene), and individually sequenced.



cDNA and Genomic Clones

Oligo-dT primed cDNA libraries from the following sources were screened with cloned frizzled PCR products obtained by degenerate PCR: adult human retina in gt10, gestational day 8.5 and 12.5 mouse embryo, and day 18.5 mouse brain in ZAP II (Stratagene). A partial MboI digest mouse genomic DNA library in FIX II (Stratagene) was similarly screened. Hybridization was performed in 30% formamide, 5 times SSC at 42 °C. Inserts were characterized by oligonucleotide hybridization, restriction mapping, and sequencing. The C. elegans cDNA clone cm08b2 (21) encoding Cfz1 was obtained from Dr. Robert Waterston (Washington University) and used to screen a C. elegans partial Sau3A genomic library in EMBL3 (a gift of Dr. Claudia Cummins, University of Wisconsin). The C. elegans frizzled gene was mapped on the C. elegans genome by hybridization to a nitrocellulose filter array of yeast artificial chromosome recombinants provided by Dr. Alan Coulson (MRC Sanger Center, Cambridge, U.K.).

5` End Analysis Using RACE-PCR

RACE-PCR analysis of the mouse fz6 5` end was performed essentially as described by Frohman et al.(22) . In brief, 10 µg of total brain RNA was used as a template for first strand cDNA synthesis primed with a gene-specific internal primer located within the coding region several hundred bases from the 5` end of the longest cDNA clone. First strand cDNA was tailed with terminal transferase in the presence of dATP and used as a template for second strand synthesis with an oligo(dT) primer carrying a 5` EcoRI site. The resulting double stranded cDNA was used as a template for two rounds of nested PCR with gene-specific antisense primers, the last of which carried a flanking EcoRI site. Following EcoRI digestion, the products were cloned into gt10 and screened by hybridization with a nested gene-specific probe. The inserts from multiple hybridizing phage clones were sequenced as described above for random cDNA sequencing. RACE-PCR to define the 5` end of the C. elegans Cfz1 gene was performed using a random primed first strand cDNA template and PCR amplified with a spliced leader-specific sense primer, 5`-GATCGAATTCGGTTTAATTACCCAAGT, and a Cfz1-specific antisense primer. The PCR products were cloned into gt10 and processed as described above.

RNase Protection

Total RNA was prepared from adult mouse brain, eye, heart, kidney, liver, lung, spleen, and testes by homogenization in guanidinium thiocyanate and phenol extraction, followed by centrifugation through 5.7 M cesium chloride (23) . Ten micrograms of total RNA from each tissue or of yeast tRNA was used for the RNase protection assay. Riboprobes were synthesized using either T7 or T3 RNA polymerase on linearized templates in pBluescript. Each mouse frizzled probe contained 150-250 bases from the antisense strand linked to 25-50 bases of vector sequence. Reagents were obtained from Ambion (Austin, TX), and the hybridization and digestion conditions were as recommended by Ambion.

Interspecific Mouse Backcross Mapping

Interspecific backcross (IB) progeny were generated by mating (C57BL/6J times Mus spretus)F1 females and C57BL/6J males as described(24) . A total of 205 N2 mice were used to map the frizzled family members (see ``Results'' for details). DNA isolation, restriction enzyme digestion, agarose gel electrophoresis, Southern blot transfer, and hybridization were performed essentially as described(25) . All blots were prepared with Hybond N nylon membrane (Amersham). The probes used in the mapping studies were labeled with [alpha-P]dCTP using a random primed labeling kit (Stratagene); washing was done to a final stringency of 0.5-1.0 times SSCP, 0.1% SDS, at 65 °C. The RFLP's identified with each probe are listed in Table 3. In all cases, the presence or absence of the M. spretus-specific fragments was followed in backcross mice.



A description of the probes and RFLPs for nearly all the loci linked to the frizzled genes has been reported. These include: Gls, Ctla4, and Fn1 on chromosome 1(26, 27) ; Gnai1 and En2 on chromosome 5(28) ; Tyr on chromosome 7(29) ; Csfg and Wnt3 on chromosome 11(30) ; Ctla1, Gja3, and Nfl on chromosome 14(31, 32) ; and Hspg1 and Myc on chromosome 15 (33) . One locus, olfactory marker protein (Omp), has not been reported previously for the Frederick IB. The probe was kindly provided by Dr. Randall Reed and it detected a 2.2-kb BamHI fragment in C57BL/6J DNA and 3.3-kb BamHI fragment in M. spretus DNA. Recombination distances were calculated as described (34) using the computer program SPRETUS MADNESS. Gene order was determined by minimizing the number of recombination events required to explain the allele distribution patterns.

In Situ Hybridization

Freshly dissected adult brains, whole embryos, or heads were rapidly frozen in plastic molds placed on a dry ice/ethanol slurry and processed for sectioning as described previously (35) . S-Labeled antisense riboprobes were prepared from linearized pBluescript plasmid subclones using either T3 or T7 RNA polymerase. In situ hybridization was performed as described by Saffen et al.(36) , and hybridized sections were exposed to x-ray film and digitized.


RESULTS

Identification of New Members of the Frizzled Family

In an ongoing effort to identify novel genes expressed in the mammalian retina, we have determined partial sequences from 5,000 human retina cDNA clones. (^2)Conceptual translation of one of these sequences showed statistically significant similarity to the protein product of the Drosophila tissue polarity gene frizzled(15) , and to two frizzled-related sequences of unknown function previously identified in the rat, Rfz1 and Rfz2(18) . This partial cDNA sequence, hereafter referred to as Hfz3, showed approximately 40% amino acid identity with each of the three previously reported frizzled sequences. (Throughout this paper the species source of each frizzled sequence will be referred to by the first letter(s) of its abbreviated name: H, human; M, mouse; D, Drosophila; R, rat; Z, zebrafish; C, C. elegans; Ch, chicken; S, sea urchin.)

To search for additional frizzled family members, we used PCR with different pairs of degenerate primers (Table 1) to amplify genomic DNA and cDNA from both vertebrates and invertebrates. Alignment of the Drosophila, rat, and partial human sequences revealed two regions of amino acid identity that were used to design a first set of degenerate PCR primers, YW-157 and YW-158. PCR amplification of mouse genomic DNA and human retina cDNA revealed several novel frizzled family members: Mfz2, Mfz3, Mfz4, Mfz6, Mfz7, and Hfz3, and Hfz5, respectively. A mixture of these cloned PCR products were used in a low stringency screen of day 8.5 and day 12.5 mouse embryo cDNA libraries, a day 18.5 embryonic mouse brain library, and a mouse genomic DNA library. Multiple independent cDNA clones were obtained that fell into five classes: Mfz2, Mfz3, Mfz4, Mfz6, and Mfz7. One-half of the Mfz2 coding region was sequenced and was found to be greater than 96% identical at the amino acid level to Rfz2. Mfz2 is therefore likely to represent the orthologue of Rfz2 (Chan et al., 1992). From the Mfz3 and Mfz4 cDNA clones the complete coding region was sequenced (Fig. 1). For Mfz6, the complete coding region was sequenced from a combination of RACE-PCR products and cDNA clones (Fig. 1).


Figure 1: Amino acid sequences of mammalian and nematode frizzled homologues. The conserved cysteine-rich domains and the transmembrane domains were aligned using GeneWorks software; the divergent signal sequences, linker regions (connecting the cysteine-rich domain and the transmembrane domain), and carboxyl-terminal tails were aligned manually to minimize the total number of gaps. The region of collagen XVIII (37) that has homology to the cysteine-rich domain is aligned beneath that segment. The consensus sequence indicates those residues that occur in five or more of the 10 frizzled family members. Lines above the sequences indicate the locations of putative membrane spanning segments. Dfz1, the frizzled gene of Drosophila melanogaster(15, 16) ; Rfz1 and Rfz2, two rat homologues of frizzled(18) ; M, mouse; H, human; C, C. elegans; coll XVIII, collagen XVIII.



Mouse genomic clones encompassing part or all of the coding region were isolated for Mfz2, Mfz3, Mfz4, Mfz6, Mfz7, and Mfz8; partial genomic DNA sequences were obtained from the Mfz3, Mfz4, and Mfz6 clones and these sequences confirm the cDNA and RACE PCR sequences in the regions in which they overlap. The Mfz7 and Mfz8 coding regions were deduced from genomic DNA sequence as no Mfz7 cDNA clones obtained thus far contain the 5` end of the coding region and Mfz8 cDNA clones have not yet been identified. Conceptual translation of 2.3 and 2.4 kb of contiguous sequence from Mfz7 and Mfz8 genomic clones, respectively, predicts in each case a single long open reading frame that begins with a putative signal sequence and includes all of the characteristic features of a frizzled family member (Fig. 1). Based upon this contiguous homology we tentatively propose that the Mfz7 and Mfz8 genes lack introns within the coding region.

In a parallel screen of an adult human retina cDNA library using PCR products derived from Hfz3 and Hfz5, multiple cDNA clones encoding Hfz3, Hfz5, and Hfz7 were identified. From this collection, the complete sequence of the Hfz5 coding region was determined (Fig. 1). Surprisingly, neither genomic nor cDNA clones were obtained for fz5 orthologues from the mouse library screens described above. However, PCR amplification of mouse genomic DNA using Hfz5-specific primers generated a product, the translated sequence of which is distinct from other mouse frizzled sequences and is 95% identical to Hfz5 over 143 amino acids, indicating that the mouse genome contains a Hfz5 orthologue. This Mfz5 PCR product was used for subsequent RNase protection and in situ hybridization studies (see below).

To examine more fully the diversity of the frizzled family, we searched the NCBI data base of partial cDNA sequences (expressed sequence tags, ESTs) for frizzled homologues. Statistically significant homology was detected in a C. elegans partial cDNA sequence, cm08b2(21) . This cDNA clone was used to identify the corresponding genomic segment from a C. elegans genomic DNA library, and was mapped on the C. elegans genome to yeast artificial chromosomes Y95B8 and Y34D9 on chromosome I. The complete sequence of the mRNA was determined by sequencing this cDNA clone, 5` RACE PCR products, and the exon regions of the corresponding genomic clone (Fig. 1). These sequences reveal a highly divergent frizzled family member that is encoded by nine exons distributed over 8 kb of genomic DNA (Fig. 2). This gene will be referred to as Cfz1.


Figure 2: Intron-exon structure of the nematode frizzled gene, Cfz1. A restriction map derived from a cloned 15.2-kb segment of C. elegans genomic DNA in EMBL3 is shown above the intron-exon structure of the Cfz1 gene. Sequences were obtained across each intron-exon boundary in the cloned genomic DNA, and exon locations were determined by restriction mapping and by PCR amplification of plasmid subclones using exon-specific primers and flanking vector primers. Introns 1-8 begin after the following base pairs in the cDNA (numbered relative to the first nucleotide in the coding region): 128, 270, 483, 624, 716, 825, 1002, and 1457. The message contains a trans-spliced leader joined to exon 1 at position -29. B, BamHI; E, EcoRI; H, HindIII; S, SalI.



As another approach to exploring the diversity of the frizzled family, a set of seven degenerate PCR primers were designed based on an alignment of the sequences of the human and mouse clones described above and the known Drosophila and rat sequences (Table 1; YW164-YW285). These, as well as the original pair of degenerate PCR primers were used for PCR amplification of chicken, sea urchin, and zebrafish genomic DNA. PCR products of the predicted size were obtained from a subset of the PCR reactions, and these were subcloned and sequenced. The resulting sequences revealed two, three, and 13 distinct frizzled-like sequences from chicken, sea urchin, and zebrafish, respectively (Fig. 3). The 13 zebrafish PCR products are partially overlapping and define at least 11 distinct genes.


Figure 3: Partial amino acid sequences of zebrafish, chicken, and sea urchin frizzled homologues. Frizzled-related sequences were PCR amplified from genomic DNA using the degenerate primers listed in Table 1. The names of the PCR products indicate the source of the genomic DNA templates: ZG, zebrafish; ChG, chicken; SG, sea urchin. Individual subcloned PCR products were sequenced on both strands; each sequence shown was confirmed from two or more independent PCR products. The consensus shown below lists those amino acids that occur in the majority of the PCR products in those regions where six or more sequences are available. The aligned sequences begin in transmembrane segment 2 (top left) and end in transmembrane segment 7 (lower right).



The original pair of degenerate PCR primers, YW157 and YW158, was also used to amplify Drosophila genomic DNA. This PCR reaction was designed to be selective for novel frizzled-like sequences, as this region in the Drosophila frizzled gene is known to be interrupted by a large intron(16) . This reaction generated a single PCR product that matched the expected size of a contiguous frizzled coding region. Subsequent work has shown that the corresponding gene, which we will refer to as Dfz2, encodes a frizzled-like protein with less than 35% amino acid identity with the original Drosophila frizzled gene product. (^3)

The sequences of the six novel mouse and human frizzled genes reported here brings to eight the number of distinct frizzled-like sequences identified in mammalian genomes. This probably represents an incomplete sampling of frizzled family members because: 1) the PCR reactions selected for family members with high homology to the original group of frizzled sequences used in primer design, 2) PCR amplification on genomic DNA selected for family members that did not contain introns between the primers, and 3) cleavage of PCR products with EcoRI and HindIII followed by preparative gel electrophoresis eliminated products with internal EcoRI or HindIII sites. In support of this idea, whole genome Southern blot hybridization with ZG05, one of the zebrafish PCR products, reveals a large number of hybridizing fragments in chicken, mouse, and human genomic DNA (Fig. 4). As this region shows significant divergence among the cloned frizzled family members at the DNA level, it is likely that some of the hybridizing fragments are derived from frizzled sequences other than the ones reported here.


Figure 4: Southern blot hybridization of chicken, mouse, and human genomic DNA probed with the zebrafish frizzled PCR product ZG05. The filter was hybridized at 37 °C in 40% formamide and 5 times SSC, and washed at 50 °C in 0.5 times SSC. The amino acid sequence of ZG05 is shown in Fig. 3. H, human; Ch, chicken; M1, mouse cell line BW; M2, mouse Balb/c. Size standards (in kb) are shown to the left.



Implications for Frizzled Structure and Evolution

In Fig. 1, the complete deduced amino acid sequences of Mfz3, Mfz4, Hfz5, Mfz6, Mfz7, Mfz8, and Cfz1 are aligned beneath the sequences of Drosophila frizzled (abbreviated Dfz1), Rfz1, and Rfz2. The lengths of the frizzled proteins range from 525 amino acids (Cfz1) to 709 amino acids (Mfz6). It is apparent from the alignment and from the hydropathy profiles of each frizzled family member (Fig. 5) that there are distinct domains that differ in their degree of amino acid sequence conservation and in their chemical properties. A putative signal peptide is found at the amino termini of all frizzled sequences, consistent with the postulated role of these proteins as integral membrane receptors and with the demonstration that the amino terminus of the Drosophila frizzled protein is extracellular(17) . The putative signal peptide region is not conserved at the level of primary sequence. Following the signal peptide, each frizzled protein has a conserved domain of 120 amino acids containing 10 invariant cysteines (Table 2), suggesting that this region contains a number of disulfide bonds. This cysteine-rich motif has also been found within a non-helical portion of the mouse alpha1 chain of type XVIII collagen(37) . This is the only sequence in the current GenBank data base that shows statistically significant homology with any part of a frizzled sequence. The cysteine-rich domain is followed by a hydrophilic and highly divergent region that ranges in length from 40 amino acids in Mfz4 to 100 amino acids in Mfz8. These characteristics suggest that this divergent region is minimally ordered and that it may serve as an extended linker.


Figure 5: Hydropathy profiles of mammalian and nematode frizzled homologues. Solid lines indicate the putative signal peptide and transmembrane domains. Hydropathy values were calculated using MacVector version 3.5 software according to the Kyte-Doolittle algorithm with a window size of 12 amino acids.





The carboxyl-terminal half of each frizzled protein consists primarily of a 220-250 amino acid region in which seven hydrophobic segments of approximately 20-25 amino acids each are joined by short hydrophilic segments ( Fig. 1and Fig. 5). Pairwise comparisons among mammalian frizzled sequences show that the overall amino acid sequence identity in this region ranges from 37% to 85% (Table 2). Interestingly, several small stretches of amino acid sequence in this region are highly conserved among all of the frizzled family members (Fig. 1). Although this region in the nematode Cfz1 sequence shows only 13-16% identity in any pairwise comparison, its hydropathy profile closely matches that of the other family members. Among the mammalian and Drosophila frizzled sequences, gaps in the alignment in this region are confined to the hydrophilic segments; alignment of the more divergent Cfz1 sequence requires several small gaps in the hydrophobic segments. These characteristics are reminiscent of those found in sequence comparisons among G-protein coupled receptors, and suggest that, like G-protein coupled receptors, the hydrophobic segments of the frizzled proteins form seven membrane spanning alpha-helices linked by hydrophilic loops(38, 39) . In the discussion that follows, we will refer to this part of the protein as the transmembrane region. The transmembrane regions show no primary sequence homology with G-protein coupled receptors, and they lack the most conserved motifs, such as the (Glu/Asp)-Arg-(Tyr/Phe) triplet at the carboxyl terminus of the third transmembrane segment, that are found in nearly all G-protein coupled receptors(40) .

Carboxyl-terminal to the transmembrane region, the frizzled proteins show little or no homology. In Drosophila frizzled this region has been shown to reside on the cytosolic face of the plasma membrane, consistent with the extracellular location of the amino terminus and the assignment of seven transmembrane segments based on the hydropathy profile(17) . The lengths of the carboxyl-terminal regions range from 25 amino acids in Mfz7 to 200 amino acids in Mfz6.

Chromosomal Locations of frizzled Genes

The present members of the frizzled gene family presumably arose by duplication and divergence from a common ancestral gene. In some instances members of a gene family are clustered at a single locus following their duplication (e.g. the globins). To test this possibility for the frizzled genes, and to begin to look for naturally occurring mouse mutants that may result from frizzled gene defects, the murine chromosomal locations of the frizzled family members (mouse locus designation, fz) were determined using an interspecific backcross mapping panel from crosses of [(C57BL/6J times Mus spretus)F1 X C57BL/6J] mice. This mapping panel has been typed for over 2000 loci that are well distributed among all 19 mouse autosomes and the X chromosome(24) . (^4)C57BL/6J and M. spretus DNAs were digested with several different restriction enzymes and analyzed by Southern blot hybridization for informative RFLPs using probes specific to each locus (Table 3). The strain distribution pattern of each RFLP was then determined for the backcross mice. All backcrosses were to C57BL/6J and, as expected, backcross progeny were either homozygous for the C57BL/6J allele or heterozygous for the C57BL/6J and M. spretus alleles. The simple presence or absence of RFLPs specific for M. spretus was followed in backcross mice. The chromosomal location of each locus was then determined by comparing its strain distribution pattern with the strain distribution patterns for all other loci already mapped in the backcross.

The mapping results are summarized in Fig. 6. Eight loci were detected on seven mouse autosomes. From 130 N2 mice examined, no recombinants were identified that separate Mfz5 and Mfz7, on mouse chromosome 1. Interestingly, Mfz5 and Mfz7 share a degree of similarity that is no greater than that seen in an average pairwise comparison of the available mammalian frizzled genes. The remaining family members are all unlinked. In addition, two loci were detected with a probe derived from Mfz2 (the mouse orthologue of Rfz2): one on chromosome 5 (Mfz2-rs1) and one on 11 (Mfz2-rs2). The human orthologue of Rfz1 has recently been mapped to human chromosome 17q21.1(41) . Mouse chromosome 11 shares a large region of homology with the long arm of human chromosome 17q (Fig. 6; data not shown). The syntenic relationship between these chromosomes and the high degree of homology between Rfz1 and Rfz2 (18) suggests that Mfz2-rs2 is the mouse orthologue of Rfz1 and that the authentic Mfz2 locus (represented by Mfz2-rs1) resides in the proximal region of mouse chromosome 5. The remaining frizzled family members have not been mapped in humans. However, based on synteny, we predict that the human orthologues of Mfz2 will map to human chromosome 7q, Mfz3 to 14q, 13, or 8p, Mfz4 to 11q, Mfz5 to 2q, Mfz6 to 8q, Mfz7 to 2q, and Mfz8 to 10p or 18q (Fig. 6).


Figure 6: Partial chromosome linkage maps showing the locations of the frizzled (Fz) family members in mouse. The genes were mapped by interspecific backcross analysis. Partial chromosome linkage maps showing the locations of the frizzled genes in relation to linked loci are shown. The number of recombinant N2 animals over the total number of N2 animals typed plus the recombination frequencies, expressed as genetic distance in centiMorgans (+/-1 S.E.), is shown for each pair of loci on the left of the chromosome maps. Where no recombinants were found between loci, the upper 95% confidence limit of the recombination distance is given in parentheses. The positions of loci in human chromosomes, where known, are shown to the right of the maps. References for the map positions of human loci described in this study can be obtained from GDB (Genome Data Base), a computerized data base of human linkage information maintained by The William H. Welch Medical Library of The Johns Hopkins University (Baltimore, MD).



Finally, we have compared our interspecific maps with composite mouse linkage maps that report the map location of many uncloned mouse mutations (provided from Mouse Genome Data base, a computerized data base maintained at The Jackson Laboratory, Bar Harbor, ME). The frizzled genes mapped in regions of the composite maps that contain a number of mouse mutations (data not shown). It will be of interest to determine whether any of these mutations are candidates for alterations in a frizzled locus.

Expression Patterns of Mouse Frizzled Genes

As an initial step in exploring the biological roles of the mammalian frizzled proteins, the distribution of transcripts among major organs in the adult mouse was determined for each gene by RNase protection. As seen in Fig. 7, each frizzled gene has a distinct tissue distribution of transcripts, and in all cases transcripts are found in more than one tissue. The most specific distributions are those of Mfz3, which is expressed at highest levels in the brain, and Mfz5, which is expressed at highest levels in the eye, kidney, and lung. Using the same RNase protection method, Mfz8 transcripts were not detected in any of the nine adult tissues examined.


Figure 7: Tissue distribution of frizzled transcripts in the adult mouse. Ten micrograms of RNA was used for RNase protection with each of the mouse frizzled probes. The RNA samples are as follows: 1, brain; 2, eye; 3, heart; 4, kidney; 5, liver; 6, lung; 7, spleen; 8, testis; 9, yeast tRNA. A control reaction with a mouse RNA polymerase II probe is shown at the bottom.



An anatomic examination at higher resolution was undertaken by in situ hybridization of mouse embryos at days 11.5 and 17.5, of postnatal day 1 animals, and of adult brains. While a detailed account of these experiments is beyond the scope of the present paper, Fig. 8shows a sample of some of the in situ hybridization patterns for Mfz2, Mfz3, Mfz4, and Mfz5 that suggest in several cases a higher degree of tissue specificity than was indicated by the RNase protection analysis. At embryonic day 17.5, Mfz2 transcripts are most abundant in the ventricular zone in the developing central nervous system, the region that contains actively dividing neuroblasts (Fig. 8C). At all stages tested, Mfz3 transcripts are present at high levels throughout the central nervous system, consistent with the pattern seen by RNase protection (Fig. 8, B, E, F, and H). Mfz4, which was observed by RNase protection to be relatively abundant in the adult central nervous systems, is found by in situ hybridization to be present exclusively within the choroid plexus (Fig. 8G); strong Mfz4 hybridization is seen in the lateral ventricles and along the midline in the third ventricle. At embryonic day 17.5 and postnatal day 1, Mfz5 transcripts are most abundant in the developing retina and/or the underlying choroid, consistent with the strong signal obtained with eye RNA by RNase protection (Fig. 8, A and D).


Figure 8: In situ hybridization to Mfz3, Mfz4, and Mfz5 transcripts during mouse development and in the adult mouse central nervous system. A-C, embryonic day 17.5 saggital sections probed with Mfz5 (A), Mfz3 (B), and Mfz2 (C). D and E, postnatal day 1 coronal section of the head probed with Mfz5 (D) and Mfz3 (E). F, embryonic day 11.5 saggital section probed with Mfz3. G and H, adult brain probed with Mfz4 (G) and Mfz3 (H).




DISCUSSION

The experiments reported here reveal a large family of putative transmembrane receptors that are related to the Drosophila tissue polarity gene frizzled. The identification of eight frizzled genes in mammals and at least 11 in zebrafish indicates that members of this family are likely to play multiple roles in vertebrate development and/or physiology. The finding of frizzled family members in organisms as diverse as humans and nematodes indicates that the frizzled proteins are likely to be part of a universal signaling system in the animal kingdom. Moreover, the expression of frizzled family members in many different tissues and during embryonic development suggests that they are involved in a wide variety of developmental and/or homeostatic processes. We consider below the structures and possible functions of the frizzled proteins in light of our current knowledge regarding Drosophila frizzled.

The aberrant orientation of surface structures seen in Drosophila frizzled mutants indicates that frizzled is involved in transmitting or interpreting a signal that determines the polarity of the bristle and hair cell precursors. Interestingly, in weak frizzled alleles hairs remain locally aligned but show global disorientation, resulting in whorls composed of several dozen hairs. As discussed in Adler et al.(13) , one interpretation of this observation is that frizzled receives distinct global and local polarity signals, with receipt of the latter signal being more robust. Alternately, frizzled may be involved in the maintenance of a single external polarity signal which becomes globally disorganized but remains locally intact when frizzled activity is diminished. The latter model would also account for the observed local spread of the altered tissue polarity phenotype in genetic mosaics. Although these models differ in the hypothesized role for frizzled, they both postulate the existence of an extracellular ligand-receptor system that must ultimately organize the cytoskeleton of the hair and bristle cells to direct their growth.

The frizzled sequences reported in Vinson et al.(14) , Chan et al.(18) , and the present work bring to 10 the number of full-length frizzled sequences known. All frizzled sequences share the same overall domain structure. Sequence comparisons show that the cysteine-rich domains and the transmembrane domains are strongly conserved, although in the case of Cfz1 the conservation in the transmembrane domain is principally at the level of hydropathy. These sequence analyses suggest a working model in which a cysteine-rich extracellular domain is tethered by a variable linker region to a bundle of seven membrane spanning alpha-helices (Fig. 9). If it is assumed for this discussion that frizzled proteins act as receptors, then this structural model suggests a dynamic model of transmembrane signal transduction analogous to that proposed for the subfamily of G-protein coupled receptors with amino-terminal ligand binding domains(42, 43) . Specifically, we envision frizzled signal transduction to proceed through the following steps: 1) extracellular ligand binds to the cysteine-rich domain, 2) ligand binding changes the interaction of the cysteine-rich region with the transmembrane domain (possibly as a result of direct contact between the ligand and the transmembrane domains), 3) changes in this interaction on the extracellular face of the protein result in a rearrangement of transmembrane alpha-helices, and 4) rearrangement of transmembrane helices alters the cytosolic face of the protein which either promotes or inhibits interactions with second messenger components. Elaborations or variations on this model are readily envisioned, including the possibility that in vivo the frizzled proteins function not as monomers but as part of a homo- or heteromultimeric complex.


Figure 9: Working model of frizzled proteins. The amino terminus is located on the extracellular face of the plasma membrane and the carboxyl terminus on the cytosolic face. Seven alpha-helical segments span the membrane.



If frizzled proteins act as receptors, that would imply the existence of a corresponding family of ligands. In the simplest case, if there is one ligand for each receptor, then mammals would be expected to have at least eight frizzled ligands, and zebrafish at least 11. Currently, the only family of ligands known to be of this size and for which no receptors have been identified are the Wnt proteins(10) . The availability of a large number of frizzled genes should facilitate a biochemical test of the possibility that these two families of proteins interact directly. Finally, the lack of primary sequence homology between the frizzled family and known receptor families suggests that these proteins may couple to a novel effector pathway. The existence of multiple frizzled proteins implies either the existence of a large number of homologous effector pathways or a convergence on a smaller number of pathways, as seen in G-protein coupled signal transduction.


FOOTNOTES

*
This work was supported by the Howard Hughes Medical Institute, the Ruth and Milton Steinbach Fund, the Medical Scientist Training Program (to B. S. A.), and by the National Cancer Institute, DHHS, under contract with ABL. The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore by hereby marked ``advertisement'' in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

The nucleotide sequence(s) reported in this paper has been submitted to the GenBank(TM)/EMBL Data Bank with accession number(s) U43205 [GenBank]and U43316[GenBank]-U43321[GenBank].

(^1)
The abbreviations used are: PCR, polymerase chain reaction; BLAST, basic local alignment search tool; RACE, rapid amplification of cDNA ends; RFLP, restriction fragment length polymorphism; IB, interspecific backcross; kb, kilobase pair(s).

(^2)
J. P. Macke, P. M. Smallwood, and J. Nathans, unpublished data.

(^3)
P. Bhanot, D. Andrew, Y. Wang, and J. Nathans, unpublished data.

(^4)
N. G. Copeland and N. A. Jenkins, unpublished results.


ACKNOWLEDGEMENTS

We thank Dr. Alan Coulson (MRC Sanger Center, Cambridge, U.K.) for providing C. elegans YAC grids; Dr. Claudia Cummins (University of Wisconsin) for the C. elegans genomic library; Phil Smallwood for mouse RNA samples; Hao Zhou for providing the mammalian Southern blot and zebrafish genomic DNA; Doris von Kessler for a gift of sea urchin DNA; Dr. Andy Fire and Siqun Xu for first strand cDNA and advice on C. elegans; Jie Cheng and Dr. Clark Riley for advice on sequence analysis; Dr. Clark Riley, Carol Davenport, Janine Ptak, and Lisa Eiben for synthetic DNA and assistance with DNA sequencing; Maria Papapavlou for assistance with tissue preparation; Jeffrey Corden for clones of mouse RNA polymerase II; Se-jin Lee for mouse cDNA and genomic DNA libraries; Deborah A. Householder and Mary Barnstead for technical assistance with chromosome mapping; Mengqing Xiang and Purnima Bhanot for general advice; and Patrick Tong and Purnima Bhanot for comments on the manuscript.


REFERENCES

  1. Banerjee, U., and Zipursky, S. L. (1990) Neuron 4, 177-187 [Medline] [Order article via Infotrieve]
  2. Fortini, M. E., and Artavanis-Tsakonas, S. (1993) Cell 75, 1245-1247 [Medline] [Order article via Infotrieve]
  3. Sprenger, F., Stevens, L. M., and Nusslein-Volhard, C. (1989) Nature 338, 478-483 [CrossRef][Medline] [Order article via Infotrieve]
  4. Hashimoto, C., Hudson, K. L., and Anderson, K. V. (1988) Cell 52, 269-279 [Medline] [Order article via Infotrieve]
  5. Lee, J. J., von Kessler, D. P., Parks, S., and Beachy, P. A. (1992) Cell 71, 33-50 [Medline] [Order article via Infotrieve]
  6. Baker, N. E. (1987) EMBO J. 6, 1765-1773
  7. Rijsewijk, F., Schuermann, M., Wagenaar, E., Parren, P., Weigel, D., and Nusse, R. (1987) Cell 50, 649-657 [Medline] [Order article via Infotrieve]
  8. Swiatek, P. J., Lindsell, C. E., Franco del Amo, F., Weinmaster, G., and Grindley, T. (1994) Genes & Dev. 8, 707-719
  9. Conlon, R. A., Reaume, A. G., and Rossant, J. (1995) Development 121, 1533-1545 [Abstract/Free Full Text]
  10. Nusse, R., and Varmus, H. E. (1992) Cell 69, 1073-1087 [Medline] [Order article via Infotrieve]
  11. Riddle, R. D., Johnson, R. L., Laufer, E., and Tabin, C. (1993) Cell 75, 1401-1416 [Medline] [Order article via Infotrieve]
  12. Gubb, D., and Garcia-Bellido, A. (1982) J. Embryol. Exp. Morphol. 68, 37-57 [Medline] [Order article via Infotrieve]
  13. Adler, P. N., Charlton, J., and Vinson, C. (1987) Dev. Genet. 8, 99-119
  14. Vinson, C. R., and Adler, P. N. (1987) Nature 329, 549-551 [CrossRef][Medline] [Order article via Infotrieve]
  15. Vinson, C., Conover, S., and Adler, P. N. (1989) Nature 338, 263-264 [CrossRef][Medline] [Order article via Infotrieve]
  16. Adler, P. N., Vinson, C., Park, W.-J., Conover, S., and Klein, L. (1990) Genetics 126, 401-416 [Abstract/Free Full Text]
  17. Park, W.-J., Liu, J., and Adler, P. N. (1994) Mech. Dev. 45, 127-137 [CrossRef][Medline] [Order article via Infotrieve]
  18. Chan, S. D. H., Karpf, D. B., Fowlkes, M. E., Hooks, M., Bradley, M. S., Vuong, V., Bambino, T., Liu, M. Y. C., Arnaud, C. D., Strewler, G. J., and Nissenson, R. A. (1992) J. Biol. Chem. 267, 25202-25207 [Abstract/Free Full Text]
  19. Nathans, J., Thomas, D., and Hogness, D. S. (1986) Science 232, 193-202 [Medline] [Order article via Infotrieve]
  20. Altschul, S. F., Gish, W., Miller, W., Myers, E. W., and Lippman, D. J. (1990) J. Mol. Biol. 215, 403-410 [CrossRef][Medline] [Order article via Infotrieve]
  21. Waterston, R. Martin, C., Craxton, M., Huynh, C., Coulson, A., Hillier, L., Durbin, R., Green, P., Shownkeen, R., Halloran, N., Metzstein, M., Hawkins, T., Wilson, R., Berks, M., Thomas, K., Thierry-Mieg, J., and Sulston, J. (1992) Nature Genet. 1, 114-123 [Medline] [Order article via Infotrieve]
  22. Frohman, M. A., Dush, M. K., and Martin, G. (1988) Proc. Natl. Acad. Sci. U. S. A. 85, 8998-9002 [Abstract]
  23. Sambrook, J., Fritsch, E. F., and Maniatis, T. (1989) Molecular Cloning: A Laboratory Manual , 2nd Ed., Cold Spring Harbor Laboratory, Cold Spring Harbor, NY
  24. Copeland, N. G., and Jenkins, N. A. (1991) Trends Genet. 7, 113-118 [Medline] [Order article via Infotrieve]
  25. Jenkins, N. A., Copeland, N. G., Taylor, B. A., and Lee, B. K. (1982) J. Virol. 43, 26-36 [Medline] [Order article via Infotrieve]
  26. Wilkie, T. M., Chen, Y., Gilbert, D. J., Moore, K. J., Yu, L., Simon, M. I., Copeland, N. G., and Jenkins, N. A. (1993) Genomics 18, 175-184 [CrossRef][Medline] [Order article via Infotrieve]
  27. Pathak, B. G., Shull, G. E., Jenkins, N. A., and Copeland, N. G. (1996) Genomics , in press
  28. Wilkie, T. M., Gilbert, D. J., Olsen, A. S., Chen, X.-N., Amatruda, T. T., Korenberg, J. R., Trask, B. J., de Jong, P., Reed, R. R., Simon, M. I., Jenkins, N. A., and Copeland, N. G. (1992) Nature Genet. 1, 85-9 [Medline] [Order article via Infotrieve]
  29. Pritchard, M. A., Baker, E., Whitmore, S. A., Sutherland, G. R., Idzerda, R. L., Park, L. S., Cosman, D., Jenkins, N. A., Gilbert, D. J., Copeland, N. G., and Beckmann, M. P. (1991) Genomics 10, 801-806 [Medline] [Order article via Infotrieve]
  30. Buchberg, A. M., Brownell, E., Nagata, S., Jenkins, N. A., and Copeland, N. G. (1989) Genetics 122, 153-161 [Abstract/Free Full Text]
  31. Haefliger, J.-A., Bruzzone, R., Jenkins, N. A., Gilbert, D. J., Copeland, N. G., and Paul, D. L. (1992) J. Biol. Chem. 267, 2057-2064 [Abstract/Free Full Text]
  32. Jenkins, N. A., Gilbert, D. J., Cho, B. C., Strobel, M. C., Williams, S. C., Copeland, N. G., and Johnson, P. F. (1995) Genomics 28, 333-336 [CrossRef][Medline] [Order article via Infotrieve]
  33. Spring, J., Goldberger, O. A., Jenkins, N. A., Gilbert, D. J., Copeland, N. G., and Bernfield, M. (1994) Genomics 21, 597-601 [CrossRef][Medline] [Order article via Infotrieve]
  34. Green, E. L. (1981) in Genetics and Probability in Animal Breeding Experiments , pp. 77-113, Oxford University, NY
  35. Cole, A., Abu-Shakra, S., Saffen, D., Baraban, J., and Worley, P. (1990) J. Neurochem. 55, 1920-1927 [Medline] [Order article via Infotrieve]
  36. Saffen, D., Cole, A., Worley, P., Christy, B., Ryder, K., and Baraban, J. (1988) Proc. Natl. Acad. Sci. U. S. A. 85, 7795-7799 [Abstract]
  37. Rehn, M., and Pihlajaniemi, T. (1995) J. Biol. Chem. 270, 4705-4711 [Abstract/Free Full Text]
  38. Schertler, G. F. X., Villa, C., and Henderson, R. (1993) Nature 362, 770-772 [CrossRef][Medline] [Order article via Infotrieve]
  39. Borjigin, J., and Nathans, J. (1994) J. Biol. Chem. 269, 14715-14722 [Abstract/Free Full Text]
  40. Probst, W. C., Snyder, L. A., Schuster, D. I., Brosius, J., and Sealfon, S. C. (1992) DNA Cell Biol. 11, 1-20 [Medline] [Order article via Infotrieve]
  41. Zhou, Z. Y., Lee, C. C., Baldini, A., and Caskey, T. C. (1995) Genomics 27, 370-373 [CrossRef][Medline] [Order article via Infotrieve]
  42. McFarland, K. C., Sprengel, R., Phillips, H. S., Kohler, M., Rosemblitt, N., Nikolics, K., Segaloff, D. L., and Seeburg, P. H. (1989) Science 245, 494-499 [Medline] [Order article via Infotrieve]
  43. Brown, E. M., Gamba, G., Riccardi, D., Lombardi, M., Butters, R., Kifor, O., Sun, A., Hediger, M. A., Lytton, J., and Hebert, S. C. (1993) Nature 366, 575-580 [CrossRef][Medline] [Order article via Infotrieve]

©1996 by The American Society for Biochemistry and Molecular Biology, Inc.