Molecular Cloning, Genomic Organization, and Developmental Regulation of a Novel Receptor from Drosophila melanogaster Structurally Related to Members of the Thyroid-stimulating Hormone, Follicle-stimulating Hormone, Luteinizing Hormone/Choriogonadotropin Receptor Family from Mammals*

(Received for publication, February 22, 1996, and in revised form, October 16, 1996)

Frank Hauser Dagger , Hans-Peter Nothacker § and Cornelis J. P. Grimmelikhuijzen

From the Department of Cell Biology and Anatomy, Zoological Institute, University of Copenhagen, Universitetsparken 15, DK-2100 Copenhagen Ø, Denmark

ABSTRACT
INTRODUCTION
EXPERIMENTAL PROCEDURES
RESULTS
DISCUSSION
FOOTNOTES
Acknowledgments
REFERENCES


ABSTRACT

Using oligonucleotide probes derived from consensus sequences for glycoprotein hormone receptors, we have cloned an 831-amino acid residue-long receptor from Drosophila melanogaster that shows a striking structural homology with members of the glycoprotein hormone (thyroid-stimulating hormone (TSH); follicle-stimulating hormone (FSH); luteinizing hormone/choriogonadotropin (LH/CG)) receptor family from mammals. This homology includes a very large, extracellular N terminus (20% sequence identity with rat TSH, 19% with rat FSH, and 20% with the rat LH/CG receptor) and a seven-transmembrane region (53% sequence identity with rat TSH, 50% with rat FSH, and 52% with the rat LH/CG receptor). The Drosophila receptor gene is >7.5 kilobase pairs long and contains 17 exons and 16 introns. Seven intron positions coincide with introns in the mammalian glycoprotein hormone receptor genes and have the same intron phasing. This indicates that the Drosophila receptor is evolutionarily related to the mammalian receptors. The Drosophila receptor gene is located at position 90C on the right arm of the third chromosome. The receptor is strongly expressed starting 8-16 h after oviposition, and the expression stays high until after pupation. Adult male flies express high levels of receptor mRNA, but female flies express about 6 times less. The expression pattern in embryos and larvae suggests that the receptor is involved in insect development. This is the first report on the molecular cloning of a glycoprotein hormone receptor family member from insects.


INTRODUCTION

Insects are very successful and adaptive animals and are of extreme, ecological and economical importance, since about 70% of all flowering plants depend on insects for pollination and insects can be severe pest animals. Despite the importance of insects, however, the molecular basis of their reproduction is not understood very well. This is in contrast to the situation in mammals, where reproduction is controlled by (i) steroid hormones, (ii) a family of closely related glycoprotein hormones produced by the adenohypophysis (luteinizing hormone (LH)1; follicle-stimulating hormone (FSH)), or placenta (choriogonadotropin, CG), and (iii) hypothalamic releasing hormones (e.g. gonadotropin-releasing hormone) that control the release of LH and FSH from the adenohypophysis. A few years ago, the mammalian receptors for FSH, LH, and CG were cloned, as well as the receptor for a fourth glycoprotein hormone, thyroid-stimulating hormone (TSH) that also is produced by the adenohypophysis (1, 2, 3, 4, 5, 6, 7, 8, 9). The receptors for human LH and CG appeared to be identical and closely related to the TSH and FSH receptors (4, 5, 6, 7, 8, 10). All three receptor types form a distinct subfamily belonging to the large family of G protein-coupled (seven-transmembrane) receptors. A remarkable feature of the mammalian TSH, FSH, LH/CG receptors, which separate them from the other G protein-coupled receptors, is the presence of a very large, extracellular N terminus, which composes about half of the receptor molecule. This N terminus probably constitutes the binding site for the glycoprotein hormone ligand (11, 12, 13).

Recently, using the polymerase chain reaction (PCR) and primers coding for consensus sequences of G protein-coupled receptors, we have found that cnidarians (which are the most primitive animals in the animal kingdom having a nervous system, such as sea anemones) produce a receptor that shows a striking structural homology with members of the mammalian TSH, FSH, LH/CG receptors (14). This was an exciting finding, since glycoprotein hormone receptors had, so far, only been cloned from mammals and not from other vertebrates or invertebrates. Our results suggested that glycoprotein hormone receptors and possibly all of the processes that are mediated by these types of receptors are conserved throughout the animal kingdom, from cnidarians to mammals. The presence of putative glycoprotein hormone receptors in invertebrates was confirmed shortly after our own discovery in cnidarians by a report on the cloning of a related receptor in molluscs (15). In the present paper, we have focused on insects, and we have cloned and characterized a Drosophila melanogaster receptor that shows a strong, structural homology with the TSH, FSH, LH/CG receptors from mammals.


EXPERIMENTAL PROCEDURES

Radioactive Labeling of DNA Probes

DNA fragments to be labeled were cleaved from vector DNA by restriction enzymes and purified by agarose gel electrophoresis. They were labeled with [alpha -32P]dCTP (Amersham Corp.; specific activity, 110 TBq/mmol) using the Ready-To-Go DNA-labeling kit from Pharmacia Biotech. Inc. (16).

DNA Sequencing and Sequence Analysis

DNA sequences were determined by the dideoxynucleotide chain termination method (17) using a Sequenase version 2.0 DNA sequencing kit (U.S. Biochemical Corp.). cDNA insertions were excised from the lambda  vector with restriction enzymes, subcloned into pBluescript SK+ (Stratagene), and sequenced. All subclones were sequenced in both directions. For the sequencing of larger insertions, the Double Stranded Nested Deletion kit (Pharmacia) or sequence-specific oligonucleotides were used. DNA sequence compilation, nucleotide and amino acid sequence comparisons, and data base searches were performed using the Lasergene software package (DNASTAR Inc.).

PCR

In order to obtain the initial PCR product used to screen the cDNA library, we used two primers, sense 5'-CCITG(T/C)GA(A/G)GA(T/C)ATIATGGGITA-3' and antisense 5'-GC(G/A)AAGG(C/A)(C/G)AG(G/A)TT(A/G)CACATIAG(G/A)AA-3' and a lambda ZAPII cDNA library from adult D. melanogaster (a kind gift from Dr. E. Gundelfinger, Federal Institute for Neurobiology, Magdeburg, Germany) as a template to perform PCR (18). The reaction mixture consisted of 50 µl of 1 × Taq buffer (Promega) containing 2.5 mM MgCl2, 1 × 106 plaque-forming units of cDNA library, 0.8 µM of each primer, 0.2 mM of all four dNTPs, and 1.5 units of Taq polymerase (Promega). Thermal cycling parameters were 3 min of initial denaturation at 94 °C, followed by 30 cycles of the following step program: 94 °C for 1 min, 48 °C for 1 min, 72 °C for 1 min. The PCR products were separated on 2% agarose gel, and bands of the expected size were isolated (Qiaquick extraction kit, Qiagen), subcloned into the SfrI site of pCR-Script (Stratagene), and sequenced.

5'-RACE

All reactions of the 5'-RACE were carried out following the protocol and using the chemicals of the 5'/3'-RACE kit of Boehringer Mannheim. mRNA from D. melanogaster Canton S embryos (16-24 h after oviposition) was used as template. mRNA was isolated with the Oligotex Direct mRNA kit (Qiagen). We used an antisense primer corresponding to positions 115-135 of Fig. 2 for cDNA synthesis and two nested antisense primers corresponding to positions 40 to 63 and -175 to -146 (Fig. 2) for PCR. The same PCR cycling parameters were used as recommended by the manufacturer. The PCR products were provided with blunt ends using T4 DNA polymerase (Pharmacia), phosphorylated using T4 polynucleotide kinase (Amersham) (19), subcloned into the SmaI site of pBluescript SK+ (Stratagene), and sequenced.


Fig. 2.

cDNA and deduced amino acid sequence of the Drosophila receptor. This figure consists of the insertions of cDNA clone lambda dgr8 (positions -254 to 2925) and of the longest cDNA clone derived by 5'-RACE PCR (positions -265 to -146). Nucleotides are numbered from 5' to 3'-end, and the amino acid residues are numbered starting with the first ATG in the open reading frame. Introns are indicated by arrows and numbered 1-16. The seven membrane-spanning domains are boxed and labeled TM I-VII. The translation termination codon is indicated by an asterisk. An in-frame stop codon in the 5'-noncoding region is underlined. A putative polyadenylylation site at the 3'-end is underlined twice. Putative glycosylation sites following the N-X-S or N-X-T consensus sequence are marked by triangles.


[View Larger Version of this Image (90K GIF file)]


Primer Extension

Primer extension was carried out according to Ref. 19 using mRNA from embryos (16-24 h after oviposition) purified as described above. We used an antisense primer corresponding to positions -174 to -146 of Fig. 2 for the extension reaction.

Isolation of cDNA Clones

The cDNA insertion of one of the PCR clones (clone pD5) was 32P-labeled and used as a probe to screen a lambda gt11 cDNA library from adult D. melanogaster Canton S (Clontech). A second cDNA library from adult D. melanogaster Canton S, which was cloned into the BamHI and XbaI sites of lambda DR2 (Clontech), was screened with a 32P-labeled insertion of clone lambda dgr1. Plaque lifting and processing of the nitrocellulose filters were performed as described in Ref. 19.

Isolation of Genomic Clones

A genomic lambda FIX II library from embryos of D. melanogaster Canton S (Stratagene) was screened with a 32P-labeled insertion of cDNA clone lambda dgr1. Further procedures were as described above.

Northern Blot Analysis

mRNA from 102 to 104 animals was isolated with the Oligotex Direct mRNA kit (Qiagen), electrophoresed on 1% agarose, 6% formaldehyde gels, and capillary-transferred onto Hybond-N membranes (Amersham) as described (19). After baking for 1 h at 80 °C, the blot was prehybridized for 4 h at 42 °C in a solution containing 50% (v/v) formamide, 5 × SSC (1 × SSC: 150 mM NaCl, 15 mM sodium citrate, pH 7.0), 5 × Denhardt's solution (1 × Denhardt's solution: 0.02% bovine serum albumin, 0.02% polyvinylpyrrolidone, 0.02% Ficoll), 0.5% SDS, 0.002% herring sperm DNA. For hybridization, the heat-denatured radioactive probe was added at a concentration of 2 × 106 cpm/ml and incubated for 18 h at 42 °C. Final washing was 2 × 10 min at 50 °C in 0.1 × SSC, 0.1% SDS. To obtain a hybridization probe for the ribosomal protein 49 (RP49) mRNA, positions -347 to +689 of the RP49 gene (20) were PCR-amplified using two primers, both 23 nucleotides long, and genomic DNA from Drosophila as a template. The hybridization signals of the Drosophila receptor and RP49 mRNAs were quantified by scanning of the autoradiograms and integration using the CREAMTM software package (Kem-En-Tec A/S, Copenhagen). As a size marker, the RNA Molecular Weight Marker II (Boehringer Mannheim) was used.

Southern Blot Analysis

D. melanogaster Canton S genomic DNA (Clontech) was digested with various restriction enzymes and separated on 0.8% agarose gels. Transfer to a Hybond-N membrane, hybridization, and washing were done as recommended by the supplier (Amersham).

Receptor Gene Localization on Polytene Chromosomes

D. melanogaster salivary gland polytene chromosome squashes were prepared on microscopic slides as described (21). These slides were incubated in 2 × SSC for 30 min at 65 °C, after which the DNA was denatured for 2 min in 0.07 M NaOH. After neutralization in 2 × SSC, the slides with the chromosome squashes were washed with increasing concentrations of ethanol and air-dried. The preparations were subsequently hybridized at 58 °C for 16 h in a solution containing 5 × SSC, 0.1% SDS, and a heat-denatured digoxygenin-labeled cDNA probe. The labeling was performed with a digoxygenin DNA-labeling kit (Boehringer Mannheim). Washing was for 30 min at 53 °C in 2 × SSC. Sites of hybridization were visualized using digoxygenin antibodies conjugated with horseradish peroxidase and diaminobenzidine/H2O2 as substrate. Chromosomes were stained with Giemsa stain (Fluka).


RESULTS

cDNA Cloning of a Drosophila Receptor Protein

We constructed one sense PCR primer that was based on conserved amino acid sequences located N-terminally of the first membrane-spanning domain of several mammalian glycoprotein hormone receptors and a second antisense primer corresponding to conserved amino acid sequences of the second membrane-spanning region. Using these primers and a cDNA library from adult D. melanogaster as a template, we carried out PCR. After subcloning of the PCR products, we obtained one clone (pD5) that contained a 161-bp insertion coding for a protein fragment comprising the first and second membrane-spanning regions of a putative Drosophila receptor related to the TSH, FSH, LH/CG receptors from mammals. Subsequently, we screened 5 × 105 phages of a cDNA library of adult D. melanogaster Canton S with the cDNA insertion of clone pD5 and obtained three positive clones (lambda dgr1, lambda dgr2, and lambda dgr4). These clones contained cDNA insertions of 2520, 2159, and 1248 bp, respectively, all coding for the same transcript (Fig. 1A). All three insertions terminated at their 3'-ends with the same internal EcoRI restriction site (Fig. 1A). We therefore used the insertion of clone lambda dgr1 to screen 5 × 105 phages of another D. melanogaster Canton S cDNA library, which was subcloned into the BamHI and XbaI sites of lambda DR2. This resulted in the isolation of two positive clones, of which one (clone lambda dgr8) contained the longest cDNA (Figs. 1 and 2).


Fig. 1. Schematic representation of the Drosophila receptor cDNA clones and the organization of the receptor gene. A, position of the cDNA clones. The triangle at clone lambda dgr1 indicates an additional insertion, drawn in scale, that originates from alternative splicing (shown in Fig. 5). B, schematic drawing of the cDNA insertion of clone lambda dgr8 and the additional 11 bases connected to its 5'-end that were obtained by 5'-RACE. This combined cDNA represents the whole receptor cDNA. The two narrow bars flanking the broad bar are the 5'- and 3'-noncoding regions, whereas the broad bar itself represents the coding region. The boxes I-VII highlighted in gray indicate the seven membrane-spanning domains. C, representation of the receptor gene. The introns are given as lines and are numbered 1-16. The exons are numbered 1-17 (exon numbers are in italic type). E, EcoRI; B, BamHI; X, XbaI; S, SalI restriction site. D, genomic, overlapping restriction fragments used for sequencing. All fragments are derived from a single genomic clone, lambda g7/3 (not shown). The numbers in parentheses give the size of the clones in kb. For comparison, all parts of Fig. 1, with the exception of intron 1, are drawn to the same scale.
[View Larger Version of this Image (18K GIF file)]


The transcription start site was determined using 5'-RACE. This yielded a major start site at position -264 of Fig. 2, which added 11 nucleotides to the 5'-end of clone lambda dgr8. This major transcription start site was confirmed by primer extension. Other 5'-RACE products pointed to additional transcription start sites at positions -261 and -246.

The composite cDNA of Fig. 2 is 3190 bp long. The ATG codon at nucleotide positions 1-3 of Fig. 2 is probably the start codon, because it is flanked by a consensus sequence for the initiation of translation (22). Furthermore, there is an in-frame stop codon upstream of this start codon (at positions -33 to -31). The cDNA codes for an 831-amino acid residue-long protein with a calculated molecular mass of 92.7 kDa. Hydropathy analysis suggests the presence of seven membrane-spanning regions, which is characteristic of G protein-coupled receptors. The protein contains four potential N-glycosylation sites in the putative extracellular N terminus and two others in the extracellular loops of the transmembrane domain, suggesting that the actual molecular mass of the receptor might be higher than 92.7 kDa. The intracellular loops of the putative transmembrane domain and the C-terminal, intracellular part of the protein contain multiple serine and threonine residues, which are potential phosphorylation sites.

Comparison of the Drosophila Receptor with Other Related Receptors

Fig. 3 shows a comparison of the Drosophila receptor with the TSH, FSH, and LH/CG receptors from rat and the putative glycoprotein hormone receptor from sea anemones. Especially in the seven-transmembrane area (from the Drosophila receptor amino acid positions 493-759 in Fig. 3), there is a high sequence homology between the Drosophila and the mammalian receptor proteins (53% sequence identity with rat TSH, 50% with rat FSH, and 52% with rat LH/CG receptor). Less sequence identity is found in the extracellular N terminus (amino acid positions 1-492 in Fig. 3; 20% sequence identity with rat TSH, 19% with rat FSH, and 20% with rat LH/CG receptor).


Fig. 3. Amino acid sequence comparison of the Drosophila receptor (GHRD), the rat TSH (TSHR), FSH (FSHR), and LH (LHR) receptors and the putative glycoprotein hormone receptor from the sea anemone Anthopleura elegantissima (GHRA). Amino acid residues that are identical between the Drosophila receptor and at least one of the other receptors are boxed. Known intron-exon transitions in the genes coding for four receptors are highlighted in gray at the corresponding amino acid residues. The intron-exon transitions in the rat TSH receptor gene are not known, but in other mammalian TSH receptor genes the first eight intron-exon transitions occur at exactly the same positions as in the genes for the rat FSH and LH receptor. The positions of the aliphatic residues characteristic for the Leu-rich repeats in the Drosophila receptor (see Fig. 4) are marked by asterisks or filled circles. The filled circles also mark intron-exon transitions in the Drosophila receptor gene that occur at the same positions and have the same intron phasing (see Table I) as in the mammalian receptor genes. The seven membrane-spanning domains are indicated by I-VII. Dashed lines represent spaces introduced to optimize alignment. The amino acid positions are given at the right. The amino acid sequences of the sea anemone and rat receptors and the intron-exon positions in their genes are from Refs. 3, 8, 14, 23, 28, and 29.
[View Larger Version of this Image (110K GIF file)]


One feature of the mammalian glycoprotein hormone receptors is the existence of Leu (or Ile/Val)-rich repeats in their N termini (12, 13, 23). These Leu-rich repeats also occur in the N terminus of the Drosophila receptor (Fig. 4). The positions of most of the aliphatic residues in the Leu-rich repeats of the Drosophila receptor follow a distinct pattern, which is very similar to that found in the mammalian receptors (Fig. 4; asterisks and filled circles in Fig. 3).


Fig. 4. Leu (Ile/Val)-rich repeats in the N-terminal region of the Drosophila receptor and the rat FSH receptor. A, consecutive segments of the Drosophila receptor N terminus (L1-L10) were aligned, and small gaps (dashed lines) were introduced to show that many of the Leu (Ile/Val) residues in one segment occur at a similar position in the other segments. These residues are boxed. The aliphatic residues highlighted in gray correspond to intron-exon transitions in the receptor gene and are given at the start of seven repeating segment. The segments L2-L7, therefore, each correspond to a distinct exon. Amino acid positions are given at the right. B, a similar alignment of Leu (Ile/Val)-rich repeats of the N terminus of the rat FSH receptor. The segments L1-L7 each correspond to a distinct exon (data from Ref. 23).
[View Larger Version of this Image (38K GIF file)]


DNA Sequence and Organization of the Drosophila Receptor Gene

Using the cDNA insertion of clone lambda dgr1 as a probe, we screened 2.5 × 105 phages of a genomic library of D. melanogaster Canton S embryos and identified 21 positive clones. One positive clone (lambda g7/3), which contained the Drosophila receptor gene, was digested with restriction enzymes, and overlapping fragments were subcloned and sequenced (Fig. 1D). The DNA of the receptor gene is >7.5 kb long. It consists of 17 exons and 16 introns (Fig. 1C). Only introns 1, 2, and 5 are more than 400 bp in length. The others are considerably shorter, in the range of 50-150 bp (Fig. 1C; Table I). The donor and acceptor sites of the introns are in agreement with the GT/AG consensus sequence (24), with the exception of the 5'-donor site of intron 8, which contains an unusual GC sequence (Table I). The existence of this unusual GC sequence was confirmed by sequencing of another, independent genomic clone.

Table I.

Intron/exon boundaries of the D. melanogaster receptor gene

The sequence of each of the intron-exon boundaries is shown, as well as the codons for the amino acid residues. Uppercase and lowercase letters represent nucleotides in the exons and introns, respectively. Each intron has been fully sequenced with the exception of intron 1, of which only 300 bases have been determined. The sequences of the introns are given in our GenBankTM/EMBL Data Bank submission. The overall positions of the introns are shown in Figs. 1 and 2. Introns 3-9 occur at the same positions within the gene as seven introns in the glycoprotein hormone receptor genes of mammals (cf. filled circles in Fig. 3). The triplets coding for the aliphatic amino acid residues at intron-exon transitions 3-9 are always interrupted after the second nucleotide (intron phase = 2).
 Intron       5'-Donor      Intron size (bp)      3'-Acceptor Intron phase

 1 CCG gtgagca.~2300...tttgcag AGG  ---
 2 CAA gtaagtg...505...ttcgcag GCG 3
Gln                         Ala
 3 GT  gtgagtt....88...cttgcag   G 2
Val                         Val
 4 AT  gtaaggc....80...ctttcag   C 2
Ile                         Ile 2
 5 AT  gtgagtc...462...cccacag   A 2
Ile                         Ile
 6 AT  gtgagta....61...tccgcag   A 2
Ile                         Ile
 7 TT  gtcagta....63...tttgcag   A 2
Leu                         Leu
 8 CT  gcaagtt....55...tttttag   A 2
Leu                         Leu
 9 CT  gtaagct....67...tccatag   C 2
Leu                         Leu
10 CGG gtgagta....59...tttatag AAT 3
Arg                         Asn
11 AG  gtgaatt....63...actgcag   A 2
Arg                         Arg
12 AG  gtaatcg...144...gttccag   G 2
Arg                         Arg
13 G   gtgggtg....71...cccgcag  GG 1
Gly                         Gly
14 AG  gtgagca....68...attgcag   C 2
Ser                         Ser
15 TTG gtgagta....70...cctgtag GTC 3
Leu                         Val
16 AA  gtaagac....68...tttccag   A 2
Lys                         Lys

Intron 1, which has a size of 2.3 kb, is located in the 5'-noncoding region, whereas all of the other introns are located in the coding region of the gene (Fig. 1, B and C, and Fig. 2). Exon 1 (Fig. 1C) contains most of the 5'-noncoding region of the corresponding receptor mRNA. Exons 2-11 code for the extracellular, N-terminal domain of the receptor, exons 12-16 for the transmembrane regions, and exon 17 for the intracellular C-terminal part (Fig. 1, B and C, and Fig. 2). Exon 17 also contains the 3'-noncoding region.

The compiled DNA in the coding regions of the Drosophila receptor gene is identical to that of the cDNA sequence with a total of 16 exceptions (Table II). In these exceptions, the coding triplets were always changed in such a way that no change in the amino acid residues resulted (Table II). Eight nucleotide differences occur between the genomic DNA and the 3'-noncoding region of the receptor cDNA (given in our GenBankTM/EMBL Data Bank submission). All of these small differences may point to two subpopulations of the D. melanogaster Canton S strain from which the commercial genomic and cDNA libraries have been constructed, or to allelic variation.

Table II.

Codon differences between the cDNA and the gene coding for the Drosophila receptor

Amino acid residue and position within the receptor protein (Fig. 2) is given in the first column, the triplet in the gene in the second column, and the triplet in the cDNA in the third column. None of the codon differences leads to a difference in amino acid residue.
  Amino acid   residue and position Codon in the gene Codon in the cDNA

Gly154 GGA GGG
Thr194 ACC ACT
Phe195 TTT TTC
Leu218 CTC CTG
Leu227 CTA CTG
Leu289 CTA CTG
Ser318 TCA TCT
Leu335 TTG TTA
Arg375 CGC CGG
Glu445 GAG GAA
Val523 GTG GTT
Leu538 TTG CTG
Ala560 GCT GCC
Gly666 GGA GGG
Ser694 AGT AGC
Pro753 CCC CCT

The cDNA clone lambda dgr1 contains an additional 70-bp insertion in the 5'-noncoding region compared with cDNA clone lambda dgr8 (Fig. 1A). This insertion probably originates from alternative splicing at the 3'-end of the first intron (Fig. 5).


Fig. 5. Partial nucleotide sequence of intron 1. Intron 1 (see Fig. 1C) is about 2300 bp long, and has only partially been sequenced. The numbers in this figure refer to the cDNA nucleotide positions in Fig. 2. Uppercase and lowercase letters represent the nucleotides in exons 1 and 2 and in intron 1, respectively. The conserved gt sequence at the splicing donor site is underlined, and the two ag sequences at both splicing acceptor sites are underlined twice. The extra insertion found in cDNA clone lambda dgr1, but not in cDNA clone lambda dgr8 (Fig. 1A) probably originates from alternative splicing at one of the two different acceptor sites of intron 1. Its 70-bp nucleotide sequence is shown in boldface type.
[View Larger Version of this Image (13K GIF file)]


Seven intron-exon transitions in the DNA coding for the extracellular N terminus of the Drosophila receptor occur at exactly the same positions as in the genes for the mammalian TSH, FSH, and LH/CG receptors (see those residues that are marked by a filled circle in Fig. 3). This indicates that the Drosophila receptor and the mammalian glycoprotein hormone receptors are evolutionarily related.

Southern Blot Analysis

Southern blot analysis using a BamHI/SalI fragment of the insertion of cDNA clone lambda dgr1 as a hybridization probe (Fig. 1A) showed single hybridization bands after digestion of Drosophila genomic DNA with EcoRI, BamHI, or SalI (Fig. 6). The presence of two hybridization bands seen after digestion of the genomic DNA with XbaI (Fig. 6) is consistent with an XbaI restriction site within the genomic sequence recognized by the hybridization probe (marked X in intron 5 of Fig. 1C). The size of the bands was as expected from the restriction map of the isolated genomic clones. No additional bands were visible after long autoradiographic exposure. All of this indicates the presence of a single gene coding for the Drosophila receptor.


Fig. 6. Southern blot analysis. Genomic DNA (20 µg) from D. melanogaster Canton S was digested with either EcoRI (E), BamHI (B), SalI (S), or XbaI (X). After electrophoresis and blotting, the fragments were hybridized with a 1457-bp-long BamHI/SalI fragment from cDNA clone lambda dgr1 (nucleotide positions 461-1918 in Fig. 2). The size of the DNA fragments (right) is given in kb. All lanes show a single hybridization band with the exception of the lane containing the XbaI fragment.
[View Larger Version of this Image (99K GIF file)]


Chromosomal Localization of the Receptor Gene

A digoxygenin-labeled insertion of cDNA clone lambda dgr1 (Fig. 1A) was used as a probe to localize the gene for the receptor on Drosophila salivary gland chromosomes. A single hybridization band was observed at position 90C on the right arm of the third chromosome (Fig. 7).


Fig. 7. Localization of the Drosophila receptor gene on polytene chromosomes. The labeled cDNA insertion of clone lambda dgr1 was used as a hybridization probe for Drosophila salivary gland chromosomes. A single band was observed at position 90C on the right arm of the third chromosome (arrowhead).
[View Larger Version of this Image (108K GIF file)]


Developmental Regulation of the Drosophila Receptor

mRNAs isolated from different developmental stages were subjected to a Northern blot analysis using the entire insertion of clone lambda dgr1 (Fig. 1A) as a hybridization probe. This showed the presence of a major 3.5-kb transcript in all developmental stages starting 8-16 h after oviposition (Fig. 8). The size of the 3.5-kb transcript corresponds well to the size of the cloned receptor cDNA (3.2 kb; see Fig. 2). In addition to the 3.5-kb transcript, a 6.5-kb transcript that was only very weakly hybridizing was observed having the same time course of expression as the 3.5-kb transcript (Fig. 8). The nature of this larger transcript is unknown.


Fig. 8. Expression of the Drosophila receptor mRNA during development. A, poly(A+) RNA (8 µg) from each of the indicated developmental stages was hybridized with the cDNA insertion of clone lambda dgr1. Numbers above the lanes refer to hours (h) or days (d) after oviposition. The size of the RNA (right) is given in kb. B, hybridization of the same blot as in A with a probe for the RP49 mRNA that is regarded to be expressed in equal amounts in all developmental stages (20, 37). C, the ratio of the receptor mRNA and RP49 mRNA hybridization signals in all developmental stages. This normalized plot for the Drosophila receptor mRNA expression shows a strong increase 8-16 h after oviposition. Receptor mRNA stays high until after pupation. Adult, female flies have 6 times less receptor mRNA than male flies.
[View Larger Version of this Image (46K GIF file)]



DISCUSSION

We have cloned a novel Drosophila receptor that shows a strong, structural homology with members of the TSH, FSH, LH/CG receptor family from mammals. This conclusion is based on the following findings. (i) The Drosophila receptor is a seven-transmembrane receptor having a very large, extracellular N terminus that comprises about half of the total receptor molecule (Fig. 2). This overall structure is similar to that of the mammalian glycoprotein hormone receptors. (ii) There is 19-20% sequence identity in the N terminus between the Drosophila receptor and the mammalian receptors (Fig. 3). (iii) The N terminus of the Drosophila receptor contains 9-10 Leu (Ile/Val)-rich repeats that occur in a similar number in the N termini of the mammalian receptors (Figs. 3 and 4). The number of those aliphatic residues in each repeating segment that can be aligned with the aliphatic residues of the other segments (which is nearly always 5; see Fig. 4A) and also the spacings between these aliphatic residues of the Drosophila receptor N terminus are either the same or very similar to those found in the mammalian receptors (Fig. 4B; cf. Refs. 12 and 23). The function of the Leu-rich repeats in the mammalian receptors has not been established yet. Recently, however, two research groups (12, 13) have developed two similar, three-dimensional models for the Leu-rich repeats in the N termini of various mammalian glycoprotein hormone receptors. These models are based on the crystal structure of the porcine ribonuclease-ribonuclease inhibitor complex, where the Leu-rich repeats of the inhibitor are arranged in a horseshoe-like structure and where these repeats function as the ribonuclease binding motifs (25, 26). In the models of Refs. 12 and 13, the Leu-rich repeats of the mammalian glycoprotein hormone receptor N terminus each have an inwardly directed beta -sheet (at the concave surface) and an outwardly directed alpha -helix (at the convex surface of the horseshoe). The inwardly directed beta -sheet structures are supposed to bind the ligand. The N terminus of the Drosophila receptor might have a similar configuration and ligand-binding capacity. Thus, the Drosophila receptor might not only be structurally, but also functionally, related to the mammalian receptors. (iv) There exists 50-53% sequence identity between the membrane-spanning domain of the Drosophila receptor and those of the mammalian glycoprotein hormone receptors. This sequence identity is very high if one takes into consideration that the sequence identity between the membrane-spanning regions of the rat LH/CG and TSH receptor is 70% and that the identity between the rat TSH and FSH receptor is 67% (10).

In addition to the structural homologies between the Drosophila and mammalian receptor proteins, the organization of their genes is also similar. Seven introns (introns 3-9; Fig. 1) in the DNA coding for the extracellular N terminus of the Drosophila receptor occur at exactly the same positions as in the genes coding for the mammalian receptors (indicated by the aliphatic residues marked by filled circles in Fig. 3). The triplets coding for the aliphatic residues at these seven intron-exon transitions are always interrupted after the second nucleotide (intron phase = 2; see Table I). The same intron phasing is found in the corresponding introns of the mammalian receptor genes (23, 27, 28, 29). This strongly suggests that the Drosophila receptor is evolutionarily related to the TSH, FSH, and LH/CG receptors from mammals.

The Drosophila receptor gene has also properties that are different from the mammalian receptor genes. The most striking difference is the existence of five introns in the region coding for the transmembrane area and one intron in the region coding for the intracellular C terminus (Figs. 1 and 2; Table I). In the genes coding for the mammalian glycoprotein hormone receptors, introns have only been found in the region coding for the large, extracellular N terminus (23, 27, 28, 29). For some of the other G protein-coupled receptor genes, however, it is known that introns do also occur in the regions coding for the seven-transmembrane domain and the intracellular C terminus, and these receptor genes have been found both in vertebrates and invertebrates, among them Drosophila (30, 31). It has always been assumed that the mammalian glycoprotein hormone receptors have originated from a recombination of a DNA sequence containing several exons coding for the repeating Leu-rich elements and an intronless sequence with one exon coding for a G protein-coupled receptor (23, 27, 28, 29). Our present findings now suggest that the gene for this ancestral G protein-coupled receptor had at least five introns (if one agrees with the theory that the number of introns in a certain gene does not increase but tends to decrease in the course of eucaryote evolution (32)). The intron-exon pattern in the gene region coding for the transmembrane and intracellular domains of the Drosophila receptor does not resemble that of any other known G protein-coupled receptor gene, indicating that the putative ancestral G protein-coupled receptor is not closely related to any of the known receptors that have been characterized at the genomic level so far. The recombination event creating the family of glycoprotein hormone receptors must have occurred early in evolution, since receptors similar to mammalian glycoprotein hormone receptors can already be found in sea anemones (14). In sea anemones, the genomic organization of the hormone receptor has only been partly investigated, and two introns have been found in the DNA region coding for the N terminus (at precisely the same positions as in the mammalian and Drosophila receptors; cf. Fig. 3). In light of our findings in Drosophila, it would be interesting to focus on the genomic organization of the region coding for the transmembrane domain of the sea anemone receptor to see whether this has indeed five or more introns.

When we started our research on the glycoprotein hormone receptor in Drosophila, it was our expectation to obtain a hormone receptor that was functionally related to the mammalian LH or FSH receptors and, thereby, to gain a better insight into the molecular mechanisms of insect reproduction. However, the developmental regulation of the novel Drosophila receptor (Fig. 8) clearly shows that receptor expression is not confined to sexually mature animals. Instead, high levels of mRNA are already produced in Drosophila embryos 8-16 h after oviposition, which is at a stage where the gonads have just started their embryonic development (33). This indicates that the Drosophila receptor must be involved in some developmental process. In this respect, it is interesting to know that mammalian LH and its receptor appear to play a role in Leydig cell proliferation and differentiation (34) and also that TSH stimulates the proliferation of mammalian thymocytes (35). In amphibians, a TSH-related protein hormone induces metamorphosis in tadpoles (36). These are all examples that glycoprotein hormone receptors can regulate development. On the other hand the possibility cannot be excluded that the Drosophila receptor is also involved in sexual reproduction, since adult male flies still have a high level of receptor mRNA (Fig. 8). In contrast to males, adult female flies have 6 times less receptor mRNA, which could point to a role of the receptor in spermatogenesis or other male-specific reproductive processes. Future studies on the localization of the receptor mRNA and on the identification of the receptor ligand will help us to understand all of the functions of the novel insect receptor.


FOOTNOTES

*   This work was supported in part by the Danish Biotechnological Research and Development Program of the Danish Research Councils, Carlsberg Foundation, Læge- og Naturvidenskabelige Komité of the Novo Nordisk Foundation, and Fonds der Chemischen Industrie (Germany). The costs of publication of this article were defrayed in part by the payment of page charges. The article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

The nucleotide sequence(s) reported in this paper has been submitted to the GenBankTM/EMBL Data Bank with accession number(s) U47005[GenBank], U47006[GenBank], and U49733[GenBank].


Dagger    Recipient of a postdoctoral fellowship from the Max Planck Society (Germany).
§   Present address: CNS Research, Pharma Devision, F. Hoffmann-La Roche AG, CH-4070 Basel, Switzerland.
   To whom correspondence should be addressed: Dept. of Cell Biology and Anatomy, Zoological Institute, University of Copenhagen, Universitetsparken 15, DK-2100 Copenhagen Ø, Denmark. Tel.: 0045-35 32 12 27; Fax: 0045-35 32 12 00.
1    The abbreviations used are: LH, luteinizing hormone; bp, base pair(s); CG, choriogonadotropin; FSH, follicle-stimulation hormone; kb, kilobase pair(s); PCR, polymerase chain reaction; RP49, ribosomal protein 49; TSH, thyroid-stimulating hormone; RACE, rapid amplification of cDNA ends.

Acknowledgments

We thank Christina Færch-Jensen for technical assistance, Astrid Juel Jensen and Marie Hansen for typing the manuscript, Dr. Leif Søndergaard (Department of Genetics, University of Copenhagen, Denmark) for support with setting up the Drosophila culture, and Drs. Eckart Gundelfinger, Bounpheng Phannavong, and Ulrich Thomas (Federal Institute for Neurobiology, Magdeburg, Germany) for hospitality and help with the localization of the receptor gene on salivary gland chromosomes.


REFERENCES

  1. Libert, F., Lefort, A., Gérard, C., Parmentier, M., Perret, J., Ludgate, M., Dumont, J. E., and Vassart, G. (1989) Biochem. Biophys. Res. Commun. 165, 1250-1255 [Medline] [Order article via Infotrieve]
  2. Loosfelt, H., Misrahi, M., Atger, M., Salesse, R., Vu Hai-Luu Thi, M. T., Jolivet, A., Guiochon-Mantel, A., Sar, S., Jallal, B., Garnier, J., and Milgrom, E. (1989) Science 245, 525-528 [Medline] [Order article via Infotrieve]
  3. McFarland, K. C., Sprengel, R., Philips, H. S., Köhler, M., Rosemblit, N., Nikolics, K., Segaloff, D. L., and Seeburg, P. H. (1989) Science 245, 494-499 [Medline] [Order article via Infotrieve]
  4. Parmentier, M., Libert, F., Maenhaut, C., Lefort, A., Gerard, C., Perret, J., van Sande, J., Dumont, J. E., and Vassart, G. (1989) Science 246, 1620-1622 [Medline] [Order article via Infotrieve]
  5. Minegish, T., Nakamura, K., Takakura, Y., Miyamoto, K., Hasegawa, Y., Ibuki, Y., and Igarashi, M. (1990) Biochem. Biophys. Res. Commun. 172, 1049-1054 [Medline] [Order article via Infotrieve]
  6. Minegish, T., Nakamura, K., Takakura, Y., Ibuki, Y., and Igarashi, M. (1991) Biochem. Biophys. Res. Commun. 175, 1125-1130 [Medline] [Order article via Infotrieve]
  7. Misrahi, M., Loosfelt, H., Atger, M., Sar, S., Guiochon-Mantel, A., and Milgrom, E. (1990) Biochem. Biophys. Res. Commun. 166, 394-403 [Medline] [Order article via Infotrieve]
  8. Sprengel, R., Braun, T., Nikolics, K., Segaloff, D. L., and Seeburg, P. H. (1990) Mol. Endocrinol. 4, 525-530 [Abstract]
  9. Kelton, C. A., Cheng, S. V. Y., Nugent, N. P., Schweickhart, R. L., Rosenthal, J. L., Overton, S. A., Wands, G. D., Kuzeja, J. B., Luchette, C. A., and Chappel, S. C. (1992) Mol. Cell. Endocrinol. 89, 141-151 [CrossRef][Medline] [Order article via Infotrieve]
  10. Salesse, R, Remy, J. J., Levin, J. M., Jallal, B., and Garnier, J. (1991) Biochimie 73, 109-120 [CrossRef][Medline] [Order article via Infotrieve]
  11. Nagayama, Y., and Rapoport, B. (1992) Mol. Endocrinol. 6, 145-156 [Abstract]
  12. Jiang, X., Dreano, M., Buckler, D. R., Cheng, S., Ythier, A., Wu, H., Hendrickson, W. A., and El Tayar, N. (1995) Structure 3, 1341-1353 [Medline] [Order article via Infotrieve]
  13. Kajava, A. V., Vassart, G., and Wodak, S. J. (1995) Structure 3, 867-877 [Medline] [Order article via Infotrieve]
  14. Nothacker, H.-P., and Grimmelikhuijzen, C. J. P. (1993) Biochem. Biophys. Res. Commun. 197, 1062-1069 [CrossRef][Medline] [Order article via Infotrieve]
  15. Tensen, C. P., van Kesteren, E. R., Planta, R. J., Cox, K. J. A., Burke, J. F., van Heerikhuizen, H., and Vreugdenhil, E. (1994) Proc. Natl. Acad. Sci. U. S. A. 91, 4816-4820 [Abstract]
  16. Feinberg, A. P., and Vogelstein, B (1983) Anal. Biochem. 132, 6-13 [Medline] [Order article via Infotrieve]
  17. Sanger, F., Nicklen, S., and Coulson, A. R. (1977) Proc. Natl. Acad. Sci. U. S. A. 74, 5463-5467 [Abstract]
  18. Saiki, R. K., Gelfand, D. H., Stoffel, S., Scharf, S. J., Higuchi, R., Horn, G. T., Mullis, K. B., and Ehrlich, H. A. (1988) Science 239, 487-491 [Medline] [Order article via Infotrieve]
  19. Sambrook, J., Fritsch, E. F., and Maniatis, T. (1989) Molecular Cloning: A Laboratory Manual, 2nd Ed., Cold Spring Harbor Laboratory, Cold Spring Harbor, NY
  20. O'Connell, P. O., and Rosbash, M. (1984) Nucleic Acids Res. 12, 5495-5513 [Abstract]
  21. Pardue, M. L. (1986) in Drosophila: A Practical Approach (Roberts, D. B., ed), pp. 111-137, IRL Press, Oxford
  22. Kozak, M. (1987) Nucleic Acids Res. 15, 8125-8132 [Abstract]
  23. Heckert, L. L., Daley, I. J., and Griswold, M. D. (1992) Mol. Endocrinol. 6, 70-80 [Abstract]
  24. Mount, S. M. (1982) Nucleic Acid Res. 10, 459-472 [Abstract]
  25. Kobe, B., and Deisenhofer, J. (1993) Nature 366, 751-756 [CrossRef][Medline] [Order article via Infotrieve]
  26. Kobe, B., and Deisenhofer, J. (1995) Nature 374, 183-186 [CrossRef][Medline] [Order article via Infotrieve]
  27. Gross, B., Misrahi, M., Sar, S., and Milgrom, E. (1991) Biochem. Biophys. Res. Commun. 177, 679-687 [Medline] [Order article via Infotrieve]
  28. Koo, Y. B., Ji, I., Slaughter, R. G., and Ji, T. H. (1991) Endocrinol. 128, 2297-2308 [Abstract]
  29. Tsai-Morris, C. H., Buczko, E., Wang, W., Xie, X.-Z, and Dufau, M. L. (1991) J. Biol. Chem. 266, 11355-11359 [Abstract/Free Full Text]
  30. Saudou, F., Boschert, U., Amlaiky, N., Plassat, J.-L., and Hen, R. (1992) EMBO J. 11, 7-17 [Abstract]
  31. Takahashi, K., Tanaka, A., Hara, M., and Nakanishi, S. (1992) Eur. J. Biochem. 204, 1025-1033 [Abstract]
  32. Gilbert, W., Marchionni, M., and McKnight, G. (1986) Cell 46, 151-154 [Medline] [Order article via Infotrieve]
  33. Hartenstein, V. (1993) in Atlas of Drosophila Development: Appendix to The Developmental Biology of Drosophila Melanogaster (Bate, M., and Martinez Arias, A., eds), Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY
  34. Kremer, H., Kraaij, R., Toledo, S. P. A., Post, M., Fridman, J. B., Hayashida, C. Y., van Reen, M., Milgrom, E., Ropers, H.-H., Mariman, E., Themmen, A. P. N., and Brunner, H. G. (1995) Nat. Genet. 9, 160-164 [Medline] [Order article via Infotrieve]
  35. Vassart, G., and Dumont, J. E. (1992) Endocr. Rev. 13, 596-611 [Medline] [Order article via Infotrieve]
  36. Buckbinder, L., and Brown, D. D. (1993) Proc. Natl. Acad. Sci. U. S. A. 90, 3820-3824 [Abstract]
  37. Kerrebrock, A. W., Moore, D. P., Wu, J. S., and Orr-Weaver, T. L. (1995) Cell 83, 247-256 [Medline] [Order article via Infotrieve]

©1997 by The American Society for Biochemistry and Molecular Biology, Inc.