A Highly Complex Organization of the Regulatory Region of the Human CYP19 (Aromatase) Gene Revealed by the Human Genome Project

Siby Sebastian and Serdar E. Bulun

Departments of Obstetrics and Gynecology and Molecular Genetics, University of Illinois at Chicago, Chicago, Illinois 60612

Address all correspondence and requests for reprints to: Serdar E. Bulun, M.D., Department of Obstetrics and Gynecology, University of Illinois at Chicago, 820 South Wood Street M/C 808, Chicago, Illinois 60612. E-mail: sbulun{at}uic.edu

Aromatase, the key enzyme for E biosynthesis, is encoded by a single copy of the CYP19 gene, localized at chromosome 15q21.21 (1, 2). In most vertebrates, aromatase is expressed only in gonads and the brain, whereas primates express this gene in additional extragonadal sites (1). Aromatase expression and E production continuously increase, as evolutionary tree progresses and reach the maximum in the human. This is achieved by the more efficient use of existing promoters and recruitment of additional novel tissue-specific promoters in fat, skin, placenta, and the bone (1). E is essential in females for the development of reproductive organs, and in both sexes for bone mineralization and gonadal function (3, 4). Moreover, in E-dependent pathologic tissues such as breast cancer and endometriosis, aromatase is up-regulated via inappropriate activation of aberrant promoters (5, 6). Alternative use of multiple promoters, which regulate mature aromatase mRNA levels by splicing of each first exon or 5'-untranslated region (UTR) onto a common splice junction immediately upstream of the coding region, is the key molecular mechanism conferring tissue-specific expression of the CYP19 gene. For instance, the proximal ovary-specific promoter (PII) gives rise to a 5'-UTR contiguous with the first coding exon (exon II), whereas a constitutively active distal promoter (I.1) in placenta is the basis of strikingly elevated levels of circulating E (100–1000 times normal) in pregnant women (7, 8). Because all mRNA species contain the identical open reading frame (exons II to X), the encoded protein is the same regardless of the promoter used. As a further twist, some of these promoters do not contain canonical TATA and CAAT elements, and each promoter is regulated in response to a distinct set of hormones or cytokines (1). Simpson et al. (1) estimated the size of the CYP19 (aromatase) gene as larger than 80 kb. Characterization of the entire genomic organization and the accurate size of this large gene, however, had not been possible to date using conventional phage or cosmid genomic libraries (1). Recently published Human Genome Project Data (http://www.ncbi.nlm.nih.gov/genome/guide/human/) allowed us, for the first time, to precisely locate all known promoters and elucidate the extraordinarily complex organization of the entire human CYP19 gene.

To achieve this, nucleotide sequence information of various 5'-UTRs and the coding region of the CYP19 gene were subjected to the BLAST (Basic Local Alignment Search Tool) homology search against the High Throughput Genomic Sequence (HTGS) database of the National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/HTGS) (9). GenBank Accession numbers of the previously characterized unique 5'-UTRs and corresponding promoters of the human CYP19 gene used for pairwise local sequence alignments were: M22246 (human full-length aromatase cDNA), S52794 (ovary-specific promoter PII), D21241 (adipose/breast cancer, I.3), S96437 (placenta-minor, I.2), D29757 (brain, I.f), S71536 (fetal tissues, I.5), L21982 (skin and adipose, I.4), D14473 (placenta-minor, 2a), and X55983 (placental-major, I.1). Sequence of bone-specific 5'-UTR and its promoter I.6 was obtained from the original publication (10). BLAST search permitted us to align the CYP19 gene coding region (exons II to X) and promoters II, I.3, I.6, and I.2 to a 178,762-bp bacterial artificial chromosome (BAC) clone (RP11-522G20) mapped to chromosome 15q21.2 (GenBank accession no. AC012169). Similarly, the remaining promoters were precisely aligned within another 144,714-bp BAC clone (RP11-108K3) mapped to chromosome15q21.2 (GenBank accession no. AC020891). Next, using the BLAST 2 search program (http://www.ncbi.nlm.nih.gov/blast/bl2seq/bl2.html), we set out to examine the alignment between these two clones, and results showed an overlap of a 6442-bp region, demonstrating continuity of the gene sequence in these BAC clones.

An analysis of these two BAC clones revealed that the entire gene spans over approximately 123-kb of DNA. Additionally, the CYP19 gene is found between markers stSG12786 and stSG47530 with the 3'-end of the gene telomeric to the 5'-end of the gene, showing the direction of transcription as from centromere to telomere. Only the 30-kb 3'-region encodes aromatase, whereas a large 93-kb 5'-flanking region serves as the regulatory unit of the gene. The most proximal ovarian-specific promoter II and two other proximal promoters, I.3 (expressed in adipose tissue and breast cancer) and I.6 (expressed in bone), are found to be located within the 1-kb region upstream of the ATG translation start site in exon II, as expected (Fig. 1Go). Promoter I.2, the minor placenta-specific promoter, is located approximately 13 kb upstream of the ATG site in exon II. The promoters specific for the brain (I.f), fetal tissues (I.5), adipose (I.4), and placenta (2a and I.1) are localized in tandem order at ~33, 43, 73, 78, and 93 kb, respectively, upstream of the first coding exon, exon II. Intriguingly, placental promoter I.1 located approximately 93,000 bp upstream of the coding region is the most distally located promoter, which gave rise to splicing of a 103-bp first exon onto the common splice junction immediately (38 bp) upstream of the ATG translational start site. The activity of promoter I.1 is the basis for 100-1000 times elevated levels of circulating E in pregnant women (11, 12). Thus, recruitment of this most distal promoter may have an evolutionary impact, since, of all species, humans are unique to acquire and maintain extraordinarily high levels of aromatase expression in placenta.



View larger version (47K):
[in this window]
[in a new window]
 
Figure 1. Genomic organization of the human CYP19 gene. Diagrammatic representation depicting location of each tissue-specific promoter of the CYP19 gene. The expression pattern and characterization of 5'-upstream regulatory elements have been published elsewhere (1 ). BLAST searches of various promoters and coding region revealed alignment to distinct locations in two overlapping (an end to end overlap of 6442 bp) BAC clones of chromosome 15q21.2 region. The first clone, RP11-522G20 (178,762 bp) aligned with the entire coding region of CYP19 gene and promoters II (Ovary), I.3 (Adipose-Minor and Breast Cancer), I.6 (Bone), and I.2 (Placenta-Minor). The overlapping clone, RP11-108K3 (144,714 bp), was aligned with remaining 5'-UTRs of the CYP19 gene [i.e. I.f (Brain), I.5 (Fetal tissue), I.4 (Skin/Adipose), 2a (Placenta-Minor), and I.1 (Placenta-Major)]. The distance of each promoter with respect to the first coding exon (exon II) was also determined. The major placental promoter, I.1, is the most distally located (approximately 93 kb). Even though each tissue expresses a unique untranslated first exon 5'-UTR, by splicing into a highly promiscuous splice acceptor site (AG/GACT) of the exon II, the coding region and the translated protein product are identical in all the tissues. Inset, Diagrammatic sketch of the structure of CYP19 mRNA species found in various human tissues. The untranslated first exon (5'-UTR) of mRNA species may be viewed as a signature of the alternatively used promoter in that particular tissue.

 
Kamat et al. (13) recently demonstrated the activity of human promoter I.1 transgene in mouse placenta. Thus, although mouse placental tissue does not express aromatase endogenously, it contains the necessary transcriptional factors for human promoter I.1 expression. This is suggestive that the recruitment of aromatase expression in human placenta occurred via alterations in the mammalian genome throughout the evolution.

Elucidation of the complete structure and organization of the human CYP19 gene in its fine details will facilitate further characterization of various molecular mechanisms by which the tissue-specific and temporal expression of this gene is regulated, in normal tissues and pathological conditions such as breast cancer and endometriosis.

Additionally, this report highlights a potential use of the Human Genome Project data and powerful bioinformatics tools (especially the freely available web-based programs) to disseminate information about the structure and organization of very large and complex genes. In absence of this freely available database and other resources, this type of work not only may be technically challenging to many laboratories, but also require a significant amount of time and effort devoted to traditional molecular biology techniques such as cloning, library screening, and sequencing.

Acknowledgments

Footnotes

This work was supported in part by NIH Grants HD38691 and CA67167 (to S.E.B.).

Abbreviations: BAC, Bacterial artificial chromosome; BLAST, Basic Local Alignment Search Tool; UTR, untranslated region.

Received April 16, 2001.

Accepted May 24, 2001.

References

  1. Simpson ER, Mahendroo MS, Means GD, et al. 1994 Aromatase cytochrome P450, the enzyme responsible for estrogen biosynthesis. Endocr Rev 15:342–355[Medline]
  2. Chen SA, Besman MJ, Sparkes RS, et al. 1988 Human aromatase: cDNA cloning, Southern blot analysis, and assignment of the gene to chromosome 15. DNA 7:27–38[Medline]
  3. Smith EP, Boyd J, Frank GR, et al. 1994 Estrogen resistance caused by a mutation in the estrogen-receptor gene in a man. N Engl J Med 331:1056–1061[Abstract/Free Full Text]
  4. Morishima A, Grumbach MM, Simpson ER, Fisher C, Qin K 1995 Aromatase deficiency in male and female siblings caused by a novel mutation and the physiological role of estrogens. J Clin Endocrinol Metab 80:3689–3698[Abstract]
  5. Zeitoun K, Takayama K, Michael MD, Bulun SE 1999 Stimulation of aromatase P450 promoter (II) activity in endometriosis and its inhibition in endometrium are regulated by competitive binding of steroidogenic factor-1 and chicken ovalbumin upstream promoter transcription factor to the same cis-acting element. Mol Endocrinol 13:239–253[Abstract/Free Full Text]
  6. Agarwal VR, Bulun SE, Leitch M, Rohrich R, Simpson ER 1996 Use of alternative promoters to express the aromatase cytochrome P450 (CYP19) gene in breast adipose tissues of cancer-free and breast cancer patients. J Clin Endocrinol Metab 81:3843–3849[Abstract]
  7. Shozu M, Akasofu K, Harada T, Kubota YJ 1991 A new cause of female pseudohermaphroditism: placental aromatase deficiency. J Clin Endocrinol Metab 72:560–566[Abstract]
  8. Bulun SE 1996 Clinical review 78: aromatase deficiency in women and men: would you have predicted the phenotypes? J Clin Endocrinol Metab 81:867–871[Medline]
  9. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ 1990 Basic local alignment search tool J. Mol Biol 215:403–410[CrossRef]
  10. Shozu M, Zhao Y, Bulun SE, Simpson ER 1998 Multiple splicing events involved in regulation of human aromatase expression by a novel promoter, I.6. Endocrinology 139:1610–1617[Abstract/Free Full Text]
  11. Toda K, Yang LX, Shizuta SJ 1995 Transcriptional regulation of the human aromatase cytochrome P450 gene expression in human placental cells. Steroid Biochem Mol Biol 53:181–190[CrossRef]
  12. Ryan KJ 1959 Biological aromatization steroids. J Biol Chem 234:268–272[Free Full Text]
  13. Kamat A, Graves KH, Smith ME, Richardson JA, Mendelson CR 1999 A 500-bp region, approximately 40kb upstream of the human CYP19 (aromatase) gene mediates placenta-specific expression in transgenic mice. Proc Natl Acad Sci USA 96:4575–4580[Abstract/Free Full Text]