(Received for publication, November 9, 1995; and in revised form, December 12, 1995)
From the
We describe the cDNA sequences for two human transcription factors, Forkhead RElated ACtivator (FREAC)-1 and -2, that belong to the forkhead family of eukaryotic DNA binding proteins. FREAC-1 and -2 are encoded by distinct genes, are almost identical within their DNA binding domains and in the COOH termini, but are otherwise divergent. Cotransfections with a reporter carrying FREAC binding sites showed that both proteins are transcriptional activators, and deletions located the activation domains to the COOH-terminal side of the forkhead domains. Expression of FREAC-1 and FREAC-2 is restricted to lung and placenta. We show that the promoters of genes for lung-specific proteins such as pulmonary surfactant proteins A, B, and C (SPA, SPB, and SPC) and the Clara cell 10-kDa protein (CC10) contain potential binding sites for FREAC-1 and FREAC-2. DNaseI footprinting verified that FREAC proteins bind to the predicted sites in the CC10 and SPB promoters. While an SPB promoter construct could be transactivated by both FREAC-1 and FREAC-2, CC10 was only activated by FREAC-1. Efficient activation of CC10 by FREAC-1 is shown to be specific for a lung cell line with Clara cell characteristics (H441) and to involve a region of the FREAC-1 protein unable to activate in other cell types.
Regulated gene expression depends on the concerted action of sequence-specific DNA binding proteins. Several structural motifs have been described that interact with DNA in a sequence-dependent manner and which define families of DNA binding proteins capable of regulating the initiation of transcription. Each family is defined by the structure of its DNA binding domain, but in many cases the members share other properties as well, such as the ability to heterodimerize or to convey certain intracellular signals.
The forkhead motif is a
100-amino acid DNA binding domain that defines a family of
transcription factors found in metazoans and Saccharomyces.
X-ray crystallography of the forkhead domain from HNF3 revealed a
three-dimensional structure that is a variation on the helix-turn-helix
motif (Clark et al., 1993). The forkhead domain binds DNA as a
monomer and contains two loops (or wings) on the COOH-terminal side of
the helix-turn-helix, which has given the structure the name ``the
winged helix'' (Brennan, 1993; Clark et al., 1993; Lai et al., 1993). Binding of the forkhead proteins FREAC-3 and
FREAC-4 to their cognate sites results in bending of the DNA at an
angle of 80-90° (Pierrou et al., 1994). Selection of
binding sites from random sequence oligonucleotides has shown that a
number of forkhead proteins share the requirement for a RTAAAYA core
sequence to bind with high affinity to DNA (Overdier et al.,
1994; Pierrou et al., 1994; Kaufmann et al., 1995).
Sequences flanking the core on both sides and minor variations within
the core provide the specificity unique to each forkhead protein.
Many forkhead genes have been isolated based on their homology to
the first identified members of this family: forkhead from Drosophila (Weigel et al., 1989; Weigel and
Jäckle, 1990) and HNF3 from rat (Lai et al., 1990); little is known about their function
(Bork et al., 1992; Häcker et
al., 1992; Clevidence et al., 1993; Kaestner et
al., 1993; Pierrou et al., 1994). Developmental mutants
in Drosophila (Grossniklaus et al., 1992;
Häcker et al., 1992), Caenorhabditis
elegans (Miller et al., 1993), and zebrafish
(Strähle et al., 1993) have been shown to
be caused by mutations in genes that contain the forkhead homology, and
several lines of evidence prove the importance of this gene family for
the embryonic development of mammals as well. Targeted disruption of
the mouse genes for HNF3
(Ang and Rossant, 1994; Weinstein et
al., 1994) and BF-1 (Xuan et al., 1995) cause severe
malformation of the central nervous system. The nude mice mutant, which
causes defective development of the thymus and hair follicles, results
from deletions within the forkhead gene whn (Nehls et
al., 1994).
The oncogenic potential of forkhead proteins was first demonstrated by the isolation of qin, a retroviral oncogene from Avian sarcoma virus 31 (Li and Vogt, 1993) with homologies to the mammalian forkhead gene BF-1 (Tao and Lai, 1992). The t(2;13)(q35;q14) translocation associated with alveolar rhabdomyosarcoma fuses part of the PAX3 gene with a forkhead gene named FKHR or ALV (Galili et al., 1993; Shapiro et al., 1993). Fusion transcripts produced from the chimerical gene give rise to a protein where the activation domain of PAX3 is replaced by the COOH-terminal part of FKHR (Galili et al., 1993; Fredericks et al., 1995). A similar situation exists in the t(X;11) translocation observed in a case of acute lymphocytic leukemia where the forkhead gene AFXI is fused with the gene for the putative transcription factor HTRXI (Parry et al., 1994).
Forkhead proteins are also involved in the control of genes expressed in terminally differentiated cells. The best studied examples are the HNF3 proteins, which have been found to regulate a number of genes expressed in liver or other endodermal tissues (reviewed by Costa (1994)).
We have previously described partial cDNA and genomic clones for seven human genes encoding Forkhead RElated ACtivator (FREAC)-1 to -7 (Pierrou et al., 1994). Two of these genes, FREAC-1 and FREAC-2, are only expressed in lung and placenta. In this paper, we report the cDNA sequences for FREAC-1 and FREAC-2. We identify binding sites for the FREAC-1 and -2 proteins in the promoter regions of several lung-specific genes. Although both FREAC proteins are potent transcriptional activators and bind with high affinity to the promoter of the gene for Clara cell 10-kDa protein (CC10), only FREAC-1 activates the CC10 gene, and this activation occurs only in a lung cell line with Clara cell-like characteristics.
A clone for mouse FREAC-1 was isolated by screening a genomic mouse library with a human FREAC-1 cDNA probe. Relevant fragments were identified by Southern blotting, subcloned, and sequenced.
The FREAC-2 expression plasmid was created by
filling in an EcoRI fragment spanning nt ()1-1880 with Klenow enzyme and inserting it into the SmaI site of pEVRF0 (Matthias et al., 1989).
Plasmids expressing truncated versions of FREAC proteins were generated through deletions of parts of the FREAC-1 and -2 genes from the full-length constructs with exonuclease Bal3I or restriction enzymes.
FREAC-1 and FREAC-2 are identical in the amino-terminal DNA binding domains and in the COOH termini. To isolate full-length cDNA clones for FREAC-1 and FREAC-2, we screened cDNA libraries derived from human lung. From several overlapping clones we were able to compile a cDNA sequence for FREAC-1 of 2509 nt, excluding the poly(A) tail. On Northern blots, we have estimated the size of the FREAC-1 mRNA to 2.6 kilobases (Pierrou et al., 1994). Given an average length of the poly(A) tail of 150 nt, the FREAC-1 cDNA sequence must be very close to full-length.
The first ATG codon in the FREAC-1 cDNA sequence is located 19 nt from the 5`-end. However, this codon is situated in a poor context for initiation of translation (Kozak, 1989), having pyrimidines in positions -3 and +4 and a purine in position -1. The second ATG codon in the FREAC-1 cDNA is located 94 nt from the 5`-end. This codon is positioned in a near-optimal context for translational initiation, and a polypeptide initiated at this codon will proceed into the forkhead homology in the correct reading frame without intervening stop codons. We have therefore assigned this codon as the start of the conceptual translation of the FREAC-1 protein. The open reading frame continues for 1062 nt, which corresponds to a protein of 354 amino acids (Fig. 1A), and is followed by an A/T-rich untranslated sequence of 1354 nt. A canonical polyadenylation signal, AATAAA, is located 50 nt upstream of the poly(A) addition site.
Figure 1: FREAC-1 and FREAC-2 are identical in the DNA binding domains and in the COOH termini. A, nucleotide sequence of human FREAC-1 cDNA (GenBank accession no. U13219) and deduced amino acid sequence. The conserved forkhead motif, which mediates DNA binding, is underlined. The lower DNA strand shows the sequence of mouse FREAC-1; gaps represent regions of the mouse gene that were not sequenced, and dots indicate nucleotides missing in the mouse sequence compared to the human. Positions of the mouse FREAC-1 sequence that differ from the published HFH-8 sequence are 111, 112, 149, 765, 766, 834, 835 (insertions), and 149 (deletion). B, nucleotide sequence of FREAC-2 cDNA (GenBank accession no. U13220) and deduced amino acid sequence. The forkhead motif is underlined. C, dot matrix comparison of amino acid sequences of FREAC-1 and FREAC-2 showing the similarities within the forkhead motifs and the COOH termini.
The size of the FREAC-2 mRNA was
estimated to be 2.4 kilobases on Northern blots. Despite extensive
library screening, we were unable to isolate more than 1964 nt of FREAC-2 cDNA from overlapping clones. Allowing for
100-200 nt of poly(A) tail leaves approximately 300 nt missing
from the 5`-end of the cDNA sequence. The reading frame defined by the
forkhead homology is open from the beginning of the cDNA sequence and
contains no ATG codon before the start of the forkhead motif. Thus, the
408 amino acids of deduced FREAC-2 sequence (Fig. 1B)
does not represent the full-length protein but lacks the proper amino
terminus. The 3`-untranslated sequence of 740 nt contains a (CA) repeat in antisense orientation around nt 1580 and an AATAAA
polyadenylation signal 17 nt upstream of the poly(A) addition site.
A comparison of the amino acid sequences of FREAC-1 and FREAC-2, derived from conceptual translation of the cDNAs, suggests that the two genes have evolved from a common ancestor (Fig. 1C). Within the forkhead domain and immediately adjacent sequences, the two proteins are virtually identical; three conservative amino acid substitutions, one serine/threonine and two serine/alanine, occur within 112 residues. Since the forkhead domain is responsible for DNA binding, it seems reasonable to assume that the two proteins have identical, or near identical, DNA binding specificity. FREAC-2 extends further on the amino-terminal side of the forkhead domain than FREAC-1 and has a serine-rich stretch in this region with 15 serines out of 18 residues. Also on the carboxyl-terminal side of the forkhead domain does the FREAC-2 sequence contain several homopolymeric runs of amino acids such as serine, glycine, and histidine. The central parts of the proteins are divergent, although islands of homology indicate that the sequences have a common origin. In the carboxyl-termini the similarity is again more obvious, and the eight last amino acids of FREAC-1 and FREAC-2 are identical. Except for the homopolymeric runs of certain amino acids, which are common among transcription factors, no conserved sequence motifs were found outside the forkhead homology when the amino acid sequences were used to search the data bases.
A comparison of the FREAC-1 and -2 cDNA sequences with other known forkhead genes revealed that FREAC-1 is very similar to HFH-8 from mouse (Clevidence et al., 1994). The cDNA sequence similarity, the matching tissue distribution of expression (Clevidence et al., 1994; Pierrou et al., 1994), and the fact that FREAC-1 and HFH-8 are located at homologous chromosomal positions in man and mouse (Avraham et al., 1995; Larsson et al., 1995) suggested that HFH-8 and FREAC-1 are homologous genes. However, the predicted amino acid sequences of HFH-8 and FREAC-1 differ on both sides of the forkhead motif due to insertions or deletions in the HFH-8 cDNA sequence, compared to that of FREAC-1. This leads to frameshifts in five different positions throughout the coding sequence. To assess whether the apparent discrepancy in use of reading frame was due to a species difference or sequencing errors, we isolated a genomic clone for the mouse homologue of FREAC-1 and sequenced the relevant regions. As shown in Fig. 1A, the human and mouse FREAC-1 sequences are indeed colinear, and the aberrant amino acid sequence of HFH-8 is most likely explained by sequencing errors introducing frameshifts in five positions of the published HFH-8 cDNA sequence (Fig. 1A).
Figure 2: Human lung cell lines express FREAC-2. Northern blot is shown with RNA from three human lung cell lines hybridized with a probe specific for FREAC-2. No expression could be detected when a FREAC-1 probe was hybridized to an identical blot. kb, kilobases.
Figure 4:
FREAC-1 and FREAC-2 have COOH-terminal
activation domains. A, luciferase reporter constructs (top) and proteins encoded by the different FREAC expression plasmids (bottom). B, relative
luciferase activity produced by COS-7 cells transfected with 4
FREAC-luc and various FREAC expression plasmids. C,
gel shift assay with a FREAC probe and extracts from COS-7 cells
transfected with different FREAC expression plasmids or empty
pEVRF expression vector. The shifted complexes present in the
``vector'' lane represent endogenous COS-7 cell proteins
capable of binding to the probe. In the FREAC-2, FREAC-2(1-242),
FREAC-1, and FREAC-1(1-326) lanes, faster migrating bands are
present in addition to the full-length complexes, which result from
protease cleavage at hypersensitive sites immediately COOH-terminal of
the forkhead domains of both FREAC-1 and
FREAC-2.
When the luciferase activity
produced by the 4 FREAC-luc reporter was compared to that of
the parental apoB-luc, we found that the presence of four FREAC binding
sites enhanced promoter activity, even without cotransfection with FREAC expression plasmids (data not shown). The activation
varied between cell lines and indicates that endogenous transcriptional
activators capable of binding to the FREAC sites are present in a
variety of cell types where no expression of FREAC-1 or FREAC-2 can be detected. This is not surprising since our
results with binding site selection (Pierrou et al., 1994)
show that forkhead proteins are closely related with regard to sequence
specificity and that the differences often are quantitative rather than
qualitative. Furthermore, the large size of the forkhead gene family
and the wide tissue distribution of its expression suggest that there
may be forkhead proteins present in virtually every cell type.
Fig. 4B illustrates the effect on luciferase
activity from 4 FREAC-luc of cotransfection with plasmids that
express FREAC-1 and FREAC-2 in COS-7 cells. FREAC-1
and FREAC-2 both activate 4
FREAC-luc 8-10-fold. When
FREAC-1(1-117), which lacks the 237 COOH-terminal amino acids of
FREAC-1, replaced the full-length construct, a repression to less than
one-tenth was observed instead of activation. A similar result was
obtained for FREAC-2(1-242), with 166 amino acids missing from
the COOH terminus. These results show that activation domains
COOH-terminal of (and distinct from) the forkhead domains are necessary
for transcriptional activation by both FREAC-1 and FREAC-2. When the
truncated FREAC proteins bind the sites in the reporter, endogenous
proteins are outcompeted and luciferase activity is brought back to
approximately the same level as that of apoB-luc. Hence, the repression
serves to verify that the loss of activation is not a consequence of
destabilized proteins or obstructed DNA binding and supports the idea
that the deletions remove true activation domains. The true level of
activation, as judged from the ratio between the activity produced by
the full-length protein and the truncated, is around 100-fold.
When we searched a data base of regulatory regions from mammalian genes for matches to the FREAC core sequence, a number of occurrences were found in genes specifically expressed in lung. Examples include the genes for pulmonary surfactant proteins A, B, and C (SPA, SPB, and SPC) and for the Clara cell 10-kDa protein (CC10). Genes for which the promoter regions have been sequenced from more than one mammalian species were examined to check whether the identified sequences have been conserved during evolution. In several cases this turned out to be the case, e.g. the sequence at position -117 of the human SPC promoter is conserved in mouse and rat, and the two sites in the CC10 promoter are found in approximately the same positions in the human, rat, mouse, and rabbit promoter sequences. A summary of putative binding sites from four genes is shown in Fig. 3C.
Figure 3: FREAC proteins bind to the promoter regions of lung-specific genes. A, DNaseI footprinting of the rat CC10 and human SPB promoters. CC10 promoter fragments with either wild type (wt) sequence or mutated in the 5`-, 3`-, or both (dbl) sites were footprinted in the absence(-) or presence of increasing amounts of FREAC-2/glutathione S-transferase (GST). The SPB promoter was footprinted with an amount of FREAC-2/GST that corresponds to the highest amount used with CC10 (approximately 10 ng). B, sequence of the two FREAC binding sites in the rat CC10 promoter. Arrows indicate the FREAC core motifs, and brackets indicate the sequences protected from DNaseI digestion. The three nucleotides in each site that were targeted with in vitro mutagenesis are marked in bold, and the actual sequences of the mutants are shown below. C, summary of putative FREAC binding sites in the promoter regions of four lung-specific genes from four mammalian species. Numbers indicate the position relative to the transcriptional start site, and rev indicates that the sequence from the antisense strand is shown.
In the human SPB and rat CC10 promoters, the predicted binding sites are located in regions to which regulatory function has been assigned based on transfections with reporter constructs and to which nuclear proteins from lung cells have been shown to bind (Stripp et al., 1992; Bohinski et al., 1993; Bohinski et al., 1994). Therefore, we chose to investigate the effect of FREAC expression on the activity of these two promoters. Fragments from the rat CC10 and human SPB promoters that had proven to be active in transient transfections were isolated by PCR. DNaseI footprinting was used to test if the predicted sites would bind FREAC-2. As shown in Fig. 3A, specific binding of FREAC-2/GST was observed for two closely positioned sites in the CC10 promoter and for one site in the SPB promoter.
To investigate if the observed binding of FREAC proteins to the promoter regions of the SPB and CC10 genes influenced transcription, we transfected luciferase reporter constructs, driven by these promoters (Fig. 4A), together with plasmids expressing FREAC-1 and FREAC-2.
Figure 5: The CC10 promoter is activated by FREAC-1-but not FREAC-2, specifically in H441 cells. H441 and HC11 cells were transfected with 300 ng of CC10-luc and increasing amounts of FREAC expression plasmids. The amount of cotransfected plasmid was held constant at 300 ng by the addition of pEVRF1 expression cloning vector. For a schematic view of the reporter and expression constructs, see Fig. 4A.
In the lung cell line H441 is CC10-luc
approximately 40-fold more active than the promoterless luciferase
plasmid pGL2-Basic (data not shown). Cotransfection with a plasmid
expressing full-length FREAC-1 activated the CC10 promoter in this construct up to 20-fold above the basal level. FREAC-1(1-326), which encodes a protein with 28 amino
acids deleted from the COOH terminus and which repressed
4FREAC-luc in COS-7, activated CC10-luc at least as efficiently
as the full length protein (25-fold), and was much more effective when
limiting concentrations of plasmid was used. No activation was observed
when FREAC-1(1-117) was used.
Although FREAC-2
appears to activate as potently as FREAC-1, judged from transfections
in COS-7 with 4 FREAC-luc, FREAC-2 failed to activate CC10-luc
in H441 cells. Neither did the truncated version of either protein,
FREAC-1(1-117) or FREAC-2(1-242), repress the basal
activity of CC10-luc in these cells.
When the same set of constructs was transfected into an epithelial cell line derived from a tissue where the CC10 gene is not normally transcribed, the murine mammary gland cell line HC11 (Ball et al., 1988), an entirely different result was obtained. CC10-luc produced a low, basal activity in these cells. This activity was extinguished by cotransfection with plasmids expressing truncated, non-activating versions of either FREAC-1 or FREAC-2. The degree of repression depended on the amount of FREAC plasmid transfected and was equally efficient for FREAC-1(1-117) and FREAC-2(1-242). This result suggests that the truncated FREAC proteins repress transcription through competition with endogenous proteins for the same binding sites. It also shows that, in this repression of the CC10 promoter, FREAC-2 is as efficient as FREAC-1. Thus, the selective activation of the CC10 promoter by FREAC-1 in H441 cells is unlikely to reflect a difference in DNA binding between FREAC-1 and FREAC-2. Rather, it implies that other factors present in H441 cells synergize with FREAC-1 but not FREAC-2.
FREAC-1 and FREAC-2 only activated the CC10 promoter 1.8-2.5-fold in HC11 cells. This modest activation was seen using low levels of FREAC expression plasmids, and at higher levels activity again declined, possibly due to squelching. In contrast to what we observed in H441 cells, FREAC-2 was here a slightly better activator than FREAC-1. Finally, FREAC-1(1-326), which in H441 cells activated as well as, or better than, full-length FREAC-1, did not activate in HC11 cells. Instead, it repressed the CC10 promoter with approximately the same efficiency as FREAC-1(1-117) or FREAC-2(1-242).
Taken together, these results suggest that entirely different mechanisms are behind the efficient activation of the CC10 promoter by FREAC-1 seen in H441 cells and the limited activation by both FREAC-1 and FREAC-2 in HC11 cells. In H441, the CC10 promoter appears to be in a context that makes it susceptible to activation by FREAC-1 but not FREAC-2, an activation which is independent of the last 28 amino acids but requires a region between amino acids 118 and 326 in the FREAC-1 sequence. In HC11 cells, a weak activation is produced by both FREAC-1 and FREAC-2, and in the case of FREAC-1 this activation depends on the integrity of the last 28 amino acids.
To verify that all the expression constructs produced proteins that were correctly folded and able to bind DNA, we prepared extracts from transfected COS-7 cells and analyzed these for the presence of FREAC proteins with a gel shift assay. As seen in Fig. 4C, all the truncated proteins as well as the full-length proteins are expressed and bind to a FREAC site oligonucleotide in this assay. Despite the fact that it is the best activator of the CC10 promoter in H441 cells, FREAC-1(1-326) was consistently present in lower amounts than the other proteins in extracts from transfected cells.
The ability of the CC10 promoter to be activated by FREAC-1 was, however, reduced by the mutations. Both single mutations were about equally effective in reducing the responsiveness to FREAC-1, and the double mutant showed the lowest level of induction. This result implies that FREAC-1 is able to activate the CC10 promoter from a single site and that no synergy exists between the two sites.
Figure 6: The SPB promoter is activated by both FREAC-1 and FREAC-2. H441 and HC11 cells were transfected with 300 ng of SPB-luc and 300 ng of the indicated FREAC expression plasmids. The deletion mutants FREAC-1(1-326), FREAC-1(1-117), and FREAC-2(1-242) were not tested in HC11 cells. For a schematic view of the reporter and expression constructs, see Fig. 4A.
We have cloned the cDNA:s for two novel transcription factors, FREAC-1 and FREAC-2, which belong to the forkhead family. Previous work demonstrated expression of FREAC-1 and FREAC-2 only in lung and placenta. The restricted expression pattern suggested that FREAC-1 and FREAC-2 could be involved in regulation of lung-specific genes. In this paper, we show that a number of genes specifically expressed in the lung epithelium contain potential binding sites for FREAC proteins. For two of these genes, the Clara cell 10-kDa protein gene and the surfactant protein B gene, we verify that the identified sites are targets for FREAC proteins.
Recently, the cDNA sequence of the mouse homologue of FREAC-1, HFH-8, was published (Clevidence et al., 1994). The nucleotide sequence of HFH-8 is very similar to that of FREAC-1: 90% homology in the coding region with differences fairly evenly distributed. In five locations, however, deletions or insertions of one or two nucleotides change the reading frame of the HFH-8 cDNA sequence compared to that of FREAC-1. As a consequence, the predicted amino acid sequence of HFH-8 differs significantly from that of FREAC-1. Sequencing of a genomic clone for mouse FREAC-1 showed that each one of the five frameshifts results from an error in the HFH-8 sequence and that FREAC-1 from mouse and man are nearly identical throughout the coding sequence.
The similarity between sequences of FREAC-1 and FREAC-2, at DNA as well as protein level, indicate a close
evolutionary relationship. On the other hand, the lack of homology in
the amino-terminal and central parts of the proteins shows that there
has been ample time for the two genes to diverge. In other words,
strong selective pressures must be behind the almost perfect
conservation of the DNA binding domains and the extensive similarities
in the COOH-terminal part. That the duplication of a presumed ancestral
gene was not a recent event is supported by the fact that FREAC-1 and FREAC-2 are located on different chromosomes (Larsson et al., 1995). ()
The sequence homology in conjunction with the similarity in tissue distribution of expression suggested that FREAC-1 and FREAC-2 may be functionally redundant. However, the qualitative difference in their ability to activate the CC10 promoter shows that although both proteins are transcriptional activators with similar or identical DNA binding specificity, they are functionally distinct. It also stresses the importance of interactions other than DNA binding for the specificity of FREAC proteins. The FREAC-2 clone that we have used in the transfection experiments does not encode the full-length protein; some amino acids are missing from the amino terminus. Thus, we cannot exclude the possibility that a full-length FREAC-2 protein would exhibit other characteristics. However, this does not change the fact that the powerful COOH-terminal activation domains present in both FREAC-1 and FREAC-2 exhibit differential activation properties. Whereas both proteins potently activate a reporter construct in a heterologous cell type (COS-7), only FREAC-1 is capable of activating the CC10 promoter in a lung cell line (H441). The behavior of the FREAC-1(1-326) deletion mutant also shows that in these two contexts, the mechanisms of activation by FREAC-1 are distinct.
A comparison of the activities produced by the different deletion mutants of FREAC-1 shows that the region necessary for the cell-specific activation of CC10 is located between amino acids 118 and 325. This coincides with the part where FREAC-1 and FREAC-2 are most divergent. It appears that FREAC-1 and FREAC-2 have evolved to perform different biological tasks while retaining the specificity for the same DNA sites and the same organ-specific expression.
The binding sites for FREAC proteins in the CC10 promoter have been shown to bind proteins present in nuclear
extracts from lung (Stripp et al., 1992). HNF3 and
HNF3
, which are expressed in many endodermal tissues including
lung (Clevidence et al., 1994; Bingle et al., 1995),
have been reported to be able to bind to these sites (Bingle and
Gitlin, 1993; Sawaya et al., 1993). However, cotransfections
with HNF3
and HNF3
showed no (Sawaya and Luse, 1994) or low
(Bingle and Gitlin, 1993; Bingle et al., 1995) transactivation
of the CC10 promoter.
The FREAC site in the SPB promoter has also been shown to bind HNF3 and HNF3
(Bohinski et al., 1994) and, at least in a non-lung cell line
(HepG2), does the binding of HNF3
, as well as HFH-8, mediate
transcriptional activation (Clevidence et al., 1994). This
picture agrees well with our observation that truncated FREAC proteins
repress the SPB promoter but not the CC10 promoter in
H441 cells. Thus, SPB and CC10 appear to differ not
in their ability to bind the different factors but in the way they
respond.
It is clear that a number of forkhead proteins are capable
of binding to the FREAC sites in the CC10 promoter in
vitro and in vivo. In addition to FREAC-1, FREAC-2, HNF3, and HNF3
, two other
forkhead genes, HFH-1 and HFH-4, are also expressed
in the lung (Clevidence et al., 1993, 1994; Hackett et
al., 1995). A more detailed analysis of temporal and spatial
expression patterns will be required to understand each protein's
role in regulating pulmonary genes. The example of FREAC-1 and FREAC-2
and their different effects on the CC10 promoter emphasizes
the importance of context in transcription factor function. It also
illustrates how interactions mediated by parts of the proteins distinct
from the DNA binding domain can provide specificity. This gives us a
clue to how the members of this large transcription factor family may
exert their distinctive functions and how cross-talk could be avoided
between proteins with overlapping DNA binding specificity.