tbCPSF30 Depletion by RNA Interference Disrupts Polycistronic RNA Processing in Trypanosoma brucei*

Edward F. Hendriks, Ammar Abdul-Razak and Keith R. Matthews {ddagger}

From the School of Biological Sciences, Division of Biochemistry, 2.205 Stopford Building, University of Manchester, Oxford Road, Manchester M13 9PT, United Kingdom

Received for publication, March 7, 2003 , and in revised form, April 15, 2003.


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
Gene expression in eukaryotes requires the post-transcriptional cleavage of mRNA precursors into mature mRNAs. In Trypanosoma brucei, mRNA processing is of particular importance, since most transcripts are derived from polycistronic transcription units. This organization dictates that regulated gene expression is promoter-independent and governed at the posttranscriptional level. We have identified tbCPSF30, a protein containing five CCCH zinc finger motifs, which is a homologue of the cleavage and polyadenylation specificity factor (CPSF) 30-kDa subunit, a component of the machinery required for 3'-end formation in yeast and mammals. Using gene silencing of tbCPSF30 by RNA interference, we demonstrate that this gene is essential in bloodstream and procyclic forms of T. brucei. Interestingly, tbCPSF30-specific RNA interference results in the accumulation of an aberrant tbCPSF30 mRNA species concomitant with depletion of tbCPSF30 protein. tbCPSF30 protein depletion is accompanied by the accumulation of unprocessed tubulin RNAs, implicating tbCPSF30 in polycistronic RNA processing. By genome data base mining, we also identify several other putative components of the T. brucei cleavage and polyadenylation machinery, indicating their conservation throughout eukaryotic evolution. This study is the first to identify and characterize a core component of the T. brucei CPSF and show its involvement in polycistronic RNA processing.


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
The African trypanosome (Trypanosoma brucei) is the causative agent of sleeping sickness in humans and nagana in cattle. Trypanosomes are evolutionarily ancient organisms in which numerous unusual aspects of eukaryotic biology have been discovered, for example the possession of glycosomes, extensive antigenic variation mediated by genome recombination, and the RNA editing of mitochondrial transcripts. Also, the trypanosomatid family of protozoan parasites does not use the normal eukaryotic arrangement for gene expression (for a review, see Ref. 1). Instead, genes are organized into polycistronic transcription units, whereby many genes may be transcribed by an upstream promoter. Precursor RNAs are then processed into mRNA by the addition of a 39-nucleotide capped RNA (spliced leader) through a trans-splicing event (2, 3) and by cleavage and polyadenylation. A consequence of this arrangement is that the regulation of genes within such transcription units is not governed by promoter activity but by mRNA processing and stability. These organisms, therefore, represent a regulatory extreme in which the genome is almost exclusively controlled post-transcriptionally. In consequence, T. brucei is an interesting model for the study of the regulation of gene expression at the RNA level.

Analyses of RNA processing in trypansomatids have largely focused on trans-splicing. Trans-splicing has been shown to be mechanistically similar to cis-splicing of introns in yeast and higher eukaryotes, and the basic components of both processes are conserved (for a review, see Ref. 4). In contrast, little is known about the process of trypanosome mRNA 3'-end formation, and no recognizable conserved motifs, such as the AAUAAA sequence of higher eukaryotes, are present upstream of trypanosome polyadenylation sites. Evidence also suggests that maturation of T. brucei polycistronic pre-mRNAs involves a temporal and mechanistic relationship between trans-splicing and 3'-end formation. It has been shown that the pyrimidine-rich sequences immediately upstream of the {alpha}-tubulin trans-splice site are a major determinant for {beta}-tubulin mRNA 3'-end formation (5). Therefore, the sequence signals important for trans-splicing of the downstream gene are believed to contribute to 3'-end formation of the upstream gene in a polycistron. This is consistent with the possibility that the polypyrimidine tract is recognized by both the trans-splicing and polyadenylation machineries, either sequentially or simultaneously (6). The nematode Caenorhabditis elegans also undergoes trans-splicing during mRNA maturation, and, as in trypanosomes, this organism is believed to couple trans-splicing and polyadenylation to process precursor RNAs. Although several studies have concentrated on the identification of the components of the trans-splicing machinery in both T. brucei and C. elegans (7, 8), there have been no studies in either of these organisms concerning the cleavage and polyadenylation machinery to date. Thus, the mechanistic links between the trans-splicing machinery and 3'-end formation in polycistronic RNA processing have only been the result of investigating polyadenylation as a consequence of perturbations in the signals/machinery for trans-splicing (5, 6, 9).

A major goal of investigating the mechanisms of pre-mRNA 3'-end processing is the identification and functional characterization of the different trans-acting factors involved in the reaction. In a previous study, we identified a family of proteins in T. brucei, possessing a CCCH zinc finger motif (10), characteristic of RNA-binding proteins. In this study, we have used genome data base mining to identify a protein containing five CCCH fingers and show it to be the cleavage and polyadenylation specificity factor 30-kDa subunit of T. brucei (tbCPSF30). Here, we specifically perturb tbCPSF30 by RNA interference and show that tbCPSF30 is essential for T. brucei survival in both the mammalian bloodstream and insect procyclic forms of the parasite. Interestingly, unprocessed tubulin transcripts accumulate when tbCPSF30 is depleted by RNA interference (RNAi),1 as has been seen previously when trans-splicing is disrupted. This implies that the integrity of the polyadenylation machinery as well as the trans-splicing machinery is required for polycistronic RNA processing. The significant sequence conservation of yeast, worm, fly, zebrafish, mammalian, and now T. brucei CPSF30 homologues, suggests that all of these are members of a family of proteins conserved from the earliest eukaryotes to vertebrates. Moreover, identification of other core component homologues of the yeast and mammalian CPSF in the T. brucei CPSF complex, by data base mining, suggests a basic conservation of the CPSF complex among eukaryotes with diverse mechanisms for regulating gene expression.


    EXPERIMENTAL PROCEDURES
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
Trypanosome Cultures and Transfection—Monomorphic bloodstream form T. brucei rhodesiense EATRO 2340 were cultured in HMI-9 at 37 °C, 5% CO2. Procyclic form T. brucei 427 cells were cultured in SDM-79 at 27 °C. For RNAi analysis, cultured T. brucei 427 "single marker T7 RNAP/TETR" bloodstream form cells (11) (SMB) were used. These cells have been engineered to express the T7 polymerase and a tetracycline repressor. RNAi analysis in procyclic forms was carried out in T. brucei 427 29-13 cells, which have also been engineered to express the T7 polymerase and the tetracycline repressor protein (11).

Transfection of both bloodstream and procyclic form parasites was carried out as described in Ref. 10 using 10 µg of NotI (pHD451) or EcoRV (p2T7i) linearized DNA electroporated into 500 µl of cells at 4 x 107 cells ml1. Transfected cells were recovered overnight in 10 ml of HMI-9 at 37 °C, 5% CO2 (bloodstream forms), or SDM-79 medium at 27 °C (procyclic forms) before being diluted to 1 x 105 cells ml1 and subjected to drug selection in 24-well plates. Drug concentrations used for selection were hygromycin (2 µg ml1 for bloodstream forms, 20–50 µg ml1 for procyclic forms), G418 (2.5 µg ml1 for bloodstream forms, 15 µg ml1 for procyclic forms), or phleomycin (0.5–2.5 µg ml1 for bloodstream forms, 5 µg ml1 for procyclic forms). Selected cells were cloned by limiting dilution under drug selection.

Transgenic Plasmid Constructs—The trypanosome expression vector pHD451 (12) was modified to contain the TY1 epitope tag fused to the C terminus of genes cloned into the HindIII and BamHI sites. This vector was designated pHD451-TY1. The tbCPSF30 coding region was amplified by PCR using primers CPSF30.F and CPSF30.R (CPSF30.F is CC AAG CTT ATG TTT ACT GAC AAC GCT GCC C; CPSF30.R is GGG ATC CCT GCC TTC CCG TTG CAT CAC) and cloned into HindIII/BamHI-digested pHD451-TY1. This plasmid was then used to direct tbCPSF30 transgene expression in either bloodstream or procyclic form parasites that had been previously engineered to express the tetracycline repressor by stable transfection with the expression construct pHD449 (12).

The tbCPSF30 RNAi construct was made using the vector p2T7i (13), which allows for tetracycline-inducible expression of double-stranded RNA from a T7 promoter in T. brucei 427 SMB or 427 29-13 procyclic cells. The tbCPSF30 coding region was amplified using PCR primers CPSF30.F and CPSF30.R and cloned into the HindIII and BamHI sites of p2T7i. Further details of all constructs used in this paper are available upon request.

Total RNA Isolation, cDNA Synthesis, and Northern Analysis— Trypanosome RNA was prepared using a Qiagen RNA-easy kit (by the manufacturer's method), and Northern blotting was performed as described previously (14). Blots were hybridized at 60 °C with riboprobes labeled with digoxygenin (Roche Applied Science), and stringency washes were at 60 °C, 0.1x SSC. Blot detection was by chemiluminescence using CDP-star as a reaction substrate. The {alpha}/{beta} tubulin intergenic region riboprobe was generated from a construct previously engineered in pBluescript II (5).

Anti-peptide Antibody Production, Protein Isolation, and Western Blotting—An anti-peptide antibody to T. brucei CPSF30 was generated using two peptide antigens comprising the sequences LIERADDPSFNKNATC and QQWGGHRRGDATGRQ (see Fig. 1A). Both were coupled to keyhole limpet hemocyanin and used to immunize rabbits (Eurogentec, Belgium), the resulting serum being affinity-purified against each of the peptide antigens. Preimmune serum and immune serum were screened against whole cell trypanosome proteins. Preimmune serum did not show any reaction with T. brucei proteins.



View larger version (46K):
[in this window]
[in a new window]
 
FIG. 1.
A T. brucei sequence homologue of CPSF30. A, a ClustalW (42) alignment of the tbCPSF30 amino acid sequence from trypanosomes, cow (43), zebrafish (44), fly (19), and yeast (43). Asterisks, colons, and dots label residues that are identical, conserved, or semiconserved across all compared. Peptides used to make the anti-peptide antibody to tbCPSF30 are italicized (see "Experimental Procedures"). B, the search CCCH motif (PROSITE (45)) from tbZFP2 (10) used to identify tbCPSF30 (five CCCH motifs) and the signatures of each CCCH domain in tbCPSF30 are shown. C, a schematic representation of the domain structure of CPSF30 homologues from cow, zebrafish, fly, yeast, and T. brucei. Black boxes, CCCH motifs; gray boxes, CCHC zinc knuckles.

 

Proteins were prepared as described in Ref. 15 and resolved on 11% polyacrylamide gels. Blotted proteins were incubated with either the anti-TY1 epitope tag monoclonal antibody (BB2; diluted 1:20) or with the affinity-purified rabbit anti-tbCPSF30 antibody (diluted 1:20). Western blots were processed using horseradish peroxidase-conjugated secondary antibody and visualized by chemiluminescence (Amersham Biosciences).

Cell Growth, Microscopy, and Computational Analysis—tbCPSF30 RNAi was induced by the addition of tetracycline (1 µg ml1), and growth was monitored over a period of 2 weeks. At intervals, ~2 x 106 cells were harvested, spread onto microscope slides, and then air-dried for 10 min. The cells were then fixed in methanol at –20 °C for at least 20 min. The cells were rehydrated in PBS, and then slides were incubated with 4,6-diamidine-2, phenylindole for 5 min. Finally, the cells were washed in PBS three times and mounted in MOWIOL (Harlow Chemical Co., Kent, UK) containing phenylene diamine (1 mg ml1). Slides were examined on a Zeiss Axioscop 2 microscope, and images were captured using Scion Image version 1.62. Figures were processed using Adobe Photoshop 6.0. The tbCPSF30 locus was mapped using the T. brucei data bases at the Sanger Institute (available on the World Wide Web at www.sanger.ac.uk/Projects/T_brucei/) and at the Institute for Genomic Research (available on the World Wide Web at www.tigr.org/tdb/mdb/tdbd/index.shtml). These same databases were screened, using Blast analysis (16), against both the yeast and mammalian proteins known to be involved in 3'-end formation.


    RESULTS
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
tbCPSF30 —As part of our interest in the family of proteins containing the RNA-binding CCCH zinc finger, we searched the trypanosome genome data base for genes encoding proteins containing this motif. As a search motif, we used the CCCH zinc finger sequence from tbZFP2, a molecule implicated in T. brucei cellular differentiation (10). This identified a sequence predicted to encode part of a protein containing five CCCH zinc finger motifs as well as two zinc knuckle motifs conforming to the CX2CX4HX4C structure (Fig. 1A). tbZFP2 has a CX8CX5CX3H zinc finger, typical of the family, whose best studied member is Tristetraprolin (TTP, also known as TIS11 and NUP475) (17, 18). In contrast, the five CCCH motifs in the newly identified gene had a degenerate spacing between the first two coordinating cysteine residues in the sequence (CX8CX4CX3H and CX7CX5CX3H; see Fig. 1B). Consequently, the gene, although a member of the tbZFP family of proteins as defined by the presence of the CCCH motif (Fig. 1B), represents a distinct subclass of this family.

The complete gene sequence was obtained by a combination of clone walking (within the data base) and PCR between the intervening sequence of contiguous clones. The resulting predicted sequence was then verified by sequencing the entire gene isolated from genomic DNA by PCR amplification. tBlastx analysis of the complete coding region demonstrated strong homology to the protein sequence for the cleavage and polyadenylation specificity factor 30-kDa subunit (CPSF30) from several organisms (34% identity and 65% similarity to yeast, 35% identity and 58% similarity to cow) (Fig. 1A). In addition to the conservation of the overall zinc finger motif structure (Fig. 1C), there were several further highly conserved regions of identity within the molecule and members of the CPSF30 family. On the basis of its strong homology to CPSF30 from several organisms, the gene was designated as the T. brucei cleavage and polyadenylation specificity factor 30-kDa subunit, tbCPSF30.

tbCPSF30 Is Expressed in Bloodstream and Procyclic Life Cycle Stages—In Drosophila, the CPSF30 homologue (Clipper, or CLP) is not ubiquitously expressed; it is absent during embryonic development (19). Since tbCPSF30 and CLP shared the greatest structural similarity (see Fig. 1C), the expression pattern of tbCPSF30 was assessed during the trypanosome developmental cycle. Total RNA was prepared from bloodstream and procyclic form parasites and hybridized with a riboprobe made to the coding region of tbCPSF30. This analysis revealed that tbCPSF30 was expressed at both these life cycle stages in similar abundance (Fig. 2A). The message was ~1100 nucleotides, in agreement with the predicted size of the coding region (831 bp) plus ~200–300 nucleotides of 5'- and 3'-untranslated regions. Southern analysis showed that tbCPSF30 was a single copy gene (data not shown).



View larger version (59K):
[in this window]
[in a new window]
 
FIG. 2.
tbCPSF30 is constitutively expressed in T. brucei. A, Northern analysis of tbCPSF30 expression in monomorphic bloodstream (Bs) and procyclic forms (Pc). The message is expressed at the same level in each life cycle stage and is ~1100 nucleotides in size. Relative RNA loadings are indicated by the panel showing ethidium bromide-stained RNA samples (EtBr). B, Western analysis of tbCPSF30 expression in monomorphic bloodstream SMB and procyclic 29-13 forms, using the affinity-purified anti-peptide antibody to tbCPSF30 ({alpha}tbCPSF30). Two bands were always detected with this antibody; the reason for this is not known. The protein is ~32–34 kDa in size and is equally expressed in bloodstream and procyclic forms. Preimmune tbCPSF30 serum did not detect any proteins (data not shown). Equal cell numbers were used to prepare each sample, and equivalent loading was confirmed by analysis of the Ponceau S-stained blot (not shown).

 

In the case of the Drosophila CLP, the protein and RNA expression pattern do not coincide due to developmental regulation at the post-transcriptional level (20). To verify that the expression pattern observed for tbCPSF30 at the RNA level also reflected protein expression, an antibody was generated to two peptide antigens (LIERADDPSFNKNATC and QQWGGHRRGDATGRQ) derived from the C terminus of tbCPSF30, away from the conserved five CCCH zinc fingers and two zinc knuckle motifs. Protein samples were prepared from both bloodstream and procyclic forms and subjected to Western analysis with the specific tbCPSF30 anti-peptide antibody. The purified tbCPSF30 antibody detected two bands of ~32–34 kDa (Fig. 2B). This double band was always observed in several different parental or wild type cell lines, although the relative proportion of each did vary between samples. We believe that both bands are tbCPSF30 and that they represent different forms of the protein (i.e. they may represent a processed or post-transcriptionally modified form of the protein). This has not been further examined; however, it is interesting to note that in vitro purification of mammalian polyadenylation complexes also indicated the presence of two forms of CPSF30 (28 and 30 kDa) (21). Moreover, both bands disappear upon tbCPSF30-specific transcript ablation (see below). Fig. 2B confirms that tbCPSF30 is expressed equally in both bloodstream and procyclic life cycle stages, contrasting with the developmental regulation demonstrated by Drosophila CLP.

Gene Silencing of tbCPSF30 Causes Cell Death—The polyadenylation machinery comprises a complex of proteins of precise stoichiometry. We therefore assessed the functional consequences of ectopic overexpression and RNAi-mediated depletion of tbCPSF30. Initially, we used the trypanosome expression vector pHD451 (12) to generate bloodstream and procyclic form lines that overexpressed a TY-1 epitope-tagged copy of tbCPSF30. Transgenic tbCPSF30-TY could be clearly detected in both life cycle stages (Fig. 3A), being overexpressed 2-fold in bloodstream forms and 3–5-fold in the procyclic form with respect to the endogenous protein (data not shown). These cell lines were morphologically identical to the parental lines and grew at an identical rate (Fig. 3B). Thus, no adverse effects result from tbCPSF30 overexpression in either life cycle stage.



View larger version (43K):
[in this window]
[in a new window]
 
FIG. 3.
Ectopic overexpression of tbCPSF30 does not generate detectable phenotype. A, Western analysis of bloodstream and procyclic parental forms (Bs449 and Pc449) expressing ectopic tbCPSF30, probed with the antibody BB2 specific for the TY1 epitope tag (top panel). +, induction with tetracycline. The bottom panel shows the same protein samples probed with an antibody to trypanosome {alpha}-tubulin (TAT) to indicate loading. B, growth analysis of tbCPSF30 bloodstream (Bs) or procyclic (Pc) trypanosomes expressing ectopic tbCPSF30 under the control of a tetracycline-inducible promoter grown in the presence (+) and absence (–) of tetracycline. The induced parental strains Bs449 and Pc449 (expressing the tetracycline repressor alone) are also shown. Cells were monitored for 8 days, being maintained at between 2 x 105 and 3–4 x 106 by passage. All cultures were grown in the presence of selective drugs.

 

Further functional analysis of the gene product was carried out using gene silencing by RNAi (22). The tbCPSF30 coding region was cloned into the vector p2T7i (13) and transfected into bloodstream SMB cells and procyclic 29-13 cells, which express the tetracycline repressor and T7 RNA polymerase, allowing tetracycline-inducible transcript ablation by RNAi. Transfected cells were induced with 1 µgml1 tetracycline, and cell growth was monitored for a period of 12–14 days for both bloodstream and procyclic tbCPSF30 RNAi cell lines as well as for untransfected controls (Fig. 4, A and B). Cell morphology was also monitored during this period (Fig. 4, C and D). Upon induction with tetracycline, there was a striking growth phenotype in both bloodstream and procyclic tbCPSF30 RNAi cell lines but not in the control populations. Specifically, cells were able to proliferate for only 2–3 days, after which many dead cells were detectable in the induced bloodstream and procyclic populations. By 6 days postinduction, tbCPSF30 RNAi procyclic cell lines were dead or dying as shown in the growth profile (Fig. 4B) and the culture image (Fig. 4D). Similarly, the bloodstream population stopped proliferating by days 3–5, although some surviving cells were always detectable among a background of lysed cells (middle right panel in Fig. 4C). Interestingly, by day 6–8, the bloodstream cells had resumed growth, and at day 12–14, their proliferation was indistinguishable from uninduced lines or the parental SMB line (Fig. 4, A and C). We predicted that the outgrowth of these cells represented selection for those unable to undergo tbCPSF30 RNAi.



View larger version (67K):
[in this window]
[in a new window]
 
FIG. 4.
tbCPSF30 is required for viability in bloodstream and procyclic form T. brucei. A and B show the growth of bloodstream and procyclic form tbCPSF30 RNAi cells compared with the control bloodstream SMB and procyclic 29–13 cell lines. In each case, cells were grown either in the presence (+) or absence (–) of tetracycline. Cells were monitored for 2 weeks, being maintained at between 2 x 105 and 3–4 x 106 by passage. All cultures were grown in the presence of selective drugs. C, phase-contrast images of SMB bloodstream cells (left panels) and tbCPSF30 RNAi bloodstream cells (right panels) at 0, 4, and 12 days after the induction of RNAi with tetracycline. D, 29-13 procyclic forms (left panels) and tbCPSF30 RNAi procyclic forms (right panels) at 0, 6, and 14 days after induction of RNAi. The cell images in C and D are derived from the same populations shown in Fig. 4, A and B. Since the procyclic tbCPSF30 RNAi cells were no longer growing by day 14, this image shows cells from the total remaining culture volume.

 

To assess the efficacy of RNAi in each population, we assayed the level of tbCPSF30 RNA at time points after induction with tetracycline. It was expected that the RNA levels for tbCPSF30 would show rapid ablation upon induction with tetracycline in bloodstream and procyclic forms. Surprisingly, however, Northern analysis showed that in both induced bloodstream and procyclic cell lines, an RNA of aberrant size was detected with the tbCPSF30 strand-specific riboprobe (Fig. 5). This RNA was slightly larger than the tbCPSF30 mRNA detected in uninduced or parental cell lines, and it was also increased in abundance when compared with normal tbCPSF30 mRNA levels (Fig. 5, arrows). This unexpected result was reproducible in several independently derived bloodstream and procyclic form cell lines. Significantly, the altered tbCPSF30 RNA was observed only in induced RNAi cell lines and disappeared upon outgrowth of the RNAi-resistant bloodstream forms when the correct sized tbCPSF30 transcript reappeared (Fig. 5, day 12 samples). We excluded the possibility that this novel transcript represented double-stranded RNA transcribed from the tbCPSF30 RNAi vector by hybridization with an antisense-specific riboprobe. This revealed that no detectable levels of tbCPSF30 double-stranded RNA were present in either the bloodstream or procyclic RNAi cell lines (data not shown). All of these facts strongly suggested that the observed tbCPSF30 RNA was tbCPSF30 RNAi-specific and did not represent a cryptic tbCPSF30 RNA derived from the tbCPSF30 RNAi 2T7i construct.



View larger version (66K):
[in this window]
[in a new window]
 
FIG. 5.
RNAi of tbCPSF30 results in the appearance of a novel tbCPSF30 RNA. Left panels, Northern analysis of RNA samples taken from induced (+) or uninduced (–) SMB or tbCPSF30 RNAi bloodstream cells prepared 0, 2, and 12 days postinduction with tetracycline. Right panel, Northern analysis of RNA samples taken from induced (+) or uninduced (–) 29-13 procyclic forms and tbCPSF30 RNAi procyclic forms (3 days postinduction with tetracycline). In each case, an arrow indicates the aberrant RNA species. Relative loadings are indicated by the panel showing ethidium bromide-stained RNA (EtBr).

 

To directly link the observed growth phenotype with tbCPSF30 expression, we analyzed the protein levels for this molecule. Fig. 6 shows protein samples from induced and uninduced bloodstream tbCPSF30 RNAi cell lines probed with the tbCPSF30 anti-peptide antibody. These samples show ~80–90% reduction in the level of tbCPSF30 protein by day 3–4 upon induction with tetracycline. This confirmed that gene silencing was operating in an efficient and inducible manner. Consistent with the observation that the bloodstream cells return to normal growth by day 10, the levels of tbCPSF30 protein returned to wild type levels by day 14 (Fig. 6). From this analysis, it is clear that there is a strict correlation between the presence of tbCPSF30 protein and the ability of T. brucei to survive in culture. It can therefore be concluded that tbCPSF30 is essential for viability in both the bloodstream and procyclic forms of T. brucei.



View larger version (43K):
[in this window]
[in a new window]
 
FIG. 6.
tbCPSF30 protein expression in tbCPSF30 RNAi bloodstream cells (grown in the presence (+) and absence (–) of tetracycline) compared with control SMB bloodstream cells. tbCPSF30 protein is depleted in tbCPSF30 RNAi bloodstream cells by day 3–4. However, by day 14, the protein levels return to wild type levels (as in the SMB population), coincident with return of the cells to normal growth. Equal cell numbers were used to prepare each sample, and equivalent loading was confirmed by analysis of the Ponceau S-stained blot (not shown).

 

tbCPSF30 Is Required for Polycistronic RNA Processing—To investigate the consequences of tbCPSF30 depletion on trypanosome polycistronic RNA processing, we examined the resolution of pre-mRNA in the bloodstream and procyclic tbCPSF30 RNAi lines. In the case of the bloodstream tbCPSF30 RNAi cells, RNA was prepared 2 days postinduction, and in the case of the procyclic tbCPSF30 RNAi cells, samples were taken 3 days postinduction. This corresponds to the period before cell death in each population. To assay RNA processing, each RNA sample was hybridized with a riboprobe detecting the {alpha}/{beta}-tubulin intergenic region. This probe detects the mature {alpha}-tubulin mRNA and unprocessed RNAs spanning the {alpha}/{beta}-tubulin dicistron. It has been reported (5, 23) that perturbation of the trans-splicing machinery results in an accumulation of dicistronic {alpha}/{beta}-tubulin RNA due to disruption of processing of the tubulin gene pre-mRNA. Our results in Fig. 7A clearly show the presence of an additional transcript in both the bloodstream and procyclic tbCPSF30 RNAi-induced cell lines, which is consistent with the size of a larger unprocessed, tubulin pre-mRNA (dicistron). Significantly, this additional transcript is not seen in either the wild type or uninduced cell lines for either of the life cycle stages of the parasite.



View larger version (97K):
[in this window]
[in a new window]
 
FIG. 7.
Processing of polycistronic RNA is compromised in tbCPSF30 depleted cells. A, left panel, Northern analysis of control bloodstream (BS) SMB cells and tbCPSF30 RNAi after 2 days in the presence (+) or absence of tetracycline (–). RNA samples were probed with the {alpha}/{beta}-tubulin intergenic region. The probe will detect monocistronic {alpha}-tubulin as well as the {alpha}/{beta}-tubulin dicistronic RNA. The same RNA analysis was performed on procyclic (PC) cell lines 3 days post-induction (right). Loading is shown by the EtBr panel. B, Northern analysis on a time course of RNA samples from bloodstream SMB cells and tbCPSF30 RNAi cells, 0, 2, and 12 days postinduction with tetracycline. These samples were probed with the {alpha}/{beta}-tubulin intergenic region. This shows that the growth phenotype (Fig. 4, A and C) and RNAi effect correlate with the ability to process polycistronic tubulin RNA. Relative loading is shown by the EtBr panel. C, Northern analysis of RNA from SMB bloodstream cells allowed to overgrow and die. These cells were assessed under the light microscope (in terms of cell death; data not shown). Day 0 represents an overgrown culture in which cells are "viable" and intact. After 2 days of further culture, all cells were completely dead. RNA samples were probed with the {alpha}/{beta}-tubulin intergenic region in order to determine the extent of polycistronic tubulin RNA in each sample. In no case was there an accumulation of unprocessed RNA. Loading is shown by the EtBr panel.

 

Further confirmation of the specificity of this result was provided by analysis of the bloodstream samples that were tbCPSF30 RNAi-resistant (i.e. those isolated 12 days after induction of tbCPSF30 RNAi). In bloodstream tbCPSF30 RNAi samples taken 0, 2, and 12 days postinduction with tetracycline, the increase in the unprocessed {alpha}/{beta}-tubulin intergenic region was specific to the samples with depleted tbCPSF30, with the outgrowing RNAi-resistant line behaving as wild type cells (Fig. 7B). This confirms that the perturbation of the processing of the tubulin polycistronic RNA was a direct result of the gene silencing of tbCPSF30.

In order to eliminate the possibility that the appearance of polycistronic RNA was simply the result of a general "dying cell" phenomenon, we allowed the parental RNAi bloodstream line (SMB) to reach maximal growth levels in culture (typically 5 x 106 cells/ml). At this stage, cells are still intact and motile, although if left for 48 h without passage, these cultures overgrow and rapidly die. Therefore, these cultures were used as a population of cells undergoing several mechanisms (starvation, cell cycle defects, cytotoxic effects) resulting in cell death. Fig. 7C shows RNA from the cultures hybridized with a probe for the {alpha}/{beta}-tubulin intergenic region. It is clear that in these dying cells there was no specific increase in unprocessed tubulin RNA. Rather, the accumulation of polycistronic RNA was specific to the depletion of tbCPSF30, demonstrating that this molecule contributes to the efficient processing of tubulin transcripts in T. brucei.

Although trans-splicing is ubiquitous for all genes in T. brucei, it was recently discovered that at least one gene (poly(A)-polymerase (PAP)) also undergoes cis-splicing. This raised the question as to whether this RNA processing event might also be disrupted in tbCPSF30-depleted cells. Thus, cis-splicing was assayed in these cells using RT-PCR across the PAP intron. No change in cis-splicing efficiency for the PAP transcript was detectable (data not shown).

Components of the T. brucei RNA Processing Machinery— Although there have been numerous studies of the components of both yeast and mammalian cleavage and polyadenylation complexes (for reviews, see Refs. 24 and 25), to date there have been no studies on the CPSF or associated factors in T. brucei. Since the T. brucei genome is organized into polycistrons, it is interesting to speculate that the core components of the polyadenylation and cleavage machinery might be different from other eukaryotes using monocistronic transcription. Consequently, by genome data base mining, we searched for components of the cleavage and polyadenylation machinery conserved in T. brucei (Table I). This revealed the presence of the previously cloned poly(A)-polymerase (26), a poly(A)-polymerase-binding protein homologue (PAPB1), and at least one other component of the cleavage and polyadenylation complex, CPSF73 (Table I). On the basis of sequence homology, other more putative component homologues of either yeast or mammalian proteins included CPSF100, CPSF160, CstF-50, CstF68, PFS2, MPE1, HRP1, and PAN2/3 (Table I). The PAP, CPSF30, CPSF73, CPSF100, and CPSF160 form core components of the cleavage and polyadenylation complex in yeast and mammals. Thus, our survey indicates that several components of the cleavage and polyadenylation machinery are already detectable in the incomplete T. brucei genome data base. This suggests that even in this evolutionarily ancient protozoan parasite, with a very usual mechanism of gene expression, the basic machinery for the processing of transcripts is similar to other lower eukaryotes as well as higher organisms.


View this table:
[in this window]
[in a new window]
 
TABLE I
T. brucei sequence homologues of the cleavage and polyadenylation machinery

 


    DISCUSSION
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
Gene regulation is vital to cell viability, growth, and development in all eukaryotic organisms. One aspect of gene regulation is the requirement for pre-mRNA processing into mature RNA species. This aspect is particularly important in trypanosomes whose genes are organized into polycistronic transcription units. Here, we report the first identification of a core component of the T. brucei cleavage and polyadenylation complex, tbCPSF30, and demonstrate that depletion of this protein following RNAi causes aberrant processing of tubulin polycistronic RNAs at two stages in the parasite life cycle. Following this phenotype, cell death occurs, indicating either that tbCPSF30 is itself essential in T. brucei or that inefficient or inappropriate RNA processing upon tbCPSF30 depletion is lethal. tbCPSF30 is, therefore, a key component in trypanosome gene expression.

tbCPSF30 RNAi induction resulted in a specific increase in the level of a tbCPSF30 RNA, the first report showing an RNAi phenotype of this sort. Importantly, the appearance of this novel RNA precisely correlated with the ability of the cells to undergo tbCPSF30-specific RNAi (as assessed by cell death and protein depletion) and was lost upon outgrowth of RNAi-resistant bloodstream forms. The existence of this RNA can be rationalized in terms of recent understanding of the mechanism of RNA interference (27). This process is believed to be a cytoplasmic phenomenon effected by the action of siRNAs on mRNA, this being mediated by a complex of proteins conserved among species able to undergo RNAi. Recently, it has been suggested that RNAi operates at the level of the translational apparatus, whereby polyribosomal RNA is the target of degradation (28). In this scenario, the novel tbCPSF30 RNA that accumulates might represent nontranslated RNA protected from degradation by RNAi. This is directly supported by our observation that the aberrant RNA does not result in tbCPSF30 protein, which is specifically depleted at the same time that the novel RNA is maximally expressed. One interesting model would be that the novel tbCPSF30 RNA is compartmentalized within the cell, for example being contained within the nucleus, and accumulates as the cell attempts to compensate for tbCPSF30 protein depletion.

In eukaryotes, the transcription of specific genes and the formation of mature mRNAs are processes that occur in the nucleus. Prior to nuclear export, mRNAs require several posttranscriptional modifications, which are tightly coupled. The precise mechanism that couples these post-transcriptional events is still poorly understood. In Leishmania and Trypanosoma, there is strong evidence that supports the model that polycistronic pre-mRNA polyadenylation requires active trans-splicing or, minimally, recognition of the 3' splice acceptor site (5, 6, 29). Furthermore, studies in C. elegans recently showed a physical interaction that couples the trans-splicing and cleavage–polyadenylation machineries via the cleavage stimulation factor 64-kDa subunit (30). Interestingly, all of these models have been proposed as a result of studies on trans-splicing and the analysis of polyadenylation placed in that context. In our study, depletion of tbCPSF30 has for the first time shown the consequence of disrupting a component of the polyadenylation machinery. This demonstrated a dramatic increase in the level of unprocessed {alpha}/{beta}-tubulin RNA, revealing that tbCPSF30 is involved in the maturation of polycistronic pre-mRNAs. Since the precursor RNAs require both trans-splicing and polyadenylation for maturation, it is evident that this accumulation can only occur if both processes are inhibited. This suggests that the efficiency of trans-splicing is dependent upon the integrity of the polyadenylation machinery as well as vice versa.

The cis-splicing machinery is similar in composition to that required in trans-splicing and, further, is mechanistically linked to 3'-end formation in higher eukaryotes (3134). Moreover, it has been shown that CPSF30 is essential for the cis-splicing of single-intron containing pre-mRNAs in vivo using the influenza NS1A protein (35). This led us to investigate the effect of tbCPSF30 depletion on the cis-splicing of T. brucei PAP, the only trypanosome gene known to contain an intron. However, we were unable to detect any difference in the ability to cis-splice the PAP gene in cells depleted for tbCPSF30. Although we cannot exclude a subtle effect on the efficiency of splicing, this suggests that CPSF30 is not essential for cis-splicing in T. brucei.

Although the control signals governing the overall process of 3'-end formation differ in higher and lower eukaryotes, several studies have shown functional similarity between the individual components of the machinery (3638). However, there are subtle but potentially important differences between the architecture of CPSF30 in different organisms. Of particular interest is the lack of the zinc knuckle CCHC motif in yeast compared with those organisms with one knuckle (zebrafish, mammals) and those with two (Drosophila, trypanosomes). It has been suggested that yeast utilize, by interaction, either the zinc knuckle in the uncharacterized PFS1 (36, 39, 40) or in the recently identified MPE1 (37) to compensate for the function of this part of the CPSF30 molecule from other organisms. It is interesting therefore that CPSF30 from trypanosomes seems to contain both these functional domains in one protein, despite the early divergence of these organisms from the eukaryotic lineage. This structural change between the molecules may also explain the failure of our attempts to functionally complement a yeast mutant with tbCPSF30 (data not shown).

Of the characterized components of the polyadenylation machinery in other organisms, only CPSF30 contains a known RNA binding domain, and this is thought to contribute to recognition of the polyadenylation signal sequence. Some debate has arisen over the endonuclease responsible for the first part of the reaction (41). The CPSF30 homologue in Drosophila (Clipper or CLP) specifically cleaves RNA hairpins, and this endonucleolytic activity resides in the region containing five copies of the CCCH zinc finger motif (19). The cleavage activity of the yeast protein also resides in a region containing the N-terminal CCCH zinc finger motif (39). Our data showing the requirement for tbCPSF30 in polycistronic RNA cleavage would support the theory that in vivo the CPSF30 is the endonuclease responsible for this initial reaction in 3'-end formation.

The identification of a trypanosome homologue of CPSF30 and the results of data mining the available trypanosome genome information suggest that the basic components of the T. brucei 3'-end RNA processing complex(es) are also similar to S. cerevisiae and mammalian systems. This observation is particularly significant, given that T. brucei diverged very early in eukaryotic evolution and exhibits a gene organization that necessitates that 3'-end formation and transcript termination be uncoupled. Clearly, organisms with very diverse mechanisms for gene expression conserve core elements of the machinery for cleavage and polyadenylation. Dissection of this complex will be fundamental to understanding the evolution of eukaryotic gene expression mechanisms and may provide new targets for the control of this important group of protozoan pathogens.


    FOOTNOTES
 
The nucleotide sequence(s) reported in this paper has been submitted to the GenBankTM/EBI Data Bank with accession number(s) AY239358 [GenBank] .

* This work was funded by the Wellcome Trust. The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. Back

{ddagger} A Wellcome Trust University Fellow. To whom correspondence should be addressed. Tel.: 44-161-275-5083; Fax: 44-161-275-5082; E-mail: keith.matthews{at}man.ac.uk.

1 The abbreviations used are: RNAi, RNA interference; PAP, poly(A)-polymerase. Back


    ACKNOWLEDGMENTS
 
We thank Prof. Elisabetta Ullu and Dr. Frederick van Deursen for critical comments on the manuscript. Steve Whittaker provided excellent technical support. We are grateful to Prof. Walter Keller for yeast strains and plasmids and Dr. Barry Wilkinson for assistance with yeast complementation experiments. We are grateful to the Sanger Institute for permission to use the incomplete T. brucei genome data base.



    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 

  1. Clayton, C. E. (2002) EMBO J. 21, 1881–1888[Free Full Text]
  2. Murphy, W. J., Watkins, K. P., and Agabian, N. (1986) Cell 47, 517–525[Medline] [Order article via Infotrieve]
  3. Sutton, R. E., and Boothroyd, J. C. (1986) Cell 47, 527–535[Medline] [Order article via Infotrieve]
  4. Agabian, N. (1990) Cell 61, 1157–1160[Medline] [Order article via Infotrieve]
  5. Matthews, K. R., Tschudi, C., and Ullu, E. (1994) Genes Dev. 8, 491–501[Abstract]
  6. LeBowitz, J. H., Smith, H. Q., Rusche, L., and Beverley, S. M. (1993) Genes Dev. 7, 996–1007[Abstract]
  7. Palfi, Z., Gunzl, A., Cross, M., and Bindereif, A. (1991) Proc. Natl. Acad. Sci. U. S. A. 88, 9097–9101[Abstract]
  8. Thomas, J., Lea, K., Zucker-Aprison, E., and Blumenthal, T. (1990) Nucleic Acids Res. 18, 2633–2642[Abstract]
  9. Kuersten, S., Lea, K., MacMorris, M., Spieth, J., and Blumenthal, T. (1997) RNA 3, 269–278[Abstract]
  10. Hendriks, E. F., Robinson, D. R., Hinkins, M., and Matthews, K. R. (2001) EMBO J. 20, 6700–6711[Abstract/Free Full Text]
  11. Wirtz, E., Leal, S., Ochatt, C., and Cross, G. A. (1999) Mol. Biochem. Parasitol. 99, 89–101[CrossRef][Medline] [Order article via Infotrieve]
  12. Biebinger, S., Wirtz, L. E., Lorenz, P., and Clayton, C. (1997) Mol. Biochem. Parasitol. 85, 99–112[CrossRef][Medline] [Order article via Infotrieve]
  13. LaCount, D. J., Barrett, B., and Donelson, J. E. (2002) J. Biol. Chem. 277, 17580–17588[Abstract/Free Full Text]
  14. Tasker, M., Wilson, J., Sarkar, M., Hendriks, E., and Matthews, K. (2000) Mol. Biol. Cell. 11, 1905–1917[Abstract/Free Full Text]
  15. Tasker, M., Timms, M., Hendriks, E., and Matthews, K. (2001) Mol. Microbiol. 39, 272–285[CrossRef][Medline] [Order article via Infotrieve]
  16. Altschul, S. F., Gish, W., Miller, W., Myers, E. W., and Lipman, D. J. (1990) J. Mol. Biol. 215, 403–410[CrossRef][Medline] [Order article via Infotrieve]
  17. Lai, W. S., Carballo, E., Strum, J. R., Kennington, E. A., Phillips, R. S., and Blackshear, P. J. (1999) Mol. Cell. Biol. 19, 4311–4323[Abstract/Free Full Text]
  18. Carballo, E., Lai, W. S., and Blackshear, P. J. (1998) Science 281, 1001–1005[Abstract/Free Full Text]
  19. Bai, C., and Tolias, P. P. (1996) Mol. Cell. Biol. 16, 6661–6667[Abstract]
  20. Bai, C., and Tolias, P. P. (1998) Nucleic Acids Res. 26, 1597–1604[Abstract/Free Full Text]
  21. Veraldi, K. L., Edwalds-Gilbert, G., MacDonald, C. C., Wallace, A. M., and Milcarek, C. (2000) RNA 6, 768–777[Abstract/Free Full Text]
  22. Ngo, H., Tschudi, C., Gull, K., and Ullu, E. (1998) Proc. Natl. Acad. Sci. U. S. A. 95, 14687–14692[Abstract/Free Full Text]
  23. Muhich, M. L., and Boothroyd, J. C. (1988) Mol. Cell. Biol. 8, 3837–3846[Medline] [Order article via Infotrieve]
  24. Wahle, E., and Ruegsegger, U. (1999) FEMS Microbiol. Rev. 23, 277–295[CrossRef][Medline] [Order article via Infotrieve]
  25. Zhao, J., Hyman, L., and Moore, C. (1999) Microbiol. Mol. Biol. Rev. 63, 405–445[Abstract/Free Full Text]
  26. Mair, G., Shi, H., Li, H., Djikeng, A., Aviles, H. O., Bishop, J. R., Falcone, F. H., Gavrilescu, C., Montgomery, J. L., Santori, M. I., Stern, L. S., Wang, Z., Ullu, E., and Tschudi, C. (2000) RNA 6, 163–169[Abstract/Free Full Text]
  27. Hannon, G. J. (2002) Nature 418, 244–251[CrossRef][Medline] [Order article via Infotrieve]
  28. Kennerdell, J. R., Yamaguchi, S., and Carthew, R. W. (2002) Genes Dev. 16, 1884–1889[Abstract/Free Full Text]
  29. Ullu, E., Matthews, K. R., and Tschudi, C. (1993) Mol. Cell. Biol. 13, 720–725[Abstract]
  30. Evans, D., Perez, I., MacMorris, M., Leake, D., Wilusz, C. J., and Blumenthal, T. (2001) Genes Dev. 15, 2562–2571[Abstract/Free Full Text]
  31. Niwa, M., Rose, S. D., and Berget, S. M. (1990) Genes Dev. 4, 1552–1559[Abstract]
  32. Niwa, M., and Berget, S. M. (1991) Genes Dev. 5, 2086–2095[Abstract]
  33. Niwa, M., and Berget, S. M. (1991) Gene Expr. 1, 5–14[Medline] [Order article via Infotrieve]
  34. Niwa, M., MacDonald, C. C., and Berget, S. M. (1992) Nature 360, 277–280[CrossRef][Medline] [Order article via Infotrieve]
  35. Li, Y., Chen, Z. Y., Wang, W., Baker, C. C., and Krug, R. M. (2001) RNA 7, 920–931[Abstract/Free Full Text]
  36. Preker, P. J., Ohnacker, M., Minvielle-Sebastia, L., and Keller, W. (1997) EMBO J. 16, 4727–4737[Abstract/Free Full Text]
  37. Vo, L. T., Minet, M., Schmitter, J. M., Lacroute, F., and Wyers, F. (2001) Mol. Cell. Biol. 21, 8346–8356[Abstract/Free Full Text]
  38. Dichtl, B., and Keller, W. (2001) EMBO J. 20, 3197–3209[Abstract/Free Full Text]
  39. Barabino, S. M., Ohnacker, M., and Keller, W. (2000) EMBO J. 19, 3778–3787[Abstract/Free Full Text]
  40. Zhao, J., Kessler, M., Helmling, S., O'Connor, J. P., and Moore, C. (1999) Mol. Cell. Biol. 19, 7733–7740[Abstract/Free Full Text]
  41. Zarudnaya, M. I., Kolomiets, I. M., and Hovorun, D. M. (2002) IUBMB Life 54, 27–31[Medline] [Order article via Infotrieve]
  42. Thompson, J. D., Higgins, D. G., and Gibson, T. J. (1994) Nucleic Acids Res. 22, 4673–4680[Abstract]
  43. Barabino, S. M., Hubner, W., Jenny, A., Minvielle-Sebastia, L., and Keller, W. (1997) Genes Dev. 11, 1703–1716[Abstract]
  44. Gaiano, N., Amsterdam, A., Kawakami, K., Allende, M., Becker, T., and Hopkins, N. (1996) Nature 383, 829–832[CrossRef][Medline] [Order article via Infotrieve]
  45. Hofmann, K., Bucher, P., Falquet, L., and Bairoch, A. (1999) Nucleic Acids Res. 27, 215–219[Abstract/Free Full Text]