Hyperphosphorylated C-terminal Repeat Domain-associating Proteins in the Nuclear Proteome Link Transcription to DNA/Chromatin Modification and RNA Processing*

Sherry M. Carty and Arno L. Greenleaf{ddagger}

From the Department of Biochemistry, Duke University Medical Center, Durham, North Carolina 27710


    ABSTRACT
 TOP
 ABSTRACT
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
Using an interaction blot approach to search in the human nuclear proteome, we identified eight novel proteins that bind the hyperphosphorylated C-terminal repeat domain (phosphoCTD) of RNA polymerase II. Unexpectedly, five of the new phosphoCTD-associating proteins (PCAPs) represent either enzymes that act on DNA and chromatin (topoisomerase I, DNA (cytosine-5) methyltransferase 1, poly(ADP-ribose) polymerase-1) or proteins known to bind DNA (heterogeneous nuclear ribonucleoprotein (hnRNP) U/SAF-A, hnRNP D). The other three PCAPs represent factors involved in pre-mRNA metabolism as anticipated (CA150, NSAP1/hnRNP Q, hnRNP R) (note that hnRNP U/SAF-A and hnRNP D are also implicated in pre-mRNA metabolism). Identifying as PCAPs proteins involved in diverse DNA transactions suggests that the range of phosphoCTD functions extends far beyond just transcription and RNA processing. In view of the activities possessed by the DNA-directed PCAPs, it is likely that the phosphoCTD plays important roles in genome integrity, epigenetic regulation, and potentially nuclear structure. We present a model in which the phosphoCTD association of the PCAPs poises them to act either on the nascent transcript or on the DNA/chromatin template. We propose that the phosphoCTD of elongating RNA polymerase II is a major organizer of nuclear functions.


The C-terminal repeat domain (CTD),1 a non-catalytic "tail" on the largest subunit of RNA polymerase, is a unique structure made up of as many as 52 repeats of the consensus heptamer YSPTSPS (1). That each seven-residue repeat contains five hydroxyl-containing side chains suggests that the CTD could be a good target for phosphorylation, and indeed, in elongating RNAP II the CTD is hyperphosphorylated (24); in preinitiating RNAP II, on the other hand, the CTD is not phosphorylated (e.g. see Ref. 5). The amino acid composition of the CTD also renders it very hydrophilic, and it is largely unstructured in aqueous solution (6, 7); in mammals a fully stretched out CTD could potentially extend some 1200 Å away from the globular body of the polymerase (whose diameter is ~150 Å, Ref. 8) (see e.g. Ref. 9). Its unusual properties render the CTD well suited to function as an extended scaffold to which transcription factors and other proteins could bind; indeed, a growing body of data indicates that such a scaffolding role is the principal function of the CTD. Experiments focusing on the hyperphosphorylated CTD characteristic of elongating RNAP II have revealed that a number of RNA processing factors bind to the phosphoCTD (e.g. see Refs. 10 and 11). Binding to the phosphoCTD tethers such factors to the transcription machinery where they can carry out their functions more efficiently (e.g. see Refs. 1216); furthermore, certain factors or processes are stimulated by phosphoCTD binding (e.g. see Refs. 1719). In addition, a pool of non-transcribing RNAP II carries a phosphoCTD and is associated with certain transcription and processing factors in potential assembly areas called "transcriptosomes" (see Ref. 20). RNA processing proteins known to bind directly to the phosphoCTD include the following: subunits of capping enzyme (2123), subunits of 3' end formation factors (24, 25), splicing factors (9), splicing factor-like proteins (2629), and an apparent small nucleolar RNA 3' end factor (30). A few other proteins of disparate function have been shown to bind directly to the phosphoCTD, including the prolyl isomerase ESS1 (31), the histone acetyltransferase p300/CBP-associated factor (32), and the KRAB/Cys2-His2 zinc finger protein ZNF74 (33). Significantly, however, the currently known phosphoCTD-associating proteins (PCAPs) were not, for the most part, discovered in a systematic way, and it is therefore very likely that numerous PCAPs remain to be discovered. Because discovering additional PCAPs promised to generate new insights into phosphoCTD function, we began a systematic search for novel human PCAPs in the nuclear proteome.

To carry out the search for new human PCAPs we exploited the ability of yeast CTD kinase I (3436) to hyperphosphorylate a recombinant CTD fusion protein, using the kinase to generate a radiolabeled, exhaustively phosphorylated probe (37). The [32P]phosphoCTD probe was then used in an interaction blot (Far-Western) approach to identify proteins in HeLa nuclear extracts that bind directly to the phosphoCTD. Numerous PCAPs were detected in the extract, which was then fractionated to purify the proteins for identification by mass spectrometry. Following this strategy we have so far succeeded in identifying eight previously unknown human PCAPs. One subset of the new PCAPs consists of known or suspected RNA processing factors as anticipated. Excitingly, another subset of the new PCAPs consists of enzymes with activities directed toward DNA or chromatin rather than toward pre-mRNA. Because the enzymes include DNA topoisomerase I, DNA (cytosine-5) methyltransferase 1, and poly(ADP-ribose) polymerase-1, our findings hint that the phosphoCTD of RNAP II plays previously unsuspected roles in several DNA-centered processes, including modifications involved in epigenetic regulation. As is the case for the RNA-directed PCAPs, binding of these DNA-directed proteins to the phosphoCTD should poise them to efficiently carry out their functions. Accordingly, we propose that the phosphoCTD of elongating RNAP II is a major organizer of diverse nuclear processes.


    EXPERIMENTAL PROCEDURES
 TOP
 ABSTRACT
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
Suppliers—
MacroPrep High S and MacroPrep High Q resins were from Bio-Rad. O-Phospho-L-serine and O-phospho-L-serine-agarose were from Sigma. Hybond-C Extra was from Amersham Biosciences. [32P]ATP was from ICN Biomedicals.

Salt Extraction of HeLa Nuclear Proteins—
HeLa nuclei were prepared, extracted with 0.15 M NaCl, and centrifuged (38). Trial extractions of this nuclear pellet were carried out by sequential homogenization at increasing NaCl concentrations in HGE++ (25 mM Hepes, pH 7.6, 15% glycerol, 0.1 mM EDTA + phenylmethylsulfonyl fluoride (1:1000 dilution of saturated solution in isopropanol) + 10 mM NaF). After each homogenization and centrifugation (150,000 x g for 60 min at 4 °C) the supernatant was kept ("extract"), and the pellet was extracted with the next salt step. For the preparative extraction, nuclear pellet (25 ml, from 0.15 M NaCl extraction) was homogenized on ice in a final volume of 200 ml of 0.5 M NaCl in HGE++ and centrifuged (150,000 x g for 60 min at 4 °C). The supernatant, termed the 0.5 M NaCl extract of the nuclear pellet, was stored at -80 °C.

High S Column of 0.5 M NaCl Extract of Nuclear Pellet—
An aliquot (140 mg of protein in 33 ml) of the 0.5 M NaCl extract of the nuclear pellet was subjected to chromatography on a 5-ml High S column as described previously (26). The flow-through was at 0.1 M NaCl, and the salt steps were at 0.25, 0.5, 0.75, and 1.0 M NaCl.

Phosphoserine Columns of NaCl High S Eluates—
Five ml of the 0.5 M NaCl High S eluate were diluted to 0.0165 M with HGE and applied to a 0.5-ml Ser(P) column equilibrated in 0.0165 M NaCl, HGE. The column was eluted stepwise with 3 x 0.5 ml of 0.01, 0.05, 0.1, 0.2, 0.4, 0.6, and 1 M Ser(P) in HGE. The 0.75 M NaCl High S eluate (5 ml) was identically subjected to Ser(P) column chromatography. Samples were analyzed on duplicate 4–15% SDS gels; one was Coomassie-stained, and one was analyzed by phosphoCTD interaction blotting.

Q Column of 0.1 M Ser(P) Eluate—
The 0.1 M Ser(P) fraction was diluted to 0.075 M in HGE and applied to a 0.5-ml Q column equilibrated with 0.075 M NaCl, HGE. Proteins were eluted stepwise with 3 x 0.5 ml of 0.075, 0.1, 0.2, 0.3, 0.4, and 1 M NaCl. Fractions were gel-analyzed as before.

PhosphoCTD Interaction Blotting—
Proteins were separated on duplicate 4–15% precast SDS gels (Bio-Rad) and either stained with Coomassie Blue or electroblotted to nitrocellulose. Membranes were then incubated overnight at 4 °C in phosphate-buffered saline, 0.2% Tween 20, 3% dry milk, 10 mM NaF, phenylmethylsulfonyl fluoride, 2 mM dithiothreitol. Blots were probed in the above solution with 32P-labeled hyperphosphorylated ß-galactosidase-yeast CTD (37) for 4 h at 4 °C. Note that each batch of phosphoCTD probe was incubated with yeast CTDK-I until complete mobility shift of the fusion protein was achieved (34, 37). Membranes were washed in phosphate-buffered saline, 0.2% Tween 20 for 4 x 10 min at 4 °C and then exposed overnight to film or a PhosphorImager screen.

Identification of PCAPs—
Well separated protein bands (Coomassie-stained gels) representing phosphoCTD-associating proteins (interaction blots), as indicated in the text and figures, were excised and subjected to in-gel trypsinization and MALDI mass spectrometry analysis (at either the Keck Foundation Biotechnology Resource Lab, Yale University or the Laboratory for Protein Microsequencing and Proteomic Mass Spectrometry, University of Massachusetts Medical School, Worcester Foundation Campus, Shrewsbury, MA (www.umassmed.edu/proteomic/)). The masses of the tryptic peptides revealed the identities of the putative PCAPs as summarized in Table I. In several cases, MS/MS analysis of selected peptides confirmed the identity or provided the identity (see Table I and Fig. 5). The relevant software used at Yale was principally ProFound (see prowl.rockefeller.edu/cgi-bin/ProFound) and PeptideSearch (see www.narrador.embl-heidelberg.de/GroupPages/PageLink/peptidesearchpage.html). The relevant software used at the University of Massachusetts was principally MS-Fit and MS-Tag (see prospector.ucsf.edu/) and MOWSE (see www.hgmp.mrc.ac.uk/Bioinformatics/Webapp/mowse/).


View this table:
[in this window]
[in a new window]
 
TABLE I PCAPs identified by interaction blotting/mass spectrometry

 


View larger version (21K):
[in this window]
[in a new window]
 
FIG. 5. Examples of mass spectra used for protein identification. A, MALDI spectrum of CA150. Parameters used for MS-Fit were: average mass, 500 ppm mass tolerance; Met(O) and <Gln permitted; molecular mass, 1–250 kDa. Search of NCBI protein data base yielded 22/24 matches with human CA150. B, MS/MS analysis (postsource decay spectrum) of a peptide with m/z = 1715 from tentatively identified hnRNP U. MS-Tag identified this peptide (K)SSGPTSLFAVTVAPPGAR(Q) as a segment of hnRNP U. C, MS/MS analysis of a peptide with m/z = 915 from a sample that MALDI indicated was a mixture of peptides. MS-Tag identified this peptide (R)GFGFVLFK(E) as a segment of hnRNP D.

 
Recombinant PCAPs and Antibodies—
DNMT1 was obtained from New England Biolabs (catalog no. M0230). Human topoisomerase I was a gift from T. Hsieh (Duke Medical Center). Yeast topoisomerase I was a gift from M. Bjornsti (St. Jude’s, Memphis, TN). Human hnRNP D (isoform M27) (39) was a gift from N. Maizels (University of Washington). Antibodies against NSAP1 were a gift from C. Astell (University of British Columbia).


    RESULTS
 TOP
 ABSTRACT
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
Extracting PCAPs from the Nuclear Pellet—
We sought a compartment of the nuclear proteome likely to be enriched for proteins that interact directly with the phosphoCTD of elongating RNAP II. Because elongating RNAP II is tightly associated with its template and is found in the pelleting fraction of nuclei extracted with low salt (40), we decided to use as starting material a centrifugally prepared pellet from HeLa cell nuclei. Because this material is generated at roughly physiological salt concentrations, it seemed likely that many PCAPs bound to the phosphoCTD in the living cells should remain bound during preparation of the nuclear pellet. Indeed, interaction blot (Far-Western) analysis of the proteins in the nuclear pellet confirmed that it was a rich source of PCAPs (Fig. 1). To dissociate the PCAPs from the phosphoCTD and solubilize them for fractionation, we extracted the pellet with increasing salt concentrations. Interaction blot analysis indicated that 0.5 M NaCl was effective in solubilizing the majority of PCAPs (Fig. 1), and we picked that concentration for our standard protocol.



View larger version (107K):
[in this window]
[in a new window]
 
FIG. 1. PhosphoCTD-associating proteins in the nuclear pellet. Equivalent amounts of the nuclear pellet (NP), salt extracts of the pellet (0.5 and 1.0 M), and residual pellet (P) were subjected to SDS-PAGE in duplicate gels; one gel was stained (Coomassie Blue), and one was electroblotted to nitrocellulose. The nitrocellulose was subjected to phosphoCTD interaction blotting (see "Experimental Procedures") and autoradiography. Positions of size markers are indicated on the left (kDa).

 
Fractionating the PCAPs—
To make possible the identification of individual PCAPs by mass spectrometry, we fractionated the nuclear extract with a series of column chromatography steps. At each step, column fractions were monitored by interaction blot analysis to identify fractions enriched for PCAPs. PCAP-enriched fractions were subjected to additional column chromatography steps; when a PCAP was pure enough, it was identified by excision from a stained gel followed by mass spectrometric analysis of trypsin fragments (see "Experimental Procedures," and e.g. see Ref. 31). The fractionation scheme is summarized diagrammatically in Fig. 2.



View larger version (30K):
[in this window]
[in a new window]
 
FIG. 2. Summary of the fractionation scheme used to separate PCAPs. FT, flow-through; P-Ser, phosphoserine.

 
High S column chromatography of the nuclear extract separated PCAPs into several fractions with the majority of the strong phosphoCTD binding activities in the 0.5 and 0.75 M salt steps (Fig. 3). To further purify the proteins possessing phosphoCTD binding activity, the two High S fractions were separately applied to phosphoserine columns, which were eluted with steps of increasing Ser(P) (Fig. 2).



View larger version (73K):
[in this window]
[in a new window]
 
FIG. 3. PhosphoCTD-associating proteins in the High S column fractions. Equivalent amounts of fractions eluted from the High S column were analyzed by SDS-PAGE (A) and phosphoCTD interaction blotting (B). NaCl concentrations of elution steps are indicated above the lanes. M, markers (kDa); OP, column onput.

 
Chromatography of the High S 0.5 M salt step fraction on a Ser(P) column cleanly separated two major PCAPs into the 0.1 M salt step fraction (Fig. 4A, lane 4, ~115-kDa species) and the 0.2 + 0.4 M salt step fractions (~100-kDa species, lanes 5 and 6). Note the specificity of the interaction blot results; most of the major protein species (Stained Gel) do not show any binding to the phosphoCTD probe (Interaction Blot). The proteins representing the predominant PCAP in the 0.1 M step and the 0.2 M step fractions were excised from a stained gel (Fig. 4A) and identified by mass spectrometry (see "Experimental Procedures," Table I, and Fig. 5). The PCAP in the 0.1 M step was identified as poly(ADP-ribose) polymerase (PARP); the PCAP in the 0.2 M step was identified as DNA topoisomerase I (Topo I). Other, fainter bands were observed in the interaction blot for both the 0.2 and 0.1 M salt step fractions. To identify some of these additional PCAPs, the 0.1 M salt step fraction of the Ser(P) column was applied to a High Q column (see Fig. 2 and below).



View larger version (108K):
[in this window]
[in a new window]
 
FIG. 4. PhosphoCTD-associating proteins in phosphoserine column fractions. A, equivalent amounts of fractions eluted from the Ser(P) column of the High S 0.5 M step fraction (Fig. 2) were analyzed by SDS-PAGE and phosphoCTD interaction blotting. Phosphoserine concentrations of elution steps are indicated above the lanes. M, markers (kDa); OP, onput. B, analysis as in A of fractions eluted from the Ser(P) column of the High S 0.75 M step fraction (Fig. 2). Dots indicate bands that were excised and identified. P-Ser, phosphoserine.

 
The High S 0.75 M step fraction was likewise fractionated by chromatography on a Ser(P) column (see Figs. 2 and 4B). The major PCAP in the onput (Interaction Blot, OP lane) appeared to be Topo I (~100 kDa); this species was recovered in the 0.2 and 0.4 M salt step fractions as expected from results of Fig. 4A. Several other PCAP bands were observable in the 0.1 M salt step (Fig. 4B, lane 4); three of these (marked with dots) were excised from a stained gel (more heavily loaded than that in Fig. 4B) and identified. The ~200-kDa PCAP was identified as DNA (cytosine-5) methyltransferase 1 (DNMT1; Table I and below). The ~105-kDa PCAP and the ~90-kDa PCAP were both found to be hnRNPs, namely hnRNP U and hnRNP R, respectively (see Table I, Fig. 5, and below).

Proteins in the 0.1 M salt step of the first Ser(P) column (Ser(P) column on left in Fig. 2; onput = 0.5 M step of the High S column) were fractionated by chromatography on High Q resin (Fig. 2). PCAPs were found in the 0.2 M salt step fraction and in the flow-through of the High Q column (data not shown). The phosphoCTD binding properties of the most prominent PCAPs in the 0.2 M step fraction, CA150, have been previously described (see Fig. 1 in Carty et al. (26)); its identification is further discussed below (see Fig. 5). The two other major PCAPs in this fraction (see Fig. 1 in Carty et al. (26)) were identified as hnRNP R (also found in a Ser(P) column fraction, above) and NSAP1, a protein related to hnRNP R (see Fig. 5, Table I, and below). Interestingly, the flow-through contained another phosphoCTD-associating hnRNP, hnRNP D (see Fig. 5, Table I, and below).

Identifying the PCAPs—
Protein bands in excised gel slices were subjected to trypsin digestion followed by mass spectrometry (see "Experimental Procedures"). We initially categorized the proteins as belonging to three different identification confidence classes depending on the mass spectra and their analyses. Confidence class A contained proteins identified by both MALDI and MS/MS analyses. A MALDI spectrum illustrating a class A protein, CA150, is presented in Fig. 5A. Twenty-four mass values from the spectrum were used to conduct a MS-Fit search of the National Center for Biotechnology Information (NCBI) protein data base (parameters in legend for Fig. 5); the search yielded human CA150 as the number one hit with 22 matches. This identification was confirmed by MS/MS analysis of one peptide (m/z = 833) and independent MS-Tag search (not shown). Three other proteins were identified similarly: PARP, NSAP1, and hnRNP R (i.e. essentially unambiguous identification by the original MALDI spectrum followed by confirmation for one or more peptides by MS/MS analysis; see Table I).

Another three proteins were identified either by MALDI analysis or by MS/MS analysis, and they were placed in confidence class B. For example, the MALDI analysis of hnRNP U was suggestive but not conclusive (Table I). However, a peptide of m/z 1715 was subjected to MS/MS (Fig. 5B), and MS-Tag unambiguously identified it as a segment of hnRNP U. In a second case, the original MALDI results indicated that a mixture of proteins had been analyzed, but MS/MS on a peptide of m/z 915 (Fig. 5C) revealed a unique sequence that MS-Tag found to be part of hnRNP D. Finally, MS/MS was not applied to topoisomerase I, but the protein was clearly identified by analysis of its MALDI spectrum using ProFound and PeptideSearch (not shown). A third confidence class was represented by DNMT1 for which MS/MS was not carried out, and the MALDI analyses were suggestive but not conclusive (Table I; but see below).

In summary, our interaction blot analysis of the HeLa nuclear pellet revealed a number of easily detectable proteins that bind directly to the hyperphosphorylated CTD. Applying two or three simple fractionation steps to the pellet extract allowed us to obtain material suitable for identification by mass spectrometry, and we identified eight of the phosphoCTD-associating proteins (see Table I and Figs. 2 and 5). The PCAPs fall into two major classes: proteins implicated principally in RNA processing and proteins involved in DNA transactions. The first class was expected, whereas the second class was unanticipated.

Confirming the Identity and PhosphoCTD Binding of PCAPs—
To confirm the assigned identities, we carried out additional experiments with most of the PCAPs. First, of course, we have already published experiments on CA150 demonstrating both that recombinant segments of the protein (primarily FF domain-containing segments) bind the phosphoCTD and that antibodies to CA150 co-precipitate preferentially RNAP II0 (RNA polymerase II carrying a hyperphosphorylated CTD) from nuclear extracts (26). We have now similarly utilized recombinant proteins or specific antibodies to confirm the phosphoCTD binding and identity of additional PCAPs.

In the case of DNA topoisomerase I, we confirmed phosphoCTD binding for the human enzyme and for Topo I from yeast. Recombinant human Topo I was subjected to interaction blot analysis; the results clearly show that it binds to the phosphoCTD (Fig. 6A). For yeast, we also tested purified recombinant Topo I. As seen in Fig. 6A, the yeast enzyme binds the phosphoCTD. Control experiments with non-phosphorylated CTD showed no binding to the topoisomerases (data not shown). Therefore we conclude that our identification of the PCAP in lane 5 of Fig. 4A was correct and that topoisomerase I binds directly to the phosphoCTD.



View larger version (60K):
[in this window]
[in a new window]
 
FIG. 6. Recombinant/purified proteins bind the phosphoCTD. A, purified recombinant human (lane 1) and yeast (lane 2) DNA topoisomerase I were analyzed by PCTD interaction blotting. Protein amounts on the blot were estimated to be ~200 ng for lanes 1 and 2 and 500 ng for lane 3 (bacterial ß-galactosidase (ß-gal)). B, human DNMT1 (lanes 2 and 3), bacterial HhaI (lane 1) and the FF5 domain of human CA150 (lane 4; fused to glutathione S-transferase) were analyzed by PCTD interaction blotting. Protein amounts in lanes 1, 2, and 4 were roughly equal. C, mammalian PARP (lanes 2; band at "113" kDa; ~30% pure) and bacterial ß-galactosidase (lanes 1) were subjected to SDS-PAGE, electroblotted to nitrocellulose, and either stained (India Ink) or incubated with phosphoCTD probe. The amount of ß-galactosidase was estimated to be ~0.8 µg. D, aliquots of hnRNP D were analyzed by SDS-PAGE and either Coomassie-stained (lane 1) or subjected to phosphoCTD interaction blotting (lane 2).

 
To confirm phosphoCTD binding of DNMT1 we obtained purified recombinant protein (New England Biolabs, catalog no. M0230) and compared its binding to that of a related enzyme from bacteria, HhaI DNA methyltransferase (New England Biolabs, catalog no. M0217). HhaI methyltransferase is not expected to bind the phosphoCTD as bacterial RNA polymerases do not contain a CTD. As shown in Fig. 6B, recombinant human DNMT1 binds the phosphoCTD (lanes 2 and 3), whereas the bacterial methyltransferase does not (lane 1). In addition to the interaction blot analysis, preliminary experiments carried out in collaboration with S. Pradahn (New England Biolabs)2 also suggest that DNMT1 is associated selectively with RNAP II0 in nuclear extracts, reminiscent of the situation for CA150.

For the third DNA/chromatin-modifying PCAP, PARP, we obtained purified bovine enzyme (BIOMOL, catalog no. SE-165; 30–50% pure) and tested its phosphoCTD binding capability. As shown in Fig. 6C, purified PARP (113 kDa) binds the phosphoCTD (lanes 2). Several significant contaminants in the preparation do not bind (lanes 2) nor does a ß-galactosidase control protein present in higher amounts than PARP (lanes 1). We conclude that indeed PARP is a phosphoCTD-associating protein.

To investigate one of the hnRNPs identified in our PCAP hunt, we obtained recombinant hnRNP D (a kind gift of N. Maizels) and subjected it to interaction blot analysis. The experiment of Fig. 6D demonstrates that purified hnRNP D indeed binds to the phosphoCTD. A control experiment indicated that the non-phosphorylated CTD does not bind (data not shown). Thus hnRNP D is a phosphoCTD-associating protein.

Two more hnRNPs were identified as PCAPs, hnRNP R and hnRNP U (see Figs. 2 and 4B). We believe these identities will be confirmed by future tests because yet a fourth hnRNP was found among our newly identified PCAPs, namely NSAP1 (Fig. 2). Although NSAP1 was first identified as a protein that binds the NS1 protein of minute virus of mice (41), it has more recently been identified as one of three splice variants of hnRNP Q (42) (other synonyms for Q splice variants are SYNCRIP and GRY-RBP; Refs. 4244). We obtained antibodies against NSAP1 and used them to test the assigned identity of the phosphoCTD-binding protein. We found that the phosphoCTD-binding protein band detected by interaction blot analysis also reacted with antibody specific to NSAP1 (data not shown).


    DISCUSSION
 TOP
 ABSTRACT
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
The Approach
We have demonstrated that interaction blotting with a [32P]phosphoCTD probe is an efficient way to detect proteins that bind directly to the phosphoCTD and that after modest purification the proteins can be identified via mass spectrometry. Applying this approach to a nuclear pellet from HeLa cells, a starting material that contains chromatin-bound, elongating RNAP II (and in fact >90% of the total RNAP II; see Ref. 32), we have identified eight novel human PCAPs to date. For most of the PCAPs, we have verified phosphoCTD binding capability by follow-up tests using recombinant and/or purified proteins. The known properties of the new PCAPs have suggested novel and unanticipated functions for the phosphoCTD in coordinating diverse nuclear processes.

The probe we used to detect human PCAPs was the yeast CTD hyperphosphorylated with yeast CTDK-I. The yeast domain, as used, consists of 25 repeats of the consensus CTD heptamer YSPTSPS with only eight non-consensus amino acids along its entire 175-residue length (37). Note that the human CTD is twice as long (52 repeats) and that it contains numerous positively charged residues in its distal (C-terminal) half (e.g. see Ref. 1). Such charged, non-consensus repeats are not found in the yeast CTD, and our probe therefore is an analogue of the proximal half of the mammalian CTD. The kinase we used, CTDK-I (3436), is homologous to human P-TEFb (45) and to other biochemically uncharacterized human open reading frames (46). In view of the sequence relatedness and because CTDK-I and P-TEFb are both involved in phosphorylating RNAP II for elongation (47, 48), the pattern of phosphates on our probe is expected to be very similar to that generated on the consensus repeats of the human CTD by P-TEFb or other elongation-related CTD kinases. Current in vivo data suggest that elongating RNAP II is predominantly phosphorylated on Ser-2 of the heptad repeats (4951).

As for any proteomics interaction search, questions of binding specificity and in vivo relevance of binding arise. In our studies, the specificity of binding is extremely high in that only a very small percentage of nuclear proteins in any one fraction (stained gels) binds the phosphoCTD (interaction blots) (e.g. see Figs. 1, 3, and 4). This degree of binding selectivity recapitulates the specificity observed earlier (9, 26, 31, 52). One possible source of artifactual interaction might be adventitious binding of the negatively charged phosphoCTD to protein domains that normally bind other negatively charged polymers such as DNA or RNA. A published study, in addition to data contained herein (below), serve as good indications that this possibility is not a significant problem. Morris and Greenleaf (9) found that probing bacterial extracts with the phosphoCTD revealed no PCAPs (see Fig. 3, lane 13 of Ref. 9), although such extracts contain hundreds of DNA- and RNA-binding proteins. This finding strongly suggests that the large number of eukaryotic homologues of these bacterial nucleic acid-binding proteins also will not bind the phosphoCTD. Further, two results in the current study contain excellent controls that help dissipate concern about this type of fortuitous interaction. First, whereas bacterial DNA methyltransferase HhaI is a close structural relative of the catalytic domain of human DNMT1 and both enzymes presumably bind DNA very similarly, only the human enzyme binds the phosphoCTD (Fig. 6B). Second, although full-length Topo I binds the phosphoCTD (Fig. 6A), this binding is not mediated by the catalytic (DNA binding) domain but rather by the non-catalytic N-terminal domain.3 Thus, although some DNA or RNA binding domains may ultimately be found to also bind the phosphoCTD, we conclude that nonspecific phosphoCTD binding by such proteins does not contribute significantly to our PCAP discovery efforts.

The question of functional relevance of the binding interactions revealed by PCAP identification is also important to consider. The following examples illustrate that, in all cases of which we are aware, combined characterization of both the in vitro binding properties and in vivo behavior of PCAPs yields results that invariably support the functional relevance of the PCAP-phosphoCTD interaction. In particular, the majority of these proteins display binding that depends not on the extent of CTD phosphorylation but on a particular pattern of CTD phosphorylation, a pattern present on the CTD when the PCAPs bind RNAP II in vivo. The best extant examples are from the genetically tractable yeasts. As one example, capping guanylyltransferase (Pce1) and capping triphosphatase (Pct1) from fission yeast bind to CTD repeats phosphorylated either uniquely at Ser-5 or doubly at Ser-2 + Ser-5; however, they bind weakly or not at all to CTD repeats phosphorylated on Ser-2 (53). These findings are consistent with current data indicating that RNAP II at the 5' ends of transcription units is preferentially phosphorylated on Ser-5, whereas RNAP II at the middle or 3' ends of transcription units is preferentially phosphorylated on Ser-2 (50). Also consistent with these two sets of data, capping enzyme cross-links to 5' ends of genes much better than to the middle or 3' ends of genes (50, 54). Furthermore, genetic studies reveal synthetic lethal interactions between genes for capping enzyme and a CTD kinase with Ser-5 specificity (55). Another good example comprises cleavage/polyadenylation factors. Budding yeast Pcf11, a constituent of cleavage factor IA, shows specificity opposite to that of capping enzymes, binding to CTD repeats containing Ser-2 phosphate but not to repeats containing Ser-5 phosphate (56). This binding specificity parallels the shift from Ser-5 to Ser-2 phosphorylation as RNAP II progresses from the 5' to 3' end of a transcription unit; also in parallel, RNAP II at the 3' end of a gene is associated with more Pcf11 than RNAP II at the 5' end of a gene (56). Furthermore, a Pcf11-interacting protein (Pti1) shows genetic interactions with Ctk1, the catalytic subunit of the CTD kinase principally responsible for Ser-2 phosphorylation of elongating RNAP II (see Ref. 51).4 A third example is the yeast prolyl isomerase Ess1, which potentially associates with RNAP II at many points along entire transcription units; Ess1 binds to CTD repeats with multiple phosphates, but its binding is relatively insensitive to phosphorylation pattern (31, 52). Additional examples of phosphorylation pattern-specific binding come from studies of mammalian PCAPs, although these investigations, for the most part, are less complete than the yeast studies due to a lack of extensive in vivo functional correlates. As one example, a splicing-like factor discovered in the Corden laboratory (57), SCAF8, has been shown to bind preferentially to a CTD phosphorylated at both Ser-2 and Ser-5, whereas another factor, SCAF1, binds preferentially to a CTD phosphorylated at either Ser-2 or Ser-5. Presumably these binding preferences will be found to reflect functional features of these proteins. Overall, although the binding specificity of only a few PCAPs has been determined to date, it is clear that CTD binding of most PCAPs depends on the pattern of CTD phosphorylation rather than merely on the presence of phosphates (negative charges). Moreover, where the binding properties of a PCAP have been determined they are in complete accord with the known or inferred in vivo function of that PCAP. Accordingly, we predict that future characterizations of the human PCAPs identified in this study will reveal that these proteins bind the phosphoCTD in vivo in functionally relevant ways.

The sensitivity of the interaction blot technique as performed with the phosphoCTD makes this approach quite useful. Because the ability to detect a specific PCAP depends both on its abundance in the fraction being tested and on its affinity for the phosphoCTD under interaction blot conditions, the exact nature of the starting material and its treatment prior to probing have a large influence on which PCAPs are detected. Thus, faint bands in the total extract can become strong bands in later chromatographic fractions. The repeat nature of the CTD makes it an unusual probe; because it contains multiple repeats (i.e. multiple potential binding sites for a PCAP), the effective binding to the immobilized proteins is expected to be tighter than that for a single repeat (9). This feature may enhance the detection efficacy of the method.

Our approach has already generated numerous PCAPs that will be of some interest for future characterizations. Detailed studies of the binding interactions between the phosphoCTD-interacting domain of a PCAP and differently phosphorylated CTD peptides will be an essential part of determining how the two partners interact and how those interactions relate to the functions of the proteins in vivo (see above and see Refs. 17, 52, 56, and 57). Although the phosphoCTD-interacting domains of most of the new PCAPs remain to be identified, CA150 was found to interact with the phosphoCTD principally through its so-called FF motifs (26); this observation was the first described specific function for an FF "domain." It will be enlightening to identify the phosphoCTD-interacting domains from additional PCAPs and to characterize and compare motifs mediating phosphoCTD-PCAP interactions.

The Proteins
Topoisomerase I—
The first DNA enzyme we identified as a PCAP was DNA topoisomerase I. It has been known for years that Topo I is present at sites of actively transcribed chromatin (58); one of its principal functions at those sites is thought to be preventing RNA/DNA heteroduplex formation by removing transient supercoils that arise as a consequence of transcription (59, 60). However, because Topo I also functions at sites of activity of other nuclear RNAPs (61), it is perhaps surprising that it binds the phosphoCTD (PCTD), a feature unique to RNAP II. There may be more than one reason for the association of Topo I with the PCTD. The great length of most RNAP II transcription units might necessitate a mechanism to ensure that Topo I will be near the transcribing polymerase to solve topological problems generated by extensive RNAP II translocation (see also Ref. 62). Also, it may be that recruitment of Topo I to the polymerase is important for roles not related to the catalytic activity of Topo I (63, 64). In any case, it is striking that Hsieh and colleagues (65) demonstrated that only a non-catalytic N-terminal segment of Topo I is required to target a fused reporter protein (ß-galactosidase) to sites of active transcription in vivo.

DNA (Cytosine-5) Methyltransferase 1—
Another DNA-modifying enzyme among the PCAPs we identified is DNMT1, which adds a methyl group to the C-5 position of cytosines in CpG dinucleotides (66). DNMT1 is a large protein with many "regulatory" domains attached to a C-terminal catalytic domain (e.g. see Refs. 67 and 68). Although the actual in vivo functions of DNMT1 are not universally agreed upon, it seems clear that one of its roles is to methylate mobile DNA elements, ensuring their transcriptional silence (69). However, the mechanisms by which specific CpGs in the mammalian genome are chosen for methylation, by DNMT1 or by other DNMTs, are not understood. Most CpGs in euchromatic regions of the human genome reside in introns (with many of these in mobile elements), and these CpGs are normally methylated (70); a smaller fraction of CpGs resides in CpG islands around certain promoters, and these CpGs are normally not methylated (71, 72). We believe it noteworthy that most normally methylated CpGs in euchromatin are in regions of the genome traversed by RNAP II0 (e.g. the introns). We propose the following explanation: DNMT1 is recruited to the PCTD of elongating RNAP II and thereby assumes a position from which it can scan the large fraction of the human genome that passes through RNAP II (see Fig. 7). Attached to the polymerase, DNMT1 adds methyl groups to unmethylated CpGs as they pass by (e.g. those found in a newly acquired mobile element). Once methylated because of this transcriptional coupling, CpGs internal to transcription units are normally maintained in a methylated state during DNA replication as generally supposed (see also Ref. 73 for a relationship between transcription and methylation).



View larger version (117K):
[in this window]
[in a new window]
 
FIG. 7. Working model of PCAPs bound to PCTD of elongating RNAP II0. RNAP II0 (red with pink phosphates) is traversing the template (DNA, brown; meant to represent both DNA and chromatin) and synthesizing a pre-mRNA transcript (green). The model depicts one subset of PCAPs (brownish tints) below the phosphoCTD interacting with DNA/chromatin and another subset (greenish tints) interacting (or poised to interact) with the RNA transcript or RNA-bound factors. FBP11 is a mammalian counterpart of yeast Prp40, and SF1 is a mammalian counterpart of yeast branchpoint-binding protein (see Refs. 9 and 14). Placement of proteins along the phosphoCTD is arbitrary. Note that, although not explicitly indicated, our results also fit nicely with the idea of a transcription factory in which the DNA template is drawn through a stationary polymerase (62); indeed, the phosphoCTD and some of its PCAPs may play important roles in the structure of such factories (see e.g. Ref. 16).

 
Poly(ADP-ribose) Polymerase-1—
PARP-1 has been implicated in both DNA and RNA transactions. Its activity is important for genome integrity, and it has a role or roles in dealing with DNA damage (e.g. see Ref. 74). In response to certain DNA damage, for example, it covalently modifies chromosomal proteins, including histones (e.g. see Refs. 75 and 76). PARP also has been shown to play a role in transcriptional regulation, notably participating in activation involving NF-{kappa}B and human T cell leukemia virus Tax protein (7779) but also participating in repression (80). Strikingly, in its role as transcriptional activator, the catalytic activity of PARP is dispensable (a finding paralleling that for Topo I, see above). Ironically, PARP was found decades ago to be one of the originally described general transcription factors, TFIIC (81); however, the functional significance of this finding was generally discounted until recently. Additionally, PARP has been found as a component of a complex involved in Ig class switching (82); remarkably, this DNA recombination-related functional implication is shared with another new PCAP, hnRNP D (see below). It is likely that several of the many activities of PARP will be found to be influenced by or dependent on its PCTD binding.

hnRNP D—
Like PARP, hnRNP D has been linked to both RNA and DNA transactions. It plays a role in mRNA turnover (as AUF1) in the cytoplasm (83, 84), but it also is part of a transcription factor in the nucleus (39). Intriguingly, hnRNP D binds G quartet DNA structures, and this property is the basis for the suggestion that hnRNP D plays a role in Ig switch recombination (85). HnRNP D associates with nucleolin (85), which is also found in the putative switch recombinase activity along with PARP (see above). Our demonstration of the phosphoCTD binding capability of hnRNP D adds a new perspective to views of its roles and possible mechanisms by which those roles are accomplished.

CA150—
CA150 was initially described in a transcriptional context, and in vivo results indicate that it can modulate transcription elongation on some genes (86, 87). On the other hand, in domain structure and in known protein-protein interactions, it bears striking similarities to the splicing factor Prp40 (FBP11/HYPC) (see e.g. Ref. 9). As mentioned, we showed earlier that the segment of CA150 principally responsible for PCTD binding is the FF motif-containing C-terminal part of the protein (26); the corresponding part of Prp40 similarly binds the PCTD (9). Furthermore, Prp40 binds to the branchpoint-binding protein, whereas CA150 binds to the human counterpart of that protein, SF1 (88). Our results suggest that its association with the PCTD stations CA150 in a position from which it can play roles in both transcription elongation and splicing.

hnRNP U—
Like several other newly recognized PCAPs, hnRNP U (SAF-A) has been implicated in processes involving both DNA and RNA. In terms of RNA transactions, for example, hnRNP U has been shown to associate with and affect the activity of the glucocorticoid receptor (89, 90). Interestingly, HnRNP U has been associated with the process of CTD phosphorylation previously, but the earlier work implicated U protein as a TFIIH-binding inhibitor of CTD phosphorylation (91); it is not yet clear how our results, and the PCTD binding of hnRNP U, will mesh with the previous study. In terms of DNA transactions, hnRNP U (SAF-A) has been found to be a nuclear matrix protein that binds to scaffold attachment regions of DNA (92, 93). In view of the matrix association of hnRNP U, it is almost certainly not coincidental that another protein associated with the matrix is in fact RNA polymerase II0, the hyperphosphorylated form of RNAP II (9496). It will be instructive to unravel the significance of these interrelated properties of hnRNP U and RNAP II.

NSAP1 and hnRNP R—
NSAP1 and hnRNP R are two PCAPs that share about 80% sequence identity. NSAP1 was identified by yeast two-hybrid screening as a HeLa protein that binds to the NS1 regulatory protein of the minute virus of mice (41). It has also been identified as a component of a complex involved in mRNA turnover along with hnRNP D (97). NSAP1 has more recently been identified as one of three splice variants of hnRNP Q (42) (other synonyms for Q splice variants are SYNCRIP and GRY-RBP; Refs. 4244). Remarkably, hnRNP Q proteins have been found to bind to SMN protein, mutations in which cause spinal muscular atrophy (42, 98) probably by disturbing spliceosome formation and splicing. Furthermore hnRNP R also associates with the SMN complex (42, 98). Their associations with SMN suggest that hnRNP Q and hnRNP R may be involved in spliceosome formation, and immunodepletion experiments directly implicate Q proteins in splicing (42); this implication is consistent with the presence of hnRNP Q3 (GRY-RBP) in spliceosomes (99). That hnRNPs Q and R interact with both the phosphoCTD and the splicing-involved SMN complex is provocative given that the SMN complex has been shown previously to display interactions with RNAP II (100). Our new findings, although not addressing when the factors associate with RNAP II during its functional cycle (see e.g. Refs. 20 and 101), are significant in suggesting new physical links between the splicing and transcription machineries mediated through hnRNPs Q/R and the SMN complex. This suggestion is consistent with and extends the proposal that spliceosome assembly occurs co-transcriptionally and is facilitated by tethering the interacting components to the phosphoCTD of elongating RNAP II (9, 14).

Implications
The connections revealed by our findings, between the phosphoCTD and several factors involved in diverse events in the nucleus, suggest that the phosphoCTD physically links a large number of nuclear processes to the transcription apparatus. The variety of these processes greatly exceeds that previously known to be CTD-linked. Our observations and their implications suggest that the phosphoCTD plays a major role in both physical and functional organization of the nucleus (see Fig. 7). In terms of RNA processing, our findings supplement and expand previously known connections to the phosphoCTD. Furthermore, our results implicate the phosphoCTD in a set of DNA transactions not previously known to be transcriptase-linked. Beyond providing new insights into nuclear organization, our results also point out the potential medical relevance of the phosphoCTD and its protein interactions. Many of the PCAPs we have identified have been linked to processes that are disturbed in diseases. As one example, our hypothesis that the transcription connection of DNMT1 is normally involved in establishing methylation patterns on RNAP II-transcribed stretches of DNA can be logically extended to propose that some altered methylation patterns, such as those observed in cancer cells (e.g. see Refs. 102 and 103), are related to this transcription connection. Another example is found in the case of NSAP1/hnRNP Q. If hnRNP Q tethers SMN complex to the phosphoCTD, as our results suggest, it is then easy to suppose that the spinal muscular atrophy-causing mutations in SMN protein, which delete the segment of SMN that binds hnRNP Q (42), may result in inappropriate uncoupling of SMN from elongating RNAP II0 with attendant disastrous consequences. A third example is PARP, mutations in which result in the inability to recover from genotoxic stress at least partly because of defects in DNA repair (75). If one function of PARP is to scan for DNA damage from the vantage point of a phosphoCTD attachment, it is easy to see how absence of this enzyme, or possibly just the loss of its ability to bind the phosphoCTD, could lead to problems with genome stability. These few examples should suffice to indicate the potential scientific and medical importance of delving further into this unusual transcriptase tail and the mechanisms by which its many protein interactions influence nuclear structure and function.


    ACKNOWLEDGMENTS
 
We thank Paul Modrich and members of his laboratory (Diana Martik, Greg Runyon, Jochen Genschel, and Shuntai Wang) for nuclear pellet and experimental advice, Caroline Astell for antiserum against NSAP1, Mary-Ann Bjornsti for yeast Topo I, Tao-shih Hsieh and Jianhong Wu for the blot of human Topo I, Nancy Maizels for hnRNP D, and Sriharsa Pradhan for DNMT1 and for communicating unpublished results. We thank Jan Jones, Hemali Phatnani, and David Skaar for advice on the manuscript, and we acknowledge Caroline Chromey for preparation of Fig. 7. We thank Timothy Bestor for helpful discussions.


    FOOTNOTES
 
Received, May 22, 2002, and in revised form, August 6, 2002.

Published, MCP Papers in Press, August 8, 2002, DOI 10.1074/mcp.M200029-MCP200

1 The abbreviations used are: CTD, C-terminal repeat domain; CTDK-I, CTD kinase I; PCAP, phosphoCTD-associating protein; phosphoCTD, hyperphosphorylated CTD; PCTD, phosphoCTD; RNAP, RNA polymerase; hnRNP, heterogeneous nuclear ribonucleoprotein; PARP, poly(ADP-ribose) polymerase; MALDI, matrix-assisted laser desorption ionization; MS/MS, tandem mass spectrometry; Topo, topoisomerase; DNMT, DNA methyltransferase; SAF-A, scaffold attachment factor A; SMN, survival motor neuron. Back

2 S. Pradahn, G. Kim, S. Carty, and A. Greenleaf, unpublished data. Back

3 S. M. Carty, A. L. Greenleaf, H. Phatnani, J. Wu, and T. Hsieh, in preparation. Back

4 D. A. Skaar and A. L. Greenleaf, submitted. Back

* This work was supported in part by a grant-in-aid from the American Heart Association (to A. L. G.) and by National Institutes of Health Tumor Immunology Training Grant 5T32CA09058 (to S. M. C.). The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. Back

{ddagger} To whom correspondence should be addressed: Dept. of Biochemistry, Duke University Medical Center, Box 3711, Durham, NC 27710. Tel.: 919-684-4030; Fax: 919-684-8885; E-mail: arno{at}biochem.duke.edu


    REFERENCES
 TOP
 ABSTRACT
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 

  1. Corden, J. L. (1990) Tails of RNA polymerase II. Trends. Biochem. Sci. 15, 383 –387[CrossRef][Medline]

  2. Dahmus, M. (1996) Reversible phosphorylation of the C-terminal domain of RNA polymerase II. J. Biol. Chem. 271, 19009 –19012[Free Full Text]

  3. Weeks, J. R., Hardin, S. E., Shen, J., Lee, J. M., and Greenleaf, A. L. (1993) Locus-specific variation in phosphorylation state of RNA polymerase II in vivo: correlations with gene activity and transcript processing. Genes Dev. 7, 2329 –2344[Abstract]

  4. O’Brien, T., Hardin, S., Greenleaf, A. L., and Lis, J. T. (1994) Phosphorylation of RNA polymerase II C-terminal domain and transcriptional elongation. Nature 370, 75 –77[CrossRef][Medline]

  5. Dahmus, M. E. (1995) Phosphorylation of the C-terminal domain of RNA polymerase II. Biochim. Biophys. Acta 1261, 171 –182[Medline]

  6. Cagas, P. M., and Corden, J. L. (1995) Structural studies of a synthetic peptide derived from the carboxyl-terminal domain of RNA polymerase II. Proteins 21, 149 –160[Medline]

  7. Bienkiewicz, E. A., Moon Woody, A., and Woody, R. W. (2000) Conformation of the RNA polymerase II C-terminal domain: circular dichroism of long and short fragments. J. Mol. Biol. 297, 119 –133[CrossRef][Medline]

  8. Gnatt, A. L., Cramer, P., Fu, J., Bushnell, D. A., and Kornberg, R. D. (2001) Structural basis of transcription: an RNA polymerase II elongation complex at 3.3 Å resolution. Science 292, 1876 –1882[Abstract/Free Full Text]

  9. Morris, D. P., and Greenleaf, A. L. (2000) The splicing factor, Prp40, binds the phosphorylated carboxyl-terminal domain of RNA polymerase II. J. Biol. Chem. 275, 39935 –39943[Abstract/Free Full Text]

  10. Bentley, D. (1999) Coupling RNA polymerase II transcription with pre-mRNA processing. Curr. Opin. Cell Biol. 11, 347 –351[CrossRef][Medline]

  11. Bentley, D. (2002) The mRNA assembly line: transcription and processing machines in the same factory. Curr. Opin. Cell Biol. 14, 336 –342[CrossRef][Medline]

  12. Greenleaf, A. L. (1993) Positive patches and negative noodles: linking RNA processing to transcription? Trends Biochem. Sci. 18, 117 –119[CrossRef][Medline]

  13. Corden, J. L., and Patturajan, M. (1997) A CTD function linking transcription to splicing. Trends Biochem. Sci. 22, 413 –416[CrossRef][Medline]

  14. Goldstrohm, A. C., Greenleaf, A. L., and Garcia-Blanco, M. A. (2001) Co-transcriptional splicing of pre-messenger RNAs: considerations for the mechanism of alternative splicing. Gene (Amst.) 277, 31 –47[CrossRef][Medline]

  15. Proudfoot, N. J., Furger, A., and Dye, M. J. (2002) Integrating mRNA processing with transcription. Cell 108, 501 –512[Medline]

  16. Maniatis, T., and Reed, R. (2002) An extensive network of coupling among gene expression machines. Nature 416, 499 –506[CrossRef][Medline]

  17. Ho, C. K., and Shuman, S. (1999) Distinct roles for CTD Ser-2 and Ser-5 phosphorylation in the recruitment and allosteric activation of mammalian mRNA capping enzyme. Mol. Cell 3, 405 –411[Medline]

  18. Hirose, Y., Tacke, R., and Manley, J. L. (1999) Phosphorylated RNA polymerase II stimulates pre-mRNA splicing. Genes Dev. 13, 1234 –1239[Abstract/Free Full Text]

  19. Cho, E. J., Rodriguez, C. R., Takagi, T., and Buratowski, S. (1998) Allosteric interactions between capping enzyme subunits and the RNA polymerase II carboxy-terminal domain. Genes Dev. 12, 3482 –3487[Abstract/Free Full Text]

  20. Gall, J. G. (2000) Cajal bodies: the first 100 years. Annu. Rev. Cell Dev. Biol. 16, 273 –300[CrossRef][Medline]

  21. McCracken, S., Fong, N., Rosonina, E., Yankulov, K., Brothers, G., Siderovski, D., Hessel, A., Foster, S., Program, A. E., Shuman, S., and Bentley, D. L. (1997) 5'-Capping enzymes are targeted to pre-mRNA by binding to the phosphorylated carboxy-terminal domain of RNA polymerase II. Genes Dev. 11, 3306 –3318[Abstract/Free Full Text]

  22. Cho, E. J., Takagi, T., Moore, C. R., and Buratowski, S. (1997) mRNA capping enzyme is recruited to the transcription complex by phosphorylation of the RNA polymerase II carboxy-terminal domain. Genes Dev. 11, 3319 –3326[Abstract/Free Full Text]

  23. Yue, Z., Maldonado, E., Pillutla, R., Cho, H., Reinberg, D., and Shatkin, A. J. (1997) Mammalian capping enzyme complements mutant Saccharomyces cerevisiae lacking mRNA guanylyltransferase and selectively binds the elongating form of RNA polymerase II. Proc. Natl. Acad. Sci. U. S. A. 94, 12898 –12903[Abstract/Free Full Text]

  24. McCracken, S., Fong, N., Yankulov, K., Ballantyne, S., Pan, G., Greenblatt, J., Patterson, S. D., Wickens, M., and Bentley, D. L. (1997) The C-terminal domain of RNA polymerase II couples mRNA processing to transcription. Nature 385, 357 –361[CrossRef][Medline]

  25. Barilla, D., Lee, B. A., and Proudfoot, N. J. (2001) Cleavage/polyadenylation factor IA associates with the carboxyl-terminal domain of RNA polymerase II in Saccharomyces cerevisiae. Proc. Natl. Acad. Sci. U. S. A. 98, 445 –450[Abstract/Free Full Text]

  26. Carty, S. M., Goldstrohm, A., Suñe, C., Garcia-Blanco, M. A., and Greenleaf, A. L. (2000) Protein-interaction modules that organize nuclear function: FF domains of CA150 bind the phospho-CTD of RNA polymerase II. Proc. Natl. Acad. Sci. U. S. A. 97, 9015 –9020[Abstract/Free Full Text]

  27. Yuryev, A., Patturajan, M., Litingtung, Y., Joshi, R. V., Gentile, C., Gebara, M., and Corden, J. L. (1996) The C-terminal domain of the largest subunit of RNA polymerase II interacts with a novel set of serine/arginine-rich proteins. Proc. Natl. Acad. Sci. U. S. A. 93, 6975 –6980[Abstract/Free Full Text]

  28. Bourquin, J. P., Stagljar, I., Meier, P., Moosmann, P., Silke, J., Baechi, T., Georgiev, O., and Schaffner, W. (1997) A serine/arginine-rich nuclear matrix cyclophilin interacts with the C-terminal domain of RNA polymerase II. Nucleic Acids Res. 25, 2055 –2061[Abstract/Free Full Text]

  29. Tanner, S., Stagljar, I., Georgiev, O., Schaffner, W., and Bourquin, J. P. (1997) A novel SR-related protein specifically interacts with the carboxy-terminal domain (CTD) of RNA polymerase II through a conserved interaction domain. Biol. Chem. 378, 565 –571[Medline]

  30. Steinmetz, E. J., Conrad, N. K., Brow, D. A., and Corden, J. L. (2001) RNA-binding protein Nrd1 directs poly(A)-independent 3'-end formation of RNA polymerase II transcripts. Nature 413, 327 –331[CrossRef][Medline]

  31. Morris, D. P., Phatnani, H., and Greenleaf, A. L. (1999) Phospho-CTD binding and the role of a prolyl isomerase in pre-mRNA 3' end formation. J. Biol. Chem. 274, 31583 –31587[Abstract/Free Full Text]

  32. Cho, H., Orphanides, G., Sun, X., Yang, X. J., Ogryzko, V., Lees, E., Nakatani, Y., and Reinberg, D. (1998) A human RNA polymerase II complex containing factors that modify chromatin structure. Mol. Cell. Biol. 18, 5355 –5363[Abstract/Free Full Text]

  33. Grondin, B., Cote, F., Bazinet, M., Vincent, M., and Aubry, M. (1997) Direct interaction of the KRAB/Cys2-His2 zinc finger protein ZNF74 with a hyperphosphorylated form of the RNA polymerase II largest subunit. J. Biol. Chem. 272, 27877 –27885[Abstract/Free Full Text]

  34. Lee, J. M., and Greenleaf, A. L. (1989) A protein kinase that phosphorylates the C-terminal repeat domain of the largest subunit of RNA polymerase II. Proc. Natl. Acad. Sci. U. S. A. 86, 3624 –3628[Abstract]

  35. Lee, J. M., and Greenleaf, A. L. (1991) CTD kinase large subunit is encoded by CTK1, a gene required for normal growth of Saccharomyces cerevisiae. Gene Expr. 1, 149 –167[Medline]

  36. Sterner, D. E., Lee, J. M., Hardin, S. E., and Greenleaf, A. L. (1995) The yeast carboxyl-terminal repeat domain kinase CTDK-I is a divergent cyclin-cyclin-dependent kinase complex. Mol. Cell. Biol. 15, 5716 –5724[Abstract]

  37. Morris, D. P., Lee, J. M., Sterner, D. E., Brickey, W. J., and Greenleaf, A. L. (1997) Assaying CTD kinases in vitro and phosphorylation-modulated properties of RNA polymerase II in vivo. Methods Companion Methods Enzymol. 12, 264 –275[CrossRef]

  38. Longley, M. J., Pierce, A. J., and Modrich, P. (1997) DNA polymerase delta is required for human mismatch repair in vitro. J. Biol. Chem. 272, 10917 –10921[Abstract/Free Full Text]

  39. Dempsey, L. A., Hanakahi, L. A., and Maizels, N. (1998) A specific isoform of hnRNP D interacts with DNA in the LR1 heterodimer: canonical RNA binding motifs in a sequence-specific duplex DNA binding protein. J. Biol. Chem. 273, 29224 –29229[Abstract/Free Full Text]

  40. Wuarin, J., and Schibler, U. (1994) Physical isolation of nascent RNA chains transcribed by RNA polymerase II: evidence for cotranscriptional splicing. Mol. Cell. Biol. 14, 7219 –7225[Abstract]

  41. Harris, C. E., Boden, R. A., and Astell, C. R. (1999) A novel heterogeneous nuclear ribonucleoprotein-like protein interacts with NS1 of the minute virus of mice. J. Virol. 73, 72 –80[Abstract/Free Full Text]

  42. Mourelatos, Z., Abel, L., Yong, J., Kataoka, N., and Dreyfuss, G. (2001) SMN interacts with a novel family of hnRNP and spliceosomal proteins. EMBO J. 20, 5443 –5452[Abstract/Free Full Text]

  43. Mizutani, A., Fukuda, M., Ibata, K., Shiraishi, Y., and Mikoshiba, K. (2000) SYNCRIP, a cytoplasmic counterpart of heterogeneous nuclear ribonucleoprotein R, interacts with ubiquitous synaptotagmin isoforms. J. Biol. Chem. 275, 9823 –9831[Abstract/Free Full Text]

  44. Lau, P. P., Chang, B. H., and Chan, L. (2001) Two-hybrid cloning identifies an RNA-binding protein, GRY-RBP, as a component of apobec-1 editosome. Biochem. Biophys. Res. Commun. 282, 977 –983[CrossRef][Medline]

  45. Zhu, Y., Pe’ery, T., Peng, J., Ramanathan, Y., Marshall, N., Marshall, T., Amendt, B., Mathews, M. B., and Price, D. H. (1997) Transcription elongation factor P-TEFb is required for HIV-1 tat transactivation in vitro. Genes Dev. 11, 2622 –2632[Abstract/Free Full Text]

  46. Liu, J., and Kipreos, E. T. (2000) Evolution of cyclin-dependent kinases (CDKs) and CDK-activating kinases (CAKs): differential conservation of CAKs in yeast and metazoa. Mol. Biol. Evol. 17, 1061 –1074[Abstract/Free Full Text]

  47. Price, D. H. (2000) P-TEFb, a cyclin-dependent kinase controlling elongation by RNA polymerase II. Mol. Cell. Biol. 20, 2629 –2634[Free Full Text]

  48. Lee, J. M., and Greenleaf, A. L. (1997) Modulation of RNA polymerase II elongation efficiency by C-terminal heptapeptide repeat domain kinase I. J. Biol. Chem. 272, 10990 –10993[Abstract/Free Full Text]

  49. Patturajan, M., Conrad, N. K., Bregman, D. B., and Corden, J. L. (1999) Yeast carboxyl-terminal domain kinase I positively and negatively regulates RNA polymerase II carboxyl-terminal domain phosphorylation. J. Biol. Chem. 274, 27823 –27828[Abstract/Free Full Text]

  50. Komarnitsky, P., Cho, E. J., and Buratowski, S. (2000) Different phosphorylated forms of RNA polymerase II and associated mRNA processing factors during transcription. Genes Dev. 14, 2452 –2460[Abstract/Free Full Text]

  51. Cho, E. J., Kobor, M. S., Kim, M., Greenblatt, J., and Buratowski, S. (2001) Opposing effects of Ctk1 kinase and Fcp1 phosphatase at Ser 2 of the RNA polymerase II C-terminal domain. Genes Dev. 15, 3319 –3329[Abstract/Free Full Text]

  52. Myers, J., Morris, D., Greenleaf, A., and Oas, T. (2001) Phosphorylation of RNA polymerase II CTD fragments results in tight binding to the WW domain from the yeast prolyl isomerase Ess1. Biochemistry 40, 8479 –8486[CrossRef][Medline]

  53. Pei, Y., Hausmann, S., Ho, C. K., Schwer, B., and Shuman, S. (2001) The length, phosphorylation state, and primary structure of the RNA polymerase II carboxyl-terminal domain dictate interactions with mRNA capping enzymes. J. Biol. Chem. 276, 28075 –28082[Abstract/Free Full Text]

  54. Schroeder, S. C., Schwer, B., Shuman, S., and Bentley, D. (2000) Dynamic association of capping enzymes with transcribing RNA polymerase II. Genes Dev. 14, 2435 –2440[Abstract/Free Full Text]

  55. Rodriquez, C. R., Cho, E.-J., Keogh, M. C., Moore, C. L., Greenleaf, A. L., and Buratowski, S. (2000) Kin28, the TFIIH-associated CTD kinase, facilitates the recruitment of mRNA processing machinery to RNA polymerase II. Mol. Cell. Biol. 20, 104 –112[Abstract/Free Full Text]

  56. Licatalosi, D. D., Geiger, G., Minet, M., Schroeder, S., Cilli, K., McNeil, J. B., and Bentley, D. L. (2002) Functional interaction of yeast pre-mRNA 3' end processing factors with RNA polymerase II. Mol. Cell 9, 1101 –1111[Medline]

  57. Patturajan, M., Wei, X., Berezney, R., and Corden, J. L. (1998) A nuclear matrix protein interacts with the phosphorylated C-terminal domain of RNA polymerase II. Mol. Cell. Biol. 18, 2406 –2415[Abstract/Free Full Text]

  58. Gilmour, D. S., and Elgin, S. C. (1987) Association of topoisomerase I with transcriptionally active loci in Drosophila. Natl. Cancer Inst. Monogr. 1987, 17 –21

  59. Wu, H. Y., Shyy, S. H., Wang, J. C., and Liu, L. F. (1988) Transcription generates positively and negatively supercoiled domains in the template. Cell 53, 433 –440[Medline]

  60. Drolet, M., Phoenix, P., Menzel, R., Masse, E., Liu, L. F., and Crouch, R. J. (1995) Overexpression of RNase H partially complements the growth defect of an Escherichia coli delta topA mutant: R-loop formation is a major problem in the absence of DNA topoisomerase I. Proc. Natl. Acad. Sci. U. S. A. 92, 3526 –3530[Abstract]

  61. Zhang, H., Wang, J. C., and Liu, L. F. (1988) Involvement of DNA topoisomerase I in transcription of human ribosomal RNA genes. Proc. Natl. Acad. Sci. U. S. A. 85, 1060 –1064[Abstract]

  62. Cook, P. R. (1999) The organization of replication and transcription. Science 284, 1790 –1795[Abstract/Free Full Text]

  63. Merino, A., Madden, K. R., Lane, W. S., Champoux, J. J., and Reinberg, D. (1993) DNA topoisomerase I is involved in both repression and activation of transcription. Nature 365, 227 –232[CrossRef][Medline]

  64. Shykind, B. M., Kim, J., Stewart, L., Champoux, J. J., and Sharp, P. A. (1997) Topoisomerase I enhances TFIID-TFIIA complex assembly during activation of transcription. Genes Dev. 11, 397 –407[Abstract]

  65. Shaiu, W. L., and Hsieh, T. S. (1998) Targeting to transcriptionally active loci by the hydrophilic N-terminal domain of Drosophila DNA topoisomerase I. Mol. Cell. Biol. 18, 4358 –4367[Abstract/Free Full Text]

  66. Leonhardt, H., and Bestor, T. H. (1993) Structure, function and regulation of mammalian DNA methyltransferase. EXS (Basel) 64, 109 –119[Medline]

  67. Margot, J. B., Aguirre-Arteta, A. M., Di Giacco, B. V., Pradhan, S., Roberts, R. J., Cardoso, M. C., and Leonhardt, H. (2000) Structure and function of the mouse DNA methyltransferase gene: Dnmt1 shows a tripartite structure. J. Mol. Biol. 297, 293 –300[CrossRef][Medline]

  68. Bestor, T. H. (2000) The DNA methyltransferases of mammals. Hum. Mol. Genet. 9, 2395 –2402[Abstract/Free Full Text]

  69. Walsh, C. P., Chaillet, J. R., and Bestor, T. H. (1998) Transcription of IAP endogenous retroviruses is constrained by cytosine methylation. Nat. Genet. 20, 116 –117[CrossRef][Medline]

  70. Yoder, J. A., Walsh, C. P., and Bestor, T. H. (1997) Cytosine methylation and the ecology of intragenomic parasites. Trends Genet. 13, 335 –340[CrossRef][Medline]

  71. Walsh, C. P., and Bestor, T. H. (1999) Cytosine methylation and mammalian development. Genes Dev. 13, 26 –34[Abstract/Free Full Text]

  72. Bird, A., Tate, P., Nan, X., Campoy, J., Meehan, R., Cross, S., Tweedie, S., Charlton, J., and Macleod, D. (1995) Studies of DNA methylation in animals. J. Cell Sci. Suppl. 19, 37 –39[Medline]

  73. Bender, C. M., Gonzalgo, M. L., Gonzales, F. A., Nguyen, C. T., Robertson, K. D., and Jones, P. A. (1999) Roles of cell division and gene transcription in the methylation of CpG islands. Mol. Cell. Biol. 19, 6690 –6698[Abstract/Free Full Text]

  74. Burkle, A. (2000) Poly(ADP-ribosyl)ation: a posttranslational protein modification linked with genome protection and mammalian longevity. Biogerontology 1, 41 –46[CrossRef][Medline]

  75. Burkle, A. (2001) Physiology and pathophysiology of poly(ADP-ribosyl)ation. Bioessays 23, 795 –806[CrossRef][Medline]

  76. Ziegler, M., and Oei, S. L. (2001) A cellular survival switch: poly(ADP-ribosyl)ation stimulates DNA repair and silences transcription. Bioessays 23, 543 –548[CrossRef][Medline]

  77. Hassa, P. O., Covic, M., Hasan, S., Imhof, R., and Hottiger, M. O. (2001) The enzymatic and DNA binding activity of PARP-1 are not required for NF-{kappa}B coactivator function. J. Biol. Chem. 276, 45588 –45597[Abstract/Free Full Text]

  78. Anderson, M. G., Scoggin, K. E., Simbulan-Rosenthal, C. M., and Steadman, J. A. (2000) Identification of poly(ADP-ribose) polymerase as a transcriptional coactivator of the human T-cell leukemia virus type 1 tax protein. J. Virol. 74, 2169 –2177[Abstract/Free Full Text]

  79. Meisterernst, M., Stelzer, G., and Roeder, R. G. (1997) Poly(ADP-ribose) polymerase enhances activator-dependent transcription in vitro. Proc. Natl. Acad. Sci. U. S. A. 94, 2261 –2265[Abstract/Free Full Text]

  80. Miyamoto, T., Kakizawa, T., and Hashizume, K. (1999) Inhibition of nuclear receptor signalling by poly(ADP-ribose) polymerase. Mol. Cell. Biol. 19, 2644 –2649[Abstract/Free Full Text]

  81. Slattery, E., Dignam, J. D., Matsui, T., and Roeder, R. G. (1983) Purification and analysis of a factor which suppresses nick-induced transcription by RNA polymerase II and its identity with poly(ADP-ribose) polymerase. J. Biol. Chem. 258, 5955 –5959[Free Full Text]

  82. Borggrefe, T., Wabl, M., Akhmedov, A. T., and Jessberger, R. (1998) A B-cell-specific DNA recombination complex. J. Biol. Chem. 273, 17025 –17035[Abstract/Free Full Text]

  83. Wilson, G. M., Sutphen, K., Moutafis, M., Sinha, S., and Brewer, G. (2001) Structural remodeling of an A + U-rich RNA element by cation or AUF1 binding. J. Biol. Chem. 276, 38400 –38409[Abstract/Free Full Text]

  84. Loflin, P., Chen, C. Y., and Shyu, A. B. (1999) Unraveling a cytoplasmic role for hnRNP D in the in vivo mRNA destabilization directed by the AU-rich element. Genes Dev. 13, 1884 –1897[Abstract/Free Full Text]

  85. Dempsey, L. A., Sun, H., Hanakahi, L. A., and Maizels, N. (1999) G4 DNA binding by LR1 and its subunits, nucleolin and hnRNP D, a role for G-G pairing in immunoglobulin switch recombination. J. Biol. Chem. 274, 1066 –1071[Abstract/Free Full Text]

  86. Sune, C., Hayashi, T., Liu, Y., Lane, W. S., Young, R. A., and Garcia-Blanco, M. A. (1997) CA150, a nuclear protein associated with the RNA polymerase II holoenzyme, is involved in Tat-activated human immunodeficiency virus type 1 transcription. Mol. Cell. Biol. 17, 6029 –6039[Abstract]

  87. Sune, C., and Garcia-Blanco, M. A. (1999) Transcriptional cofactor CA150 regulates RNA polymerase II elongation in a TATA-box-dependent manner. Mol. Cell. Biol. 19, 4719 –4728[Abstract/Free Full Text]

  88. Goldstrohm, A. C., Albrecht, T. R., Sune, C., Bedford, M. T., and Garcia-Blanco, M. A. (2001) The transcription elongation factor CA150 interacts with RNA polymerase II and the pre-mRNA splicing factor SF1. Mol. Cell. Biol. 21, 7617 –7628[Abstract/Free Full Text]

  89. Eggert, M., Michel, J., Schneider, S., Bornfleth, H., Baniahmad, A., Fackelmayer, F. O., Schmidt, S., and Renkawitz, R. (1997) The glucocorticoid receptor is associated with the RNA-binding nuclear matrix protein hnRNP U. J. Biol. Chem. 272, 28471 –28478[Abstract/Free Full Text]

  90. Eggert, H., Schulz, M., Fackelmayer, F. O., Renkawitz, R., and Eggert, M. (2001) Effects of the heterogeneous nuclear ribonucleoprotein U (hnRNP U/SAF-A) on glucocorticoid-dependent transcription in vivo. J. Steroid Biochem. Mol. Biol. 78, 59 –65[CrossRef][Medline]

  91. Kim, M. K., and Nikodem, V. M. (1999) hnRNP U inhibits carboxy-terminal domain phosphorylation by TFIIH and represses RNA polymerase II elongation. Mol. Cell. Biol. 19, 6833 –6844[Abstract/Free Full Text]

  92. Romig, H., Fackelmayer, F. O., Renz, A., Ramsperger, U., and Richter, A. (1992) Characterization of SAF-A, a novel nuclear DNA binding protein from HeLa cells with high affinity for nuclear matrix/scaffold attachment DNA elements. EMBO J. 11, 3431 –3440[Abstract]

  93. Gohring, F., and Fackelmayer, F. O. (1997) The scaffold/matrix attachment region binding protein hnRNP-U (SAF-A) is directly bound to chromosomal DNA in vivo: a chemical cross-linking study. Biochemistry 36, 8276 –8283[CrossRef][Medline]

  94. Fisher, P. A., Lin, L., McConnell, M., Greenleaf, A., Lee, J. M., and Smith, D. E. (1989) Heat shock-induced appearance of RNA polymerase II in karyoskeletal protein-enriched (nuclear "matrix") fractions correlates with transcriptional shutdown in Drosophila melanogaster. J. Biol. Chem. 264, 3464 –3469[Abstract/Free Full Text]

  95. Warren, S. L., Landolfi, A. S., Curtis, C., and Morrow, J. S. (1992) Cytostellin: a novel, highly conserved protein that undergoes continuous redistribution during the cell cycle. J. Cell Sci. 103, 381 –388[Abstract/Free Full Text]

  96. Bregman, D. B., Du, L., van der Zee, S., and Warren, S. L. (1995) Transcription-dependent redistribution of the large subunit of RNA polymerase II to discrete nuclear domains. J. Cell Biol. 129, 287 –298[Abstract]

  97. Grosset, C., Chen, C. Y., Xu, N., Sonenberg, N., Jacquemin-Sablon, H., and Shyu, A. B. (2000) A mechanism for translationally coupled mRNA turnover: interaction between the poly(A) tail and a c-fos RNA coding determinant via a protein complex. Cell 103, 29 –40[Medline]

  98. Rossoll, W., Kroning, A. K., Ohndorf, U. M., Steegborn, C., Jablonka, S., and Sendtner, M. (2002) Specific interaction of Smn, the spinal muscular atrophy determining gene product, with hnRNP-R and gry-rbp/hnRNP-Q: a role for Smn in RNA processing in motor axons? Hum. Mol. Genet. 11, 93 –105[Abstract/Free Full Text]

  99. Neubauer, G., King, A., Rappsilber, J., Calvio, C., Watson, M., Ajuh, P., Sleeman, J., Lamond, A., and Mann, M. (1998) Mass spectrometry and EST-database searching allows characterization of the multi-protein spliceosome complex. Nat. Genet. 20, 46 –50[CrossRef][Medline]

  100. Pellizzoni, L., Charroux, B., Rappsilber, J., Mann, M., and Dreyfuss, G. (2001) A functional interaction between the survival motor neuron complex and RNA polymerase II. J. Cell Biol. 152, 75 –85[Abstract/Free Full Text]

  101. Sacco-Bubulya, P., and Spector, D. L. (2002) Disassembly of interchromatin granule clusters alters the coordination of transcription and pre-mRNA splicing. J. Cell Biol. 156, 425 –436[Abstract/Free Full Text]

  102. Warnecke, P. M., and Bestor, T. H. (2000) Cytosine methylation and human cancer. Curr. Opin. Oncol. 12, 68 –73[CrossRef][Medline]

  103. Baylin, S. B., and Herman, J. G. (2000) DNA hypermethylation in tumorigenesis: epigenetics joins genetics. Trends Genet. 16, 168 –174[CrossRef][Medline]