The Structure of the Zinc Finger Domain from Human Splicing Factor ZNF265 Fold*

Craig A. Plambeck {ddagger} §, Ann H. Y. Kwan § , David J. Adams {ddagger} ||, Belinda J. Westman § , Louise van der Weyden {ddagger} ||, Robert L. Medcalf **, Brian J. Morris {ddagger} and Joel P. Mackay § {ddagger}{ddagger}

From the {ddagger}Basic and Clinical Genomics Laboratory, School of Medical Science and Institute for Biomedical Research, University of Sydney, New South Wales 2006, Australia, the §School of Molecular and Microbial Biosciences, University of Sydney, New South Wales 2006, Australia, and **Monash University, Department of Medicine, Box Hill Hospital, Box Hill, Victoria 3128, Australia

Received for publication, February 24, 2003 , and in revised form, March 25, 2003.


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
Identification of the protein domains that are responsible for RNA recognition has lagged behind the characterization of protein-DNA interactions. However, it is now becoming clear that a range of structural motifs bind to RNA and their structures and molecular mechanisms of action are beginning to be elucidated. In this report, we have expressed and purified one of the two putative RNA-binding domains from ZNF265, a protein that has been shown to bind to the spliceosomal components U1-70K and U2AF35 and to direct alternative splicing. We show that this domain, which contains four highly conserved cysteine residues, forms a stable, monomeric structure upon the addition of 1 molar eq of Zn(II). Determination of the solution structure of this domain reveals a conformation comprising two stacked {beta}-hairpins oriented at ~80° to each other and sandwiching the zinc ion; the fold resembles the zinc ribbon class of zinc-binding domains, although with one less {beta}-strand than most members of the class. Analysis of the structure reveals a striking resemblance to known RNA-binding motifs in terms of the distribution of key surface residues responsible for making RNA contacts, despite a complete lack of structural homology. Furthermore, we have used an RNA gel shift assay to demonstrate that a single crossed finger domain from ZNF265 is capable of binding to an RNA message. Taken together, these results define a new RNA-binding motif and should provide insight into the functions of the >100 uncharacterized proteins in the sequence data bases that contain this domain.


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
The zinc finger architecture has provided a diverse range of structures and functions (1), including recognition of protein, DNA, and RNA targets. Whereas the structural basis for interactions between DNA and classical zinc fingers has been well characterized (24), RNA-zinc finger interactions are less well understood, with only limited structural information on the conformation of RNA-binding domains available. Available structures have indicated that in many cases, RNA-binding domains are not completely folded in the absence of their target RNA sequence (5). Conformational changes have been observed to occur in both protein and the RNA target upon binding (59), and even completely unstructured domains have been found to fold upon interacting with RNA (10). It has therefore been suggested that rigid proteins may be unfavorable for RNA-binding and flexible structures may increase specificity (5).

In recent years, the functions of many RNA-binding proteins that regulate the process of RNA splicing have been delineated, enhancing our understanding of splice site recognition, spliceosomal coordination, and alternative splicing. Typically, the accessory proteins involved in splicing (SR proteins) contain an N-terminal RNA-binding domain and a C-terminal protein-binding or RS domain that is rich in Arg and Ser residues.

The human SR protein, ZNF265 (or ZIS) (11), can bind to the splicing factors U1-70K and U2AF35 (components of the E-complex that forms early in the splicing process) and is able to stimulate alternative splicing (12). It has also been shown to immunoprecipitate with splicing factors in association with mRNA and to co-localize in the nucleus with components of the transcriptosome (12), indicating that it plays a role in mRNA processing. ZNF265 is a multidomain protein, and is highly conserved in many organisms, including mouse (Zfp265) (13), rat (Zfp265; formerly Zis) (14), and Xenopus (C4SR) (15). Like other SR proteins, ZNF265 contains a C-terminal RS domain, as well as a nuclear localization sequence and a glutamic acidrich domain. However, in contrast to other SR proteins, the N terminus of ZNF265 does not contain any of the known RNA recognition motifs, but rather contains 8 Cys residues spaced appropriately to give rise to two putative zinc finger domains, each with a CX2/4CX10CX2C topology.

Domains with homology to the putative zinc fingers of ZNF265 have been observed in a wide range of proteins, including ubiquitin ligases such as Mdm2 (16, 17) and RanBP2 (18, 19), as well as RNA-binding proteins such as EWS (20). However, little structural information is available on any of these proteins. In addition, the putative zinc finger domains appear to be capable of carrying out different functions depending on context. In RanBP2, the eight tandem zinc finger domains are essential for exportin-1 binding (21), whereas the double zinc finger domain in C4SR, the Xenopus homolog of ZNF265, binds to RNA (15).

In the present study, we show that the conserved Cys-containing motifs in ZNF265 are genuine zinc-binding domains and we have determined the solution conformation of the first zinc finger of ZNF265, using NMR methods. The structure appears to constitute a variant of the zinc ribbon class of zinc-binding domains (1). In addition, we demonstrate that the ZNF265 zinc finger domain can bind cyclin B1 mRNA, indicating that this fold has apparently been adapted for both RNA-binding and protein-binding. Such adaptability appears to be common among zinc-binding motifs.


    EXPERIMENTAL PROCEDURES
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
Sequence Analysis—All sequence analysis and alignments were performed using the software pileup and prettybox (Australian Genome Information Service).1

Expression and Purification—The first putative zinc finger region of ZNF265 is located at the N-terminal end of the protein. A construct encoding residues 1–40 of human ZNF265 (accession code NP_005446 [GenBank] ) was therefore cloned into the Escherichia coli expression vector pGEX 4T-3 (Amersham Biosciences), creating a C-terminal fusion with glutathione S-transferase (GST).2 Note that DNA sequencing revealed that all 10 isolated clones contained aspartic acid at position 32, rather than asparagine (the residue listed in the GenBankTM sequence). It is likely that the sequence variation indicates either a single nucleotide polymorphism or perhaps a sequencing error in the original paper (11); only one cDNA was sequenced in this initial report. The fusion protein was expressed in the host strain DH5{alpha} grown in Luria-Bertani broth. Cells were grown at 37 °C, and expression of ZNF265-F1 was induced at an A600 nm of 0.6 by the addition of isopropyl-{beta}-D-thiogalactopyranoside (0.5 mM). After a further 3 h, cells were harvested by centrifugation, and the cell pellet was stored at –80 °C prior to use.

Cell pellets were resuspended in sonication buffer containing Tris (50 mM, pH 8), NaCl (50 mM), Triton X-100 (1%), phenylmethylsulfonyl fluoride (0.5 mM), and {beta}-mercaptoethanol (2 mM). Cells were lysed by sonication and centrifuged. The supernatant was loaded onto a glutathione-SepharoseTM 4B (Amersham Biosciences) column pre-equilibrated in sonication buffer. The column was washed and the 45-residue ZNF265-F1 peptide (containing an additional Gly-Ser at the N terminus, arising from the thrombin recognition site and a Gly-Pro-Ile at the C terminus because of cloning via a pGEMTM-T Easy intermediate) cleaved from the column with thrombin (5 units/1 ml of beads) at room temperature overnight. The eluted peptide was purified to homogeneity by reverse phase high performance liquid chromatography, and its identity confirmed using electrospray mass spectrometry (Mtheor = 4995.6 Da; Mobs = 4996.0 Da).

Analytical Ultracentrifugation—Sedimentation equilibrium experiments were performed using a Beckman model XL-A analytical ultra-centrifuge equipped with an An-60 Ti rotor. ZNF265-F1 was prepared by dissolving lyophilized peptide in H2O containing 1 mM tris(2-carboxyethyl)phosphine and 1.7 mM ZnSO4 and the pH was adjusted to 5.8 using 0.1 M NaOH. Protein concentrations were determined in the XL-A by reference to a blank containing this buffer alone. Data were collected in two-sector cells as absorbance (280 nm) versus radius scans (0.001 cm increments, 10 averages) at 36,000, 43,000, and 56,000 rpm. Scans were collected at 3-h intervals and compared to determine when the samples reached equilibrium. Analysis of the data was carried out using the NONLIN software (22) and the best model and final parameters were determined by examination of the residuals derived from fits to several models. The partial specific volume of ZNF265-F1 (0.708 ml g1) and the density of the buffer (0.999 g ml1) were determined from the amino acid sequence (23) using the program SEDNTERP (www.bbri.org/rasmb/rasmb.html) (24).

NMR Spectroscopy—ZNF265-F1 was prepared for NMR by dissolving ~2.5 mg of peptide in H2O/D2O (95:5, 500 µl) containing tris(2-carboxyethyl)phosphine (5 mM) and ZnSO4 (1.5 mM), giving a ~1 mM sample. The sample was prepared at low pH to avoid protein aggregation, and the pH was then increased to 5.8 using 0.1 M NaOH. The sample was placed in a 5-mm outer diameter, susceptibility matched microcell (Shigemi). For experiments in D2O, the sample was lyphophilized and reconstituted in a microcell in 500 µl of 99.96% D2O.

All NMR experiments used for determination of the structure of ZNF265-F1 were carried out at 298 K on a 600-MHz Bruker DRX600 spectrometer, equipped with a 5-mm triple resonance probe and three-axis pulsed field gradients. Water suppression was achieved using pulsed-field gradients and the WATERGATE sequence (25). The following homonuclear two-dimensional spectra were recorded on the ZNF265-F1 sample: DQFCOSY (26), TOCSY (27) ({tau}m = 35 and 70 ms), and NOESY (28) ({tau}m = 50 and 200 ms). Acquisition times were 33 and 131 ms for each of these experiments in t1 and t2, respectively.

Spectra were processed using XWINNMR (Bruker) on a Silicon Graphics O2 work station. The 1H frequency scale of all spectra was directly referenced to sodium 3-trimethylsilyl propionate-2,2,3,3-d4 at 0.00 ppm. Two-dimensional NMR spectra were processed using Lorentz-Gauss window functions in the directly detected dimension and shifted square sine bell functions in the indirectly detected dimension. Polynomial baseline corrections were applied to the processed spectra in F2. Spectra were analyzed using the program XEASY (29).

Structure Calculations—NOE-derived distance restraints were obtained from intensities of cross-peaks in the two-dimensional 1H NOESY spectra and calibrated using the CALIBA module of DYANA (30). Dihedral angle restraints for {phi} angles were set to –60 ± 40o for 3JHN{alpha} < 6 Hz and –120 ± 40o for 3JHN{alpha} > 8 Hz, based on the use of the INFIT algorithm (31) that is part of the XEASY package. In addition, a restraint of –100 ± 80o for {phi} angles was applied for residues in which the intraresidue H{alpha}-HN NOE was clearly weaker than the NOE between H{alpha} and HN of the following residue (32). The {beta} methylene protons of the two Pro residues were stereospecifically assigned from NOESY spectra on the basis that H{alpha} is always closer to H{beta}3 than H{beta}2 (33). Both X-Pro bonds were identified as being in the trans-conformation on the basis of strong NOEs between the H{delta} values of each Pro residue and the H{alpha} protons of the preceding residues (34). No hydrogen bond restraints were used in the structure calculations.

Initially, structure calculations were performed in DYANA using a distance geometry/simulated annealing protocol (30) and additional NOEs were assigned iteratively based on earlier sets of structures. No assumption about zinc coordination was made at this stage and preliminary structures indicated the four cysteines (Cys15, Cys20, Cys31, and Cys34) were possible zinc ligands. To include additional distance and angle constraints that maintain the tetrahedral bonding geometry and appropriate bond lengths with the zinc atom (35), as well as to incorporate ambiguous restraints, subsequent structural refinement was performed in CNS (36) using the package ARIA1.2 (37, 38). Manually assigned NOEs in combination with the remaining ambiguous NOEs were included in the ARIA structure calculations and the latter NOEs were iteratively assigned by the program.

Briefly, eight cycles of 20 structures each and a last cycle of 500 structures were performed in the final refinement. Manually assigned NOEs were included in iteration zero as soft restraints. In each cycle the seven lowest energy structures from the previous iteration were used to extract additional NOE restraints with a tolerance of 0.02 ppm in the F1 dimension and 0.02 ppm in the F2 dimension. If a restraint was violated by more than a predefined target value in over 50% of the seven structures, it was discarded. The violation target value was progressively reduced from 1000 Å in iteration zero to 0.1 Å in iteration eight. Ambiguous distance restraints were treated as described (38) and the peak volume cut off was gradually reduced from 1.01 in the first iteration to 0.80 in the last. Dihedral angle restraints and stereospecific assignments as detailed above were also included in the calculations. The final assignments made by ARIA were checked and corrected manually where necessary. Overall, 4 (1.2%) non-sequential inter-residue NOEs for which no consistent assignment could be determined were excluded from the final calculations.

Calculations were carried out in the simplified all-hydrogen PARALLHDG5.3 force field with non-bonded interactions modeled by PROLSQ force field (39); floating chirality assignment (40) was used for all methylene and isopropyl groups for which no stereospecific assignment could be made. Finally, the 100 lowest energy structures were refined in a 9-Å shell of water molecules (41). The 20 conformers with the lowest value of Etot were visualized and analyzed using the programs MOLMOL (42), PROCHECK (43), and WhatIf (44). The family of 20 low-energy structures (accession code 1N0Z) have been deposited in the Protein Data Bank. Chemical shift and restraint lists that were used in the structure calculations have been deposited in BioMagRes-Bank (accession code 5667).

RNA Electrophoretic Mobility Shift Assay (REMSA)—Expression plasmids harboring the full-length cyclin B1 cDNA were used to transcribe the cyclin RNA probe in vitro. Following linearization with BamHI, 1 µg of template was incubated for 2 h at 37 °C in the presence of 50 µCi of [{alpha}-32P]UTP (DuPont), 10 µM UTP, 0.5 mM ATP, 0.5 mM GTP, 0.5 mM CTP, 20 units of RNase inhibitor (Promega), and 50 units of T7 RNA polymerase. RNA probes were purified on a 4% polyacrylamide-urea denaturing gel, eluted overnight at room temperature in a solution containing 500 mM NH4CH3COO and 1 mM EDTA, ethanol precipitated at –80 °C, and resuspended in water (100 cps/µl) as previously described (45).

For the binding assays, 0.1–5.0 µg of affinity purified GSTZNF265-F1 protein was preincubated with 150 µg of heparin (Sigma) for 10 min at room temperature in CEB buffer before addition of the RNA probe (200 cps). After 30 min incubation at room temperature, samples were subjected to RNase T1 digestion (1 µl; 2 units; Roche Applied Science) for 15 min at room temperature. Samples were then electrophoresed through a 4% native polyacrylamide gel and protein-RNA complexes were visualized by autoradiography.


    RESULTS
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
Homology of the Zinc Finger Domain of ZNF265—To investigate the conservation of the zinc finger domains of ZNF265, a Blast analysis was conducted using the amino acid sequence of the first zinc finger domain. Fig. 1 shows that in addition to being nearly identical to each other, both of the zinc fingers of human ZNF265 are completely conserved across a number of eukaryotic species, including rat and mouse ZFP265 and the RNA-binding protein C4SR from Xenopus laevis. Furthermore, these domains show homology to the zinc fingers found in Drosophila sol (46), as well as in RNA-binding proteins from Arabidopsis thaliana, suggesting ancient origins for this zinc finger motif, and therefore an important function(s). More than 100 proteins carry domains of this type, and these proteins exhibit a variety of domain structures and functions. Proteins may contain between one and eight copies of these domains, which may be arranged either in tandem clusters or spread throughout the protein. In some cases, no other recognizable sequence motifs have been identified.



View larger version (33K):
[in this window]
[in a new window]
 
FIG. 1.
Domain structure and sequence alignment of ZNF265. A, schematic diagram of ZNF265 indicating the domain structure of the protein. B, sequence lineup of the cysteine-rich regions of ZNF265 and related proteins. Conserved residues are indicated with light gray and gray shading, whereas cysteines identified as putative zinc ligands are indicated with a dark gray box. Non-native residues introduced to the sequence as a result of cloning artifacts are in italics. Numbers shown beside sequences are for full-length protein sequences.

 

Despite this diversity, two functional themes are apparent. Several of the proteins (such as RanBP2) are involved in either ubiquitination or SUMOylation, whereas many others appear to be involved in RNA metabolism. Thus, in a number of cases, the putative zinc-binding motif is directly C-terminal to a known RNA recognition motif, such as an RRM domain (47), and proteins such as ZNF265 contain other elements associated with splicing, such as RS domains (48).

The Cysteine-rich Region of ZNF265 Forms a Monomeric Zinc-dependent Structure—Sequence analysis performed at the time of the initial discovery of rat ZFP265 ("Zis" (14)) identified the putative zinc finger domains of this protein, although no experimental verification was made. We therefore subcloned DNA encoding the first such domain from ZNF265 (ZNF265-F1) into the pGEX-4T-3 vector. The protein was expressed as a GST fusion and purified using affinity chromatography and reversed-phase high performance liquid chromatography. Onedimensional 1H NMR spectra, recorded at pH 5.8 in both the absence and presence of 1 molar eq of Zn(II) (Fig. 2A), showed a substantial increase in chemical shift dispersion in the presence of Zn(II). This increase is consistent with the formation of persistent secondary structure and demonstrates that these domains are indeed genuine zinc finger domains.



View larger version (21K):
[in this window]
[in a new window]
 
FIG. 2.
ZNF265-F1 is a monomeric zinc-binding domain. A, one-dimensional 1H NMR spectra of ZNF265-F1 in the absence (top) and presence (bottom) of 1 molar eq of Zn(II). The substantial increase in chemical shift dispersion in the presence of Zn(II) indicates the formation of stable secondary structure. Spectra were recorded at pH 5.8 and 298 K, on a 600 MHz spectrometer (sample concentration was 0.5 mM). B, sedimentation equilibrium data. Data were recorded at 43,000 rpm (25 °C). The lower graph displays a plot of absorbance at 280 nm versus radius (r2/2; cm2), whereas the upper graph illustrates the residual deviations resulting from the fit of an ideal single species model to the data.

 

To determine the aggregation state of ZNF265-F1, we used sedimentation equilibrium methods (Fig. 2B). Data were recorded on two different loading concentrations at three speeds, and these data were well fitted by a model incorporating a single, ideal species with a molecular mass of 4.9 kDa (with 95% confidence intervals of 4.5 and 5.2 kDa; Mtheor = 5,061 Da). Thus ZNF265-F1 is monomeric in solution under these conditions.

Determination of the Structure of ZNF265-F1—The high quality of the 1H NMR spectra of ZNF265-F1 in the presence of Zn(II) (Fig. 2A) allowed us to determine the three-dimensional solution structure of this domain. 1H resonance assignments were achieved by using standard two-dimensional homonuclear NMR experiments. Using DYANA (30), NOEs were iteratively assigned and a preliminary was fold determined; no zinc ion was included, and the zinc coordination site was not restrained during these calculations. On the basis of these structures, it was clear that the four cysteine thiol groups (Cys15, Cys20, Cys31, and Cys34) comprised the zinc coordination sphere. Subsequent calculations, carried out using the ARIA protocol (37, 38) implemented in CNS (36), therefore included a zinc ion, together with constraints defining tetrahedral coordination. The remaining ambiguous NOEs were introduced iteratively in ARIA in an automated manner. The final ensemble of the 20 lowest energy structures is based on a total of 709 unambiguous intra- and inter-residue distance restraints derived from two-dimensional NOESY spectra and 22 backbone dihedral restraints. The total number of experimental restraints was 731. Of these, 347 were meaningful inter-residues restraints, representing an average of 12.8 constraints per structured residue (i.e. residues 12–38). From the final ARIA calculations, the 20 lowest energy structures were chosen to represent the solution structure of ZNF265-F1 (Fig. 3A). The structures display good covalent geometry and good nonbonded contacts. Analysis of the structures with PROCHECK (43) shows that, for residues that exhibit {phi} angle order parameters >0.6, over 99.6% of non-glycine and non-proline residues falls into the most favored or additionally allowed regions of the Ramachandran plot. Structural statistics for the ensemble are given in Table I.



View larger version (38K):
[in this window]
[in a new window]
 
FIG. 3.
The solution structure of ZNF265-F1. A, ensemble of best 20 structures of ZNF265-F1 (backbone atoms only). Structures are superimposed over the backbone atoms (C{alpha}, C', and N) of residues 12–38 (residues 2–11 and 39–43, which are unstructured, are omitted for clarity). The zinc chelating side chains and the zinc atom are shown. B, ribbon diagram of one of the lowest energy structures of ZNF265-F1 showing elements of secondary structure as recognized in the program MOLMOL. Both diagrams are shown in wall-eyed stereo format. C, diagram showing the crossing of the two {beta}-hairpins that form the zinc-binding site. The hairpins are angled at ~80° relative to each other. D, diagram showing the distribution of conserved residues in the first CF domain of ZNF265. Residues that are highly conserved are shown in space filling representation and are labeled. The most highly conserved residues appear to be involved in maintaining the fold of the domain, whereas others lie on the protein surface, indicating a functional role.

 

View this table:
[in this window]
[in a new window]
 
TABLE I
Structural statistics for ZNF265-F1

 

ZNF265-F1 Forms a Zinc Ribbon—Residues 12–38 of ZNF265-F1 adopt a compact, globular fold that incorporates a single zinc ion. The fold consists of two distorted {beta}-hairpins that each provide two of the zinc ligating cysteines (Fig. 3B); the Zn(II) ion is effectively sandwiched between the two hairpins, which cross each other at an angle of ~80° (Fig. 3C). The first {beta}-hairpin is formed by residues Asp12-Asn24 (strands 1 and 2 comprise residues Trp13-Cys15, and Asn22-Asn24, respectively) and displays backbone hydrogen bonds between the HN proton of Cys15 and the carbonyl oxygen of Asn22, as well as between the HN proton of Asn24 and the carbonyl oxygen of Trp13. In addition there are two hydrogen bonds to the S{gamma} atom of Cys20, from the backbone amide protons of Asn22 and Gly21 (2.1 and 3.1 Å in the lowest energy structure, respectively); and an additional hydrogen bond to the S{gamma} atom of Cys15, from the backbone amide protons of Asp17 (3.6 Å). These hydrogen bonds are characteristic of the rubredoxin knuckle, which is very common in zinc-binding domains (49). Residues Cys15-Lys18 form a type VIII {beta}-turn, judging from backbone {phi} and {varphi} angles (although no (i,i+3) hydrogen bond is observed), whereas residues Asp17-Cys20 form an interlinked type I {beta}-turn (50). The side chain carbonyl oxygen of Asp17 forms hydrogen bonds with the HN protons of Lys18 and Lys19, as is commonly observed in type I turns (50).

The second {beta}-hairpin consists of residues Thr29-Lys38 but is not recognized by the Kabsch and Sander (51) secondary structure prediction algorithm (as implemented in MOLMOL; 42) in any of the final conformers. A closer manual inspection of the structures, however, reveals that many of the hydrogen bonding patterns observed across the first {beta}-sheet are also maintained here. For example, hydrogen bonds can be observed between the H{gamma} proton of Ser30 and the side chain carbonyl oxygen of Glu37, the HN proton of Lys38 and the carbonyl oxygen of Thr29, and between the HN proton of Cys31 and the carbonyl oxygen of Arg36. This hairpin is recognized in the family of conformers that are obtained prior to water refinement, but the slight loosening of the ensemble that takes place in the water refinement puts the hydrogen bond parameters just outside the tolerances set by the Kabsch and Sander (51) algorithm. Consequently, we have not shown this hairpin in Fig. 3. The two strands of this "hairpin" are connected by residues Cys31-Cys34, which form a half-knuckle with a hydrogen bond linking the S{gamma} atom of Cys31 to the backbone amide proton of Arg33. A small hydrophobic core is formed by Trp13, Cys15, Pro16, Asn22, Asn24, Cys31, and one face of the aliphatic side chain of Lys40.

A search of known protein structures, using ZNF265-F1 as a template and the program DALI (52), did not reveal any structural homologs, indicating that ZNF265-F1 constitutes a new protein fold. Because the fold consists of two hairpins crossing each other, we have termed this new fold a crossed finger or CF domain. However, DALI searches can yield false negatives when dealing with small domains (1). Inspection of the structure reveals that it falls into the zinc ribbon class of zinc-binding domains, as defined by Krishna et al. (1). Members of the zinc ribbon class all possess two {beta}-hairpins, but otherwise display significant sequence and structural variability. Typically, a third {beta}-strand adjoins one of the hairpins. Many classical zinc ribbon proteins are involved in the transcriptional/translational machinery and are capable of interacting with either DNA or RNA (1). Overlays of ZNF-F1 with typical zinc ribbons (e.g. the ribosomal protein L37E; PDB number 1JJ2 [PDB] ) give RMSDs of 2–3 Å.

Within the family of crossed finger domains, a number of amino acids, other than the zinc ligands, are highly conserved (Fig. 1). Fig. 3D illustrates the distribution of these conserved amino acids on the structure of ZNF265-F1. Two of these, Trp13 and Asn24, are >90% buried, and are likely to be important for maintaining the folded structure of the CF domain. Indeed, the buried side chain amide protons of Asn24 form hydrogen bonds with the backbone carbonyl oxygens of Trp13 and Ile14. Lys38 also packs against Trp13 and probably contributes to fold stability, whereas the remaining most highly conserved residues, Asp12, Lys18, Arg27, Arg28, and Phe25, are largely exposed. These latter residues may therefore serve a functional role.

Could the CF be a Novel RNA-binding Domain?—Analysis of the surface of the ZNF265-F1 structure reveals a number of conserved amino acids that are presumably not involved in specifying the folded conformation of the domain. Two lysines (Lys18 and Lys19) and three arginine (Arg27, Arg28, and Arg33) residues that lie within the well defined portion of the domain and are well conserved are highlighted in Fig. 4A together with the exposed phenylalanine residue Phe25. Strikingly, the spatial arrangement of a number of these residues is remarkably similar to the RNA-binding surface identified in the double stranded RNA-binding module of PKR (Fig. 4B). A large body of structural (53) and mutagenic (5456) data has shown that a solvent-exposed phenylalanine, together with several lysine and arginine residues are used to contact RNA.



View larger version (71K):
[in this window]
[in a new window]
 
FIG. 4.
Putative RNA-binding surface of the ZNF265 CF domain. A, structure of ZNF265-F1, showing putative RNA-binding surface of the ZNF265 CF domain, with a number of the most highly conserved residues displayed in space-filling representation. B, structure of the double stranded RNA-binding module of PKR (53); residues implicated in RNA binding are displayed in space-filling representation and are colored to match the spatially related residues from ZNF265. Unstructured end terminal residues are omitted for clarity. The Protein Data Bank accession code for the PKR structure is 1QU6 [PDB] .

 

The ZNF265 Crossed Finger Domain Can Bind mRNA—The structural homology of the CF domain to a known RNA-binding protein suggested to us that this domain might function in RNA recognition. Interestingly, crossed finger domains are found in a number of other proteins that have either been shown directly to be involved in RNA metabolism or contain domains with RNA-related functions. The double-CF domain of C4SR, the Xenopus homolog of ZNF265, has been shown to stably interact with a cyclin B1 mRNA transcript (15). To determine whether ZNF265 could bind to the same cyclin transcript, a REMSA analysis were performed using an in vitro transcribed 32P-labeled full-length cyclin B1 transcript and purified ZNF265 protein. As shown in Fig. 5, addition of increasing concentrations of ZNF265 to the cyclin transcript produced a protein-RNA complex (indicated by arrow). This was seen clearly when using 5 µg of purified material (lane 4), but was also evident at a lower concentration (i.e. 100 ng) after prolonged exposure of the film. These results indicate that ZNF265 is able to bind to the cyclin transcript.



View larger version (43K):
[in this window]
[in a new window]
 
FIG. 5.
ZNF265 binds to cyclin mRNA. REMSA showing the binding of GST-ZNF265-F1 to a cyclin B1 mRNA transcript. Increasing concentrations of GST-ZNF265-F1 (0, 0.1, 1.0, and 5.0 µg; lanes 1–4, respectively) were added to the radiolabeled cyclin transcript and the mixtures subjected to RNase T1 digestion as described under "Experimental Procedures." The concentration dependent binding of GSTZNF265-F1 is clearly observed by the appearance of a retarded protein-RNA complex at the highest concentration of GST-ZNF265-F1 (arrow; lane 4).

 


    DISCUSSION
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
The Structure of ZNF265-F1—The crossed finger domain seems to be a subclass of the zinc ribbon fold, and it adds to the already large family of zinc-binding domains in which the zinc probably serves only a structural (rather than a catalytic) role (1). There are at least eight classes of such domains for which structural information is available (1), and the structures that they form are diverse. A recurring theme among these structures is the fact that the zinc ligands are divided into two pairs in the amino acid sequence. The spacing between the two ligands within a pair is generally 2 to 5 amino acids, whereas the sequence connecting the pairs may be up to 50 amino acids in length. It is clear that these H/C-X2–5-H/C motifs take up a limited number of conformations, and presumably form a stable macrocyclic structure when bound to a zinc ion. Motifs containing histidine residues often take up a helical conformation (e.g. in classical zinc fingers), whereas C-X2–5-C sequences typically form part of a turn or take up an irregular conformation. Given the diversity of folds that contain these motifs, it is interesting to speculate about why they have apparently arisen so frequently during evolution. It is possible that when two H/C-X2–5-H/C units arise by random sequence variation and are spaced less than ~50 amino acids apart, there is a relatively high probability that this sequence will be able to ligate a zinc ion, thereby forming some kind of persistent structure (high at least compared with the chance of persistent structure forming in the absence of directing influences like metal-binding motifs).

At least three of the classes of zinc-binding domains have been demonstrated to bind specifically to RNA. The retroviral nucleocapsid domain that is essential for RNA packaging comprises a C-X2-C-X4-H-X4-C zinc-binding domain (57). The zinc ribbon fold is found in many ribosomal proteins and recognizes RNA, and a number of proteins containing classical zinc fingers, such as TFIIIA, also recognize RNA (58). Of these, structural alignments indicate that the zinc ribbon resembles most closely the ZNF265 domain.

ZNF265 as an RNA-binding Protein—The REMSA data presented here indicate that the ZNF265 CF domains are capable of interacting with an mRNA transcript from the cyclin B1 gene. These data are in accord with the previous demonstration by Ladomery et al. (15) that the zinc finger region from X. laevis C4SR, a ZNF265 sequence homolog (~98% similarity in the double CF domain region), can stably bind the same RNA message.

Several themes have emerged from studies of RNA-binding proteins to date and at least two of these are reflected in ZNF265. First, the importance of both basic and exposed aromatic amino acids for RNA recognition has become clear (5). The CF domains of ZNF265 contain many basic residues (seven in ZNF265-F1), as well as an exposed phenylalanine (Phe25) and, remarkably, a number of these residues combine to form a surface that is rather similar to that observed in unrelated RNA-binding proteins such as PKR. Indeed, the use of a surface phenylalanine for the recognition of a ribonucleotide base is rather common among RNA-binding proteins; the transcriptional termination factor Rho is another example of this phenomenon (59).

Second, RNA recognition motifs (and double stranded RNA-binding modules) are often found in pairs separated by a flexible linker; and large conformational changes, such as the freezing of a flexible linker between two RNA-binding domains, often occur upon RNA binding (47, 60). This is in contrast to DNA-binding zinc fingers that typically have much shorter linkers, usually of seven amino acids between adjacent fingers (61). ZNF265 contains two CF domains separated by a 24-residue linker, and interestingly PKR has two double stranded RNA-binding modules that are connected by a 22-residue flexible linker, consistent with the observed RNA-binding function of ZNF265. It has previously been suggested that widely spaced zinc fingers may allow greater flexibility in binding site selection (62, 63).

We have shown that the zinc fingers of ZNF265 form an RNA-binding motif, and the protein therefore conforms to the standard architecture of SR protein splicing factors (64, 65). It has been shown that ZNF265 has a role in directing alternative splicing and is able to bind to both U1-70K and U2AF35 (12). However, the binding to these proteins alone is insufficient for directing alternative splicing, because U1-70K and U2AF35 bind at their respective sites in all introns. Thus, to enable splice site selection the zinc finger domain of ZNF265 may selectively bind RNA targets; we are currently in the process of defining the sequence specificity for the ZNF265-RNA interaction.

The CF Domain in Other Proteins—Sequence analysis of the proteins that contain CF domains indicates a likely role for many CF-containing proteins in RNA metabolism. Whereas a number of these proteins have no known function, at least 30 contain other motifs such as the well characterized RNA recognition motif, or RRM (66), further linking this domain to RNA metabolism. The presence of multiple CF domains in the nuclear transport factor RanBP2 is particularly interesting. RanBP2 is a 358-kDa nucleoporin that is involved in nuclear export (18, 67). The presence of CF domains in this protein may indicate that it is involved not only in the export of protein cargo but also in shuttling RNA in and/or out of the nucleus. It should be noted, however, that few of the positively charged residues found in ZNF265-F1 are conserved in either the RanBP2 or the other ubiquitin ligase CF domains. This may argue for the adaptation of the CF fold for more than one function, in the same way that many other classes of zinc-binding domains appear now to be capable of a variety of roles (68).

In summary, a combination of structural, REMSA, and sequence analysis data have shown that the zinc-binding domains of the splicing factor ZNF265 are zinc ribbons that bind to RNA, most likely via a mechanism that bears some resemblance to the binding modes of other RNA-binding proteins. These data point to an interesting convergence of function among a wide variety of proteins involved in RNA metabolism.


    FOOTNOTES
 
The atomic coordinates and structure factors (code 1QU6 [PDB] ) have been deposited in the Protein Data Bank, Research Collaboratory for Structural Bioinformatics, Rutgers University, New Brunswick, NJ (http://www.rcsb.org/).

* This work was supported in part by a grant from the Australian Research Council. The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. Back

Supported by an Australian Postgraduate Award. Back

|| Recipients of National Health and Medical Research Council (NHMRC) C. J. Martin fellowships. Present address: The Wellcome Trust Sanger Institute, Hinxton CB10 1SA, United Kingdom. Back

{ddagger}{ddagger} To whom correspondence should be addressed: School of Molecular and Microbial Biosciences, University of Sydney, NSW 2006, Australia. Tel.: 61-2-9351-3906; Fax: 61-2-9351-4726; E-mail: j.mackay{at}mmb.usyd.edu.au.

1 See www.angis.org.au. Back

2 The abbreviations used are: GST, glutathione S-transferase; REMSA, RNA electrophoretic mobility shift assay; CF, crossed finger domain; r.m.s.d., root mean square deviation. Back


    ACKNOWLEDGMENTS
 
We thank Michael Ladomery for providing the cyclin B1 clone, Kayesh Fairley for input into the chemical shift assignments, and Dr. Bill Bubb for expert maintenance of the DRX600 NMR spectrometer at the University of Sydney.



    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 

  1. Krishna, S. S., Majumdar, I., and Grishin, N. V. (2003) Nucleic Acids Res. 31, 532–550[Abstract/Free Full Text]
  2. Foster, M. P., Wuttke, D. S., Radhakrishnan, I., Case, D. A., Gottesfeld, J. M., and Wright, P. E. (1997) Nat. Struct. Biol. 4, 605–608[Medline] [Order article via Infotrieve]
  3. Wuttke, D. S., Foster, M. P., Case, D. A., Gottesfeld, J. M., and Wright, P. E. (1997) J. Mol. Biol. 273, 183–206[CrossRef][Medline] [Order article via Infotrieve]
  4. Searles, M. A., Lu, D., and Klug, A. (2000) J. Mol. Biol. 301, 47–60[CrossRef][Medline] [Order article via Infotrieve]
  5. Draper, D. E. (1999) J. Mol. Biol. 293, 255–270[CrossRef][Medline] [Order article via Infotrieve]
  6. Battiste, J. L., Tan, R., Frankel, A. D., and Williamson, J. R. (1994) Biochemistry 33, 2741–2747[Medline] [Order article via Infotrieve]
  7. Tan, R., and Frankel, A. D. (1994) Biochemistry 33, 14579–14585[Medline] [Order article via Infotrieve]
  8. Chen, L., and Frankel, A. D. (1994) Biochemistry 33, 2708–2715[Medline] [Order article via Infotrieve]
  9. Puglisi, J. D., Chen, L., Blanchard, S., and Frankel, A. D. (1995) Science 270, 1200–1203[Abstract]
  10. Markus, M. A., Hinck, A. P., Huang, S., Draper, D. E., and Torchia, D. A. (1997) Nat. Struct. Biol. 4, 70–77[Medline] [Order article via Infotrieve]
  11. Nakano, M., Yoshiura, K., Oikawa, M., Miyoshi, O., Yamada, K., Kondo, S., Miwa, N., Soeda, E., Jinno, Y., Fujii, T., and Niikawa, N. (1998) Gene (Amst.) 225, 59–65[CrossRef][Medline] [Order article via Infotrieve]
  12. Adams, D. J., van der Weyden, L., Mayeda, A., Stamm, S., Morris, B. J., and Rasko, J. E. (2001) J. Cell Biol. 154, 25–32[Abstract/Free Full Text]
  13. Adams, D. J., van der Weyden, L., Kovacic, A., Lovicu, F. J., Copeland, N. G., Gilbert, D. J., Jenkins, N. A., Ioannou, P. A., and Morris, B. J. (2000) Cytogenet. Cell Genet. 88, 68–73[CrossRef][Medline] [Order article via Infotrieve]
  14. Karginova, E. A., Pentz, E. S., Kazakova, I. G., Norwood, V. F., Carey, R. M., and Gomez, R. A. (1997) Am. J. Physiol. 273, F731–F738[Medline] [Order article via Infotrieve]
  15. Ladomery, M., Marshall, R., Arif, L., and Sommerville, J. (2000) Gene (Amst.) 256, 293–302[CrossRef][Medline] [Order article via Infotrieve]
  16. Momand, J., Zambetti, G. P., Olson, D. C., George, D., and Levine, A. J. (1992) Cell 69, 1237–1245[Medline] [Order article via Infotrieve]
  17. Honda, R., Tanaka, H., and Yasuda, H. (1997) FEBS Lett. 420, 25–27[CrossRef][Medline] [Order article via Infotrieve]
  18. Yokoyama, N., Hayashi, N., Seki, T., Pante, N., Ohba, T., Nishii, K., Kuma, K., Hayashida, T., Miyata, T., Aebi, U., Fukui, M., and Nishimoto, T. (1995) Nature 376, 184–188[CrossRef][Medline] [Order article via Infotrieve]
  19. Pichler, A., Gast, A., Seeler, J. S., Dejean, A., and Melchior, F. (2002) Cell 108, 109–120[Medline] [Order article via Infotrieve]
  20. May, W. A., Gishizky, M. L., Lessnick, S. L., Lunsford, L. B., Lewis, B. C., Delattre, O., Zucman, J., Thomas, G., and Denny, C. T. (1993) Proc. Natl. Acad. Sci. U. S. A. 90, 5752–5756[Abstract]
  21. Singh, B. B., Patel, H. H., Roepman, R., Schick, D., and Ferreira, P. A. (1999) J. Biol. Chem. 274, 37370–37378[Abstract/Free Full Text]
  22. Johnson, M. L., Correia, J. J., Yphantis, D. A., and Halvorson, H. R. (1981) Biophys J 36, 575–588[Abstract]
  23. Perkins, S. J. (1986) Eur. J. Biochem. 157, 169–180[Abstract]
  24. Hayes, D. B., Laue, T., and Philo, J. (1994) SEDNTERP: Interpretation of Sedimentation Data, Version 1.X, Boston Biomedical Research Institute, Boston, MA
  25. Piotto, M., Saudek, V., and Sklenar, V. (1992) J Biomol. NMR 2, 661–665[Medline] [Order article via Infotrieve]
  26. Piantini, U., Sorensen, O. W., and Ernst, R. R. (1982) J. Am. Chem. Soc. 104, 6800–6801
  27. Bax, A., and Davis, D. G. (1985) J. Magn. Reson. 65, 355–360
  28. Kumar, A., Ernst, R. R., and Wüthrich, K. (1980) Biochem. Biophys. Res. Commun. 95, 1–6[Medline] [Order article via Infotrieve]
  29. Bartels, C., Xia, T.-H., Billeter, P., Guntert, P., and Wuthrich, K. (1995) J. Biomol. NMR 5, 1–10[Medline] [Order article via Infotrieve]
  30. Guntert, P., Mumenthaler, C., and Wuthrich, K. (1997) J. Mol. Biol. 273, 283–298[CrossRef][Medline] [Order article via Infotrieve]
  31. Szyperski, T., Guntert, P., Otting, G., and Wuthrich, K. (1992) J. Magn. Reson. 99, 552–560
  32. Scanlon, M. J., Naranjo, D., Thomas, L., Alewood, P. F., Lewis, R. J., and Craik, D. J. (1997) Structure 5, 1585–1597[Medline] [Order article via Infotrieve]
  33. Kline, A. D., Braun, W., and Wuthrich, K. (1988) J. Mol. Biol. 204, 675–724[Medline] [Order article via Infotrieve]
  34. Wuthrich, K. (1986) NMR of Proteins and Nucleic Acids, Wiley, New York
  35. Neuhaus, D., Nakaseko, Y., Schwabe, J. W., and Klug, A. (1992) J. Mol. Biol. 228, 637–651[Medline] [Order article via Infotrieve]
  36. Brunger, A. T., Adams, P. D., Clore, G. M., DeLano, W. L., Gros, P., Grosse-Kunstleve, R. W., Jiang, J. S., Kuszewski, J., Nilges, M., Pannu, N. S., Read, R. J., Rice, L. M., Simonson, T., and Warren, G. L. (1998) Acta Crystallogr. Sect. D 54, 905–921[CrossRef][Medline] [Order article via Infotrieve]
  37. Nilges, M., Macias, M. J., O'Donoghue, S. I., and Oschkinat, H. (1997) J. Mol. Biol. 269, 408–422[CrossRef][Medline] [Order article via Infotrieve]
  38. Nilges, M. (1997) Fold Des. 2, S53–57[Medline] [Order article via Infotrieve]
  39. Linge, J. P., and Nilges, M. (1999) J. Biomol. NMR 13, 51–59[CrossRef][Medline] [Order article via Infotrieve]
  40. Folmer, R. H., Hilbers, C. W., Konings, R. N., and Nilges, M. (1997) J. Biomol. NMR 9, 245–258[CrossRef][Medline] [Order article via Infotrieve]
  41. Jorgensen, W. L., Chandrasekhar, J., Madura, J., Impey, R., and Klein, M. (1983) J. Chem. Phys. 79, 926–935[CrossRef]
  42. Koradi, R., Billeter, M., and Wuthrich, K. (1996) J. Mol. Graph. 14, 29–32; 51–55
  43. Laskowski, R. A., Rullmannn, J. A., MacArthur, M. W., Kaptein, R., and Thornton, J. M. (1996) J. Biomol. NMR 8, 477–486[Medline] [Order article via Infotrieve]
  44. Vriend, G., and Sander, C. (1993) J. Appl. Crystallogr. 26, 47–60[CrossRef]
  45. Maurer, F., Tierney, M., and Medcalf, R. L. (1999) Nucleic Acids Res. 27, 1664–1673[Abstract/Free Full Text]
  46. Delaney, S. J., Hayward, D. C., Barleben, F., Fischbach, K. F., and Miklos, G. L. (1991) Proc. Natl. Acad. Sci. U. S. A. 88, 7214–7218[Abstract]
  47. Perez-Canadillas, J. M., and Varani, G. (2001) Curr. Opin. Struct. Biol. 11, 53–58[CrossRef][Medline] [Order article via Infotrieve]
  48. Graveley, B. R., and Maniatis, T. (1998) Mol. Cell 1, 765–771[Medline] [Order article via Infotrieve]
  49. Schwabe, J. W., and Klug, A. (1994) Nat. Struct. Biol. 1, 345–349[Medline] [Order article via Infotrieve]
  50. Hutchinson, E. G., and Thornton, J. M. (1994) Protein Sci. 3, 2207–2216[Abstract/Free Full Text]
  51. Kabsch, W., and Sander, C. (1983) Biopolymers 22, 2577–2637[Medline] [Order article via Infotrieve]
  52. Holm, L., and Sander, C. (1997) Nucleic Acids Res. 25, 231–234[Abstract/Free Full Text]
  53. Nanduri, S., Carpick, B. W., Yang, Y., Williams, B. R., and Qin, J. (1998) EMBO J. 17, 5458–5465[Abstract/Free Full Text]
  54. Patel, R. C., Stanton, P., and Sen, G. C. (1996) J. Biol. Chem. 271, 25657–25663[Abstract/Free Full Text]
  55. Clarke, P. A., and Mathews, M. B. (1995) RNA 1, 7–20[Abstract]
  56. McMillan, N. A., Carpick, B. W., Hollis, B., Toone, W. M., Zamanian-Daryoush, M., and Williams, B. R. (1995) J. Biol. Chem. 270, 2601–2606[Abstract/Free Full Text]
  57. De Guzman, R. N., Wu, Z. R., Stalling, C. C., Pappalardo, L., Borer, P. N., and Summers, M. F. (1998) Science 279, 384–388[Abstract/Free Full Text]
  58. Theunissen, O., Rudt, F., Guddat, U., Mentzel, H., and Pieler, T. (1992) Cell 71, 679–690[Medline] [Order article via Infotrieve]
  59. Bogden, C. E., Fass, D., Bergman, N., Nichols, M. D., and Berger, J. M. (1999) Mol. Cell 3, 487–493[Medline] [Order article via Infotrieve]
  60. Hall, K. B. (2002) Curr. Opin. Struct. Biol. 12, 283–288[CrossRef][Medline] [Order article via Infotrieve]
  61. Meagher, M. J., Schumacher, J. M., Lee, K., Holdcraft, R. W., Edelhoff, S., Disteche, C., and Braun, R. E. (1999) Gene (Amst.) 228, 197–211[CrossRef][Medline] [Order article via Infotrieve]
  62. Reuter, G., Giarre, M., Farah, J., Gausz, J., Spierer, A., and Spierer, P. (1990) Nature 344, 219–223[CrossRef][Medline] [Order article via Infotrieve]
  63. Fasano, L., Roder, L., Core, N., Alexandre, E., Vola, C., Jacq, B., and Kerridge, S. (1991) Cell 64, 63–79[Medline] [Order article via Infotrieve]
  64. Manley, J. L., and Tacke, R. (1996) Genes Dev. 10, 1569–1579[CrossRef][Medline] [Order article via Infotrieve]
  65. Fu, X. D. (1995) RNA 1, 663–680[Medline] [Order article via Infotrieve]
  66. Varani, G., and Nagai, K. (1998) Annu. Rev. Biophys. Biomol. Struct. 27, 407–445[CrossRef][Medline] [Order article via Infotrieve]
  67. Wu, J., Matunis, M. J., Kraemer, D., Blobel, G., and Coutavas, E. (1995) J. Biol. Chem. 270, 14209–14213[Abstract/Free Full Text]
  68. Mackay, J. P., and Crossley, M. (1998) Trends Biochem. Sci 265, 1–4[CrossRef]