CCHX Zinc Finger Derivatives Retain the Ability to Bind Zn(II) and Mediate Protein-DNA Interactions*

Raina J. Y. Simpson {ddagger}, Edward D. Cram, Robert Czolij, Jacqueline M. Matthews, Merlin Crossley and Joel P. Mackay §

From the School of Molecular and Microbial Biosciences, G08, University of Sydney, New South Wales 2006, Australia

Received for publication, October 31, 2002 , and in revised form, May 2, 2003.


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
Classical (CCHH) zinc fingers are among the most common protein domains found in eukaryotes. They function as molecular recognition elements that mediate specific contact with DNA, RNA, or other proteins and are composed of a {beta}{beta}{alpha} fold surrounding a single zinc ion that is ligated by two cysteine and two histidine residues. In a number of variant zinc fingers, the final histidine is not conserved, and in other unrelated zinc binding domains, residues such as aspartate can function as zinc ligands. To test whether the final histidine is required for normal folding and the DNA-binding function of classical zinc fingers, we focused on finger 3 of basic Krüppel-like factor. The structure of this domain was determined using NMR spectroscopy and found to constitute a typical classical zinc finger. We generated a panel of substitution mutants at the final histidine in this finger and found that several of the mutants retained some ability to fold in the presence of zinc. Consistent with this result, we showed that mutation of the final histidine had only a modest effect on DNA binding in the context of the full three-finger DNA-binding domain of basic Krüppel-like factor. Further, the zinc binding ability of one of the point mutants was tested and found to be indistinguishable from the wild-type domain. These results suggest that the final zinc chelating histidine is not an essential feature of classical zinc fingers and have implications for zinc finger evolution, regulation, and the design of experiments testing the functional roles of these domains.


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
It is estimated that around 3% of all human gene products contain one or more zinc-binding domains, or zinc fingers (ZnFs).1 These domains are defined by the presence of one or more Zn(II) ions that stabilize the folded structure of the domain. More than 20 different classes of zinc fingers have been described, and these differ in the number of Zn(II) ions bound and in the spacing and identities of the ligands. The Zn(II) ion is normally coordinated by histidine and cysteine residues, although aspartate has been reported in LIM domains (1). The most common configuration is termed the classical (CCHH or C2H2) ZnF, which has the consensus sequence Cys-X2–4-Cys-X12-His-X2–6-His. Over 14,000 of these domains are predicted in current sequence data bases.

The majority of classical ZnFs are found in transcription factors, and a single transcription factor may contain more than 30 ZnF domains. It is well established that many of these domains mediate specific protein-DNA interactions (2), although recent reports have also demonstrated that at least some classical ZnFs can act as protein recognition motifs (for a review, see Ref. 3). The three-dimensional structures of a number of ZnF-DNA complexes have been determined, and a great deal is understood about the roles of individual amino acids in determining DNA sequence specificity (4, 5). These ZnFs bind to DNA using a tandem array of more than one ZnF (often three), where each ZnF contacts three base pairs.

The identification of ZnF motifs in the ever growing number of protein sequences is based primarily on the presence of conserved cysteine or histidine residues and the spacing between them. Further, DNA binding ability is sometimes inferred when three or more contiguous classical ZnFs are identified. This ability to infer function from sequence is becoming increasingly important as the amount of available sequence data increases. Interestingly, an examination of sequence data bases reveals that there are a number of proteins that contain sequences that correspond to one or more classical ZnFs, with the exception that the final zinc-ligating residue in the last ZnF is neither cysteine nor histidine (for examples, see Fig. 1). The question therefore arises as to whether these sequences are capable of folding and forming modules that are functional even in the absence of the typical final zinc-binding residue or whether the mutation leads to proteins that can no longer fold or (for example) bind to DNA.



View larger version (44K):
[in this window]
[in a new window]
 
FIG. 1.
Sequence alignment of variant classical zinc fingers and BKLF. A, amino acid sequences of variant CCHX zinc fingers, with the numbering scheme used in this paper indicated above. Zinc-ligating residues are boxed, and the variable position is indicated by gray shading. B, schematic representation of BKLF. The zinc finger domains are labeled, and the amino acid sequence of wild-type (wt) BF3 is shown below. The histidine that was mutated in this study is indicated with an arrow.

 

In order to address this question, we have investigated the physical and functional properties of a panel of point mutants based on the transcriptional repressor basic Krüppel-like factor/Krüppel-like factor 3 (BKLF) (6). BKLF binds to DNA sequences containing CACCC motifs by means of three characteristic Krüppel-like ZnFs (Krüppel-like fingers are a subset of classical CCHH fingers with significant homology to those found in the archetypal protein Drosophila regulatory protein Krüppel). We chose to study the third or C-terminal zinc finger of BKLF (BKLF-F3, or BF3). We show that the third zinc finger is essential for high affinity DNA binding and that the final zinc-ligating histidine of BKLF-F3 can be substituted with a number of different residues without severely compromising the DNA binding ability of BKLF. Further, the mutant BF3 domains still bind Zn(II) and form substantial secondary structure, although they are clearly not as well ordered as the wild-type domain. Remarkably, the His -> Asn mutant binds Zn(II) with an affinity that is essentially indistinguishable from that of the wild-type BF3.

These results demonstrate that three side chains can be sufficient to bind a Zn(II) ion. Further, our data show that such domains, even when partially folded, can act in concert with other ZnFs to bind DNA. These findings further our understanding of the basic ZnF scaffold and show that attempts to identify DNA-binding ZnFs from amino acid sequence data should not necessarily exclude apparently incomplete ZnF configurations.


    EXPERIMENTAL PROCEDURES
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
Plasmid Construction—The region encoding zinc fingers 1–3 of BKLF (BKLF-F1–3 residues 254–344) was amplified from the murine BKLF cDNA using primers CGGGATCCACCATGGCAAGGAAGCGCAGGATAC (A19) and CGGAATTCAGACTAGCATGTGGCGTT (A866), using the original murine BKLF cDNA clone in the vector pMT2-BKLF (6). The resulting fragment was digested with BamHI and EcoRI and inserted into pGEX-2T to generate in-frame fusions of BF3 with glutathione S-transferase, and the plasmids were transformed into Escherichia coli BL21 (DE3) cells. This plasmid also contains a thrombin cleavage site between the GST and the insert. Mutant derivatives were similarly constructed using primers related to A866 but differing in the codon corresponding to BKLF H341, the final histidine. Single finger constructs corresponding to both wild-type BF3 (residues 316–344) and BF3 mutants were similarly generated, using the 5' primer CGGGATCCATCAAACCTTTCCAGTGTCC (A259) and the same 3' A866 primer used above. The deletion constructs BF1-2 and BF1-3(–HMLV) were generated using the 5' primer A19 and the 3' primers CGGAATTCACCCAGTGTGTTTTCGGAAATG (A877) and CGGAATTCTACTAGCGTTTCCTGTGTAGGGCAA (A876). BF1-21/2 was constructed by digesting the pGEX-2T-BF1-3 plasmid with HindIII and EcoRI, filling in the overhangs with Klenow, and religating. This created a stop site after Ser327. DNA sequencing was carried out in order to confirm the sequences of all constructs.

Electrophoretic Mobility Shift Assays (EMSAs)—In order to compare the effects of the point mutations, EMSAs were carried out. Reactions were set up in a total volume of 30 µl, comprising 0.1 pg of 32P-labeled probe, ~100 ng of recombinant protein, 10 mM Hepes, pH 7.8, 50 mM KCl, 5 mM MgCl2, 1 mM EDTA, and 5% glycerol. The reactions used in the quantitative EMSAs further included 1 mg/ml dI-dC, 5 mg/ml bovine serum albumin, and 0.5% (v/v) Nonidet P-40. In these reactions, 67 pM 32P-labeled DNA and 0–2000 nM wild-type BF3 or 0–4500 nM mutant BF3A protein were used. After incubation on ice for 10 min, the samples were loaded onto a 6% native polyacrylamide gel made up in 0.5x TBE. The gel was then subjected to electrophoresis at 15 V/cm and 4 °C for 2 h, dried, analyzed, and quantified when necessary using a PhosphorImager (Amersham Biosciences). The probes used in the experiments were end-labeled according to standard procedures using polynucleotide kinase (7). The sequence was 5'-TAGAGCCACACCCTGGTAAG-3' (only the top strand is shown). The DNA-binding affinities of the wild-type BF3 and mutant BF3A were estimated by nonlinear least squares analysis as follows. The fraction of DNA complexed to the BF1-3 proteins (fcx) was calculated by using the equation fcx = Icx/(Icx + If), where Icx and If are the intensities of the bands corresponding to peptide-complexed DNA and free DNA, respectively. The DNA-binding affinity, Ka, was obtained by fitting the experimentally derived fcx to the equations,

(Eq. 1)
and

(Eq. 2)
where [D.P] is the concentration of DNA-protein complex, Dtot is the total DNA concentration, Ka is the association constant, and Ptot is the total protein concentration.

Overexpression and Purification of BF3 and Mutants—The wild-type and mutant BF3 proteins were expressed and purified in the same manner. Luria broth was inoculated with transformed E. coli cells at 37 °C. When the A600 reached ~0.6, protein expression was induced with the addition of isopropyl-{beta}-D-thiogalactoside (0.4 mM). After 4 h, the cells were pelleted by centrifugation and stored at –20 °C prior to lysis. The cells were resuspended in lysis buffer (50 mM Tris, 50 mM NaCl, 1% Triton X-100, 1.4 mM phenylmethylsulfonyl fluoride, 1.4 mM {beta}-mercaptoethanol, pH 8.0) and lysed by gentle sonication. The soluble fraction, separated from the insoluble fraction by centrifugation (15,000 rpm, 4 °C, 20 min), was loaded onto glutathione-Sepharose beads. Unbound proteins were washed away from the beads with wash buffer (50 mM Tris, 100 mM NaCl, 10% (v/v) glycerol, 1.4 mM phenylmethylsulfonyl fluoride, 1.4 mM {beta}-mercaptoethanol, pH 8.0), and the beads were equilibrated with thrombin buffer (50 mM Tris, 150 mM NaCl, 2.5 mM CaCl2, pH 8.0). Thrombin was added, and the mixture was incubated for either 2 h at 37 °C or overnight at 25 °C. The eluted peptides were lyophilized, redissolved in water, and further purified by reversed-phase HPLC (using a gradient of 5–95% acetonitrile in 0.1% trifluoroacetic acid). The purified peptides were lyophilized and stored at –20 °C. The identity of each peptide was confirmed using positive ion electrospray mass spectrometry.

Far UV CD Spectropolarimetry—Each HPLC-purified peptide was dissolved in a solution containing TCEP (1 mM) and ZnSO4 (1 mM) to concentrations of either 30 µM (BF3, BF3D, BF3E, and BF3N) or 12 µM (BF3A). The pH of these solutions was ~2.0. A far UV CD spectrum of each peptide was taken, the pH was then adjusted to ~5.5 with the addition of NaOH, and a second spectrum of each peptide was recorded. CD spectra were recorded at 25 °C on a Jasco J-720 spectropolarimeter equipped with a Neslab RTE-111 temperature controller. Spectra were recorded in a 1-mm path length cell with a resolution of 0.5 nm and bandwidth of 1 nm over the wavelength range of 190–250 nm. Each spectrum represented the average of three scans accumulated at a speed of 20 nm min1 with a response time of 1 s.

For the Zn(II) titration experiments, aliquots of a solution containing ZnCl2 (13–20 mM; pH 5.5) were added to solutions of the wild-type BF3 (13 µM) and the mutant BF3N (20 µM), each containing 0.5 mM TCEP, pH 5.5. CD spectra were taken at each point in the titration, allowing 5 min for equilibration after each Zn(II) addition. Spectra were recorded over the wavelength range of 195–200 nm with a resolution of 1 nm and as the average of 50 scans. Spectra were base line-corrected by subtraction of a spectrum of TCEP/ZnSO4 buffer alone.

Values of the association constant for zinc binding (Ka) were determined by plotting the change in ellipticity at a single wavelength against the total Zn(II) concentration. Nonlinear least squares analysis was used to determine the Zn(II)-binding affinities, using the equations,

(Eq. 3)
and

(Eq. 4)
where {theta} represents the observed ellipticity, {theta}P is the ellipticity of the apoprotein, [P.Zn] is the concentration of the protein-Zn(II) complex, Ptot is the total protein concentration, {theta}P.Zn is the ellipticity of the holoprotein, Ka is the association constant for Zn(II) binding, and Zntot is the total Zn(II) concentration. The final values of Ka were obtained by averaging the values determined at 195, 196, 197, 198, 199, and 200 nm.

Analytical Ultracentrifugation—Sedimentation equilibrium experiments were performed using an OptimaTM XL-A analytical ultracentrifuge (Beckman Instruments) equipped with an An-60ti rotor. BF3E (at concentrations of 8.6, 20, and 33.6 µM; all in 1 mM TCEP, 1 mM ZnSO4, pH 5.6) was centrifuged against a matched buffer at 25 °C at 30,000 and 42,000 rpm, using double sector cells. Data were collected as A230 versus radius scans in 0.001-cm increments, and 10 scans were averaged for each data set. Base line correction was achieved by the subtraction of data recorded at 360 nm. Scans were taken at 3-h intervals and compared to ensure that the samples reached equilibrium. Analysis of the data was carried out using the NONLIN software (8), and the final parameters were determined by a nonlinear least squares fit of the data to a single species model. The goodness of fit was determined by examination of the residuals derived from the fit. The partial specific volume was determined from the amino acid sequence (9), and the solvent density was determined to be 0.997 g ml1 using the program SEDNTERP (10).

Nuclear Magnetic Resonance Spectroscopy—Samples of BF3 were prepared by dissolving either 3.5 mg of 15N-labeled BF3 or 11 mg of BF3 in H2O/D2O (95:5) containing 1.5 molar equivalents of both TCEP and ZnSO4. The pH was adjusted to 5.5 using 0.1 and 0.01 M NaOH; this gave sample concentrations of 1 mM for 15N-labeled BF3 and 3 mM for BF3.

For the comparison of wild-type BF3 with BF3E, both peptides were dissolved in a solution containing TCEP (1 mM) and ZnSO4 (1 mM). The pH was adjusted to 5.5, and each solution was supplemented with D2O (5%, v/v) and d4-(trimethylsilyl)propionic acid (1 µl). The final sample concentrations were 100 µM (BF3) and 85 µM (BF3E). The pH of the BF3E sample was subsequently dropped to pH ~2 to obtain the spectrum of the unfolded protein. NMR spectra were recorded on a Bruker DRX600 spectrometer equipped with a triple resonance (HCN) probe and three-axis pulsed-field gradients. NMR spectra used for the comparison of BF3 and BF3E were recorded at 298 K, whereas those used for structure determination were acquired at 280 K. The solvent signal was suppressed using pulsed field gradients. One-dimensional 1H spectra consisted of 128 scans collected as 8,192 complex data points over a spectral width of 7,200 Hz. The following homonuclear two-dimensional experiments were recorded on the unlabeled BF3 sample: TOCSY (11) ({tau}m = 70 ms), DQFCOSY (12), and NOESY (13) ({tau}m = 200 ms). Three-dimensional HNHA (14) and three-dimensional TOCSY-HSQC (15) ({tau}m = 70 ms) experiments were used to assign the 15N,1H HSQC spectra of the 15N-labeled BF3, whereas the HNHA spectrum was used to derive 3JHNH{alpha} coupling constants. Spectra were processed as described previously (16). The 1H frequency scale was referenced to d4-(trimethylsilyl)propionic acid at 0.00 ppm.

Determination of the Structure of BF3—Resonance assignment was carried out using the standard homonuclear sequential assignment method (17). Cross-peaks in the two-dimensional NOESY spectra were integrated in XEASY and converted to upper distance limits using the CALIBA module of DYANA (18) and the default DYANA parameters, except that an empirical correction of 0.5 Å was added to the upper distance limits involving {beta}-methylene groups to account for spin diffusion. Dihedral angle restraints for {varphi} angles were derived from 3JHNH{alpha} coupling constants measured from the HNHA spectrum (14). Residues with positive {varphi} angles were verified using methods as described in Ref. 19.

Initially, structure calculations were performed in DYANA, and additional NOEs were assigned iteratively based on earlier sets of structures. At this stage, structures were calculated without incorporating the Zn(II) atom. Analysis of preliminary structures established that the N{epsilon}2 atoms of both histidine residues were coordinating the Zn(II). This was later confirmed using a 1H-15N HMQC experiment optimized to detect J-couplings in histidine side chains (20). In order to include additional distance and angle constraints that maintain the tetrahedral bonding geometry and appropriate bond lengths with the zinc ion (21), as well as to incorporate ambiguous restraints, subsequent structural refinement was performed in CNS (22) using the package ARIA (23, 24). Manually assigned NOEs in combination with the remaining ambiguous NOEs were included in the ARIA structure calculations, and the latter were iteratively assigned in an automated manner.

Briefly, one cycle of 200 structures was followed by seven cycles of 20 structures each and a final cycle of 1000 structures. Manually assigned NOEs were included in iteration zero as soft restraints. In each cycle, the seven lowest energy structures from the previous iteration were used to extract additional NOE restraints with a tolerance of 0.02 ppm in the F1 dimension and 0.015 ppm in the F2 dimension. If a restraint was violated by more than a predefined target value in over 50% of the seven structures, it was discarded. The violation target value was progressively reduced from 1000 Å in iteration zero to 0.1 Å in iteration eight. 18 of 782 distance restraints were discarded in the final iteration due to these violations. Ambiguous distance restraints were treated as described previously (25), and the peak volume cut-off was gradually reduced from 1.01 in the first iteration to 0.80 in the last. The final assignments made by ARIA were checked and corrected manually where necessary. Calculations were carried out in the simplified all-hydrogen PARALLHDG5.2 force field with nonbonded interactions modeled by the PROLSQ force field (26); floating chirality assignment (27) was used for all methylene and isopropyl groups.

Finally, the 25 lowest energy structures were subjected to water refinement using the standard water refinement protocol supplied by ARIA1.2 (28). The structures were immersed in a 7-Å shell of water molecules and were subjected to a short molecular dynamics simulation taking into account the Lennard-Jones, van der Waals, and electrostatic interactions and based on a slightly modified OPLS force field (29). These water-refined structures were visualized and analyzed using the programs MOLMOL (30), PROCHECK (31), and WhatIf (32). The coordinates of BKLF-F3 have been deposited in the Protein Data Bank under the accession code 1P7A.

Atomic Absorption Spectrometry—BF3 and BF3D samples used in far UV CD analyses were dialyzed against sodium acetate buffer (10 mM, pH 5.4) containing 1 mM dithiothreitol, in order to remove excess Zn(II). The resulting samples were rechecked by CD to confirm that they had remained folded. The Zn(II) content of BF3 (26 µM) and BF3D (19 µM) samples were determined on a Varian SpectrAA 10/20 flame emission spectrometer at 213.9 nm. Concentrations were determined by reference to a standard curve constructed using atomic absorption standard zinc(II) solution (1000 mg/liter in 0.5 mol/liter nitric acid; BDH), diluted with Milli-Q® water to final concentrations of 0.01, 0.05, 0.1, 0.5, and 1 mg/liter.


    RESULTS
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
Many Variants of CCHH Zinc Fingers Exist—A search of the Pfam data base (available on the World Wide Web at pfam.wustl.edu/) using the search pattern X2-C-X1–5-C-X12-H-X3–6-(H/C) reveals 20,646 protein domains classified as CCHH zinc fingers (as of March 10, 2003). Within this set, ~250 domains actually contain a cysteine in place of the final histidine residue. It was demonstrated recently that variant domains of this type could form stable folded structures that closely resemble other classical CCHH zinc fingers (33). More unusual are the 82 sequences that lack either a cysteine or a histidine in the corresponding position (position 23 in Fig. 1A). Of these, 10 sequences, however, do exhibit a cysteine or histidine residue at position 20, 21, or 22 (Fig. 1A), and it is conceivable that this residue acts as the fourth zinc ligand. 71 of the remaining 72 sequences (with the exception of HKR2; Fig. 1A) contain neither cysteine nor histidine in any of the positions 20–28. Interestingly, a number of these CCHX variants appear to be located at the C-terminal end of an array of CCHH fingers.

Given the dogma that four ligating amino acid side chains are required for the formation of a classical zinc finger structure, the existence of these variants led us to ask whether such sequences can form functional zinc finger domains, or if instead they represent vestigial domains that no longer play a role in the function of the parent protein.

CCHX Mutants of BKLF Bind to DNA—In order to address this question, we made a series of single site mutations in the zinc finger region of the transcriptional repressor BKLF (6). At its C terminus, BKLF has three contiguous CCHH fingers (Fig. 1B); this three-finger array binds with high affinity and specificity to oligonucleotides containing the sequence CACCC (6). We generated a fusion protein encompassing the three fingers of murine BKLF (BKLF-F1–3 residues 254–344) C-terminal to glutathione S-transferase (GST). Six mutants were also prepared that harbored substitutions of the final histidine ligand in the third finger, namely aspartate, glutamate, asparagine, glutamine, alanine, and arginine (Fig. 1B). These constructs were termed BF1-3X (where X represents Asp, Glu, Asn, Gln, Ala, and Arg). Aspartate is known to act as a zinc ligand in LIM domains (34), a class of zinc fingers in which two zinc atoms are bound. Glutamate, asparagine, and glutamine were chosen as residues with side chains that could conceivably act as ligands to Zn(II), and alanine and arginine were selected as negative controls. We initially also attempted to make a derivative containing a cysteine, but despite several attempts, we were not successful at generating this protein by normal methods in E. coli, since the protein appears to be cytotoxic; this mutant was therefore not pursued further.

The seven proteins (wild-type BF1-3 and the six mutants) were tested for their ability to bind a typical CACCC box motif, namely the motif in the {beta}-globin promoter (see "Experimental Procedures"). GST alone and probe alone were also included as negative controls in this experiment (Fig. 2A). Remarkably, all of the point mutants retained near native DNA binding ability. This result was unexpected, given the generally accepted view that four zinc ligands are required for the formation of a folded zinc finger domain and the low probability that either arginine or alanine could coordinate Zn(II). Three C-terminal truncation mutants were also tested for their DNA binding ability. The mutant with four amino acids truncated at the C terminus of finger 3 (BF1-3(–HMLV)), including the second zinc-ligating histidine, was able to bind DNA, albeit with a reduced binding affinity (Fig. 2B). In contrast, protein constructs either with 17 amino acids truncated (BF1-21/2) or comprising only fingers 1 and 2 (BF1-2) were not able to bind DNA (Fig. 2B).



View larger version (59K):
[in this window]
[in a new window]
 
FIG. 2.
BKLF mutants bind to DNA. A, the upper panel shows an EMSA testing the ability of GST-BKLF finger fusion proteins to bind to a CACCC box site. In lanes 1–7, minimal amounts of GST fusion protein were used in order to maximize the ability of the assay to detect differences in the binding affinities of GST-BF1-3X derivatives. In lanes 8–10, the DNA binding activity of the deletion mutant BKLF-F1-2 was assessed, but in these lanes significantly more protein was used, so that weak DNA binding activity by the BKLF-F1-2 would not be overlooked. The retarded complexes are indicated with an arrow. The lower panel shows the preparations of the fusion proteins used in the upper panel run on in SDS-PAGE and stained with Coomassie Brilliant Blue, indicating that the fusion proteins are essentially intact and present in similar concentrations in the preparations assayed. B, EMSAs testing the ability of five GST-BKLF constructs to bind to a CACCC site. The constructs are (from left to right) wild-type BF1-3, BF1-3A, BF1-3 missing four C-terminal amino acids that includes the zinc-ligating His341 (BF1-3(–HMKV)), BF1-3 without 17 C-terminal amino acids leaving only the residues for half of the ZnF motif (BF1-21/2), and BF1-2. Protein concentrations used were 760, 500, 230, and 33 nM, except for GST-BF1-3(–HMLV), where higher concentrations of 8.2, 5.3, 2.5, and 0.4 µM were used to show the much reduced binding affinity. C and D, representative quantitative EMSAs of the wild-type BF1-3 (C) and alanine point mutant BF1-3A (D). E, determination of the DNA binding affinity from the EMSA data in C and D. Icx/Itot is the fraction of total intensity in each lane that is found in the shifted band. The fitted Ka values are (2.5 ± 0.4) x 107 M–1 (wild type, gray line) and (2.16 ± 0.22) x 106 M–1 (mutant BF1-3A, black line).

 

In order to quantify the effect of the point mutations on DNA binding, quantitative EMSAs were carried out using the wild-type and the alanine mutant BF1-3A (Fig. 2, C–E). The wild-type protein binds a typical CACCC box site with a Ka of (2.5 ± 0.4) x 107 M–1. Surprisingly, the His -> Ala mutation had a relatively modest effect on DNA binding; the mutant BF1-3A bound DNA with a Ka of (2.16 ± 0.22) x 106 M–1 (Fig. 2E).

The observation that even the BF1-3A and BF1-3(–HMLV) mutants bind DNA is most simply explained by one of two possible models. First, it could be that BF3 folds normally in the context of the mutants, irrespective of whether a fourth zinc ligand is present at position 341 (using the numbering in Fig. 1B). Second, it is possible that the BF1-3 mutants are either partially folded or completely unfolded but that the formation of structure and DNA binding take place concomitantly. A number of recent reports demonstrate simultaneous folding and binding events (35, 36).

BKLF Finger 3 Is a Typical Classical ZnF—The first step that we took in order to delineate the effect of the mutations was to determine the structure of the wild-type BF3 using NMR spectroscopy. Resonance assignment from homonuclear NMR spectra was straightforward, and the structure calculations were carried out using ARIA (23). The 20 structures with lowest overall energies (from the final refinement in water) were used to represent the solution structure of BF3 (Fig. 3A). The structures display good covalent geometry, judging from the small deviations from ideal bond lengths and angles, and good nonbonded contacts, as shown by the low value of the mean Lennard-Jones potential (Table I). The atomic coordinates for this family of conformers have been deposited with the Protein Data Bank.



View larger version (17K):
[in this window]
[in a new window]
 
FIG. 3.
Solution structure of BF3. A, ensemble of the best 25 structures of BF3. Structures are superimposed over the backbone atoms of residues 319–342 (the N-terminal eight, which are unstructured, are omitted for clarity). The zinc-chelating side chains are shown in orange, and the zinc atom is shown in gray. B, ribbon diagram of the lowest energy structure of BF3, showing elements of secondary structure as recognized in the program MOLMOL (30). C, overlay over the backbone atoms (Ca, C', N, Cys321–His341 of BF3) of BF3 with the second zinc finger of Zif268 (45). BF3 is shown in blue, and Zif268-F2 is shown in red.

 

View this table:
[in this window]
[in a new window]
 
TABLE I
Structural statistics for the family of 25 BF3 structures

 

The overall topology of BF3 conforms to the expected fold of classical CCHH zinc finger domains: two short strands of antiparallel {beta}-sheet strands linked by a rubredoxin-like turn are followed by an {alpha}-helix that contains the two Zn(II)-coordinating histidine residues. The short {beta}-sheet encompasses residues 319–321 and 326–328, and hydrogen bonds are formed that involve the backbone amide protons of Phe319, Cys321, and Phe328 and the carboxyl oxygens of Phe328, Arg326, and Phe319, respectively. The {beta}-turn linking these two strands has hydrogen bonds and dihedral angles consistent with the rubredoxin-like turn that is commonly found in zinc-binding domains (see, for example, Refs. 37 and 38). A positive {varphi} angle is often found in the residue following the second zinc-ligating Cys, and we confirmed its existence in BF3 using the methods described in Ref. 19. The {alpha}-helix runs from residue Ser331 to Leu343, although both d{alpha}N(i, i + 3) and d{alpha}N(i, i + 4) NOEs are observed for residues 339–343, suggesting that the conformation of this region lies somewhere between an {alpha}-helix and a 3–10 helix. This phenomenon is also seen in the last 3–4 residues of the {alpha}-helix of the third ZnF of Sp1 (39).

Overall, the structure of BF3 is very similar to other classical zinc fingers. For example, it overlays with the second ZnF of Zif268 with a root mean square deviation of 0.82 Å over the ordered backbone atoms (Fig. 3C) (40).

BF3 Mutants Form Secondary Structure in a Zn(II)-dependent Fashion—Having established that the BF1-3 point mutants were all capable of DNA binding and that the wild-type BF3 domain formed a normal classical zinc finger fold, we sought to determine whether the mutants were able to bind Zn(II). The six mutants were also produced as single finger GST fusion proteins (termed BF3X). These overexpressed GST-BF3X proteins were subjected to glutathione-affinity chromatography, and thrombin was used to release the BF3X domains. However, all six mutant proteins exhibited partial cleavage by thrombin at secondary sites. Useable amounts of wild-type BF3, as well as mutants BF3D, BF3E, BF3N, and BF3A but not mutants BF3R or BF3Q were isolated by reverse phase HPLC. In order to ascertain whether the mutant domains were able to bind Zn(II), far UV CD spectra were recorded. The spectra of wild-type BF3 and each of the mutants at low pH (pH ~2) and in the presence of excess Zn(II) are typical of unstructured polypeptides. Surprisingly, however, an increase of the pH to 5.5 resulted in noticeable increases in the secondary structure content of all domains (Fig. 4A, black lines). This was manifested as a red shift of the minimum and, in the case of BF3, by the presence of positive ellipticity at low wavelengths. Spectra recorded at pH 5.5 in the absence of Zn(II) were similar to the low pH spectra (data not shown).



View larger version (15K):
[in this window]
[in a new window]
 
FIG. 4.
CCHX mutants contain substantial secondary structure. A, far UV CD spectra of both BF3 and BF3X mutants. Reported spectra are the sum of three scans collected at a rate of 20 nm min1 and using a bandwidth of 1 nm and a resolution time of 1 s (25 °C). All spectra are base line-corrected. Spectra recorded at pH 5.5 are shown as black lines, and the spectrum in gray was collected at pH ~2. B, determination of the zinc-binding affinity for BF3 and BF3N from the observed change in ellipticity at 199 nm. The fitted values of Ka are (4.6 ± 1.4) x 104 M (wild type, gray line) and (1.1 ± 0.6) x 105 M (BF3N, black line).

 

To assess more quantitatively whether Zn(II) binding is affected by these point mutations, the Zn(II)-binding affinities of the wild-type BF3 and the BF3N mutant were determined. Zn(II) was titrated into solutions of these domains, and the change in ellipticity at wavelengths between 195 and 200 nm was recorded (Fig. 4B). Surprisingly, nonlinear least squares fitting of the titration data revealed that BF3 and BF3N bound Zn(II) with similar affinities ((4.6 ± 1.4) x 104 M–1 and (1.1 ± 0.6) x 105 M–1, respectively).

BF3E Is Monomeric in Solution and BF3D Binds Zn2+ in a 1:1 Ratio—In order to determine the aggregation state of one of the BF3X mutants, BF3E was subjected to sedimentation equilibrium analysis. Concentration versus radial distance profiles (Fig. 5) were obtained at two different rotor speeds, and nonlinear least squares analysis using the software NONLIN (8) showed that the data fitted very well to an ideal single species model with a molecular mass of 3,500 ± 200 Da. This value is in good agreement with the theoretical mass of 3,638 Da for Zn-BF3E. The Zn(II)/protein ratios for wild-type BF3 and BF3D were determined by atomic absorption spectrometry; both were 1:1.



View larger version (19K):
[in this window]
[in a new window]
 
FIG. 5.
BF3E is monomeric in solution. Sedimentation equilibrium data for BF3E recorded at 42,000 rpm (25 °C). The lower graph displays a plot of absorbance at 230 nm versus r2/2 (cm2), whereas the upper graph illustrates the residual deviations resulting from the fit of an ideal single species model to the data.

 

BF3X Mutants Are Partially Folded in Solution—The one-dimensional 1H NMR spectrum of wild-type BF3 (Fig. 6A) contained sharp and well dispersed signals, which is indicative of a folded protein with a significant degree of tertiary structure. In contrast, the one-dimensional 1H NMR spectrum of the BF3E (Fig. 6B), while exhibiting a reasonable amount of chemical shift dispersion compared with the same protein at pH 2 (Fig. 6C), was much broader. Given that BF3E is monomeric in solution (see above), the broadness indicates that BF3E is undergoing substantial chemical exchange on the microsecond-millisecond time scale.



View larger version (25K):
[in this window]
[in a new window]
 
FIG. 6.
BF3E is probably partially folded. Amide regions of the one-dimensional 1H NMR spectra of BF3 and BF3E. A, wild-type BF3 at pH 5.5. B, BF3E at pH 5.5. C, BF3E at pH ~2.0. Spectra were recorded at 25 °C on a Bruker DRX600 NMR spectrometer.

 


    DISCUSSION
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
Zinc finger domains are abundant in the genomes of eukaryotic organisms. They are known to function largely as specific binding motifs that recognize targets such as DNA, RNA, and other proteins. It has generally been observed that zinc-binding domains form compact, stable structures in which the presence of the zinc ion is essential for the formation of that structure. The functions of these domains are generally thought to be dependent on the existence of stable structure, and all known zinc-binding domains (excluding those in metalloenzymes) ligate the zinc ion through four amino acid side chains.

Zinc- and DNA-binding Properties of the Mutants—The transcriptional repressor BKLF contains a tandem array of three classical CCHH zinc fingers that bind with high affinity to CACCC-containing DNA sequences. The affinity of this interaction (3 x 107 M–1) is comparable with those reported for other known three-ZnF constructs binding to their cognate DNA sequence; for example, the three CCHH ZnFs of Sp1 bind DNA with a Ka of 7 x 107 M–1 (41). We have shown here that the third zinc finger of BKLF is capable of making a substantial contribution to DNA binding even when the most C-terminal of its four normal zinc ligands (His341 in Fig. 1B) is mutated. The DNA binding affinity of the three-finger construct BF1-3 is only reduced 10-fold when the last zinc-ligating His mutated is Ala. Indeed, even a truncated construct lacking any residue at the fourth zinc-ligating position (BF1-3(–HMLV)) retains the ability to bind DNA, demonstrating that the DNA binding contributions made by this "broken" finger do not depend on the presence of four zinc ligands. A combination of CD, atomic absorption, UV-visible, and 1H NMR data indicates that the point mutants are capable of binding one molar equivalent of Zn(II), thereby forming substantial secondary structure. Surprisingly, the Zn(II)-binding affinity of the single BF3 finger appears not to be affected by the H -> N mutation. Sedimentation equilibrium data confirm that zinc binding is achieved by a single protein molecule (rather than by, for example, two molecules, each contributing two ligands).

Examination of the sequence of BF3 reveals that there are no other amino acids in the vicinity of His341 that could readily substitute for that residue as the fourth zinc ligand. Whereas it is conceivable that His333 could serve this role, such an arrangement would require a substantial structural rearrangement, and it is unlikely that the resulting structure would be capable of contributing to DNA binding. Taken together therefore, our data indicate that three ligands are sufficient (although not optimal) for Zn(II) binding in BKLF. This conclusion is supported by an assortment of previous reports. For example, a truncated C2H2 ZnF (that is missing its last zinc-ligating histidine) was found to coordinate Co(II) in a tetrahedral manner and with an affinity comparable with that of the intact ZnF (42). In a second study, it was observed that when one of the three classical ZnFs of Zif268 was mutated to a CCHA configuration, the protein retained its ability to bind DNA (43). Finally, Cook et al. (44) reported a mutant of the Saccharomyces cerevisiae transcriptional activator ADR1, in which the second histidine of the C-terminal CCHH finger (in a two-ZnF tandem array) was substituted to a tyrosine. The ability of this mutant to activate transcription was reduced but not abrogated, and it was postulated that three zinc chelators might be sufficient to bind Zn(II) and maintain the protein in its active form.

Conformation of the Mutants—The enhanced susceptibility of these mutants to proteolysis in E. coli, however, suggests that the mutants do not form compact tertiary structures. Both the CD and the 1H NMR data support this conclusion. Whereas considerable amounts of secondary structure were formed upon the addition of Zn(II), CD spectra of the mutants were still somewhat different from the wild-type spectrum. The broad nature of the 1H NMR spectrum of BF3E indicates the existence of interconverting conformers in a chemical exchange process. Given that the zinc-binding affinity for the BF3 His -> Asn mutant is not significantly different from that of the wild-type domain, it is likely that the chemical exchange arises from loose packing of the mutants, much like the molten globule state often discussed in the context of protein folding. However, this partial formation of structure is obviously sufficient to allow the recognition of DNA in the context of the three-zinc finger construct BF1-3, and it is possible that the mutated third finger forms more regular structure concomitantly with DNA binding. Alternatively, the NMR broadening may be a consequence of a change in the kinetics of metal binding; distinguishing these possibilities would be problematic.

The Implications for Other Studies—The identification of ZnFs from genomic sequence data relies in part on the presence of predictably spaced cysteine and histidine residues. Whereas a number of variant CCHX zinc fingers are picked up by automated methods, it is possible that others are not, given the apparent plasticity of the requirements for a functional ZnF. Results such as those presented here may assume increasing importance as the use of sequence data alone to infer protein function becomes more prevalent. Thus, our results indicate that care should be taken before presuming that CCHX zinc fingers found in putative proteins are nonfunctional or vestigial.

These data also cast a question on the routine use of alanine substitution to create null ZnFs in functional studies. Because it had been assumed that the structure and, hence, function of a ZnF depends on the ligation of Zn(II) by four ligands, alanine-substituted ZnFs were generally expected to be nonfunctional. This study indicates that, at least in the context of some sequences, alanine substitution mutants may retain significant residual ZnF structure and activity.

The CCHH zinc finger fold is a common scaffold from which proteins with different DNA binding specificities have been generated. It is the simple structure and small number of residues required to structurally stabilize the domains that makes them particularly versatile and adaptable. The high number of CCHH zinc finger genes in eukaryotic genomes suggests that they may have evolved early in evolution, and an intermediate containing only three zinc ligands that exhibited suboptimal function may have played a role in their evolution. Another interesting possibility is that the presence of an amino acid other than histidine and cysteine as a zinc ligand or the absence of one ligand may constitute a means by which ZnF proteins could be regulated. Whereas the CCHX ZnFs may still recognize DNA, their greater susceptibility to proteolytic degradation might reduce their cellular half-life. This idea might also be pertinent in the design of novel ZnFs as a method in which the bioavailability of designed ZnFs might be controlled.


    FOOTNOTES
 
The atomic coordinates and structure factors (code 1P7A) have been deposited in the Protein Data Bank, Research Collaboratory for Structural Bioinformatics, Rutgers University, New Brunswick, NJ (http://www.rcsb.org/).

* This work was supported by a grant from the Australian Research Council. The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. Back

{ddagger} Supported by Australian Postgraduate Awards. Back

§ An Australian Research Fellow. To whom correspondence should be addressed. Tel.: 61-2-9351-3906; Fax: 61-2-9351-4726; E-mail: j.mackay{at}mmb.usyd.edu.au.

1 The abbreviations used are: ZnF, zinc finger; EMSA, electrophoretic mobility shift assay; TCEP, tris(2-carboxyethyl)phosphine; HPLC, high pressure liquid chromatography; GST, glutathione S-transferase; BF3, BKLF-F3; TOCSY, total correlation spectroscopy; NOE, nuclear Overhauser effect; NOESY, NOE spectroscopy; HSQC, heteronuclear single quantum correlation spectroscopy. Back


    ACKNOWLEDGMENTS
 
We thank Dr. Bill Bubb for expert maintenance of the DRX600 NMR spectrometer at the University of Sydney and for valuable advice.



    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 

  1. Dawid, I. B., Breen, J. J., and Toyama, R. (1998) Trends Genet. 14, 156–162[CrossRef][Medline] [Order article via Infotrieve]
  2. Luscombe, N. M., Austin, S. E., Berman, H. M., and Thornton, J. M. (2000) Gen. Biol. 1, 001.001–001.037
  3. Mackay, J. P., and Crossley, M. (1998) Trends Biochem. Sci. 265, 1–4[CrossRef]
  4. Wolfe, S. A., Grant, R. A., Elrod-Erickson, M., and Pabo, C. O. (2001) Structure 9, 717–723[CrossRef][Medline] [Order article via Infotrieve]
  5. Pabo, C. O., Peisach, E., and Grant, R. A. (2001) Annu. Rev. Biochem. 70, 313–340[CrossRef][Medline] [Order article via Infotrieve]
  6. Crossley, M., Whitelaw, E., Perkins, A., Williams, G., Fujiwara, Y., and Orkin, S. H. (1996) Mol. Cell. Biol. 16, 1695–1705[Abstract]
  7. Sambrook, J., Fritsch, E. F, and Maniatis, T. (1989) Molecular Cloning: A Laboratory Manual, 2nd Ed., pp. 10.59–10.61, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY
  8. Johnson, M. L., Correia, J. J., Yphantis, D. A., and Halvorson, H. R. (1981) Biophys. J. 36, 575–588[Abstract]
  9. Perkins, S. J. (1986) Eur. J. Biochem. 157, 169–180[Abstract]
  10. Hayes, D. B., Laue, T., and Philo, J. (1995) SEDNTERP, 1.05, University of New Hampshire, Durham, NH
  11. Bax, A., and Davis, D. G. (1985) J. Magn. Reson. 65, 355–360
  12. Rance, M., Sorensen, O. W., Bodenhausen, G., Wagner, G., Ernst, R. R., and Wuthrich, K. (1983) Biochem. Biophys. Res. Commun. 117, 479–485[Medline] [Order article via Infotrieve]
  13. Kumar, A., Ernst, R. R., and Wüthrich, K. (1980) Biochem. Biophys. Res. Commun. 95, 1–6[Medline] [Order article via Infotrieve]
  14. Vuister, G., and Bax, A. (1993) J. Am. Chem. Soc. 115, 7772–7777
  15. Marion, D., Driscoll, P. C., Kay, L. E., Wingfield, P. T., Bax, A., Gronenborn, A. M., and Clore, G. M. (1989) Biochemistry 28, 6150–6156[Medline] [Order article via Infotrieve]
  16. Kowalski, K., Czolij, R., King, G. F., Crossley, M., and Mackay, J. P. (1999) J. Biomol. NMR 13, 249–261[CrossRef][Medline] [Order article via Infotrieve]
  17. Wuthrich, K. (1986) NMR of Proteins and Nucleic Acids, Wiley Interscience, New York
  18. Guntert, P., Mumenthaler, C., and Wuthrich, K. (1997) J. Mol. Biol. 273, 283–298[CrossRef][Medline] [Order article via Infotrieve]
  19. Ludvigsen, S., and Poulsen, F. M. (1992) J. Biomol. NMR 2, 227–233[Medline] [Order article via Infotrieve]
  20. Pelton, J. G., Torchia, D. A., Meadow, N. D., and Roseman, S. (1993) Protein Sci. 2, 543–558[Abstract/Free Full Text]
  21. Neuhaus, D., Nakaseko, Y., Schwabe, J. W., and Klug, A. (1992) J. Mol. Biol. 228, 637–651[Medline] [Order article via Infotrieve]
  22. Brunger, A. T., Adams, P. D., Clore, G. M., DeLano, W. L., Gros, P., Grosse-Kunstleve, R. W., Jiang, J. S., Kuszewski, J., Nilges, M., Pannu, N. S., Read, R. J., Rice, L. M., Simonson, T., and Warren, G. L. (1998) Acta Crystallogr. D Biol. Crystallogr. 54, 905–921[CrossRef][Medline] [Order article via Infotrieve]
  23. Nilges, M. (1995) J. Mol. Biol. 245, 645–660[CrossRef][Medline] [Order article via Infotrieve]
  24. Nilges, M., Macias, M. J., O'Donoghue, S. I., and Oschkinat, H. (1997) J. Mol. Biol. 269, 408–422[CrossRef][Medline] [Order article via Infotrieve]
  25. Nilges, M. (1997) Fold Des. 2, S53–S57[Medline] [Order article via Infotrieve]
  26. Hendrickson, W. A. (1985) Methods Enzymol. 115, 252–270[Medline] [Order article via Infotrieve]
  27. Folmer, R. H., Hilbers, C. W., Konings, R. N., and Nilges, M. (1997) J. Biomol. NMR 9, 245–258[CrossRef][Medline] [Order article via Infotrieve]
  28. Linge, J. P., Williams, M. A., Spronk, C. A., Bonvin, A. M., and Nilges, M. (2003) Proteins 50, 496–506[CrossRef][Medline] [Order article via Infotrieve]
  29. Linge, J. P., and Nilges, M. (1999) J. Biomol. NMR 13, 51–59[CrossRef][Medline] [Order article via Infotrieve]
  30. Koradi, R., Billeter, M., and Wuthrich, K. (1996) J. Mol. Graph. 14, 51–55[CrossRef][Medline] [Order article via Infotrieve]
  31. Laskowski, R. A., Rullmannn, J. A., MacArthur, M. W., Kaptein, R., and Thornton, J. M. (1996) J. Biomol. NMR 8, 477–486[Medline] [Order article via Infotrieve]
  32. Vriend, G., and Sander, C. (1993) J. Appl. Crystallogr. 26, 47–60[CrossRef]
  33. Liew, C. K., Kowalski, K., Fox, A. H., Newton, A., Sharpe, B. K., Crossley, M., and Mackay, J. P. (2000) Structure 8, 1157–1166[CrossRef][Medline] [Order article via Infotrieve]
  34. Bach, I. (2000) Mech. Dev. 91, 5–17[CrossRef][Medline] [Order article via Infotrieve]
  35. Demarest, S. J., Martinez-Yamout, M., Chung, J., Chen, H., Xu, W., Dyson, H. J., Evans, R. M., and Wright, P. E. (2002) Nature 415, 549–553[CrossRef][Medline] [Order article via Infotrieve]
  36. Radhakrishnan, I., Perez-Alvarado, G. C., Parker, D., Dyson, H. J., Montminy, M. R., and Wright, P. E. (1997) Cell 91, 741–752[Medline] [Order article via Infotrieve]
  37. Schwabe, J., and Klug, A. (1994) Nat. Struct. Biol. 1, 345–349[Medline] [Order article via Infotrieve]
  38. Summers, M. F., South, T. L., Kim, B., Hare, D. R. (1990) Biochemistry 29, 329–340[Medline] [Order article via Infotrieve]
  39. Narayan, V. A., Kriwacki, R. W., and Caradonna, J. P. (1997) J. Biol. Chem. 272, 7801–7809[Abstract/Free Full Text]
  40. Wang, B., Grant, R., and Pabo, C. (2001) Nat. Struct. Biol. 8, 589–593[CrossRef][Medline] [Order article via Infotrieve]
  41. Nagaoka, M., Kondo, Y., Uno, Y., and Sugiura, Y. (2002) Biochem. Biophys. Res. Commun. 296, 553–559[Medline] [Order article via Infotrieve]
  42. Merkle, D. L., Schmidt, M. H., and Berg, J. M. (1991) J. Am. Chem. Soc. 113, 5450–5451
  43. Green, A., and Sarkar, B. (1998) Biochem. J. 333, 85–90[Medline] [Order article via Infotrieve]
  44. Cook, W. J., Mosley, S. P., Audino, D. C., Mullaney, D. L., Rovelli, A., Stewart, G., and Denis, C. L. (1994) J. Biol. Chem. 269, 9374–9379[Abstract/Free Full Text]
  45. Pavletich, N. P., and Pabo, C. O. (1991) Science 252, 809–817[Medline] [Order article via Infotrieve]