©1995 by The American Society for Biochemistry and Molecular Biology, Inc.
Co-crystallization of an ETS Domain (PU.1) in Complex with DNA
ENGINEERING THE LENGTH OF BOTH PROTEIN AND OLIGONUCLEOTIDE (*)

(Received for publication, July 24, 1995; and in revised form, August 9, 1995)

Frédéric Pio Chao-Zhou Ni Richard S. Mitchell John Knight Scott McKercher Michael Klemsz (§) Angela Lombardo Richard A. Maki Kathryn R. Ely (¶)

From the Cancer Research Center, La Jolla Cancer Research Foundation, La Jolla, California 92037

ABSTRACT
INTRODUCTION
MATERIALS AND METHODS
RESULTS AND DISCUSSION
FOOTNOTES
ACKNOWLEDGEMENTS
REFERENCES

ABSTRACT

The PU.1 transcription factor is a member of the ets gene family of regulatory proteins. These molecules play a role in normal development and also have been implicated in malignant processes such as the development of erythroid leukemia. The Ets proteins share a conserved DNA-binding domain (the ETS domain) that recognizes a purine-rich sequence with the core sequence: 5`-C/AGGAA/T-3`. This domain binds to DNA as a monomer, unlike many other DNA-binding proteins. The ETS domain of the PU.1 transcription factor has been crystallized in complex with a 16-base pair oligonucleotide that contains the recognition sequence. The crystals formed in the space group C2 with a = 89.1, b = 101.9, c = 55.6 Å, and beta = 111.2° and diffract to at least 2.3 Å. There are two complexes in the asymmetric unit. Production of large usable crystals was dependent on the length of both protein and DNA components, the use of oligonucleotides with unpaired A and T bases at the termini, and the presence of polyethylene glycol and zinc acetate in the crystallization solutions. This is the first ETS domain to be crystallized, and the strategy used to crystallize this complex may be useful for other members of the ets family.


INTRODUCTION

Transcription factors bind to target DNA sequences and regulate important metabolic functions such as cell growth, development, and differentiation. The PU.1 (spi-1, sfpi-1) transcription factor (1) is a member of the ets gene family, a recently discovered family of regulatory proteins. There are now more than 35 members in this family that have been identified in various organisms from Drosophila to humans (reviewed in (2) and (3) ). These molecules play a role in normal development and have been implicated in malignant processes such as erythroid leukemia and Ewing's sarcoma (4) . The Ets proteins share a conserved region of approximately 85 amino acids known as the ETS domain (5) that serves as a DNA-binding domain and recognizes a purine-rich sequence with the core sequence, 5`-C/AGGAA/T-3`.

Ets proteins differ in size and in the relative position of the ETS domain. For example, the domain is found near the carboxyl-terminal end of the molecule in PU.1 ((1) ; see Fig. 1) and the ets-1 and ets-2 proteins(6, 7) , in the middle of the sequence in Erg(8) , and within the amino-terminal region in Elk-1(9) . The remaining sequences in Ets proteins are presumed to form other functional domains such as activation domains or inhibitory domains that mask the DNA binding site (10, 11) . (^1)The ETS domain is sufficient for DNA binding and binds to DNA as a monomer, unlike many other DNA-binding proteins.


Figure 1: Schematic representation of the PU.1 protein. The sequence of the full-length protein encompasses the activation domain, a PEST region, and the ETS domain which is located at the carboxyl-end of the molecule (reviewed in (2) ). The site of phosphorylation (S148) that influences protein-protein interactions is labeled(18) . Below the molecule, the amino acid sequences for the termini of the two recombinant fragments tested for crystallization are listed. The shorter segment extending from residues 168 to 260 was cloned first; however, this fragment was not a stable protein for structural studies. The longer segment corresponded to residues 160 to 272 which is the actual carboxyl terminus of the full-length PU.1 molecule. This protein was extremely soluble and monodisperse in solution. The amino-terminal serine of this fragment results from the cloning strategy and is not part of the wild-type sequence.



Recently, the folding pattern of the DNA-binding domain of Fli-1, an ets family protein, was described by NMR analysis(12) . The domain consists of 3 alpha-helices and a four-stranded antiparallel beta-sheet. Features of this secondary structure (13) as well as that of the murine ets-1 domain (14) are very similar to the winged helix-turn-helix motif in DNA-binding proteins such as CAP (15) and HNF-3(16) . In order to define precisely the protein-DNA contacts, we co-crystallized the ETS domain of the PU.1 transcription factor in complex with cognate DNA.

The PU.1 transcription factor is expressed in hematopoietic cells and specifically in B cells, macrophages, neutrophils, and mast cells(1, 2) . The sequence of PU.1 is identical with the oncogene Spi-1(17) . Spi-1 is activated in the erythroid leukemia induced by spleen focus forming virus. Integration of spleen focus forming virus upstream of the Spi-1/PU.1 gene results in overexpression of the Spi-1/PU.1 protein. This event is associated with the development of erythroid leukemia. The PU.1 molecule has been shown to interact with other nuclear proteins. For example, PU.1 binds to the 3` enhancer sequence of the Ig-kappa gene in complex with a second factor NF-EM5 (PIP)(18, 19) . Formation of the ternary complex of PU.1, NF-EM5, and DNA is dependent on PU.1 binding to the core GGAA sequence and phosphorylation of serine 148 in PU.1(18) . The sites of protein-protein interaction and phosphorylation are immediately adjacent and amino-terminal to the DNA-binding domain.

There are several subfamilies of Ets proteins that appear to have arisen by gene duplication of a primordial gene(3) . The amino acid sequence of PU.1 is the most divergent from ets-1, yet there is 40% sequence homology in the DNA-binding domains of these proteins. Fourteen residues are strictly conserved in the DNA-binding domain when all ETS domains are compared. Here we report a strategy to clone and express a recombinant fragment encompassing the ETS domain of PU.1 for structural studies. Successful co-crystallization with DNA was dependent on the length of the protein fragment and also on the length of the synthetic oligonucleotide bound to the fragment. It has been shown in studies of other DNA-binding proteins (reviewed in (20, 21, 22) ) that alteration of the length of DNA oligonucleotides is important to optimize crystallization of the protein-DNA complex. Recently, an extensive analysis of conditions to produce crystals of the U1A-RNA complex was reported(23) . In that study, varying the length of RNA hairpins as well as utilization of mutant proteins was necessary to produce high quality crystals. The results of the screening of both protein and RNA components were used to propose a general strategy for crystallization of protein-RNA complexes. Since this is the first ETS domain to be crystallized, the details of the selection and production of the protein and DNA components of the complex will be described here. Because of the strong sequence homology of the DNA-binding domains, similar strategies may be useful for successful crystallization of ETS domains from other members of the ets family.


MATERIALS AND METHODS

Cloning and Expression of the PU.1 DNA-binding Domain

The DNA-binding domain of PU.1 was cloned in the pET11 expression vector (24) by polymerase chain reaction amplification of the DNA-binding domain from the full-length mouse PU.1 cDNA as described previously (1) . DNA sequence analysis was used to verify that the sequence of the amplified product was identical with the original clone. For bacterial expression, pET plasmid constructs were used to transform Escherichia coli BL21(DE3)pLysS cells. A preculture of 50 ml of LB medium (25) and ampicillin (100 µg/ml) was inoculated with a single colony from freshly transformed BL21(DE3)pLysS cells bearing the DNA-binding domain insert. After an overnight incubation at 37 °C, this preculture was used to inoculate 7.5 liters of LB-ampicillin media. Cells were grown overnight at 26 °C in an aerated fermentor (Microferm, New Brunswick, NJ). The next morning, 2.5 liters of LB-ampicillin buffered at pH 7.4 with sodium phosphate were added to the culture. Expression of protein was induced at 26 °C with the addition of 1 mM isopropyl-1-thio-beta-D-galactopyranoside. After 4 h, cells were harvested by centrifugation and stored as a paste at -70 °C.

Purification of PU.1 DNA-binding Domain

Cell pellets from 1 liter of culture were resuspended in 200 ml of lysis buffer (20 mM Tris-HCl, pH 7.5, 200 mM NaCl, 2 mM EDTA, and 0.1 mM phenylmethylsulfonyl fluoride). Cells were lysed on ice by sonication, cell debris was cleared by centrifugation at 17,000 rpm and 4 °C for 60 min, and the concentration of sodium chloride in the supernatant was adjusted to 1 M. Polyethyleneimine was added to a final concentration of 0.2% and precipitation proceeded with gentle mixing for 30 min on ice. The precipitate was removed by centrifugation at 15,000 rpm and 4 °C for 30 min. The supernatant solution was dialyzed at pH 7.5 against 20 mM Tris-HCl, 60 mM NaCl, and 0.1 mM phenylmethylsulfonyl fluoride and then centrifuged again before application to CM-Sepharose Fast-Flow resin. The ETS domain was isolated by ion exchange chromatography at 4 °C with a linear NaCl gradient (60 mM to 1.2 M). Fractions containing the DNA-binding domain were pooled and concentrated by ultrafiltration. The domain was purified to homogeneity by gel filtration on a Sephacryl S-100 (Pharmacia) molecular sizing matrix at pH 7.4 in phosphate-buffered saline and 0.02% sodium azide. Purified protein was concentrated to 0.5 mM, quick-frozen, and stored in aliquots at -70 °C.

Purification of Selenomethionine-substituted Protein

In order to produce modified protein for structure solution by multiwavelength anomalous dispersion phasing methods(26) , recombinant PU.1 DNA-binding domain was produced with selenomethionine substituted for methionine. Bacterial cells (E. coli strain B834; Novagen, Inc.) which are auxotrophic for methionine (BL21DE3met-) were used to express the DNA-binding domain. Competent B834 cells were freshly transformed with the pET11 vector containing the domain. For expression of the modified protein, a preculture of 50 ml of LB-ampicillin medium was inoculated with a single colony and incubated at 37 °C. After 16 h, 5 ml of this preculture was used to inoculate 1 liter of M9 medium (25) containing 100 µg/ml ampicillin supplemented with 50 µg/ml selenomethionine (Sigma) and 2 mg/liter each of biotin and thiamine. Cells were grown at room temperature until the absorbance at A reached 0.15, and expression of recombinant protein was induced by the addition of 1 mM isopropyl-1-thio-beta-D-galactopyranoside. After 16 h, cells were harvested by centrifugation and stored at -70 °C. The selenomethionine-substituted protein was purified by procedures described for the native domain. The extent of selenomethionine substitution was evaluated by amino acid analysis and mass spectrometry.

DNA Synthesis and Purification

DNA oligonucleotides of various lengths were synthesized on a 10-µm scale using phosphoramidite chemistry with an Applied Biosystems Model 394 DNA/RNA synthesizer. Derivatized oligonucleotides were synthesized by substituting iodinated uracil phosphoramidites (Glen Research Laboratories) for thymine phosphoramidites. After the last cycle, the oligonucleotides were cleaved from the solid support, and protecting groups on exocyclic amines were removed by treatment with ammonium hydroxide according to manufacturer's protocols before lyophilization. Oligonucleotides were purified by reverse phase HPLC (^2)on a Vydac C4 column at 56 °C using an acetonitrile gradient in 100 mM triethylammonium bicarbonate buffer (pH 8.5). Fractions containing the full-length oligonucleotide were pooled and acetonitrile was removed by dialysis against triethylammonium bicarbonate buffer. The oligonucleotides were desalted in 20% ethanol on Bio-Gel P2 resin (Bio-Rad Laboratories, Inc.), lyophilized twice, and stored in aliquots at -70 °C.

Before co-crystallization, DNA extinction coefficients were calculated for each oligonucleotide strand(27) , and complementary strands were mixed in equimolar ratios in 5 mM Mes, 200 mM NaCl, pH 7.0, to a final concentration of 0.5 mM. Strands were annealed by heating the mixture to 95 °C and slowly cooling over a few hours to 20 °C.

Space Group Determination and X-ray Data Collection

Crystals were characterized for diffraction using a Rigaku RU-200 rotating anode x-ray source with a graphite monochromator operating at 50 kV and 100 mA, two San Diego Multiwire Systems area detectors, and the UCSD data processing programs(28) . Initial characterization and space group determination were performed at room temperature; however, the crystals were sensitive to x-ray exposure. Therefore, all crystals used for this study were cryoprotected in solutions of polyethylene glycol (PEG) and methylpentanediol and immediately frozen in a nylon loop in a cooled nitrogen stream. X-ray data were collected at -145 °C using a cryocooling device and a liquid nitrogen-cooled gas stream (Molecular Structures, Inc.).


RESULTS AND DISCUSSION

Screening of Protein Fragments

Two different recombinant proteins were generated that each include the minimal DNA-binding domain. These fragments are shown in Fig. 1. The two fragments differ in length at both the amino- and carboxyl-terminal ends of the sequence. The amino-terminal sequence and amino acid composition of these fragments indicated that the purified proteins lacked the amino-terminal methionine, probably as a result of proteolytic cleavage by methionyl aminopeptidase(29) .

We first generated a protein of 93 amino acids corresponding to residues 168 to 260 since this region encompassed the minimal DNA-binding domain identified by deletion analysis(1) . After expression and purification, when this fragment was tested by dynamic light scattering, the protein solution was monodisperse (results not shown) which was a preliminary indication that the recombinant molecule was suitable for crystallization trials(30) . However, when the protein was concentrated beyond 5 mg/ml, the fragment formed aggregates and insoluble precipitates. Moreover this fragment was susceptible to proteolytic degradation upon prolonged storage. These observations suggested that the fragment was not folded correctly and that the molecule was not a good candidate for crystallization. After extensive screening, no crystals were obtained with this fragment alone. Only small crystals were observed for this fragment in complex with DNA, and these crystals were difficult to reproduce.

In order to generate a fragment with improved solubility properties, a strategy to alter the length of the molecule was implemented. The design of a construct to produce the longer fragment shown in Fig. 1was based on secondary structure predictions and an alignment of multiple ETS domain sequences. This analysis indicated that the predicted secondary structure of the sequence at the amino-terminal boundary of the short fragment was not consistent for members of the ets family. For PU.1, this region was predicted to form an alpha-helix, while in the majority of other ets family sequences, beta-strands were predicted. Therefore, the amino-terminal sequence of the new construct was extended to the boundary of the PEST domain excluding a region at the end of the PEST region that is a conserved hydrophilic sequence (see Fig. 1). At the carboxyl terminus, the sequence was extended to the end of the full-length PU.1 molecule. The long fragment encoded by this construct corresponded to residues 160 to 272. After expression and purification, this fragment was remarkably soluble up to concentrations of 60 mg/ml and remained monodisperse in solution even at these high concentrations and after prolonged storage at -70 °C. Despite the optimal physical properties of this fragment, it is surprising that the molecule never crystallized alone even with extensive screening using incomplete factorial (31) and sparse matrix (32) crystallization trials.

Co-crystallization with DNA Oligonucleotides

Some DNA-binding proteins crystallize only when complexed to specific cognate oligonucleotides (reviewed in (21, 22) ). In many of the complexes crystallized to date, the ends of the DNA fragments interacted in the crystal lattice to form an extended, distorted DNA helix with base-paired interactions between adjacent DNAs in the crystal lattice. In this respect, the oligonucleotides direct the orientation of the complex in the crystal. The PU.1 DNA-binding domain recognizes a purine-rich sequence having a core sequence of 5`-GGAA-3`. The sequences of the oligonucleotides used in this study were identified by screening random sequence oligonucleotides.^1 A number of oligonucleotides were chemically synthesized each of which included the PU box sequence and differed in length. As shown in Fig. 2, oligonucleotides with termini that provide blunt-ended or overhanging bases were tested for co-crystallization.


Figure 2: Oligonucleotides tested in co-crystallization trials. Each of the oligonucleotides listed was synthesized for co-crystallization with the PU.1 domain. The sequences differ in length and termini flanking a core sequence shown in the box at the top of the figure. The core sequence contains the GGAA recognition sequence for PU.1 (bold). In each oligonucleotide, the lines represent the repetition of this same core sequence. The oligonucleotides were designed to provide both blunt-ended duplex DNA fragments and fragments that have unpaired T or A bases at the termini. The latter were tested because they have the potential for end-to-end stacking in the crystal lattice. The best success with the production of sizable crystals was achieved with two oligonucleotides with a 5`-AT overhang (marked with asterisks). The shorter of the two fragments, i.e. 16 bp in length, was used to produce diffraction-quality crystals. Other oligonucleotides with unpaired termini were designed to permit Hoogsteen base-pairing between DNA fragments within the crystal lattice. Although the PU.1 DNA binding domain bound these DNA fragments, crystals were never obtained for complexes formed with these oligonucleotides.



The quality of the oligonucleotides was critical for successful co-crystallization. In particular, care was taken to achieve >95% homogeneous oligonucleotide by reverse-phase HPLC. The chromatographic separations were run at 56 °C to avoid the formation of secondary structure during purification. Full-length oligonucleotides were eluted from the C4 column with an acetonitrile-triethylammonium bicarbonate gradient. Purification using other gradients or performed on ion exchange resins did not produce oligonucleotides that were adequate for crystallization. After extensive dialysis to remove acetonitrile, each purified oligonucleotide was concentrated by successive lyophilizations from dilute ammonium bicarbonate and was finally desalted in 20% ethanol with a Bio-Gel P2 column. Complete desalting was critical for the formation of large crystals. In fact, DNA heterogeneity or contaminating ions were factors that inhibited crystal growth or produced showers of poorly formed crystals.

Prior to mixing with protein, duplex DNA was annealed by heating to 95 °C and cooling slowly to 20 °C. Molar extinction coefficients were calculated for each strand (22) to ensure that the strands to be annealed were present in equimolar concentrations. Duplex DNA molecules shown in Fig. 2were mixed with freshly thawed PU.1 protein in molar ratios of 2:1 or 1:1 DNA:protein. In each case, complex formation was verified using a gel shift electrophoretic assay (results not shown). DNA binding was tested with both of the protein fragments. Solubility testing and precipitation analyses were also performed with selected complexes before crystallization trials. The solubility of the protein-DNA complexes was diminished relative to the proteins alone, particularly as compared to the longer PU.1 fragment. In fact, some of the complexes precipitated immediately upon mixing. These precipitates could be redissolved by the addition of NaCl or could be prevented if NaCl was present in the protein solution prior to the addition of DNA. Optimal conditions for mixing PU.1 with DNA were carefully defined yet were dependent on the presence of NaCl at concentrations that varied for each complex.

PU.1-DNA complexes were formed with each of the oligonucleotides shown in Fig. 2and each of the two PU.1 fragments. Using UV absorbance measurements at 278 nm for protein components and at 260 nm for DNA samples, the final concentration of the complex was estimated at 0.2 mM to 0.4 mM. These complexes were screened for crystallization using the sparse matrix method(32) , starting with oligonucleotides >20 bp in length. Trials were set up using vapor diffusion and hanging drops. In these initial screens, crystals grew from conditions that are typical for protein-DNA complexes, i.e. neutral pH, polyethylene glycol (PEG), and divalent cations(33) .

For complexes with the short protein fragment, only small crystals were obtained in most of the trials. In one case, somewhat larger crystals were observed when the protein was complexed to a 20-bp blunt-ended oligonucleotide, but these crystals could not be improved by complementary screening with shorter oligonucleotides or DNAs with overhanging bases. In contrast, complexes formed with the longer protein fragment were more amenable to screening. The best crystals for this complex initially formed with a 23-bp oligonucleotide with an AT overhang (see Fig. 2). Crystals of this complex were observed in several drops of the screen. The similarity of conditions in each of these trials suggested that sodium acetate was essential for crystallization. Tests altering the pH and acetate concentration produced larger crystals of the complex (0.2 times 0.1 times 0.05 mm) after 2 months.

In order to improve these crystals, shorter oligonucleotides were designed. Those with the AT overhang were given priority in the screening. When the long protein fragment was complexed with a 16-bp oligonucleotide with an AT overhang, crystals formed readily as expected; however, under the conditions described above, only crystals with an irregular morphology were obtained. With further screening, well-shaped crystals were produced in drops that contained PEG and zinc acetate. It is interesting that a number of the helix-turn-helix proteins have been crystallized from PEG solutions containing acetate ions. For example, the heat shock factor was crystallized from PEG 4000 and ammonium acetate(34) , HNF-3 transcription factor from potassium acetate (without PEG; (16) ), NF-kappaB-50-DNA complex from sodium acetate and PEG 8000(36) , paired homeodomain from ammonium acetate and PEG 1000(37) , and even-skipped homeodomain from potassium acetate and PEG 8000(38) . It appears from this summary that it is a good strategy to test the acetate ion in trials to crystallize helix-turn-helix proteins. Since the presence of zinc acetate produced significant improvement of the PU.1-DNA complex, it is possible that both ions will represent favorable conditions for crystallizing ETS domains. Evaluation of the general utility of these ions awaits the crystallization of other ETS domains.

To our knowledge, this is the first report of a helix-turn-helix protein-DNA complex crystallized in the presence of zinc acetate. In other families of DNA-binding proteins, such as zinc-finger proteins (39) or the diphtheria toxin repressor(40) , zinc ions were necessary for crystallization because these molecules have discrete binding sites for the zinc ions in coordination with residues such as histidines or cysteines. In the case of ETS domains, it is possible that the zinc ions also stabilize the protein structure, but identification of the sites for zinc binding awaits the elucidation of the crystal structure.

The PU.1-DNA complex crystals diffracted to 3.5 Å and were improved further by altering the concentration and molecular weight of the PEG used as precipitant. Lower PEG concentrations reduced twinning and excess nucleation. A dramatic improvement in crystal morphology was achieved by substituting PEG 600 for PEG 8000. For the production of large crystals, 5 µl of complex were mixed on a siliconized coverslip with 5 µl of a reservoir solution containing 100 mM sodium cacodylate, pH 6.5, 3-10% PEG 600, and 200 mM zinc acetate. After mixing, the coverslips were inverted and sealed over the reservoir. Parallelopiped crystals formed at 19 °C in 3 to 5 days. In some cases, macroseeding (41) was used to produce large crystals. Crystals were washed free of mother liquor, dissolved, and subjected to nondenaturing gel electrophoresis to confirm the presence of complex.

Diffraction Analyses

These crystals were strongly birefringent and diffracted to at least 2.3-Å resolution. However, the crystals began to dissolve and crack when stored for more than 1-2 weeks and were very sensitive in the x-ray beam. It is interesting that this instability is frequently reported for protein-DNA complex crystals(21) . Therefore, crystals were flash-frozen before diffraction experiments in cryoprotectant solutions of 8% PEG 600 and 30% methylpentanediol. A single crystal was quickly transferred from the crystallization drop to the cryoprotectant solution, then picked up in a loop and immediately frozen with a cooled nitrogen stream. After freezing, the crystals were extremely stable in the x-ray beam at -145 °C with no significant decay after 2.5 days of data collection. Flash-freezing did not alter the space group nor significantly change the cell dimensions of the crystals.

The crystals of the PU.1-DNA complex belong to the space group C2 with a = 89.1, b = 101.9, c = 55.6 Å, and beta = 111.2°. Assuming a molecular mass for the complex of 22,800 daltons, calculations of the cell dimensions were consistent with V(42) of 2.58 Å/dalton, solvent content of 48%, and two complexes in the asymmetric unit. These calculations were confirmed by experimental measurements of the crystal density(43) . A native data (98% complete) set has been collected at -145 °C to 2.3 Å resolution. The data collection statistics are presented in Table 1. The diffraction pattern displayed strong reflections near 3.5 Å that result from scattering of B-DNA which indicated that the DNA oligonucleotides lie approximately along the b axis.



Heavy Atom Searches

Two approaches are being used to obtain heavy atom substitutions for phase calculation. The first approach is to covalently modify the protein and/or DNA components of the complex prior to crystallization and the second is to soak complex crystals in solutions containing heavy metal compounds. In the first strategy, the long PU.1 domain was prepared as a selenomethionine-substituted protein by expression of the recombinant molecule in bacterial culture with selenomethionine as the sole source of methionine. There are 3 methionines in the long PU.1 fragment, and substitution of the 3 residues by selenomethionine was confirmed by amino acid analysis (data not shown). The extent of substitution was 70-86% complete in different cultures. The modified protein was co-crystallized in complex with DNA. Large diffraction-quality crystals of this complex were produced that are isomorphous with the native crystals.

In order to modify the DNA for heavy atom substitution, halogenated bases (i.e. iodine-substituted uridine for thymine) are suitable for multiple isomorphous replacement methods (e.g.(35) ). Several iodinated oligonucleotides were synthesized chemically and crystallized in complex with the DNA-binding domain. Iodinated oligonucleotides were tested for binding to the PU.1 molecule by gel shift analyses before co-crystallization. Large isomorphous crystals were obtained with several of these modified oligonucleotides. Besides serving as sites for heavy atom substitution, the iodines may also serve as markers to orient the DNA in the crystal lattice. Since the axis of the DNA is known from the strong reflections in the diffraction pattern, the positions of the iodines at different sites on different oligonucleotides should define the direction of the DNA in the first electron density maps.

Finally, crystals of the native complex are being soaked in heavy atom compounds to produce substitutions for multiple isomorphous replacement phase calculations. Diffraction data for complexes with modified protein and/or DNA are being collected using flash-frozen crystals and ultra-low temperature data collection.

Summary

The production of large diffraction quality crystals of the PU.1 ETS domain in complex with DNA was achieved by a strategy that combined varying the length of both the protein and DNA components of the complex. The DNA fragments used in this study were critical to the successful crystallization for several reasons. Apparently, end-to-end stacking of the oligonucleotides is needed for nucleation of crystal growth since the majority of crystals obtained were from complexes with overhanging bases. Furthermore, the length of the oligonucleotide was important since complexes containing longer oligonucleotides, especially those in the range of 20-23 bp, did not diffract strongly, probably as a result of spacious unoccupied volumes in the crystal lattice. It is interesting that the optimal length for the DNA was 16 bp which corresponds to the length of DNA protected from nuclease cleavage in footprint analyses(1) .

While the shorter DNA oligonucleotides were best for crystallization, the longer protein fragment exhibited the ideal physical properties for solubility, DNA binding, and complex crystallization. It is possible that there is an ideal ratio of size of protein to length of DNA for successful crystallization. This ratio relates directly to the shape of the protein component, rather than the oligonucleotide, because the overall shape of the B-DNA is regular and cylindrical. In cases where end-to-end stacking occurs in the crystal, the DNA forms elongated ``fiber-like'' features arranged side-by-side in the lattice. Since the protein component is usually globular, packing of the bound protein within the lattice formed by neighboring DNA oligonucleotides is important for growth of a three-dimensional crystal. With the parameters reported here and homology-based sequence alignments, it may be possible to design similar protein and DNA fragments to crystallize other ETS domains.


FOOTNOTES

*
This work was supported by United States Department of the Army Grant DMD17-94J-4439 (to K. R. E.) and by National Institutes of Health Grant AI20194 (to R. A. M.). The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore by hereby marked ``advertisement'' in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.

§
Present address: Dept. of Microbiology and Immunology, Indiana University School of Medicine, Indianapolis, IN 46202-5120.

To whom correspondence should be addressed: La Jolla Cancer Research Foundation, 10901 North Torrey Pines Rd., La Jolla, CA 92037. Tel.: 619-455-6480; Fax: 619-455-0181.

(^1)
M. Klemsz and R. A. Maki, unpublished results.

(^2)
The abbreviations used are: HPLC, high performance liquid chromatography; PEG, polyethylene glycol; Mes, 2-(N-morpholino)ethanesulfonic acid; bp, base pair(s).


ACKNOWLEDGEMENTS

We are grateful to Kelly Riddle-Hilde for preparing the manuscript for publication and Muizz Hasham for graphics illustrations. Also, we thank Drs. Craig Hauser and Nuria Assa-Munt for helpful suggestions for bacterial culture and Khanh Nguyen for advice on HPLC chromatography.


REFERENCES

  1. Klemsz, M. J., McKercher, S. R., Celada, A., Van Beveren, C., and Maki, R. A. (1990) Cell 61,113-124 [Medline] [Order article via Infotrieve]
  2. Moreau-Gachelin, F. (1994) Biochim. Biophys. Acta 1198,149-163 [CrossRef][Medline] [Order article via Infotrieve]
  3. Wasylyk, B., Hahn, S. L., and Giovane, A. (1993) Eur. J. Biochem. 211,7-18 [Abstract]
  4. Hromas, R., and Klemsz, M. (1994) Int. J. Hematol. 59,257-265 [Medline] [Order article via Infotrieve]
  5. Karim, F., Urness, L., Thummel, C., Klemsz, M., McKercher, S., Celada, A., Van Beveren, C., Maki, R., Gunther, C., and Nye, J. (1990) Genes & Dev. 4,1451-1453
  6. Watson, D. K., McWilliams, M. J., Lapis, P., Lautenberger, J. A., Schweinfest, C. W., and Papas, T. S. (1985) Proc. Natl. Acad. Sci. U. S. A. 82,7294-7298 [Abstract]
  7. Reddy, E. S. P., and Rao, V. N. (1988) Oncogene Res. 3,239-246 [Medline] [Order article via Infotrieve]
  8. Reddy, E. S., Rao, V. N., and Papas, T. S. (1987) Proc. Natl. Acad. Sci. U. S. A. 84,6131-6135 [Abstract]
  9. Rao, V. N., Huebner, K., Isobe, M., Ar-Rusdi, A., Croce, C. M., and Reddy, E. S. P. (1989) Science 244,66-70 [Medline] [Order article via Infotrieve]
  10. Wasylyk, C., Kerckaert, J.-P., and Wasylyk, B. (1992) Genes & Dev. 6,965-974
  11. Lim, F., Kraut, N., Frampton, J., and Graf, T. (1992) EMBO J. 11,643-652 [Abstract]
  12. Liang, H., Mao, X., Olejniczak, E. T., Nettesheim, D. G., Yu, L., Meadows, R. P., Thompson, C. B., and Fesik, S. W. (1994) Nature Struct. Biol. 1,871-876 [Medline] [Order article via Infotrieve]
  13. Liang H., Olejniczak, E. T., Mao, X., Nettesheim, D. G., Yu, L., Thompson, C. B., and Fesik, S. W. (1994) Proc. Natl. Acad. Sci. U. S. A. 91,11655-11659 [Abstract/Free Full Text]
  14. Donaldson, L. W., Petersen, J. M., Graves, B. J., and McIntosh, L. P. (1994) Biochemistry 33,13509-13516 [Medline] [Order article via Infotrieve]
  15. Schultz, S. C., Shields, G. C., and Steitz, T. A. (1991) Science 253,1001-1007 [Medline] [Order article via Infotrieve]
  16. Clark, K. L., Halay, E. D., Lai, E., and Burley, S. K. (1993) Nature 364,412-420 [CrossRef][Medline] [Order article via Infotrieve]
  17. Moreau-Gachelin, F., Ray, D., Mattei, M. G., Tambourin, R., and Tavitian, A. (1989) Oncogene 4,1449-1456 [Medline] [Order article via Infotrieve]
  18. Pongubala, J. M. R., Van Beveren, C., Nagulapalli, S., Klemsz, M. J., McKercher, S. R., Maki, R. A., and Atchison, M. L. (1993) Science 259,1622-1625 [Medline] [Order article via Infotrieve]
  19. Eisenberg, C. F., Singh, H., and Storb, U. (1995) Genes & Dev. 9,1377-1387
  20. Jordan, S. R., Whitcombe, T. V., Berg, J. M., and Pabo, C. O. (1985) Science 230,1383-1385 [Medline] [Order article via Infotrieve]
  21. Harrison, S. C., and Sauer, R. T. (1994) Curr. Opin. Struct. Biol. 4,1-66
  22. Dock-Bregeon, A.-C., and Moras, D. (1992) in Crystallization of Nucleic Acids and Proteins: A Practical Approach (Ducruix, A., and Geig é , R., eds) pp. 145-174, IRL Press, New York
  23. Oubridge, C., Ito, N., Teo, C.-H., Fearnley, I., and Nagai, K. (1995) J. Mol. Biol. 249,409-423 [CrossRef][Medline] [Order article via Infotrieve]
  24. Studier, F. W., Rosenberg, A. H., Dunn, J. J., and Dubendorff, J. W. (1988) Methods Enzymol. 185,60-89
  25. Sambrook, J., Fritsch, E. F., and Maniatis, T. (1989) Molecular Cloning: A Laboratory Manual , 2nd Ed, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY
  26. Hendrickson, W. A. (1991) Science 254,51-58 [Medline] [Order article via Infotrieve]
  27. Puglisi, J. D., and Tinoco, I. J. (1989) Methods Enzymol. 180,304-325 [Medline] [Order article via Infotrieve]
  28. Howard, A. J., Nielsen, C., and Xuong, N. H. (1985) Methods Enzymol. 114,452-472 [Medline] [Order article via Infotrieve]
  29. Hirel, P.-H., Schmitter, J.-M., Dessen, P., Fayat, G., and Blanquest, S. (1989) Proc. Natl. Acad. Sci. U. S. A. 86,8247-8251 [Abstract]
  30. Ferré-D'Amaré, A. R., and Burley, S. K. (1994) Bio/Technology 7,1157-1161
  31. Carter, C. W., Jr., and Carter, C. W. (1979) J. Biol. Chem. 254,12219-12223 [Medline] [Order article via Infotrieve]
  32. Jancaric, J., and Kim, S.-H. (1991) J. Appl. Crystallog. 24,409-411 [CrossRef]
  33. Aggarwal, A. K. (1990) in Methods: A Companion to Methods in Enzymology (Carter, C. W., Jr., ed) Vol. 1, pp. 83-90, Academic Press, Inc., New York
  34. Harrison, C. J., Bohm, A. A., and Nelson, H. C. M. (1994) Science 263,224-227 [Medline] [Order article via Infotrieve]
  35. Ferré-D'Amaré, A. R., Prendergast, G. C., Ziff, E. B., and Burley, S. K. (1993) Nature 363,38-45 [CrossRef][Medline] [Order article via Infotrieve]
  36. Muller, C. W., Rey, F. A., Sodeoka, M., Verdine, G. L., and Harrison, S. C. (1995) Nature 373,287-288 [Medline] [Order article via Infotrieve]
  37. Xu, W., Rould, M. A., Jun, S., Desplan, C., and Pabo, C. O. (1995) Cell 80,639-650 [Medline] [Order article via Infotrieve]
  38. Hirsch, J. A., and Aggarwal, A. K. (1995) Proteins 21,268-271 [Medline] [Order article via Infotrieve]
  39. Pavletich, N. P., and Pabo, C. O. (1991) Science 252,809-817 [Medline] [Order article via Infotrieve]
  40. Qiu, X., Verlinde, C. L. M. J., Zhang, S., Schmitt, M. P., Holmes, R. K., and Hol, W. G. J. (1995) Structure 3,87-100 [Medline] [Order article via Infotrieve]
  41. Stura, E. A., and Wilson, I. A. (1990) in Methods: A Companion to Methods in Enzymology (Carter, C. W., Jr., ed) Vol. I, pp. 38-49, Academic Press, Inc., New York
  42. Matthews, B. W. (1968) J. Mol. Biol. 33,491-497 [Medline] [Order article via Infotrieve]
  43. Low, B. W., and Richards, F. M. (1952) J. Am. Chem. Soc. 74,1660-1666

©1995 by The American Society for Biochemistry and Molecular Biology, Inc.