Investigation of the Pyrimidine Preference by the c-Myb DNA-binding Domain at the Initial Base of the Consensus Sequence*

(Received for publication, January 31, 1997, and in revised form, April 17, 1997)

Masayuki Oda Dagger , Koji Furukawa Dagger , Kazuhiro Ogata §, Akinori Sarai par , Shunsuke Ishii par , Yoshifumi Nishimura ** and Haruki Nakamura Dagger Dagger Dagger

From the Dagger  Biomolecular Engineering Research Institute, Furuedai, Suita, Osaka 565, § Kanagawa Academy of Science and Technology (KAST), Kanazawa-ku, Yokohama 236,  Department of Structural Biology, School of Medicine, Yokohama City University, Kanazawa-ku, Yokohama 236, par  Tsukuba Life Science Center, Institute of Physical and Chemical Research (RIKEN), Tsukuba, Ibaraki 305, and ** Graduate School of Integrated Science, Yokohama City University, Kanazawa-ku, Yokohama 236, Japan

ABSTRACT
INTRODUCTION
EXPERIMENTAL PROCEDURES
RESULTS
DISCUSSION
FOOTNOTES
REFERENCES


ABSTRACT

The principal determinant of the pyrimidine preference by the c-Myb DNA-binding domain at the initial base of the consensus sequence was investigated by mutation of both the protein and the DNA base pairs, with analysis by a filter binding assay. Amino acid residue 187 was revealed to interact with the pyrimidine base position, as estimated from our previous complex structure. Unexpectedly, since the pyrimidine preference is retained even in the Gly187 mutant, the principal origin of the base specificity should not occur via the direct-readout mechanism, but by an indirect-readout mechanism, namely in the intrinsic "bendability" of the pyrimidine-purine step of the DNA duplex. A significant but rather small positive base pair roll is detectable in the conformation of DNA in complex with the c-Myb DNA-binding domain. Following the conventional chemical rules of the direct-readout mechanism, amino acid mutagenesis at position 187 yielded several new base preferences for the protein.


INTRODUCTION

Specific interactions between proteins and DNA are critical to gene expression and regulation, so a general readout mechanism of the information encoded in DNA has been sought (1-3). However, a number of complex structures of DNA duplexes and proteins determined at atomic resolution have revealed that nature uses a great variety of readout mechanisms (4-11).

In most complex structures, the direct-readout mechanism is mediated by intermolecular hydrogen bond networks and hydrophobic interactions between DNA duplexes and proteins. The interaction modes have been classified into: (i) the intrinsic chemical features of bases and amino acids (1, 3, 12), and (ii) the stereochemical relations between the amino acids and the bases inside the DNA major grooves (12, 13). In contrast, the indirect-readout mechanism works in several systems, such as the trp repressor/operator (4, 5), where the DNA bases are specifically recognized by proteins without the use of particular hydrogen bonds or non-polar contacts. Instead, each sequence-dependent deformation of the DNA conformation stabilizes the characteristic geometry of the phosphate backbone, which directly interacts with the protein through polar contacts (6). Water molecules are often observed to mediate the specific interaction through additional hydrogen bonds (4, 5). One common type of DNA deformation is a steep kink of the duplex (7-9), which substantially contributes to readout of the minor groove (8, 9).

In general, a combination of the direct- and indirect-readout mechanisms results in specific base pair recognition. In other words, both the specific binding affinity and the DNA bending contribute to the free energy of complex formation (6, 14). This situation has made it difficult to determine how each consensus base sequence is recognized by the corresponding protein, even when the precise complex structure is known.

The c-myb gene product (c-Myb) is a transcriptional activator that specifically binds to DNA fragments containing the consensus sequence PyAAC(G/T)G, where Py indicates a pyrimidine (15-17). The DNA-binding domain (DBD)1 of c-Myb consists of three imperfect 51- or 52-residue repeats (designated R1, R2, and R3 from the N terminus) (18-20). The last two repeats, R2 and R3, are sufficient for the recognition of the specific DNA sequences (20, 21). NMR analysis revealed that both R2 and R3 contain three helices, and the third helix in each is a recognition helix (22-24). R2 and R3 are closely packed in the major groove, so that the two recognition helices directly contact each other to bind cooperatively to the specific base sequence.

In the complex of c-Myb R2R3 with the Myb-binding DNA sequence (MBS-I), the consensus A4, the counterpart guanine of C6, and the last G8 directly interact with Asn183 in R3, Lys182 in R3, and Lys128 in R2, respectively (Fig. 1) (23). The strong cooperativity between R2 and R3 originates from the putative polar interactions between the side chains of Glu132 and Asn179, and between those of Arg131 and Asp178. However, it is not clear why the initial Py corresponding to the third base position in the MBS-I fragment is preferred by c-Myb R2R3, although this Py3 is less specific than the other A4, A5, C6, and G8 sites in the consensus DNA sequence (17). In our NMR structure shown in Fig. 1, Ser187 is the only candidate that interacts with the T3 base, and this ability was suggested in our previous paper (23). The hydroxyl group in the Ser side chain could form a hydrogen bond with the O4 oxygen of the T3 base, either directly or through water molecules.


Fig. 1. Specific interaction between the c-Myb R2R3 and the MBS-I fragment (23). The side chains of Lys128, Lys182, Asn183, and Ser187 are indicated by the thick black lines, and the consensus bases Py3(T3), A4, A5, C6, and G8 are indicated by the thick white lines. The black and white thin lines are the DNA double strands. The backbone of the protein is shown by a pipe model. The figure was drawn with the program InsightII (Molecular Simulations Inc., San Diego).
[View Larger Version of this Image (65K GIF file)]

Thus far, the Myb-homologous DBD has been found in over 30 proteins from many species. An alignment of the DBDs shows that the Ser at position 187 is highly conserved in the animal sequences, whereas it is variable in the plant sequences (23, 25).

Here, to investigate the role of Ser187 and the origin of this pyrimidine preference at the third base position, both Ser187 in the c-Myb R2R3 and the third T-A base pair in the 22-mer MBS-I fragment containing the Myb-binding site were substituted by other amino acids and other base pairs, respectively. The interactions between them were examined using a filter binding assay, whose efficiency has already been shown (17, 26, 27). The recognition mechanism will be discussed.


EXPERIMENTAL PROCEDURES

Plasmids and Site-directed Mutagenesis

A DNA fragment encompassing R2R3 (Leu90-Val193) in the DNA-binding domain of c-Myb was amplified by polymerase chain reaction, using pact-c-myb (28) as the template and two synthetic primers, to generate an NcoI site and a BamHI site at the 5'- and the 3'-end of the amplified fragment, respectively. After digestion with NcoI and BamHI, the DNA fragment was cloned into pAR2156NcoI (17) to yield the expression plasmid, pRP23. An additional Met-Glu- sequence was introduced at the N terminus of R2R3. Site-directed mutagenesis was performed by two-step polymerase chain reaction, as described by Higuchi (29). Here the name of each mutant protein is indicated as, for example, C130I/S187G for the simultaneous mutations that replace Cys130 with Ile and Ser187 with Gly.

Protein Expression and Purification

Escherichia coli BL21(DE3) was transformed with the wild type and mutant plasmids (30). Freshly precultivated cells were inoculated into growth medium containing 100 µg/ml ampicillin and were grown at 37 °C. When the culture reached an A600 of about 0.4, isopropyl-1-thio-beta -D-galactopyranoside was added to a final concentration of 0.5 mM. The cells were cultured at 22 °C for another 12 h. The harvested cells were suspended in 50 mM Tris-HCl buffer (pH 7.8) containing 5 mM MgCl2, and were lysed by sonication at 4 °C. After the cell debris was removed by centrifugation, ammonium sulfate was added to the supernatant to 50% saturation. After an incubation at 4 °C for 1 h, the supernatant was dialyzed against 50 mM potassium phosphate buffer (pH 7.5) containing 200 mM NaCl, and was then applied to a phosphocellulose column (Whatman, P11). The purified fractions were pooled, and the buffer was exchanged to 100 mM potassium phosphate buffer (pH 7.5) containing 20 mM KCl. The protein concentrations were determined from UV absorption at 280 nm and were calculated by using the molar absorption coefficient of 3.7 × 104 M-1 cm-1 (17).

CD Measurements

Circular dichroism (CD) spectra were measured at 20 °C on a Jasco J-600 spectropolarimeter equipped with a water-circulating cell holder. The spectra were obtained in 100 mM potassium phosphate buffer (pH 7.5) containing 20 mM KCl, using a 0.2-cm optical path length cell. The protein concentration was 0.1 mg/ml. CD spectra between 200 and 250 nm were obtained using a scanning speed of 20 nm/min, a time response of 1 s, a bandwidth of 1 nm, and an average over 8 scans.

Preparation of Oligonucleotides

The 22-mer oligonucleotide CACCCTAACTGACACACATTCT, containing the Myb-binding site in the simian virus 40 enhancer sequence (MBS-I) (31), and the third base substituted variants were synthesized and purified by high performance liquid chromatography with a C18 reverse-phase column (Fig. 2). The purified DNA was suspended in STE (10 mM Tris-HCl (pH 8.0), 100 mM NaCl, 1 mM EDTA), and complementary strands were annealed and end-labeled with [gamma -32P]ATP (Amersham) using T4 polynucleotide kinase (Toyobo, Osaka, Japan). The labeled DNAs were purified by passage through spin columns (Pharmacia Biotech Inc., HR-300). Here the name of each variant DNA is indicated as, for example, [C3]MBS-I, for the substitution of the T-A base pair at the third position by a C-G base pair.


Fig. 2. Sequences of the cognate 22-mer MBS-I and the third base pair substituted MBS-I fragments. The base numbering follows Ref. 23. The third base pair is denoted in bold letters, and the consensus base sequence is surrounded by a box.
[View Larger Version of this Image (16K GIF file)]

Filter Binding Assay

All filter binding assays for the protein-DNA binding were carried out essentially as described (32-34). [32P]DNA and various amounts of the c-Myb R2R3 mutant proteins were incubated in 100 µl of binding buffer (100 mM potassium phosphate buffer (pH 7.5), 20 mM KCl, 0.1 mM EDTA, 500 µg/ml bovine serum albumin, and 5% (v/v) glycerol) on ice for 30 min. The final concentration of the [32P]DNA in binding buffer was 0.4 nM, which was always a lower concentration than the Kd value. The incubated samples were filtered through a nitrocellulose membrane (Schleicher & Schuell, BA-85, 0.45 µm) in approximately 10 s with suction. The filters were dried and counted by a liquid scintillation counter. The equilibrium dissociation constants Kd were obtained from the binding titration curve, based on the least square fitting to the normalized bound DNA (y) with the protein concentration (x) using the formula, y = x/(x + Kd).


RESULTS

Prior to the mutational analyses of Ser187, the Cys130 in R2, which is the only cysteine residue in the c-Myb R2R3 and is located at a position equivalent to an isoleucine in R3, was replaced with Ile, to facilitate the protein purification and the DNA-binding assay (35). It was reported that this mutation has little effect on DNA binding (36). The affinity of the C130I mutant was also measured in our own assay system, and it was shown to be almost equal to that of the wild type, and to maintain the pyrimidine preference at the third base position (Table I).

Table I. Dissociation constants for the cognate 22-mer MBS-I fragments and the third base-pair substituted variants with the Ser187-substituted mutants


Protein Kd
T3 C3 A3 G3

nM
Wild-typea 3.2 3.7 8.7 25
C130I 5.5 5.7 12 27
C130I/S187G 18 17 33 37
C130I/S187A 9.3 12 22 24
C130I/S187T 15 11 21 36
C130I/S187N 26 37 15 53
C130I/S187Q 36 39 54 39
C130I/S187V 13 8.9 37 34
C130I/S187L 130 73  >= 103 72
C130I/S187K 61 22  >= 103 36
C130I/S187R 43 22  >= 103 21
C130I/S187D  >= 103  >= 103  >= 103  >= 103

a An additional Met-Ala-sequence was introduced at the N terminus of R2R3, which was used in the NMR experiment (23).

A series of 10 amino acids, Gly, Ala, Thr, Asn, Gln, Val, Leu, Lys Arg, and Asp, were introduced into position 187 of the c-Myb R2R3, which is a Ser residue in the wild type. The purity of each mutant protein was about 95%, as monitored by SDS-polyacrylamide gel electrophoresis. All of the mutant proteins have secondary structure contents similar to the wild type, as confirmed by the CD spectra at the far UV region (Fig. 3). The perfect coincidence of all the spectra suggests that the global tertiary structures of the mutant proteins were not deformed.


Fig. 3. Far-UV CD spectra of the wild type c-Myb R2R3 and the 10 Ser187-substituted mutants. All of them are superimposed. The vertical scale is normalized by the molar concentration.
[View Larger Version of this Image (18K GIF file)]

The binding affinities of the mutants to the cognate 22-mer MBS-I fragments and the third base pair substituted variants were analyzed using the filter binding assay, and the results are summarized in Table I. All measurements were repeated at least twice, and typical experimental errors for the Kd value were less than 10%, although the retention efficiency was 20 ± 10% depending on the experimental conditions. From the methylation interference experiments (17) and the NMR analyses (23), the number of bound DNA duplexes per the c-Myb is considered to be one within the concentration used in this assay. As already indicated in the previous experiments (17, 26, 27), the filter binding assay was validated for this investigation.

The C130I/S187G mutant protein binds about one-third less strongly to the cognate MBS-I than the standard C130I mutant. The relative binding free energy change for the replacement of Ser with Gly, calculated from the Kd values, is 0.65 kcal/mol. It should correspond to the free energy derived from the interaction between the Ser side chain and the T3 base. This Gly mutant preferentially binds to both the cognate [T3]MBS-I and the substituted [C3]MBS-I. That is, even when residue 187 has no side chain, the mutant protein prefers the third pyrimidine as well as the wild type and the C130I mutant proteins.

The substitutions of Ser187 by Ala (C130I/S187A), Thr (C130I/S187T), or Val (C130I/S187V) reveal slightly reduced binding affinities, although the sequence specificities are retained like the standard C130I. In contrast, the C130I/S187N mutant preferentially binds to the [A3]MBS-I. The affinity for the A3 base is similar to that of the wild type, although those for the other three bases (T, C, and G) are greatly reduced, by approximately one-half to one-sixth. The specific interaction between the Asn residue and the A3 base closely follows the intrinsic chemical features. Interestingly, for the substitution by Gln, which is one methylene group longer than Asn, the C130I/S187Q mutant loses the preference for the A3 base. Also, in the case of the C130I/S187L mutant, in which Leu is one methylene group longer than Val, the binding affinity is greatly reduced.

The mutant proteins C130I/S187K and C130I/S187R, which introduced basic amino acids into position 187, specifically prefer to bind to the [G3]MBS-I and [C3]MBS-I variants. In contrast, for the substitution of Ser187 by acidic Asp (C130I/S187D), the binding affinity is completely reduced and is no longer sequence-specific.


DISCUSSION

Thus far, many amino acid replacements in the c-Myb R2R3 have been created and assayed by specific DNA binding (37-39), and almost all of their effects have been explained by the specific polar contacts between the R2R3 and the DNA in the three-dimensional structure of the R2R3-DNA complex (23). The current mutational study clearly indicates that residue 187 in R2R3 is also able to interact with the T3 base, as estimated from the geometry of Ser187 in the NMR complex structure (23). This specific DNA-binding mode is very different from the telomeric DNA recognition by the yeast RAP1-DBD (40), whose amino acid sequence is weakly homologous to that of the c-Myb R2R3.

However, the substitution of Ser187 with Gly, Ala, or Val unexpectedly resulted in only about a 3-fold decrease in the binding affinity toward any base, which would be a consequence of a direct-readout mechanism, while the pyrimidine base preference at the third position in the MBS-I fragment was retained. Ser is thought to have weak specificity, because its side chain can act as either a hydrogen bond donor or an acceptor, and thus can bind to any base. Nevertheless, Ser187 of the R2R3 preferentially binds to the pyrimidine bases. If this interaction were attributable only to the direct-readout mechanism, then the substitution of Ser187 should have resulted in an over 100-fold reduction of the binding affinity and a loss of the sequence specificity, like the substitution of Lys128 by Ala (23), and those of Asn136 and Asn186 by Ala (38). These results suggest that the preference of the pyrimidine bases at the third position of MBS-I should occur primarily by an indirect-readout mechanism.

In our previous structural study, no distinct deformation of the global DNA conformation was observed (23). However, when the local bending of the DNA duplex was carefully analyzed in 25 NMR complex structures and the refined average structure (Protein Data Bank codes 1MSF and 1MSE, respectively), significantly positive roll angles were always observed between the third pyrimidine and the fourth purine, as indicated by an arrow in Fig. 4. Characteristic negative slides (-1.1 ± 0.3 Å) were also observed at the same pyrimidine-purine step, corresponding to positive rolling, while the twist angles at this step were 34.1 ± 2.3°, nearly equal to the twist angle in standard B-form DNA.


Fig. 4. The roll angles of individual base pair steps of the MBS-I DNA fragment complexed with the c-Myb R2R3 (23), calculated using a program developed by Babcock et al. (41). Each filled circle with a straight line and an error bar is the average and the standard deviation for the 25 NMR structures (Protein Data Bank code 1MSF), respectively. Each open circle with a dashed line indicates the rolling of the refined average structure (code 1MSE). The arrow indicates the pyrimidine-purine step between T3 and A4. The consensus bases are indicated in bold letters.
[View Larger Version of this Image (21K GIF file)]

Similar significant, positive rolls at pyrimidine-purine steps are general phenomena (42), observed in many complex crystal structures of repressors and homeodomains with the helix-turn-helix motif, as summarized in Table II. In every case, as a part of the consensus base sequence, the base pair roll bends the DNA so that the recognition helix is wrapped by the DNA duplex in the major groove (3, 42). Consequently, a large contact area is created between the recognition helix and the DNA major groove, facilitating the preferable polar contacts between the protein side chains and the DNA phosphate backbone. The local roll in the MBS-I fragment may be associated with the small magnitude of observed bending in long DNA duplexes bound with the c-Myb R2R3 (52). This bending may be enhanced by other regions in the protein, like the transactivation domain.

Table II. Significant roll angles of DNA duplexes at the pyrimidine-purine steps in the protein-DNA crystal structures with the helix-turn-helix motif


Protein-DNA (PDB/resolution) Pyrimidine-purine stepa Rollb (degree) References

CAPc-DNA30 C5CA6C/T26CG27C 38.9 (43)
(1CGP/3.0 Å) C5DA6D/T26DG27D 30.4
434Crod-OR1 C6AA7A/T15BG16B 7.8 (44)
(3CRO/2.5 Å) T16AG17A/C5BA6B 13.8
434Re-OR1 C6AA7A/T15BG16B 6.1 (45)
(2OR1/2.5 Å) T16AG17A/C5BA6B 5.4
434R-OR2 C26AA27A/T15BG16B 7.9 (46)
(1RPE/2.5 Å) T36AG37A/C5BA6B 6.5
434R-OR3 C6AA7A/T15BG16B 1.8 (47)
(1PER/2.5 Å) T16AG17A/C5BA6B 9.0
 lambda Rf-OL1 C6(1)A7(1)/T35(2)G36(2) 9.2 (48)
(1LMB/1.8 Å) T15(1)G16(1)/C26(2)A27(2) 11.3
trpRg-trp O T6IA7I/T14JA15J 8.5 (4)
(1TRO/1.9 Å) T14IA15I/T6JA7J 9.0
T6KA7K/T14LA15L 11.8
T14KA15K/T6LA7L 14.7
trpRg-trp O T9CA10C/T9IA10I 10.0 (5)
(1TRR/2.4 Å) T9FA10F/T9LA10L 9.7
MATa1/alpha 2h-DNA21 T5CG6C/C39CA40C 12.0 (49)
(1YRN/2.5 Å) T15CA16C/T29CA30C 13.3
C17CA18C/T27CG28C 14.4
Oct-1 POUi-DNA15 T205AG206A/C226BA227B 8.2 (50)
(1OCT/3.0 Å) C207AA208A/T224BG225B 11.5
Engrailed HDj-DNA21 T11AA12A/T32BA33B 3.7 (51)
(1HDD/2.8 Å) T15AA16A/T28BA29B 9.6

a The residue and chain identifiers are those as registered in the Protein Data Bank (PDB).
b The roll angles were calculated using the program rna (41).
c E. coli catabolite gene activator protein.
d Phage 434 Cro protein.
e Phage 434 repressor.
f lambda repressor.
g E. coli trp repressor.
h MATa1/MATalpha 2 homeodomain heterodimer.
i Oct-1 POU domain.
j Engrailed homeodomain.

Due to the intrinsic propeller-twist of the DNA base pairs, the pyrimidine-purine step has two stable conformations, with rolling of 0° and around 10° (53, 54), from the physical requirements of the base stacking (55). There is negligible additional free energy cost required for the 10° rolling at the pyrimidine-purine step, even for a free DNA duplex without a protein. This is the physical origin of the so-called "bendability" of kinked DNA duplexes, commonly observed in the minor groove readout mechanism (8, 9). At the other pyrimidine-pyrimidine, purine-purine, and purine-pyrimidine steps, no such tendency toward a strongly bistable step is observed (53). In fact, the binding free energy differences between the pyrimidine bases and the purine bases at the third base position for the current Gly187, Ala187, and Val187 mutants are 0.4 ± 0.1 kcal/mol, as calculated from the dissociation constants in Table I.

Fig. 5 shows the results of the relative binding free energy changes Delta Delta G toward the C130I/S187G mutant: Delta Delta G = Delta Gbind (mutant against the third N base) - Delta Gbind (C130I/S187G against the same third N base), where Delta Gbind = RT ln Kd. Here, the difference was calculated while keeping the same third position base pair. We can now separate the bendability effect from the total binding free energies between the c-Myb R2R3 mutants and the variety of DNA sequences, unless the binding modes vary from the wild type. Each positive and negative free energy corresponds to a decrease and an increase of the binding affinity, depending upon the intrinsic chemical features of the amino acids and the bases, and subtracting the DNA bending effect.


Fig. 5. Relative binding free energy changes (Delta Delta G) toward the C130I/S187G mutant protein binding of the Ser187-substituted R2R3 mutant proteins to the cognate MBS-I and its variants. Each Delta Delta G for the particular base pair was calculated from the Kd values, as described under "Discussion." The side chain volumes (Å3) of the substituted amino acids (56) are indicated in the parentheses below the individual residues.
[View Larger Version of this Image (45K GIF file)]

For the Ala substitution, the binding affinity is increased as compared with Gly187, independent of the bases at the third position, probably due to the hydrophobic contacts. When the side chain volume is larger in the Val substitution, a similar binding affinity to the pyrimidines remains, but the affinity becomes neutral to the purines. Therefore, the volume of space created between residue 187 and the third base may allow at most the Val-pyrimidine pair, but the Val-purine pair would be slightly too large for the space. In fact, other amino acids, such as Leu and Gln, with larger side chain volumes than Val, significantly lack binding affinity, as indicated in Fig. 5. Moreover, the Val-, Leu-, and Gln-substituted mutants always have lower affinities for adenine than for guanine. This is also supported by the fact that the amino N6 of adenine occupies a larger volume than the oxygen O6 of guanine, which should be located at the position nearest to the side chain of residue 187. 

From this consideration of the space volume around residue 187 and the third base, the native and the optimum interaction between Ser187 and T3 should be mediated by water molecules, as long as the binding mode is assumed to be the same in all of the mutant proteins and DNAs. In the Thr mutant, the disposition of the water molecules could be different from that in the wild type, thus yielding a slight decrease in the binding affinity. Since there is no possible conformation on the helix in which the methyl group of the Thr side chain would be able to access the methyl group in T3, as shown in a modeling study, a specific non-polar contact between the Thr mutant and T3 is not expected.

Following the conventional chemical rules for specific binding between amino acids and bases (1, 3, 12), the current Asn mutant specifically binds to the A3 base relative to the other bases, as indicated in Fig. 5. The Asn side chain size is less than that of Val, and there should be enough space for the Asn-adenine pair, resulting in the formation of direct hydrogen bonds with a free energy gain of about 0.5 kcal/mol. In addition, the Lys and Arg mutant proteins prefer to bind to the G3 base. From their intrinsic chemical nature, both basic amino acids can bind to the guanine base almost exclusively by electrostatic interaction. In contrast, these mutant proteins bind to the [A3]MBS-I and [T3]MBS-I bases with only weak affinity, probably because of the bulky side chains of the amino acids, like the Leu mutant. It is interesting that their long side chains seem to interact with the guanine base on the opposite side of C3. The acidic Asp substitution results in a severe reduction of its DNA binding, which is much lower than the Gly substitution, suggesting that the Asp side chain cannot interact with any base, including cytosine, in this geometry. Rather, the negative ionic charge may disturb other specific hydrogen bonds between the protein and the DNA.

The wild type protein and the C130I mutant with Ser187 bind to the cognate DNA most tightly among the mutant proteins, and their Kd values are in the nanomolar order. Generally, transcriptional regulator proteins bind to their target genes with greater affinity (57). These results are consistent with the conservation of Ser in position 187 of c-Myb among animal species (23). In contrast, among plant species, the amino acid in this position varies (25). This suggests that the recognition mode in the plant Myb homologues may be different from that of the c-Myb DBD from animal species. In fact, in the case of the yeast RAP1 domain 1, the corresponding Val409 residue does not interact with the DNA in the complex structure (40), although the free domain structure is similar to that of the c-Myb R3.

In conclusion, the current mutational analysis revealed that the pyrimidine preference of the native c-Myb DBD for the initial base of the consensus sequence originates principally in the intrinsic positive roll at the pyrimidine-purine step of the DNA duplex. For the purine-purine step, as much as 0.4 kcal/mol of additional free energy would be necessary, corresponding to the bendability. When these bending energies are separated, the conventional chemical rules between the amino acids and the bases are distinctively observed in the c-Myb R2R3 mutants.

It is still difficult to extract a definite "recognition code" from the variety of DNA information readout mechanisms. The situation becomes much more complicated when the DNA flexibility is considered. Only a screening technology, such as a phage display library (58-60), would be expected to reveal a novel, specific form of DNA recognition, instead of an artificial molecular design. However, based upon the complex structure and the mutational analysis, one may be able to dissect the sequence specific affinity into the DNA bendability and the specific interaction between the amino acids and the bases. Without this kind of precise analysis, we may never reach a complete understanding of the readout mechanism, nor produce any novel devices for molecular readout.


FOOTNOTES

*   The costs of publication of this article were defrayed in part by the payment of page charges. The article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
Dagger Dagger    To whom all correspondence should be addressed. Tel.: 81-6-872-8212; Fax: 81-6-872-8219.
1   The abbreviations used are: DBD, DNA-binding domain; MBS, Myb-binding DNA sequence.

REFERENCES

  1. Seeman, N. C., Rosenberg, J. M., and Rich, A. (1976) Proc. Natl. Acad. Sci. U. S. A. 73, 804-808 [Abstract]
  2. Lehming, N., Sartorius, J., Kisters-Woike, B., von Wilcken-Bergmann, B., and Müller-Hill, B. (1990) EMBO J. 9, 615-621 [Abstract]
  3. Pabo, C. O., and Sauer, R. T. (1992) Annu. Rev. Biochem. 61, 1053-1095 [CrossRef][Medline] [Order article via Infotrieve]
  4. Otwinowski, Z., Schevitz, R. W., Zhang, R.-G., Lawson, C. L., Joachimiak, A., Marmorstein, R. Q., Luisi, B. F., and Sigler, P. B. (1988) Nature 335, 321-329 [CrossRef][Medline] [Order article via Infotrieve]
  5. Lawson, C. L., and Carey, J. (1993) Nature 366, 178-182 [CrossRef][Medline] [Order article via Infotrieve]
  6. Travers, A. A. (1992) Curr. Opin. Struct. Biol. 2, 71-77
  7. Winkler, F. K., Banner, D. W., Oefner, C., Tsernoglou, D., Brown, R. S., Heathman, S. P., Bryan, R. K., Martin, P. D., Petratos, K., and Wilson, K. S. (1993) EMBO J. 12, 1781-1795 [Abstract]
  8. Kim, Y., Geiger, J. H., Hahn, S., and Sigler, P. B. (1993) Nature 365, 512-520 [CrossRef][Medline] [Order article via Infotrieve]
  9. Schumacher, M. A., Choi, K. Y., Zalkin, H., and Brennan, R. G. (1994) Science 266, 763-770 [Medline] [Order article via Infotrieve]
  10. Klimasauskas, S., Kumar, S., Roberts, R. J., and Cheng, X. (1994) Cell 76, 357-369 [Medline] [Order article via Infotrieve]
  11. Vassylyev, D. G., Kashiwagi, T., Mikami, Y., Ariyoshi, M., Iwai, S., Ohtsuka, E., and Morikawa, K. (1995) Cell 83, 773-782 [Medline] [Order article via Infotrieve]
  12. Suzuki, M., and Yagi, N. (1994) Proc. Natl. Acad. Sci. U. S. A. 91, 12357-12361 [Abstract/Free Full Text]
  13. Suzuki, M. (1993) EMBO J. 12, 3221-3226 [Abstract]
  14. Pontiggia, A., Rimini, R., Harley, V. R., Goodfellow, P. N., Lovell-Badge, R., and Bianchi, M. E. (1994) EMBO J. 13, 6115-6124 [Abstract]
  15. Biedenkapp, H., Borgmeyer, U., Sippel, A. E., and Klempnauer, K.-H. (1988) Nature 335, 835-837 [CrossRef][Medline] [Order article via Infotrieve]
  16. Weston, K. (1992) Nucleic Acids Res. 20, 3043-3049 [Abstract]
  17. Tanikawa, J., Yasukawa, T., Enari, M., Ogata, K., Nishimura, Y., Ishii, S., and Sarai, A. (1993) Proc. Natl. Acad. Sci. U. S. A. 90, 9320-9324 [Abstract]
  18. Gonda, T. J., Gough, N. M., Dunn, A. R., and de Blaquiere, J. (1985) EMBO J. 4, 2003-2008 [Abstract]
  19. Klempnauer, K.-H., and Sippel, A. E. (1987) EMBO J. 6, 2719-2725 [Abstract]
  20. Sakura, H., Kanei-Ishii, C., Nagase, T., Nakagoshi, H., Gonda, T. J., and Ishii, S. (1989) Proc. Natl. Acad. Sci. U. S. A. 86, 5758-5762 [Abstract]
  21. Howe, K. M., Reakes, C. F. L., and Watson, R. J. (1990) EMBO J. 9, 161-169 [Abstract]
  22. Ogata, K., Hojo, H., Aimoto, S., Nakai, T., Nakamura, H., Sarai, A., Ishii, S., and Nishimura, Y. (1992) Proc. Natl. Acad. Sci. U. S. A. 89, 6428-6432 [Abstract]
  23. Ogata, K., Morikawa, S., Nakamura, H., Sekikawa, A., Inoue, T., Kanai, H., Sarai, A., Ishii, S., and Nishimura, Y. (1994) Cell 79, 639-648 [Medline] [Order article via Infotrieve]
  24. Ogata, K., Morikawa, S., Nakamura, H., Hojo, H., Yoshimura, S., Zhang, R., Aimoto, S., Ametani, Y., Hirata, Z., Sarai, A., Ishii, S., and Nishimura, Y. (1995) Nat. Struct. Biol. 2, 309-320 [Medline] [Order article via Infotrieve]
  25. Bilaud, T., Koering, C. E., Binet-Brasselet, E., Ancelin, K., Pollice, A., Gasser, S. M., and Gilson, E. (1996) Nucleic Acids Res. 24, 1294-1303 [Abstract/Free Full Text]
  26. Takeda, Y., Sarai, A, and Rivera, V. M. (1989) Proc. Natl. Acad. Sci. U. S. A. 86, 439-443 [Abstract]
  27. Sarai, A., and Takeda, Y. (1989) Proc. Natl. Acad. Sci. U. S. A. 86, 6513-6517 [Abstract]
  28. Nishina, Y., Nakagoshi, H., Imamoto, F., Gonda, T. J., and Ishii, S. (1989) Nucleic Acids Res. 17, 107-117 [Abstract]
  29. Higuchi, R. (1989) in PCR Technology (Erlich, H. A., ed), pp. 61-88, Stockton Press, New York
  30. Studier, F. W., Rosenberg, A. H., Dunn, J. J., and Dubendorff, J. W. (1990) Methods Enzymol. 185, 60-89 [Medline] [Order article via Infotrieve]
  31. Nakagoshi, H., Nagase, T., Kanei-Ishii, C., Ueno, Y., and Ishii, S. (1990) J. Biol. Chem. 265, 3479-3483 [Abstract/Free Full Text]
  32. Riggs, A. D., Suzuki, H., and Bourgeois, S. (1970) J. Mol. Biol. 48, 67-83 [Medline] [Order article via Infotrieve]
  33. Riggs, A. D., Bourgeois, S., and Cohn, M. (1970) J. Mol. Biol. 53, 401-417 [Medline] [Order article via Infotrieve]
  34. Kim, J. G., Takeda, Y., Matthews, B. W., and Anderson, W. F. (1987) J. Mol. Biol. 196, 149-158 [Medline] [Order article via Infotrieve]
  35. Guehmann, S., Vorbrueggen, G., Kalkbrenner, F., and Moelling, K. (1992) Nucleic Acids Res. 20, 2279-2286 [Abstract]
  36. Myrset, A. H., Bostad, A., Jamin, N., Lirsac, P.-N., Toma, F., and Gabrielsen, O. S. (1993) EMBO J. 12, 4625-4633 [Abstract]
  37. Saikumar, P., Murali, R., and Reddy, E. P. (1990) Proc. Natl. Acad. Sci. U. S. A. 87, 8452-8456 [Abstract]
  38. Gabrielsen, O. S., Sentenac, A., and Fromageot, P. (1991) Science 253, 1140-1143 [Medline] [Order article via Infotrieve]
  39. Frampton, J., Gibson, T. J., Ness, S. A., Doderlein, G., and Graf, T. (1991) Protein Eng. 4, 891-901 [Abstract]
  40. König, P., Giraldo, R., Chapman, L., and Rhodes, D. (1996) Cell 85, 125-136 [Medline] [Order article via Infotrieve]
  41. Babcock, M. S., and Olson, W. K. (1994) J. Mol. Biol. 237, 98-124 [CrossRef][Medline] [Order article via Infotrieve]
  42. Suzuki, M., Yagi, N., and Gerstein, M. (1995) Protein Eng. 8, 329-338 [Abstract]
  43. Schultz, S. C., Shields, G. C., and Steitz, T. A. (1991) Science 253, 1001-1007 [Medline] [Order article via Infotrieve]
  44. Mondragón, A., and Harrison, S. C. (1991) J. Mol. Biol. 219, 321-334 [Medline] [Order article via Infotrieve]
  45. Aggarwal, A. K., Rodgers, D. W., Drottar, M., Ptashne, M., and Harrison, S. C. (1988) Science 242, 899-907 [Medline] [Order article via Infotrieve]
  46. Shimon, L. J. W., and Harrison, S. C. (1993) J. Mol. Biol. 232, 826-838 [CrossRef][Medline] [Order article via Infotrieve]
  47. Rodgers, D. W., and Harrison, S. C. (1993) Structure 1, 227-240 [Medline] [Order article via Infotrieve]
  48. Beamer, L. J., and Pabo, C. O. (1992) J. Mol. Biol. 227, 177-196 [Medline] [Order article via Infotrieve]
  49. Li, T., Stark, M. R., Johnson, A. D., and Wolberger, C. (1995) Science 270, 262-269 [Abstract]
  50. Klemm, J. D., Rould, M. A., Aurora, R., Herr, W., and Pabo, C. O. (1994) Cell 77, 21-32 [Medline] [Order article via Infotrieve]
  51. Kissinger, C. R., Liu, B., Martin-Blanco, E., Kornberg, T. B., and Pabo, C. O. (1990) Cell 63, 579-590 [Medline] [Order article via Infotrieve]
  52. Saikumar, P., Gabriel, J. L., and Reddy, E. P (1994) Oncogene 9, 1279-1287 [Medline] [Order article via Infotrieve]
  53. Calladine, C. R., and Drew, H. R. (1984) J. Mol. Biol. 178, 773-782 [CrossRef][Medline] [Order article via Infotrieve]
  54. Nelson, H. C. M., Finch, J. T., Luisi, B. F., and Klug, A. (1987) Nature 330, 221-226 [CrossRef][Medline] [Order article via Infotrieve]
  55. Calladine, C. R. (1982) J. Mol. Biol. 161, 343-352 [CrossRef][Medline] [Order article via Infotrieve]
  56. Zamyatnin, A. A. (1984) Annu. Rev. Biophys. Bioeng. 13, 145-165 [CrossRef][Medline] [Order article via Infotrieve]
  57. Spolar, R. S., and Record, T., Jr. (1994) Science 263, 777-784 [Medline] [Order article via Infotrieve]
  58. Choo, Y., and Klug, A. (1994) Proc. Natl. Acad. Sci. U. S. A. 91, 11163-11167 [Abstract/Free Full Text]
  59. Choo, Y., and Klug, A. (1994) Proc. Natl. Acad. Sci. U. S. A. 91, 11168-11172 [Abstract/Free Full Text]
  60. Reber, E. J., and Pabo, C. O. (1994) Science 263, 671-673 [Medline] [Order article via Infotrieve]

©1997 by The American Society for Biochemistry and Molecular Biology, Inc.