Nonpolar Thymine Isosteres in the Ty3 Polypurine Tract DNA Template Modulate Processing and Provide a Model for Its Recognition by Ty3 Reverse Transcriptase*

Daniela Lener, Mamuka Kvaratskhelia and Stuart F. J. Le Grice {ddagger}

From the Resistance Mechanisms Laboratory, HIV Drug Resistance Program, NCI-Frederick, National Institutes of Health, Frederick, Maryland 21702-1201

Received for publication, March 7, 2003 , and in revised form, April 29, 2003.


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
Despite diverging in sequence and size, the polypurine tract (PPT) primers of retroviruses and long terminal repeat-containing retrotransposons are accurately processed from (+) U3 RNA and DNA by their cognate reverse transcriptases (RTs). In this paper, we demonstrate that misalignment of the Ty3 retrotransposon RT on the human immunodeficiency virus-1 PPT induces imprecise removal of adjacent (+)-RNA and failure to release (+)-DNA from the primer. Based on these observations, we explored the structural basis of Ty3 PPT recognition by chemically synthesizing RNA/DNA hybrids whose (–)-DNA template was substituted with the non-hydrogen-bonding thymine isostere 2,4-difluoro-5-methylbenzene (F). We observed a consistent spatial correlation between the site of T -> F substitution and enhanced ribonuclease H (RNase H) activity ~12–13 bp downstream. In the most pronounced case, dual T -> F substitution at PPT positions –1/–2 redirects RNase H cleavage almost exclusively to the novel site. The structural features of this unusual base suggest that its insertion into the Ty3 PPT (–)-DNA template weakens the duplex, inducing a destabilization that is recognized by a structural element of Ty3 RT ~12–13 bp from its RNase H catalytic center. A likely candidate for this interaction is the thumb subdomain, whose minor groove binding tract most likely contacts the duplex. The spatial relationship derived from T -> F substitution also infers that Ty3 PPT processing requires recognition of sequences in its immediate 5' vicinity, thereby locating the RNase H catalytic center over the PPT-U3 junction, a notion strengthened by additional mutagenesis studies of this paper.


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
Although reverse transcriptase (RT)1-associated ribonuclease H (RNase H) activity degrades RNA of the RNA/DNA replication intermediate with little sequence specificity, it must precisely remove the tRNA and polypurine tract (PPT) primers of (–)-strand (1) and (+)-strand DNA synthesis (2), respectively, to generate sequences at the 5' and 3' termini of the double-stranded DNA recognized by the integration machinery (38). Since the PPT is most likely embedded in a considerably larger RNA/DNA hybrid, precise hydrolysis at the PPT-U3 junction observed in vitro (9) suggests unique structural features may participate by correctly positioning this junction in the RNase H catalytic center. Our recent chemical footprinting of HIV-1 PPT-containing RNA/DNA hybrids (10) and a comparison with the crystal structure of HIV-1 RT bound to a related duplex (11) support this notion. Nucleic acid in the RT-RNA/DNA co-crystal is distorted 8–14 bp upstream of the PPT-U3 junction, comprising weakly paired, unpaired, and mispaired bases (Fig. 1A). Subsequent chemical footprinting studies (10) revealed that template thymines of this region and thymine +1 (i.e. immediately 3' to the PPT) deviate from standard Watson-Crick base pairing in the absence of the retroviral enzyme. The finding that these naturally occurring HIV-1 PPT distortions are 10–14 bp apart was particularly intriguing, since this approximates the distance between the thumb subdomain and RNase H catalytic center of the heterodimer-associated p66 subunit (1114). Therefore, it is possible that the A-tract-induced HIV-1 PPT distortion plays a role in sequestering RT via an interaction with structural elements at the base of the p66 thumb, thus positioning the RNase H catalytic center over the PPT-U3 junction for correct processing.



View larger version (38K):
[in this window]
[in a new window]
 
FIG. 1.
A, sequence and structural features of the HIV-1 PPT. Data are taken from Sarafianos et al. (11) and Kvaratskhelia et al. (10). In this figure and throughout this paper, the (+)-RNA sequence is presented in lowercase type. The shaded region represents the portion of the PPT-containing RNA/DNA hybrid observed in the RT-RNA/DNA co-crystal. Weakly paired bases of the template and primer are represented by filled boxes, and unpaired bases are enclosed within circles. The sequence has been extended at the 3'-end to indicate the entire PPT sequence and PPT-U3 junction. Within this expanded sequences, template thymines sensitive to KMnO4 oxidation in the absence of RT are indicated by the arrows. +1 represents the first base of (+)-U3 RNA or DNA, and –1 represents the 3'-terminal base of the PPT primer. B, Ty3 PPT sequence and substrates used in the present study. C, Ty3 PPT processing profiles. PPT processing was evaluated after 1, 2, 5, 10, 30, and 60 min (lanes 1–6, respectively). Lane C, no enzyme. The migration positions of the intact Ty3 PPT-R and Ty3 PPT-D substrates are indicated at the top of the gel, and the correct cleavage at the PPT-U3 junction is indicated as PPT-U3 (–1).

 

To examine whether this hypothesis may account for the precision of PPT processing in related long terminal repeat-containing elements, we focused here on the Saccharomyces cerevisiae retrotransposon Ty3. Differences in the primary structure of the HIV-1 and Ty3 RTs (the former is a p66/p51 heterodimer, whereas Ty3 RT is a 55-kDa monomer) as well as in the sequence of their PPTs make the comparison between these two systems very interesting. Our preliminary analysis of Ty3 RT has indicated that its DNA polymerase and RNase H catalytic centers are separated by ~21 bp (15) rather than the 17 bp observed in HIV-1 RT, suggesting a different spatial arrangement of thumb subdomain and RNase H catalytic center. Moreover, the Ty3 PPT differs in both size (12 bp) and sequence (Fig. 1B) from the HIV-1 counterpart. Whereas Ty3 RT processes its PPT with the appropriate precision in vitro (15, 16), we show here that it fails to remove (+)-DNA and imprecisely processes (+)-RNA 3' to the HIV-1 PPT, suggesting co-evolution of enzyme and substrate.

RNA/DNA hybrids, whose (–)-DNA template was both individually and dually substituted with 2,4-difluoro-5-methylbenzene (F) for thymine, were used here to investigate structural features of the Ty3 PPT, mediating its recognition and processing. 2,4-Difluoro-5-methylbenzene is isosteric with thymine but has severely reduced hydrogen bonding capacity (17). Thus, F is a particularly useful tool to study the role of hydrogen bonding and base structure and has been extensively used to evaluate the fidelity of DNA synthesis (1821). However, to date there have been no studies on the recognition of F-substituted RNA/DNA hybrids. In the present study, T -> F substitutions were designed to introduce flexibility and, possibly, structural changes at different positions of Ty3 PPT-containing RNA/DNA hybrids, without changing the sequence of the primer. We show that subtle alterations to the structure of the Ty3 PPT (+)-RNA/(–)-DNA hybrid reposition the RNase H domain, inducing a novel but highly specific cleavage within the U3 region, 12 nt downstream from the site of F insertion. This suggests that correct processing of the (+)-strand primer may proceed through interaction of a structural subdomain of Ty3 RT with sequences immediately 5' to the PPT and –12 bp from the PPT-U3 junction.


    EXPERIMENTAL PROCEDURES
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
Materials—Ty3 RT was expressed and purified as described (16). Unsubstituted DNA oligonucleotides were obtained from Integrated DNA Technologies (Gaithersburg, MD). 2,4-Difluoro-5-methylbenzene was purchased as the phosphoramidite from Glen Research. The DNA oligonucleotides containing single and pairwise F substitutions were synthesized using the Expedite 8909 automated synthesizer (PerkinElmer Life Sciences). RNA oligonucleotides were purchased from Dharmacon (Boulder, CO). All other reagents were of the highest purity and purchased from Sigma.

PPT Selection—To evaluate HIV-1 PPT selection a 55-nt, (–)-strand DNA template (corresponding to nucleotides 9048–9103 of the HIV-1HXB2 genome) was hybridized to a 30-nt, 5'-end-labeled PPT-containing RNA extended by 10 ribonucleotides (HIV-1/PPT-R) or deoxyribonucleotides (HIV-1/PPT-D) beyond the authentic RNase H cleavage site (Fig. 2A). The primer/template mixture was annealed by heating to 90 °C and slow cooling in 10 mM Tris/HCl (pH 7.6), 25 mM NaCl. A reaction mixture containing 50 nM template-primer was prepared in 10 mM Tris/HCl (pH 7.8), 9 mM MgCl2, 80 mM NaCl, 5 mM dithiothreitol. Hydrolysis was initiated by the addition of enzyme to a final concentration of 150 nM in a volume of 80 µl. The reaction mixture was incubated at 37 °C. Ten-µl aliquots were removed at times indicated and mixed with an equal volume of 89 mM Tris borate, pH 8.3, 2 mM EDTA, and 95% (v/v) formamide containing 0.1% (w/v) bromphenol blue and xylene cyanol. Polymerization products were resolved by high voltage denaturing 15% polyacrylamide gel electrophoresis and visualized by phosphor imaging. Quantification was performed using Quantity One software (Bio-Rad).



View larger version (45K):
[in this window]
[in a new window]
 
FIG. 2.
Processing of the HIV-1 PPT by Ty3 RT. A, substrates used in the study. The PPT sequence is enclosed within the shaded box. B, processing of the HIV-1 PPT-R and HIV-1 PPT-D substrates. The enzyme under investigation is indicated below each panel. C, processing of the HIV-1 PPT-D substrate by Ty3 RT in the context of (+)-strand, DNA-dependent DNA synthesis. PPT processing was evaluated after 1, 2, 5, 10, 30, and 60 min (lanes 1–6, respectively). Lane C, no enzyme. The migration positions of the intact HIV-1 PPT-R and HIV-1 PPT-D substrates, the correctly processed PPT (PPT-U3 (–1)), and the extended U3 DNA have been indicated.

 

For Ty3 PPT selection, a 46-nt, (–)-strand, DNA template (corresponding to nucleotides 4780–4809 of the Ty3 genome) was hybridized to a 29-nt PPT-containing RNA; the 13 nt 3' to the authentic cleavage site were either ribonucleotide (Ty3 PPT-R) or deoxyribonucleotide (Ty3 PPT-D; Fig. 1B). The mixture was annealed as described above, and a reaction mixture containing 50 nM template-primer was prepared in 25 mM Tris/HCl (pH 7.8), 9 mM MgCl2, 80 mM NaCl, 5 mM dithiothreitol. Hydrolysis was initiated by the addition of RT to a final concentration of 150 nM in an 80-µl volume. The reaction mixture was incubated at 30 °C. Ten-µl aliquots were removed at the times indicated and processed as above.

PPT Selection with 2,4-Difluoro-5-methylbenzene-substituted Substrates—Ty3 PPT-R RNA primer (Fig. 3A) was hybridized to 46-nt, (–)-strand DNA templates harboring single or dual F substitutions, as indicated. Conditions for template-primer annealing, PPT processing, and sample evaluation were as described above. For all experiments evaluating PPT selection, the hydrolysis products were processed as described under "PPT Selection."



View larger version (26K):
[in this window]
[in a new window]
 
FIG. 3.
Processing of Ty3 PPT variants harboring dual F substitutions in the (–)-DNA template. A, structure of an A:T and A:F base pair. B, Ty3 substrate Ty3 PPT-R, indicating the site of T -> F substitution. C, PPT hydrolysis profiles. PPT processing was evaluated after 30 s and 1, 2, 5, and 10 min (lanes 1–5). Lanes C, no enzyme. The position of the F insertion into the DNA template is indicated below each panel. The position in the adjacent RNA/DNA hybrid at which enhanced RNase H activity was observed is indicated by the open arrows. The migration positions of the intact Ty3 PPT-R substrate and the correctly processed PPT (PPT-U3 (–1)) have been indicated. D, quantification of PPT processing data. For all substrates, hydrolysis at each base pair of the RNA/DNA hybrid from position –1 to +11 was calculated as a percentage of total starting material. The 2-min time point was selected for quantitation, where the reaction was in the linear range.

 

Modification of RNA/DNA Heteroduplexes with KMnO4KMnO4 sensitivity of Ty3 PPT templates harboring a single +2, or dual –1/–2, –5/–7, or –9/–11 T -> F substitution was evaluated by a modification of the protocol of Kvaratskhelia et al. (10). RNA/DNA hybrids were incubated at room temperature for 5 min in 20 mM Tris-HCl, pH 8.0, 100 mM NaCl, and 100 µM MgCl2. The total reaction volume was 20 µl. Reactions were initiated by adding 2 µl of freshly prepared 25 mM KMnO4 solution and terminated after 30 s with 2 µl of 14 M {beta}-mercaptoethanol. After ethanol precipitation, samples were treated with 100 µl of 1 M piperidine for 30 min at 90 °C. Piperidine was removed by vacuum desiccation. Nucleic acids were washed three times with 50 µl of water and vacuum-dried after each resuspension. Samples were finally resuspended in 89 mM Tris borate, pH 8.3, 2 mM EDTA, and 95% formamide containing 0.1% bromphenol blue and xylene cyanol and analyzed by electrophoresis through 15% denaturing polyacrylamide gels. Modification products were visualized by phosphor imaging.

Circular Dichroism Spectra and Thermal Melting Profiles—Equimolar amounts (~25 µM) of RNA primer were hybridized to +2, –1/–2, –5/–7, and –9/–11 F-substituted 46-nt, (–)-strand PPT-containing DNA templates by heating to 90 °C and slow cooling in degassed 10 mM Na2HPO4/NaH2PO4, pH 7.0, 80 mM NaCl. Circular dichroism spectra were recorded at 30 °C with an AVIV 202 spectrophotometer using a 1-mm path length cuvette. Correction for each spectrum was against the respective buffer-only spectrum. Nucleic acid duplexes were scanned from 190 to 300 nm. For measurement of melting temperatures (Tm), 10 µg/ml solutions of the same substrates were analyzed in a Beckman DU 640 spectrophotometer. E260 was measured at 0.2 °C intervals from 30 to 80 °C. The Tm of each hybrid was calculated by the "first derivative" method described by the manufacturer.


    RESULTS
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
Altered Processing of the HIV-1 PPT by Ty3 RT—A clear difference between the PPTs of Ty3 and more extensively studied retroviruses is the presence of contiguous (rA:dT) and (rG: dC) tracts in the latter, which might provide a structural basis for recognition. This difference prompted us to investigate the manner in which Ty3 RT processes its cognate PPT and that of HIV-1, for which a considerable body of literature is available (5, 79, 2224). Fig. 1C indicated that the Ty3 PPT is accurately released from (+) U3 RNA or DNA by Ty3 RT. We next evaluated HIV-1 PPTs extended at the 3' terminus by either (+)-RNA or (+)-DNA (HIV-1 PPT-R or HIV-1 PPT-D, respectively. Fig. 2A). The hydrolysis pattern obtained with HIV-1 RT on a duplex extended with (+)-RNA (Fig. 2B, i) is similar to the one that we (9) and others (7, 8, 22) have reported, namely preferential cleavage at the PPT-U3 RNA junction and minor cleavage on either side (we define positions –1 and +1 as the bases on each side of the processing site). However, Ty3 RT cleaved this HIV-1 substrate with significantly altered specificity. Hydrolysis occurred at three positions of the adjacent RNA/DNA hybrid, centered around +2, whereas the correct site was virtually uncleaved (Fig. 2B, ii). Because RNase H cleaves RNA at the PPT-(+)-DNA junction, we examined hydrolysis of an HIV-1 PPT extended by (+)-DNA at its 3' terminus. In this case, we can only observe cleavage at the PPT-U3 DNA junction, if specific cleavage occurs, or within the PPT itself. The data of Fig. 2B, iii, again show minimal hydrolysis at the PPT-U3 DNA junction. In Fig. 2C, dNTPs were included to examine hydrolysis of the HIV-1 PPT-(+) DNA duplex in the context of DNA synthesis. The results indicated that, despite efficient polymerization from the HIV-1 PPT-D substrate, the removal of U3 DNA was again impaired, even after a 1-h incubation with Ty3 RT, eliminating the possibility that structural features of the HIV-1 substrate prevented binding of Ty3 RT. Following DNA synthesis, the 3'-OH of the extended primer is located 20 bp from the PPT-U3 junction, which would be ideally situated for Ty3 RNase H-mediated hydrolysis (Ty3 RNase H cleaves RNA substrates around 11 and 21 nt from the extremity that directs binding (15)). Similarly, the 5'-end of the HIV-1 PPT-R does not direct hydrolysis, since the same cleavage pattern was observed using a substrate extended by 15 nt at the 5'-end of the PPT. Therefore, altered processing of HIV-1-PPT-R (Fig. 2B, ii) and lack of cleavage of HIV-1-PPT-D by Ty3 RT (Fig. 2B, iii) must reflect recognition of a structural feature assumed by the HIV-1 PPT. In a similar experiment, Ty3 PPT variants were completely and nonspecifically hydrolyzed by HIV-1 RT (data not shown).

Dual T -> F Substitutions of the Ty3 PPT DNA Template Modulate Cleavage Specificity—The results of Fig. 2 suggest that structural features of the Ty3 PPT may contribute to the specificity of processing. Therefore, we used the base analog 2,4-difluoro-5-methylbenzene (F; Fig. 3A), which is isosteric with thymine but fails to hydrogen-bond with adenine (25). F was substituted for several thymines of the DNA template complementary to the PPT (3'-C-T-C-T-C-T-C-T-C-C-T-T-5'). Such a strategy subtly alters the stability of the PPT-containing heteroduplex, at the site of substitution, and allowed us to determine the impact on both the structure of the duplex and cleavage specificity.

Initially, a series of doubly F-substituted Ty3 PPT RNA/DNA hybrids (Fig. 3A) was examined to determine whether localized destabilization of the nucleic acid duplex affected either the kinetics or specificity of processing. Indeed, adjacent F insertion at template positions –1/–2 had a profound effect, redirecting the RNase H catalytic center primarily over position +10/+11 of the non-PPT RNA/DNA hybrid (Fig. 3C, ii). This repositioning of Ty3 RT was also observed with substrates containing dual –5/–7 (Fig. 3C, iii) or –9/–11 substitutions (Fig. 3C, iv), which enhance cleavage at positions +6 and +3 of the RNA/DNA hybrid, respectively. Although less dramatic than the effect observed by a –1/–2 substitution, phosphor imaging and quantification (Fig. 4D) indicated that F-induced +6 and +3 cleavage is equivalent to or exceeds that at the PPT-U3 junction (Fig. 4C, i). Therefore, cleavage at the PPT-U3 junction is affected differently by adjacent or interrupted T -> F substitutions. Adjacent substitutions create a more pronounced local destabilization that will sequester the majority of the enzyme at the new recognition site. Alternatively, the distortion induced by the presence of two adjacent T -> F substitutions might render the PPT-U3 junction uncleavable. Notably, the combined data of Fig. 4, B and C, also show a constant spatial correlation of 12–13 bp between the site of F insertion and that of enhanced RNase H activity.



View larger version (50K):
[in this window]
[in a new window]
 
FIG. 4.
Characterization of F-substituted Ty3 PPT RNA/DNA hybrids. A, melting temperatures. Data for a single T -> F substitution outside the PPT (+2) has been included as a control. B, CD spectra of doubly F-substituted hybrids. C, KMnO4 sensitivity of template thymines of the Ty3 PPT-containing substrate. The Ty3 PPT-R substrate of Fig. 1B was used for these studies. C, PPT-containing RNA/DNA hybrid lacking T -> F substitution. Lane 1, +2 T -> F; lane 2, –1/–2 T -> F; lane 4, –5/–7 T -> F. Lane 5, –9/–11 T -> F. Template nucleotides +14/+15 of the Ty3 PPT-R substrate are single-stranded, whereas template nucleotide +9 lies within an RNA/DNA hybrid. These two positions thus illustrate thymine reactivity in its single and double-stranded environment, respectively. The bar to the right defines the position of the Ty3 PPT RNA/(–) DNA duplex. KMnO4 reactivity 5' to this duplex again represents template thymines in their single-stranded configuration.

 

Characterization of F-substituted Ty3 PPT RNA/DNA Hybrids—Three independent experiments were performed to evaluate if F insertion affected the Ty3 PPT structure (Fig. 4). Since the lack of hydrogen bonding has been correlated with a substantial drop in the Tm of shorter nucleic acid duplexes (18, 20), we determined the melting temperature of the F-substituted RNA/DNA hybrids. Wild type Ty3 RNA/DNA hybrid had a Tm of 69 °C (Fig. 4A). A single T -> F substitution at position +2 (i.e. outside the PPT-containing duplex) reduced this to 65.6 °C, whereas that of substrates harboring dual substitutions varied from 63.0 to 60.5 °C. Thus, whereas T -> F substitutions had the expected consequences on decreasing duplex stability, the Tm of all RNA/DNA hybrids was considerably higher than the temperature at which PPT processing was evaluated (30 °C). Fig. 4B compares the CD spectrum of each doubly substituted RNA/DNA hybrid to the wild type. The spectra were in good agreement with published data on polypurine-containing RNA/DNA hybrids, which assume an intermediate configuration between A-like and B-like (8, 26, 27). Although minor differences were noted in the peak and trough heights at 277 and 210 nm, respectively, mutant substrates differed minimally from the wild type PPT. Finally, in Fig. 4C, we examined the sensitivity of template thymines to chemical modification by KMnO4 following T -> F substitution. Previously, we successfully applied this strategy to the HIV-1 PPT, illustrating that template thymines +1 and –10 to –15 adopted a distorted structure (10). In contrast, very little KMnO4 sensitivity is observed in the wild type Ty3 PPT (lane C), suggesting the absence of preexisting structural perturbations. However, the possibility that these might be induced following enzyme binding could not be excluded. Furthermore, the single +2 F (Fig. 4C, lane 1) or dual –1/–2, –5/–7, and –9/–11 T -> F substitutions (Fig. 4C, lanes 2–4, respectively) were accommodated without altering the structure of neighboring A:T base pairs. Since F is insensitive to KMnO4 oxidation, this prohibited any direct evaluation on the structure of the dF:rA pair. The combined data of Fig. 4 therefore provide a strong argument that T -> F substitution within or adjacent to the Ty3 PPT is not accompanied by global changes in structure but rather a subtle and localized alteration in hydrogen bonding.

Single T -> F Substitutions of the Ty3 PPT RNA/DNA Hybrid—To conduct a more detailed analysis of Ty3 PPT architecture, a second series of RNA/DNA hybrids was prepared containing single T -> F substitutions from positions –1to –11 of the (–) DNA template (Fig. 5A). In each case, processing at the PPT-U3 RNA junction was preserved (Fig. 5B) but was accompanied by an alteration in specificity that again directed cleavage ~12 bp downstream. Introducing F as template nucleotide –1 enhanced cleavage at position +10, and to a lesser extent at position +11 (Fig. 5, B and C, ii). This substitution also decreased cleavage at the PPT-U3 RNA junction and increased cleavage at position +2 and +3 (Fig. 5, B and C, compare i with ii). A –2 substitution likewise resulted in significantly increased cleavage at position +10 without affecting the physiological cleavage site. These results suggest that Ty3 RT partitions between the correct recognition site and a second artificially induced as a consequence of F insertion. A –1 F substitution also promoted increased cleavage at several positions downstream from the junction. (Fig. 5B, ii). The extent of +10 cleavage in the case of both monosubstituted substrates exceeded that at the authentic junction, and was comparable with what was observed with a dual –1/–2 substitution. Introducing T -> F substitutions at the 5'-end of the (–)-DNA template (positions –9 and –11) had a similar effect, inducing RNase H activity ~12–13 bp from the site of insertion (Fig. 5B, iv and v). Although –5 and –7 T -> F substitutions yielded a similar result, the alteration in cleavage specificity was not as pronounced (Fig. 5C, iv and v). These data provide additional support that T -> F substitution of the PPT (–) DNA template induces a subtle alteration in duplex architecture and promotes the interaction with a structural component of Ty3 RT 12–13 bp from the RNase H active site. Moreover, the gradual increase in RNase H activity on the non-PPT RNA/DNA hybrid as the position of F substitution approaches the 3'-end of the (–)-DNA template suggests that regions immediately upstream of the PPT may work in concert with the F-induced destabilization.



View larger version (49K):
[in this window]
[in a new window]
 
FIG. 5.
Processing of Ty3 PPT substrates harboring single T -> F substitutions. A, Ty3 PPT-R substrate, indicating the sites of substitution. B, evaluation of F-substituted PPT substrates. The position of the T -> F substitution is indicated below each panel. Lane notations and incubation times are as in the legend to Fig. 4. For each T -> F substitution of the (–)-DNA template, the position of enhanced RNase H activity is indicated by the open arrow. C, quantification of PPT processing data. For all substrates, hydrolysis at each base pair of the RNA/DNA hybrid from position –1 to +11 was calculated as a percentage of total starting material. The 2-min time point was selected for quantitation.

 

Altering Sequences 5' of the Ty3 PPT Affects Processing— Taken together, the T -> F substitution data of Figs. 3 and 5 imply that this non-hydrogen-bonding thymine isostere introduces flexibility and possibly localized structural changes in the RNA/DNA hybrid. This may serve as a determinant for Ty3 RT binding that directs RNase H cleavage ~12–13 bp from the site of substitution. However, it is important to consider these observations with respect to the features of the wild type PPT-containing duplex that sequester Ty3 RT and result in correct cleavage at the PPT-U3 junction.

Since the Ty3 PPT/(–) DNA hybrid is 12 bp (Fig. 1B), our data suggest that initial sequestration of the retrotransposon polymerase is mediated by sequences adjacent to the 5'-end of the polypurine tract. This possibility was tested by introducing two alterations immediately 5' of the Ty3 PPT, changing the sequence from 5'-rC-rC-rC-rU-3' to either 5'-rC-rU-rC-rU-3' (mutant C -> U) or 5'-rU-rU-rU-rU-3' (mutant 3C -> 3U). The latter sequence closely resembles the U-rich region 5' of the HIV-1 PPT, which has been shown to play a role in its utilization (31, 32). To control for potential destabilization of the duplex resulting from C -> U substitutions, each RNA primer was extended by an additional 7 nucleotides at its 5' terminus (Fig. 6A). As a consequence, the 5' flanking region of the PPT served as an additional substrate for Ty3 RNase H, resulting in precise cleavage at both the 5' and 3' PPT junctions (Fig. 6B, i). This was expected, because our previous data on HIV-1 and Ty3 PPT selection indicated that they are accurately processed at both the 5' and 3' termini from within a larger RNA/DNA hybrid (9, 16). A single base pair substitution, altering the sequence to 5'-rC-rU-rC-rU-3', resulted in slightly increased 5' processing (Fig. 6B, ii). However, altering the sequence upstream of the Ty3 PPT to 5'-rU-rU-rU-rU-3' was accompanied by a significant increase in cleavage at the 5' terminus of the PPT (Fig. 6B, iii). To more accurately assess the effect of these mutations and eliminate the possibility that 3' cleavage events had been obscured, we also radiolabeled the RNA substrates at their 3'-end. RNase H hydrolysis products obtained after 30 s were quantified. The results (Fig. 6C) indicated that cleavage at the PPT-U3 junction of mutant 3C -> 3U represented only ~30% of total cleavage at the PPT termini, compared with ~80% for the WT. This significant change in cleavage specificity suggests that the accuracy and extent of Ty3 PPT processing are influenced by sequences adjacent to its 5'-end.



View larger version (35K):
[in this window]
[in a new window]
 
FIG. 6.
Processing of Ty3 PPT variants altered in their 5' regions. A, sequence of wild type and mutant RNA/DNA hybrids. The PPT is represented within the shaded box. The enclosed and shaded box immediately upstream depicts sequence changes corresponding to the C -> U and 3C -> 3U mutants. B, PPT processing profiles. The nature of the PPT mutant is indicated below each panel. Lane notation is as described in the legend to Fig. 4. C, quantification of PPT processing data after 30 s of incubation. Cleavage at the PPT-U3 junction was quantified as a percentage of total hydrolysis at the 3'- and 5'-ends of the PPT. Hydrolysis at positions –13 to –15 were considered cleavage at the 5'-end of the PPT. In this experiment, the RNA primer was labeled at its 3' terminus.

 


    DISCUSSION
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
The molecular mechanism through which the PPT is recognized and specifically processed remains elusive, despite numerous studies describing the effects of altering either nucleic acid sequence (4, 7, 8, 22) or the structural motifs of RT (23, 24, 28). This is of particular importance, since the fidelity with which the PPT is processed has significant bearing on subsequent steps in replication. Previous studies with HIV-1 and equine infectious anemia virus (9) showed their PPTs were faithfully selected by RT when embedded within a considerably larger DNA/RNA hybrid (~120 bp) (i.e. where nucleic acid termini cannot influence positioning of RT). These and other observations (7, 8) implicate the PPT as an active participant in its recognition and processing. Furthermore, despite differences in nucleic acid sequence and RT architecture, a universal mechanism for PPT selection by the cognate RT might be expected. In this paper, the local flexibility of the Ty3 PPT was varied to determine its effect on (+)-strand primer removal. Introducing the non-hydrogen-bonding thymine isostere, F, consistently induces novel RNase H-mediated cleavage 12–13 bp from the site of substitution. For example, the dual –1/–2 T -> F substitution redirects the RNase H domain almost exclusively to the novel cleavage site (Fig. 3D, ii). Additional mutagenesis studies indicate that altering sequences immediately 5' to the Ty3 PPT (and ~13 bp from the PPT-U3 junction) significantly alters the balance of 5' and 3' processing (Fig. 6). Finally, Ty3 RT hydrolyzes the HIV-1 PPT-R substrate ~13 bp downstream from a region demonstrated by chemical footprinting and x-ray crystallography to contain weakly paired, mispaired, and unpaired bases (10, 11). These independent lines of evidence suggest the coordinated action of two regions of Ty3 RT during PPT selection: the RNase H domain, which must be positioned at the PPT-U3 junction, and a structural motif within or adjacent to the DNA polymerase domain, which interacts with nucleic acid ~13 bp upstream of the processing site.

Although supporting structural evidence for this Ty3 RT motif is presently unavailable, it is possible to extrapolate from crystallographic data for the HIV-1 enzyme (1113) to infer the region of the retrotransposon polymerase that interacts with nucleic acid ~13 bp from its RNase H catalytic center. In all nucleic acid-containing structures of HIV-1 RT, extensive contacts are made between the base of the p66 thumb subdomain and the substrate 3–7 bp behind from the DNA polymerase catalytic center. The RNase H catalytic center of HIV-1 RT must therefore be 10–14 bp downstream from the base of the thumb subdomain. In particular, helix {alpha}H of the thumb is partially embedded within the minor groove of double-stranded DNA. Gly262, Lys263, and Trp266 of helix {alpha}H are part of the minor groove binding track, a motif implicated in correct tracking of the enzyme over nucleic acid (29). Mutagenesis studies suggest that this motif could function as a "sensor" of duplex configuration, detecting base pair alterations introduced by lesions (30). Since crystallographic (11) and chemical footprinting data (10) have both identified distortions within the HIV-1 PPT, recognition of this structure by the minor groove binding track is a plausible mechanism that helps position the RNase H catalytic center directly over the PPT-U3 junction. Indeed, mutations in the minor groove binding track have been correlated with altered specificity of HIV-1 PPT processing (31). It is very likely that Ty3 RT also has a thumb subdomain and a minor groove binding track. This may be the structural element of Ty3 RT that interacts with the F-induced structural perturbation. Indeed, secondary structure analysis has identified a putative thumb subdomain for Ty3 RT and a motif related to the HIV-1 minor groove binding tract.2 In fact, the data of Figs. 3 and 5 indicate that whereas T -> F substitutions are not associated with major structural distortions, the local perturbations they introduce are sufficient to sequester a Ty3 enzyme "scanning" the PPT-containing RNA/DNA hybrid and induce cleavage 12–13 bp downstream. This hypothesis also explains the hydrolysis profile obtained by Ty3 RT on the HIV-1 PPT. Positioning of the Ty3 RNase H catalytic active site over positions +2to +4 (Fig. 2B) locates the thumb subdomain 12–13 bp upstream, around positions –10 to –11. Our chemical footprinting analysis (10) identified distortions between positions –10 and –15 (Fig. 1A). Recognition of such distortion by the putative thumb subdomain of Ty3 RT is consistent with recognition of the T -> F-induced local structural destabilization.

Data of Fig. 6 implicate a cis-acting element (i.e. the short dG:rC block immediately upstream of the (+) strand primer) in Ty3 PPT recognition. Regions 5' of the PPTs of murine leukemia (31) and simian immunodeficiency virus (32) have also been shown to control the efficiency and accuracy with which they are processed. Likewise, in vivo studies with another long terminal repeat-containing retrotransposon, Ty1, have shown that an A:T-rich region immediately upstream of the PPT participates in its selection (33). Thus, despite the absence of sequence homology between these long terminal repeat-containing elements, features of the RNA/DNA hybrid 5' to the Ty3 PPT appear to influence its selection. Although we show here that alteration of this upstream region is associated with changes in PPT processing, the underlying structural basis is not clear. However, since G:C tracts are associated with major groove compression (34), it is possible that subtle differences in groove width immediately preceding the Ty3 PPT serve to "lock" the RT thumb in position, thus ensuring that the RNase H domain is correctly positioned over the biologically relevant processing site. Initial NMR studies with a Ty3 PPT-containing RNA/DNA hybrid in the absence of Ty3 RT have suggested that it may adopt an unusual configuration,3 which would support our postulation. Although our work has exploited the hydrogen-bonding isostere F, several alternative modified nucleosides are now available to better understand the molecular basis of PPT and tRNA primer processing and selection in HIV-1 and Ty3, including 2-aminopurine, 2,6-diaminopurine, purine riboside, and the non-hydrogen-bonding cytosine analog 2-fluoro-4-methylbenzene (35). The latter analog, in combination with F, has been used to probe the structure of the HIV-1 PPT and elucidate the molecular basis of its selection.4 These studies show that introducing F or 2-fluoro-4-methylbenzene into the HIV-1 PPT induces novel cleavage 3–4 nt downstream the insertion site instead of the 12 nt observed for Ty3, suggesting that the molecular bases for Ty3 and HIV-1 PPT selection are different.


    FOOTNOTES
 
* The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. Back

{ddagger} To whom correspondence should be addressed. Tel.: 301-846-5256; Fax: 301-846-6013; E-mail: slegrice{at}ncifcrf.gov.

1 The abbreviations used are: RT, reverse transcriptase; RNase H, ribonuclease H; PPT, polypurine tract; HIV, human immunodeficiency virus; F, 2,4-difluoro-5-methylbenzene; PPT-R, 5'-end-labeled PPT-containing RNA extended by 10 ribonucleotides; PPT-D, 5'-end-labeled PPT-containing RNA extended by 10 deoxyribonucleotides; WT, wild type; nt, nucleotide(s). Back

2 D. Lener and S. Le Grice, unpublished observations. Back

3 J. Marino, personal communication. Back

4 J. W. Rausch, J. Qu, H.-Y. Yi-Brunnozzi, E. T. Kool, and S. F. J. Le Grice, submitted for publication. Back


    ACKNOWLEDGMENTS
 
We thank M. Powers (SAIC Frederick) for synthesis of F-substituted DNA oligonucleotides and S. Tarasov (Structural Biophysics Laboratory, NCI-Frederick) for technical assistance and helpful discussions on CD measurements. We also thank G. J. Klarmann for useful suggestions and helpful discussions of the data. We also acknowledge the NCI, National Institutes of Health, CCR Fellows Editorial Board for editing the manuscript.



    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 

  1. Taylor, J. M., and Illmensee, R. (1975) J. Virol. 16, 553–558[Medline] [Order article via Infotrieve]
  2. Sorge, J., and Hughes, S. H. (1982) J. Virol. 43, 482–488[Medline] [Order article via Infotrieve]
  3. Omer, C. A., and Faras, A. J. (1982) Cell 30, 797–805[Medline] [Order article via Infotrieve]
  4. Rattray, A. J., and Champoux, J. J. (1989) J. Mol. Biol. 208, 445–456[Medline] [Order article via Infotrieve]
  5. Pullen, K. A., Ishimoto, L. K., and Champoux, J. J. (1992) J. Virol. 66, 367–373[Abstract]
  6. Smith, J. S., and Roth, M. J. (1992) J. Biol. Chem. 267, 15071–15079[Abstract/Free Full Text]
  7. Pullen, K. A., Rattray, A. J., and Champoux, J. J. (1993) J. Biol. Chem. 268, 6221–6227[Abstract/Free Full Text]
  8. Powell, M. D., and Levin, J. G. (1996) J. Virol. 70, 5288–5296[Abstract]
  9. Rausch, J. W., and Le Grice, S. F. (1997) J. Biol. Chem. 272, 8602–8610[Abstract/Free Full Text]
  10. Kvaratskhelia, M., Budihas, S. R., and Le Grice, S. F. (2002) J. Biol. Chem. 277, 16689–16696[Abstract/Free Full Text]
  11. Sarafianos, S. G., Das, K., Tantillo, C., Clark, A. D., Jr., Ding, J., Whitcomb, J. M., Boyer, P. L., Hughes, S. H., and Arnold, E. (2001) EMBO J. 20, 1449–1461[Abstract/Free Full Text]
  12. Huang, H., Chopra, R., Verdine, G. L., and Harrison, S. C. (1998) Science 282, 1669–1675[Abstract/Free Full Text]
  13. Jacobo-Molina, A., Ding, J., Nanni, R. G., Clark, A. D., Jr., Lu, X., Tantillo, C., Williams, R. L., Kamer, G., Ferris, A. L., Clark, P., Hizi, A., Hughes, S. H., and Arnold, E. (1993) Proc. Natl. Acad. Sci. U. S. A. 90, 6320–6324[Abstract]
  14. Kohlstaedt, L. A., and Steitz, T. A. (1992) Proc. Natl. Acad. Sci. U. S. A. 89, 9652–9656[Abstract]
  15. Rausch, J. W., Grice, M. K., Henrietta, M., Nymark, M., Miller, J. T., and Le Grice, S. F. (2000) J. Biol. Chem. 275, 13879–13887[Abstract/Free Full Text]
  16. Lener, D., Budihas, S. R., and Le Grice, S. F. (2002) J. Biol. Chem. 277, 26486–26495[Abstract/Free Full Text]
  17. Guckian, K. M., Krugh, T. R., and Kool, E. T. (1998) Nat. Struct. Biol. 5, 954–959[CrossRef][Medline] [Order article via Infotrieve]
  18. Moran, S., Ren, R. X., and Kool, E. T. (1997) Proc. Natl. Acad. Sci. U. S. A. 94, 10506–10511[Abstract/Free Full Text]
  19. Barsky, D., Kool, E. T., and Colvin, M. E. (1999) J. Biomol. Struct. Dyn. 16, 1119–1134[Medline] [Order article via Infotrieve]
  20. Morales, J. C., and Kool, E. T. (2000) Biochemistry 39, 2626–2632[CrossRef][Medline] [Order article via Infotrieve]
  21. Dzantiev, L., Alekseyev, Y. O., Morales, J. C., Kool, E. T., and Romano, L. J. (2001) Biochemistry 40, 3215–3221[CrossRef][Medline] [Order article via Infotrieve]
  22. Wohrl, B. M., and Moelling, K. (1990) Biochemistry 29, 10141–10147[Medline] [Order article via Infotrieve]
  23. Powell, M. D., Beard, W. A., Bebenek, K., Howard, K. J., Le Grice, S. F., Darden, T. A., Kunkel, T. A., Wilson, S. H., and Levin, J. G. (1999) J. Biol. Chem. 274, 19885–19893[Abstract/Free Full Text]
  24. Rausch, J. W., Lener, D., Miller, J. T., Julias, J. G., Hughes, S. H., and Le Grice, S. F. (2002) Biochemistry 41, 4856–4865[CrossRef][Medline] [Order article via Infotrieve]
  25. Kool, E. T. (2001) Annu. Rev. Biophys. Biomol. Struct. 30, 1–22[CrossRef][Medline] [Order article via Infotrieve]
  26. Hung, S. H., Yu, Q., Gray, D. M., and Ratliff, R. L. (1994) Nucleic Acids Res. 22, 4326–4334[Abstract]
  27. Ratmeyer, L., Vinayak, R., Zhong, Y. Y., Zon, G., and Wilson, W. D. (1994) Biochemistry 33, 5298–5304[Medline] [Order article via Infotrieve]
  28. Ghosh, M., Williams, J., Powell, M. D., Levin, J. G., and Le Grice, S. F. (1997) Biochemistry 36, 5758–5768[CrossRef][Medline] [Order article via Infotrieve]
  29. Bebenek, K., Beard, W. A., Darden, T. A., Li, L., Prasad, R., Luton, B. A., Gorenstein, D. G., Wilson, S. H., and Kunkel, T. A. (1997) Nat. Struct. Biol. 4, 194–197[Medline] [Order article via Infotrieve]
  30. Latham, G. J., Forgacs, E., Beard, W. A., Prasad, R., Bebenek, K., Kunkel, T. A., Wilson, S. H., and Lloyd, R. S. (2000) J. Biol. Chem. 275, 15025–15033[Abstract/Free Full Text]
  31. Bacharach, E., Gonsky, J., Lim, D., and Goff, S. P. (2000) J. Virol. 74, 4755–4764[Abstract/Free Full Text]
  32. Ilyinskii, P. O., and Desrosiers, R. C. (1998) EMBO J. 17, 3766–3774[Abstract/Free Full Text]
  33. Wilhelm, M., Heyman, T., Boutabout, M., and Wilhelm, F. X. (1999) Nucleic Acids Res. 27, 4547–4552[Abstract/Free Full Text]
  34. Nussinov, R. (1991) Comput. Appl. Biosci. 7, 295–299[Abstract]
  35. Kool, E. T. (2002) Annu. Rev. Biochem. 71, 191–219[CrossRef][Medline] [Order article via Infotrieve]