Intramolecular Higher Order Packing of Parallel Quadruplexes Comprising a G:G:G:G Tetrad and a G(:A):G(:A):G(:A):G Heptad of GGA Triplet Repeat DNA*

Akimasa Matsugami {ddagger}, Tomoyuki Okuizumi {ddagger}, Seiichi Uesugi {ddagger} and Masato Katahira {ddagger} § 

From the {ddagger}Department of Environment and Natural Sciences, Graduate School of Environment and Information Sciences, Yokohama National University, 79-7 Tokiwadai, Hodogaya-ku, Yokohama 240-8501, Japan and the §RIKEN Genomic Sciences Center, Tsurumi, Yokohama 230-0045, Japan

Received for publication, April 9, 2003


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
GGA triplet repeats are widely dispersed throughout eukaryotic genomes and are frequently located within biologically important regions such as gene regulatory regions and recombination hot spot sites. We determined the structure of d(GGA)4 (12-mer) under physiological conditions and founded the formation of an intramolecular parallel quadruplex for the first time. Later, a similar architecture to that of the intramolecular parallel quadruplex was found for a telomere DNA in the crystalline state. Here, we have determined the structure of d(GGA)8 (24-mer) under physiological conditions. Two intramolecular parallel quadruplexes comprising a G:G:G:G tetrad and a G(:A):G(:A):G(:A):G heptad are formed in d(GGA)8. These quadruplexes are packed in a tail-to-tail manner. This is the first demonstration of the intramolecular higher order packing of quadruplexes at atomic resolution. K+ ions, but not Na+ ones, are critically required for the formation of this unique structure. The elucidated structure suggests the mechanisms underlying the biological events related to the GGA triplet repeat. Furthermore, in the light of the structure, the mode of the higher order packing of the telomere DNA is discussed.


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
Several kinds of triplet repeats are found in the human genome. A link to the occurrence of a certain disease has been established for some of triplet repeats. The CCG, CTG, CAG, and GAA repeats are linked to fragile X syndrome, myotonic dystrophy, Huntington's disease, and Friedreich ataxia, respectively (14). It is suggested that these repeats have unusual structures and that the unusual structures cause the extraordinary expansion of the repeats related to the occurrence of the diseases (5, 6). The GGA triplet repeat is widely dispersed throughout eukaryotic genomes (7). The GGA repeat has been identified in portions of human and mouse cellular DNA that cross-hybridize with the internal direct repeat (IR3) repetitive region of Epstein-Barr virus (8). The GGA repeat, together with the GAA repeat, has also been found in micro satellite DNA belonging to the rat polymeric immunoglobulin receptor gene (9). A fragment of the microsatellite DNA containing both the repeats was suggested to attenuate gene expression at the transcriptional and post-transcriptional levels (10). Moreover, the GGA repeat has been identified in various sequences ranging from that of the mouse WASP gene (11), which is a homologue of the gene mutated in the Wiskott-Aldrich syndrome, to regulatory elements governing cell type-specific expression of neural cell adhesion molecule genes (12). The GGA repeat is frequently located within gene regulatory regions and recombination hot spot sites (13). Thus, the biological significance of the GGA repeat is widely recognized, although the link to a certain disease has not yet been clarified. The GGA repeat is capable of forming variable structures (1418).

We determined the structure of d(GGAGGAGGAGGA) (d(GGA)4) composed of four tandem GGA units under physiological conditions (19). d(GGA)4 folds into an intramolecular quadruplex composed of a G:G:G:G tetrad and a G(:A):G(:A): G(:A):G heptad. Four G-G segments of d(GGA)4 are aligned parallel to each other due to seven successive turns of the main chain at each of the GGA and GAGG segments. This was the first demonstration that DNA can form an intramolecular parallel quadruplex. Later, a similar architecture to that of the intramolecular parallel quadruplex was found for the telomere DNA in the crystalline state (20). We also showed that two quadruplexes of d(GGA)4 form a dimer stabilized through the stacking interaction between the heptads of the two quadruplexes.

Our findings as to GGA triplet repeat DNA, together with the crystallographic result for telomere DNA, indicated the possibility of intramolecular higher order packing of quadruplexes for longer DNA with certain repeating units. To address this point and to elucidate the character of naturally occurring GGA repeat DNA, we studied the structure of d(GGAGGAGGAGGAGGAGGAGGAGGA) (d(GGA)8) under physiological conditions. Here, we present its unique structure. This is the first demonstration of higher order packing of quadruplexes at atomic resolution. The biological events related to GGA triplet repeat DNA can be rationalized in the light of the elucidated structure. Furthermore, our findings provide a support for the hypothetical higher order packing of quadruplexes of telomere DNA and indicate the mode of packing for it.


    EXPERIMENTAL PROCEDURES
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
Sample Preparation—d(GGA)8 and mutant oligomers in which each single G residue was replaced by an I residue were prepared as described previously (19). DNA was dissolved in 10 mM sodium phosphate buffer (pH 6.7) containing 3 mM NaN3 and either 0 or 1–50 mM KCl. The DNA concentrations were 1–40 µM for CD and 0.1–1 mM for NMR. 2,2-Dimethyl-2-silapentane-5-sulfonate was used as an internal chemical shift reference. Samples were heated at 95 °C for 5 min, followed by gradual cooling to room temperature prior to the measurements.

CD and NMR Spectroscopies—CD spectra and thermal CD melting curves were recorded with a Jasco J-720 spectropolarimeter. The temperature of the solution was raised from 1 to 95 °C at the rate of 1 °C/min. Changes in CD intensity were monitored at 263 nm. The melting temperature was determined by use of the derivative of the melting curve.

NMR spectra were recorded with Bruker DRX600 and DRX800 spectrometers equipped with a quadruple-resonance probe with X, Y, and Z gradients. The following NMR experiments were used to assign the resonances and to obtain distance and dihedral angle constraints; NOESY,1 TOCSY, 31P-decoupled DQF-COSY, 1H-13C HSQC, 1H-15N HSQC, and 1H-31P HetCor. Spectra were processed with XWIN-NMR (Bruker), NMRPipe (21), and Capp/Pipp/Stapp (22).

Distance and Dihedral Angle Constraints—Interproton distances were calculated from NOESY spectra with mixing times of 50 and 200 ms as described previously (19). In total, 750 distance constraints were obtained.

Dihedral angle constraints for the {beta} torsion angle were derived from 3JH5'-P and 3JH5''-P couplings as described previously (19, 23, 24). Thus, the {beta} torsion angles of G2, G5, G8, G14, G17, and G20 were constrained to 180 ± 20°. For the other residues, {beta} was left unconstrained. Dihedral angle constraints for the {gamma} torsion angles were derived from 3JH4'-H5' and 3JH4'-H5'' couplings as described previously (19, 23, 24). Thus, the {gamma} torsion angles of G2, G4, G5, G7, G8, G10, G14, G16, G17, G19, G20, and G22 were constrained to 60 ± 20°. For the other residues, {gamma} was left unconstrained.

Dihedral angle constraints for the {delta} and endocyclic {nu}0, {nu}1, {nu}2, {nu}3, and {nu}4 torsion angles were derived from 3JH1'-H2', 3JH2''-H3', and 3JH3'-H4' couplings as described previously (19, 2325). The sugar pucker was determined as follows on the basis of the results of analysis of the intensities of the H1'-H2', H2''-H3', and H3'-H4' cross-peaks in the DQF-COSY spectrum: all G residues, except for G13, and A12, C1'-exo conformation; all A residues except for A12 and A24, C3'-endo conformation; A24, O4'-endo conformation; G13, not determined due to overlapping of resonances. The sugar conformations were confirmed independently by the C3' and C4' chemical shift values (23, 24, 26). The {delta} and endocyclic {nu}0{nu}4 torsion angles of all G, except for G13, and A12 residues were moderately constrained, leaving the sugar free to take any conformation without an energy penalty between O4'-endo and C2'-endo including C1'-exo in the pseudorotation cycle. In the same way, those of all A residues, except for A12 and A24, were constrained between C2'-exo and C4'-exo, including C3'-endo, and those of A24 between C4'-exo and C1'-exo, including O4'-endo. That of G13 was not constrained.

Dihedral angle constraints for the {epsilon} torsion angle were derived from 3JH3'-P coupling as described previously (19, 23, 24). Thus, the {epsilon} torsion angles of all residues, except for A24, were constrained to –120 ± 45°. For A24, {epsilon} was left unconstrained. No dihedral angle constraints were used for the {alpha} and {zeta} torsion angles.

Structure Calculation—Structure calculations were carried out using distance and dihedral angle constraints with a simulated annealing protocol supplied with X-PLOR, version 3.8 (27). Neither hydrogen bonding constraints nor planarity constraints were included. Twenty final structures were selected from 100 calculations on the basis of the criteria of the smallest residual energy values. None of them violated the distance constraints by more than 0.3 Å or the dihedral angle constraints by more than 6°. The structures were viewed with Insight II (MSI).


    RESULTS
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
K+-induced Ordered Structure of d(GGA)8In the absence of K+, several broad imino proton signals were observed for d(GGA)8 (Fig. 1A). This indicates that although d(GGA)8 seems to form some structure in the absence of K+, it is rather unstable. In contrast, 16 sharp signals appeared in the presence of K+ (Fig. 1B), indicating the formation of a certain stable ordered structure. The chemical shift range of 10.7–11.9 ppm for these signals suggests that the structure formed is a G:G:G:G quartet-based quadruplex. The sharp signals did not appear when Na+ was added instead of K+ (Fig. 1C). Thus, K+ is critically required for the formation of this stable structure. This is notable from a physiological point of view, because the K+ concentration, but not the Na+ one, is high, over 100 mM, in the nucleus.



View larger version (16K):
[in this window]
[in a new window]
 
FIG. 1.
Imino proton and CD spectra of d(GGA)8. Imino proton spectra of d(GGA)8, 1 mM, in H2O containing 10 mM sodium phosphate (pH 6.7) and 3 mM NaN3 at 25 °C (A), and those with either 30 mM KCl (B) or 30 mM NaCl (C). CD spectra of d(GGA)8,40 µM, in H2O containing 10 mM sodium phosphate (pH 6.5), 3 mM NaN3, and 100 mM KCl at 5 °C.

 

It is known that a parallel quadruplex gives a positive CD band at 260 nm, while an antiparallel one gives a positive CD band at 295 nm either with or without a positive CD band at 260 nm (28, 29). The CD spectrum of d(GGA)8 in the presence of K+ had a positive CD band at around 260 nm without a positive CD band at around 295 nm (Fig. 1D). This suggests that d(GGA)8 forms a parallel quadruplex. The melting temperature of the structure formed by d(GGA)8 in the presence of K+ was determined to be rather high, 86 °C. This again suggests the formation of a quadruplex, because the melting temperature of a quadruplex is generally very high.

It was also found that the melting temperature does not depend on the concentration of d(GGA)8. This indicates that the structure formed is unimolecular. This conclusion was supported by the result of analysis of the line widths of NMR resonances that are sensitive to molecular weight. The line widths for d(GGA)8 (24-mer) were the same as those for d(GGA)4 (12-mer), which exists as a dimer in solution (19).

Resonance Assignments—The non-exchangeable 1H, 13C, and 31P resonances of d(GGA)8 were assigned as described previously (19) using standard methods (23, 30). As an example, Fig. 2A shows expansion of the NOESY spectrum allowing the sequential assignments of H1' and H6/H8 through H1'(i – 1)-H6/H8(i)-H1'(i) connectivities. H2', H2'', H3', H4', and H5'/H5'' resonances were assigned by means of TOCSY and DQF-COSY spectra. These assignments were confirmed by the sequential H3'(i 1)-P(i)-H4'/H5'/H5''(i) connectivities in the 1H-31P HetCor spectrum, the assignments of 31P resonances being made at the same time (data not shown). The chemical shifts of 13C resonances were obtained from the 1H-13C HSQC spectrum. Exchangeable 1H resonances were assigned as described previously with the use of mutant oligomers in which each single G residue was replaced by an I residue (19).



View larger version (18K):
[in this window]
[in a new window]
 
FIG. 2.
NOESY spectra. A, H6/H8-H1' region of the NOESY spectrum with a mixing time of 200 ms for d(GGA)8 in 2H2O containing 10 mM sodium phosphate (pH 6.7), 3 mM NaN3, and 30 mM KCl at 25 °C. The lines show the H1'(i – 1)-H6/H8(i)-H1'(i) connectivities, the intraresidue cross-peaks being labeled. Cross-peaks a–j are as follows: a, A3H2-A21H1'; b, A6H2-A18H1'; c, A9H2-A15H1'; d, A15H2-A9H1'; e, A18H2-A6H1'; f, A21H2-A3H1'; g, G1H8-G22H1'; h, G7H8-G16H1'; i, G13H8-G10H1'; j, G19H8-G4H1'. Expansion of the NOESY spectrum in H2O at 2 °C, indicating NOEs originating from GNH2 (B) and ANH2 (C). Cross-peaks in B are as follows: a and a', G1NH2-G4H8; b and b', G4NH2-G7H8; c, G7NH2-G10H8; d, G10NH2-G1H8; e, G13NH2-G16H8; f, G16NH2-G19H8; g, G19NH2-G22H8; h, G22NH2-G13H8; a* and a#, G1NH2-A3H8; b* and b#, G4NH2-A6H8; c*, G7NH2-A9H8; e*, G13NH2-A15H8; f*, G16NH2-A18H8; g*, G19NH2-A21H8; i, i', i*, and i#, G1NH2-A3NH2; j, j', j*, and j#, G4NH2-A6NH2; k, G7NH2-A9NH2; l, G13NH2-A15NH2; m and m', G16NH2-A18NH2; n, n', and n*, G19NH2-A21NH2. Cross-peaks in C are as follows: a and a', A3NH2-G1H1'; b and b', A6NH2-G4H1'; c and c', A9NH2-G7H1'; d, A15NH2-G13H1'; e and e', A18NH2-G16H1'; f and f', A21NH2-G19H1'.

 

Formation of Two G:G:G:G Tetrads and Two G(:A):G(:A): G(:A):G Heptads and Their Mutual Arrangement—The formation of the G2:G5:G8:G11 and G14:G17:G20:G23 tetrads was concluded on the basis of GNH/NH2-GH8 (Fig. 2B) and GNH-GNH NOEs; the NOEs commonly observed for both tetrads are shown in Fig. 3A. The formation of the G1(:A3):G4(:A6): G7(:A9):G10 and G13(:A15):G16(:A18):G19(:A21):G22 heptads was concluded on the basis of GNH/NH2-AH8 (Fig. 2B), ANH2-GH1' (Fig. 2C), GNH2-ANH2, and GH8-AH8 NOEs, in addition to GNH/NH2-GH8 and GNH-GNH ones; the NOEs commonly observed for both heptads are shown in Fig. 3B. A12 and A24 are not involved in either the tetrads or the heptads.



View larger version (29K):
[in this window]
[in a new window]
 
FIG. 3.
Schematic representation of the tetrad (A), heptad (B), and the overall structure (C). The same representation is applied for the G14:G17:G20:G23 tetrad and G13(:A15):G16(:A18):G19(:A21):G22 heptad. Key NOEs commonly observed for both the tetrads are shown by solid lines connecting the corresponding protons in A and those commonly observed for both the heptads in B.

 

The arrangement of two heptads was determined on the basis of the strong AH2-AH1'/2'/2'' NOEs between A3 and A21, A6 and A18, and A9 and A15, and the medium to weak GH8-GH1' ones between G1 and G22, G4 and G19, G7 and G16, and G10 and G13 (Fig. 2A). Thus, the overall structure of d(GGA)8 was concluded to be as shown in Fig. 3C.

Structure of d(GGA)8The applied constraints and the structural statistics for the 20 final structures are summarized in Table I. The root mean square deviations (r.m.s.d.s) of the 20 final structures versus the mean structure for all heavy atoms were 0.35 ± 0.13 Å.


View this table:
[in this window]
[in a new window]
 
TABLE I
NMR constraints and structural statistics for d(GGAGGAGGAGGAGGAGGAGGAGGA)

 

Fig. 4A shows a stereo view of the superposition of the 20 final structures. A representative structure with the lowest energy is shown in Fig. 4B, a trace of the sugar-phosphate main chain being indicated by a tube for clarity. Two tetrad and two heptads can be seen (Fig. 4, C–E), as already discussed qualitatively. A12 is located close to the upper heptad, although it is not involved in the heptad. A24 is stacked on G23.



View larger version (43K):
[in this window]
[in a new window]
 
FIG. 4.
The structure of d(GGA)8. A, a stereo view of the superposition of the 20 final structures, being rotated by ~180° from Fig. 3C. G2, G5, G8, and G11 are colored in yellow; G1, A3, G4, A6, G7, A9, and G10 in magenta; G13, A15, G16, A18, G19, A21, and G22 in green; G14, G17, G20, and G23 in light brown; and A12 and A24 in sky blue. The sugar-phosphate main chain is colored in gray and O4' of sugars in red. B, a stereo view of the representative structure with the lowest energy. Stacking between the tetrad and the heptad for G2:G5:G8:G11 and G1(:A3):G4(:A6):G7(:A9):G10 (C) and for G14:G17:G20:G23 and G13(:A15): G16(:A18):G19(:A21):G22 (D). E, stacking between the two heptads.

 

A turn at a G1-G2-A3 trinucleotide segment, which is composed of a sheared G1:A3 base pair and a single G2 residue at the top, is followed by another turn at a G2-A3-G4-G5 segment (Fig. 4B). A further succession of turns is found for each GGA and GAGG segment except for the G11-A12-G13-G14 and G22-G23-A24 segments. In the cases of the canonical B- and A- forms, the backbone dihedral angles are as follows; {alpha}(g), {beta}(t), {gamma}(g+), {epsilon}(t), and {zeta}(g) (31). Deviation from the canonical dihedral angles of {alpha}, {beta}, and {gamma} was found for all A residues. Deviation of {alpha} and {beta} was found for G4, G7, G10, G11, G16, G19, and G22. Deviation of {zeta} was found for almost all G residues. The combination of these deviated dihedral angles results in the successive turns of the backbone.

These successive turns cause the four G-G segments, i.e. the G1-G2, G4-G5, G7-G8, and G10-G11 segments, to be aligned parallel to each other. As a result, the G1–G11 portion of d(GGA)8 forms an intramolecular parallel quadruplex (Figs. 3C and 4B). Similarly, the G13-G14, G16-G17, G19-G20, and G22-G23 segments are aligned parallel to each other, and the G13–G23 portion forms another intramolecular parallel quadruplex. The two quadruplexes are packed in a tail-to-tail manner through stacking between the two heptad planes, the orientation of G-G segments of one quadruplex being opposite to that of the other quadruplex (Figs. 3C and 4B).

The stacking between the tetrad and heptad planes is shown in Fig. 4, C and D. The five-membered ring of one guanine base is stacked on the six-membered ring of the other guanine base for each G-G segment. The A3, A6, A9, A15, A18, and A21 bases are stacked on the G2, G5, G8, G14, G17, and G20 sugars, respectively, which is characteristic of the structure of a GNA trinucleotide loop and consistent with the extreme upfield shift of H4' (2.62–2.84 ppm) and the moderate upfield shift of H3' (4.56–4.66 ppm) of these G residues (32, 33). Stacking of the two heptads is shown in Fig. 4E. The G1, A3, G4, A6, G7, A9, and G10 bases are stacked on the G22, A21, G19, A18, G16, A15, and G13 ones, respectively.

All G residues take on the anti conformation with respect to the glycoside bond, while all A residues, except for A12 and A24, take on the high anti-conformation. All G residues take on the B-form sugar conformation (around C1'-exo), while all A residues, except for A12 and A24, take on the A-form sugar conformation (around C3'-endo). The high anti and C3'-endo conformations of the six A residues are supposed to be preferable for the series of turns and/or for good stacking interactions.

Weak to medium NOEs were observed between A6H2 and G7H8, A9H2 and G10H8, A18H2 and G19H8, and A21H2 and G22H8. These NOEs could be very strong, if A6, A9, A18, and A21 take on the syn conformation, forming hydrogen bonds, i.e. G4NH2-A6N1 and G4N3-A6NH2, G7NH2-A9N1 and G7N3-A9NH2, G16NH2-A18N1 and G16N3-A18NH2, and G19NH2-A21N1 and G19N3-A21NH2, respectively. Therefore, there is the possibility that there is a small fraction of the syn conformation for A6, A9, A18, and A21, being in fast exchange with the major high anti conformation on an NMR time scale.

Comparison of the Structures of d(GGA)8 and d(GGA)4The structures of d(GGA)8 and d(GGA)4 are shown schematically in Fig. 5, A and B. d(GGA)8 forms a monomeric structure, while d(GGA)4 forms a dimeric one. The structure of the G1-G11 portion of d(GGA)8 is similar to that of the upper monomer of d(GGA)4. A12 of d(GGA)8 folds back and is located close to the upper heptad, while that of d(GGA)4 is stacked on G11 and moves away from the center. The main chain of d(GGA)8 goes down further through A12, and G13 is involved in the lower heptad. The structure of the G13-A24 portion of d(GGA)8 is similar to that of the lower monomer of d(GGA)4. As a whole, the monomeric architecture of d(GGA)8 resembles the dimeric architecture of d(GGA)4.



View larger version (39K):
[in this window]
[in a new window]
 
FIG. 5.
Comparison of the structures of d(GGA)8 (24-mer) (A), d(GGA)4 (12-mer) (B), and the telomere DNA, d[AGGG(TTAGGG)3], in the crystalline state (C).

 


    DISCUSSION
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 
From the viewpoint of symmetry, the formation of an octad composed of four G and four A bases may seem to be natural. A12 of d(GGA)4 (12-mer) does not fold back to the heptad plane, and thus the octad is not formed (Fig. 5B). This was explained by that the terminal A12 of d(GGA)4 does not have a following G residue that would pull it to the heptad plane (19). In the case of d(GGA)8 (24-mer) (Fig. 5A), it is conceivable that A12 may associate with the upper heptad, resulting in the formation of an octad. However, all NMR data for d(GGA)8 were against the existence of the sheared G10:A12 base pair that is needed for the formation of an octad, although the position of A12 was determined to be close to the upper heptad (Fig. 4B). A12 must bridge the upper and lower quadruplexes. It seems that A12 cannot be involved in the octad to play this role due to structural restriction. The terminal A24 does not fold back to the heptad, either, for the same reason as for d(GGA)4. Thus, an octad is not formed for d(GGA)8 either.

From the viewpoint of achieving maximum stacking interactions, the relative arrangement of the two heptads of d(GGA)8 can be rationalized. For example, if the lower heptad is rotated by either 90° or 180°, one A base of each heptad cannot be involved in the stacking interaction, which is energetically less stable and thus unfavorable.

Each monomer of d(GGA)4 forms an intramolecular parallel quadruplex (Fig. 5B). Matsugami et al. (19) first demonstrated that DNA can form the intramolecular parallel quadruplex. Later, an intramolecular parallel quadruplex was found for a telomere DNA in the crystalline state (20). These structures have indicated the intramolecular higher order packing of quadruplexes. In this study, we have demonstrated for the first time that the intramolecular higher order packing of quadruplexes actually occurs and the mode of the packing at atomic resolution is discussed. The intramolecular packing of the two quadruplexes for d(GGA)8 is achieved through the stacking interaction between the heptads of each quadruplex (Fig. 5A). As a result, the two quadruplexes are arranged in a tail-to-tail manner. Alternatively, if the G13(:A15):G16(:A18):G19(:A21): G22 heptad is stacked on the G2:G5:G8:G11 tetrad, with A12 bulging out, then the two quadruplexes are arranged in a head-to-tail manner. However, this does not occur for d(GGA)8, because heptad-heptad stacking in a tail-to-tail manner is energetically much more stable and favorable than heptad-tetrad stacking in a head-to-tail manner.

When the GGA unit is further repeated, formation of the structure shown in Fig. 5A is expected for the other 24-mer region, too. The structures formed for each 24-mer region may further pack through the stacking interaction between the tetrads of each structure.

The GGA triplet repeat is abundant in eukaryotic genomes (7), and is frequently located within biologically important regions such as gene regulatory regions and recombination hot spot sites (1113). Thus, the biological significance of the GGA repeat is widely recognized. The GGA repeat shows considerable genetic polymorphism due to genetic instability (7, 34). This genetic instability is supposed to originate from DNA slippage during DNA replication and/or frequent recombination of the repeats (35). The structure elucidated in this study can be used to rationalize the occurrence of these events. In the course of replication, double-stranded DNA is locally melted to yield single-stranded DNA for use as a template for DNA synthesis. In this case, the GGA strand is expected to form the unique intramolecular structure we found, when the high concentration of K+, over 100 mM, in the nucleus is considered. The structure is very stable, as revealed by its high melting temperature of 86 °C. It was also found for d(GGA)4 that once the unique structure is formed, the addition of the complementary strand, d(TCC)4, is not effective for the formation of a duplex (19). The unique structure remained even in the presence of the complementary strand. The formed structure could cause the slippage during DNA replication, which results in a gain or loss of repeats and thus genetic instability.

Alternatively, intermolecular association of two double-stranded DNAs through the packing of two quadruplexes formed in each DNA, as shown in Fig. 5B, may also occur and facilitate the recombination. This idea is coincident with the fact that the GGA repeat is frequently located in recombination hot spot sites. The facilitated recombination may also be responsible for the genetic instability of the GGA repeat. Furthermore, the intermolecular association at the GGA repeat may play a role in the pairing of homologous chromosomes during meiosis. The idea of pairing through a quadruplex has already been proposed for the G stretch (3740). It is known that the synapsis formation between homologous chromosomes in meiosis usually begins at telomere DNA, but that it sometimes starts at loci inside a chromosome.

It is expected that the unique intramolecular structure shown in Fig. 5A is formed in the course of transcription as well, because DNA is locally melted in this case, too. The formed structure is supposed to be an obstacle for an RNA polymerase to proceed, which could explain the attenuation of gene expression at the transcription level (10). We have revealed that RNA with the related sequence also can form a similar intramolecular parallel quadruplex (36). This RNA structure is very stable, too, the melting temperature being 86 °C (36). Therefore, it is expected that the unique structure in the transcribed mRNA becomes an obstacle for the translation machinery to associate with and/or proceed along mRNA in the course of translation, which may lead to the attenuation of gene expression at the post-transcriptional level (10).

The structure elucidated in this study can also provide a clue as to the higher order packing of the telomere DNA. A quadruplex of telomere DNA only contains the tetrad, not the heptad (Fig. 5C). Therefore, the arrangement of the two quadruplexes in a head-to-tail manner, in which the G26:G32:G38: G44 tetrad of the second quadruplex is stacked on the G4:G10: G16:G22 tetrad of the first quadruplex, with a T23-T24-A25 segment serving as a linker, seems to be likely for d[AGGG-(TTAGGG)7], as proposed by Parkinson et al. (20). Because of the lack of the heptad, the tail-to-tail arrangement of the two quadruplexes, in which the G26:G32:G38:G44 tetrad is stacked on the G2:G8:G14:G20 tetrad, with folding back of the T23-T24-A25 segment, seems less likely, although it is not impossible.


    FOOTNOTES
 
The atomic coordinates and structure factors (code 1OZ8) have been deposited in the Protein Data Bank, Research Collaboratory for Structural Bioinformatics, Rutgers University, New Brunswick, NJ (http://www.rcsb.org/).

* The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. Back

Supported by Grants-in-aid for Scientific Research (numbers 12470487 and 15030214) and the Protein 3000 Project of the Ministry of Education, Culture, Sports, Science and Technology of Japan. To whom correspondence should be addressed. Tel.: 81-45-339-4264; Fax: 81-45-339-4264; E-mail: masakata{at}ynu.ac.jp.

1 The abbreviations used are: NOESY, nuclear Overhauser effect spectroscopy; NOE, nuclear Overhauser effect; TOCSY, total correlation spectroscopy; DQF, double quantum filtered; HSQC, heteronuclear single-quantum coherence spectroscopy; r.m.s.d., root mean square deviation. Back



    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 EXPERIMENTAL PROCEDURES
 RESULTS
 DISCUSSION
 REFERENCES
 

  1. Pieretti, M., Zhang, F., Fu, Y. H., Warren, S. T., Oostra, B. A., Caskey, C. T., and Nelson, D. L. (1991) Cell 66, 817–822[Medline] [Order article via Infotrieve]
  2. Mahadevan, M., Tsilfidis, C., Sabourin, L., Shutler, G., Amemiya, C., Jansen, G., Neville, C., Marang, M., Barcelo, J., O'Hoy, K., Leblond, S., Earle-Macdonald, J., De Jong, P. J., Wieringa, B., and Korneluk, R. G. (1992) Science 255, 1253–1255[Medline] [Order article via Infotrieve]
  3. Huntington's Disease Collaborative Research Group (1993) Cell 72, 971–983[Medline] [Order article via Infotrieve]
  4. Campuzano, V., Montermini, L., Molto, M. D., Pianese, L., Cossee, M., Cavalcanti, F., Monros. E., Rodius, F., Duclos, F., Monticelli, A., et al. (1996) Science 271, 1423–1427[Abstract]
  5. Catasti, P., Chen, X., Mariappan, S., Bradbury, M., and Gupta, G. (1999) Genetica 106, 15–36[Medline] [Order article via Infotrieve]
  6. Bowater, R. P., and Wells, R. D. (2001) Prog. Nucleic Acids Res. Mol. Biol. 66, 159–202[Medline] [Order article via Infotrieve]
  7. Beckmann, J. S., and Weber, J. L. (1992) Genomics 12, 627–631[Medline] [Order article via Infotrieve]
  8. Heller, M., Flemington, E., Kieff, E., and Deininger, P. (1985) Mol. Cell. Biol. 5, 457–465[Medline] [Order article via Infotrieve]
  9. Koch, K. S., Gleiberman, A. S., Aoki, T., Leffert, H. L., Feren, A., Jones, A. L., and Fodor, E. J. (1995) Nucleic Acids Res. 23, 1098–1112[Abstract]
  10. Aoki, T., Koch, K. S., and Leffert, H. L. (1997) J. Mol. Biol. 267, 229–236[CrossRef][Medline] [Order article via Infotrieve]
  11. Derry, J. M., Wiedemann, P., Blair, P., Wang, Y., Kerns, J. A., Lamahieu, V., Godfrey, V. L., Wilkinson, J. E., and Francke, U. (1995) Genomics 29, 471–477[CrossRef][Medline] [Order article via Infotrieve]
  12. Hirsch, M. R., Gaugler, L., Deagostini-Bazin, H., Bally-Cuif, L., and Goridis, C. (1990) Mol. Cell. Biol. 10, 1959–1968[Medline] [Order article via Infotrieve]
  13. Wells, R. D., Collier, D. A., Hanvey, J. C., Shimizu, M., and Wohlrab, F. (1998) FASEB J. 2, 2939–2949
  14. Huertas, D., and Azorin, F. (1996) Biochemistry 35, 13125–13135[CrossRef][Medline] [Order article via Infotrieve]
  15. Mishima, Y., Kaizu, H., and Kominami, R. (1997) J. Biol. Chem. 272, 26578–26584[Abstract/Free Full Text]
  16. Usdin, K. (1998) Nucleic Acids Res. 26, 4078–4085[Abstract/Free Full Text]
  17. Kettani, A., Bouaziz, S., Skripkin, E., Majumdar, A., Wang, W., Jones, R., and Patel, D. J. (1999) Structure (Lond.) 7, 803–815[Medline] [Order article via Infotrieve]
  18. Kettani, A., Gorin, A., Majumdar, A., Hermann, T., Skripkin, E., Zhao, H., Jones, R., and Patel, D. J. (2000) J. Mol. Biol. 297, 627–644[CrossRef][Medline] [Order article via Infotrieve]
  19. Matsugami, A., Ouhashi, K., Kanagawa, M., Liu, H., Kanagawa, S., Uesugi, S., and Katahira, M. (2001) J. Mol. Biol. 313, 255–269[CrossRef][Medline] [Order article via Infotrieve]
  20. Parkinson, G. N., Lee, M. P. H., and Neidle, S. (2002) Nature 417, 876–880[CrossRef][Medline] [Order article via Infotrieve]
  21. Delaglio, F., Grzesiek, S., Vuister, G. W., Zhu, G., Pfeifer, J., and Bax, A. (1995) J. Biomol. NMR 6, 277–293[Medline] [Order article via Infotrieve]
  22. Garrett, D. S., Powers, R., Gronenborn, A. M., and Clore, G. M. (1991) J. Magn. Reson. 95, 214–220
  23. Varani, G., Aboul-ela, F., and Allain, F. H. T. (1996) Prog. Nucl. Magn. Reson. Spect. 29, 51–127[CrossRef]
  24. Smith, J. S., and Nikonowicz, E. P. (2000) Biochemistry 39, 5642–5652[CrossRef][Medline] [Order article via Infotrieve]
  25. Katahira, R., Katahira, M., Yamashita, Y., Ogawa, H., Kyogoku, Y., and Yoshida, M. (1998) Nucleic Acids Res. 26, 744–755[Abstract/Free Full Text]
  26. Santos, R. A., Tang, P., and Harbison, G. S. (1989) Biochemistry 28, 9372–9378[Medline] [Order article via Infotrieve]
  27. Brünger, A. T. (1993) X-PLOR Version 3.1: A System for X-ray Crystallography and NMR, Yale University Press, New Haven, CT
  28. Balagurumoorthy, P., Brahmachari, S. K., Mohanty, D., Bansal, M., and Sasisekharan, V. (1992) Nucleic Acids Res. 20, 4061–4067[Abstract]
  29. Catasti, P., Chen, X., Moyzis, R. K., Bradbury, M., and Gupta, G. (1996) J. Mol. Biol. 264, 534–545[CrossRef][Medline] [Order article via Infotrieve]
  30. Wüthrich, K. (1986) NMR of Proteins and Nucleic Acids, John Wiley and Sons Inc., New York
  31. Saenger, W. (1983) Principles of Nucleic Acid Structure, Springer-Verlag Inc., New York
  32. Li, Y., Zon, G., and Wilson, W. D. (1991) Proc. Natl. Acad. Sci. U. S. A. 88, 26–30[Abstract/Free Full Text]
  33. Hirao, I., Kawai, G., Yoshizawa, S., Nishimura, Y., Ishido, Y., Watanabe, K., and Miura, K. (1994) Nucleic Acids Res. 22, 576–582[Abstract]
  34. Hoffman-Liebermann, B., Liebermann, D., Trout, A., Kedes, L. H., and Cohen, S. N. (1986) Mol. Cell. Biol. 6, 3632–3642[Medline] [Order article via Infotrieve]
  35. Hearne, C. M., Ghosh, S., and Todd, J. A. (1992) Trends Genet. 8, 288–294[Medline] [Order article via Infotrieve]
  36. Liu, H., Matsugami, A., Katahira, M., and Uesugi, S. (2002) J. Mol. Biol. 322, 955–970[CrossRef][Medline] [Order article via Infotrieve]
  37. Sen, D., and Gilbert, W. (1988) Nature 334, 364–366[CrossRef][Medline] [Order article via Infotrieve]
  38. Subdquist, W. I., and Klug, A. (1989) Nature 342, 825–829[CrossRef][Medline] [Order article via Infotrieve]
  39. Williamson, J. R., Raghuraman, M. K., and Cech, T. R. (1989) Cell 59, 871–880[Medline] [Order article via Infotrieve]
  40. Blackburn, E. H. (1991) Nature 350, 569–573[CrossRef][Medline] [Order article via Infotrieve]