Hostname: page-component-cd9895bd7-gbm5v Total loading time: 0 Render date: 2024-12-28T21:38:41.907Z Has data issue: false hasContentIssue false

How sequence alterations enhance the stability and delay expansion of DNA triplet repeat domains

Published online by Cambridge University Press:  06 November 2023

Jens Völker
Affiliation:
Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, NJ, USA
Kenneth J. Breslauer*
Affiliation:
Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, NJ, USA The Rutgers Cancer Institute of New Jersey, New Brunswick, NJ, USA
*
Corresponding author: Kenneth J. Breslauer; Email: [email protected]
Rights & Permissions [Opens in a new window]

Abstract

DNA sequence alterations within DNA repeat domains inexplicably enhance the stability and delay the expansion of interrupted repeat domains. Here we propose mechanisms that rationalise such unanticipated outcomes. Specifically, we describe how interruption of a DNA repeat domain restricts the ensemble space available to dynamic, slip out, repeat bulge loops by introducing energetic barriers to loop migration. We explain how such barriers arise because some possible loop isomers result in energetically costly mismatches in the duplex portion of the repeat domain. We propose that the reduced ensemble space is the causative feature for the observed delay in repeat DNA expansion. We further posit that the observed loss of the interrupting repeat in some expanded DNAs reflects the transient occupation of loop isomer positions that result in a mismatch in the duplex stem due to ‘leakiness’ in the energy barrier. We propose that if the lifetime of such a low probability event allows for recognition by the mismatch repair system, then ‘repair’ of the repeat interruption can occur; thereby rationalising the absence of the interruption in the final expanded DNA ‘product.’ Our proposed mechanistic pathways provide reasoned explanations for what have been described as ‘puzzling’ observations, while also yielding insights into a biomedically important set of coupled genotypic phenomena that map the linkage between DNA origami thermodynamics and phenotypic disease states.

Type
Perspective
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2023. Published by Cambridge University Press

Repeat DNA sequences are inherently unstable and prone to undergo expansions, deletions, and chromosome rearrangements

Trinucleotide and higher order nucleotide repeat genomic domains are characterised by relative instability compared to more random sequence domains. This instability frequently is associated with uncontrolled DNA expansion or contraction events, as well as with chromosome rearrangements. These genomic events often correlate with debilitating phenotypic disease states (Ashley and Warren, Reference Ashley and Warren1995; Orr and Zoghbi, Reference Orr and Zoghbi2007). Mirkin and Khristich published a very interesting and insightful review of possible mechanisms and consequences of repeat DNA instability, entitled ‘On the wrong track: molecular mechanisms of repeat mediated genome instability’ (Khristich and Mirkin, Reference Khristich and Mirkin2020). In their scholarly review, the authors critically discuss various models and conceptual frameworks that have been proposed to explain the propensity for repeats to expand or contract.

Interruptions in pre-expanded repeat domains protect repeat DNAs against expansion while being lost in expanded repeat domains

It has been a relatively short time since DNA repeat expansion was proposed as a genotypic cause of phenotypic diseases induced by a threshold level of triplet repeat DNA expansion (Sutherland and Richards, Reference Sutherland and Richards1995). Since then, some progress has been made towards understanding the causal relationship(s) between repeat domain instability and DNA expansion. Nonetheless, crucial features of the mechanisms of how repeat DNA instability leads to DNA expansion still remain unknown. In a subchapter within their review subtitled, ‘Role of repeat interruptions in repeat stability’, Khristich and Mirkin underscore the inability of current models to account for the reduced levels of repeat expansion caused by one or more ‘interruptions’ within the repeat sequence domain. Employing the example of CGG repeats containing AGG interruptions, the authors also note the puzzling apparent loss of the pre-existing AGG interruption in those repeat DNA domains that have become expanded; fascinating yet perplexing observations for which meaningful explanations currently are lacking.

CAA interruptions near the 3′ end of CAG repeats within the Huntingtin gene delay the onset of disease in pre-mutation length Huntington’s patients

The deficiency in defining a mechanistic explanation for the origins of such interruption-induced delay in repeat expansion represents a barrier to the rational design of target-based therapeutic interventions. The need for such a mechanistic understanding is reinforced by a recent review of deep sequencing data from pre-mutation length CAG repeats in Huntington’s disease patients (Wright et al., Reference Wright, Black, Collins, Gall-Duncan, Caron, Pearson and Hayden2020). In this review, Pearson and co-workers note that one or more CAA interruptions of the CAG repeats near the 3′ end of the repeat domain dramatically delay the age of onset of the Huntington’s disease phenotype relative to carriers of the same length composed of uninterrupted CAG repeats. Since the CAA triplet codes for the same amino acid, glutamine, as the CAG triplet it replaces, these authors concluded that the delayed onset of Huntington’s disease must be due to changes in the properties of the repeat sequence itself and not due to the polyglutamine tract in the mutated Huntingtins protein.

Rationalising interruption-induced alterations in repeat DNA properties

Here we propose plausible explanations/mechanisms that can rationalise the impact of repeat interruptions on repeat expansion events, as well as on the puzzling apparent loss of the interrupting repeats after expansion. Our mechanistic proposals, as elaborated on below, are based on insights we have garnered from our published studies on strategically designed, CAG repeat-containing oligonucleotide systems (Völker et al., Reference Völker, Klump and Breslauer2007, Reference Völker, Plum, Gindikin, Klump and Breslauer2014). Such constructs create a dynamic energy landscape shaped by nearly isoenergetic, interchanging, positional isomers that we have dubbed as ‘rollamers’ (Völker et al., Reference Völker, Klump and Breslauer2008, Reference Völker, Gindikin, Klump, Plum and Breslauer2012, Reference Völker, Plum, Gindikin and Breslauer2019; Li et al., Reference Li, Völker, Breslauer and Wilson2014; Völker and Breslauer, Reference Völker and Breslauer2022) (Fig. 1).

Figure 1. (a) Cartoon version of a repeat bulge loop within a larger repeat domain. Such a static representation does not highlight the reality that such repeat bulge loops really are dynamic ensembles of loop isomers as indicated in (b) (Völker et al., Reference Völker, Gindikin, Klump, Plum and Breslauer2012). Note that in these cartoon representations, we present the repeat bulge loop as a ‘unstructured ring’ to account for the (largely unknown) fluctuating microstructures that likely make up the repeat bulge loop ensemble within a given rollamer isomer (Völker et al., Reference Völker, Klump and Breslauer2008).

Disruption of DNA secondary structure as a potential cause for the repeat interruption-induced increase in repeat stability. For quite some time it has been proposed that aberrant secondary structure formation, stabilised by repeat specific intrastrand base pairing interactions, might serve as critical intermediates in the processes that induce repeat instability leading to expansion (Gacy et al., Reference Gacy, Goellner, Juranic, Macura and McMurray1995; Mitas, Reference Mitas1997; Pearson and Sinden, Reference Pearson and Sinden1998; McMurray, Reference McMurray1999; Sinden et al., Reference Sinden, Potaman, Oussatcheva, Pearson, Lyubchenko and Shlyakhtenko2002; Lenzmeier and Freudenreich, Reference Lenzmeier and Freudenreich2003). This recognition also suggests a potential basis for the empirically observed reduction in repeat domain instability in the presence of one or more ‘wrong’ triplets. In this regard, it has been suggested that destabilisation of the repeat secondary structure by the ‘wrong’ triplet results in an unstable secondary structure, one that would not provide the same challenges to the replication and repair machinery as would be created by a ‘correct’ repeat secondary structure; thereby preventing the expansion process from occurring. However, as pointed out by Khristich and Mirkin in their comprehensive review (Khristich and Mirkin, Reference Khristich and Mirkin2020), it is hard to conceive how a single base change, or even a small number of repeat interruptions within very large repeat domains could manifest such an effect, and there are scant data in support of this hypothesis.

Delaney and coworkers have perhaps presented the most relevant dataset in partial support of this perspective by showing that AGG interruptions within a (CGG)n repeat oligonucleotide alter its secondary structure (as defined by susceptibility to structure-specific chemical probes), and the thermodynamic stability of the freely folding ensemble of structures adopted by (CGG)n repeat oligonucleotides (Jarem et al., Reference Jarem, Huckaby and Delaney2010). In this regard it should be noted that oligonucleotides composed only of repeat sequences do not fold into one unique native structure, but rather present an ensemble of interrelated folding forms, a feature that makes unambiguous interpretation of the data difficult. Given the unknown ensemble distributions in their samples, other interpretations of Delaney’s results are possible. Of interest here is the interruption-induced changes in the ensemble, a feature that also plays a role, albeit in a different context, in the proposed mechanism put forward below.

Abasic sites (and mismatches) can be accommodated within repeat DNA secondary structures without loss of secondary structure stability

In our published studies, we used abasic site lesions, inserted site specifically in place of guanine in select CAG repeats that are trapped within a repeat bulge loop conformation by conventional Watson and Crick base-paired domains upstream and downstream of the CAG domain (Völker et al., Reference Völker, Plum, Klump and Breslauer2009). This arrangement significantly restricts the confounding repeat ensemble distribution (Völker et al., Reference Völker, Klump and Breslauer2008), thereby simplifying the analysis. Our measurements revealed the impact of the abasic site lesion within the repeat bulge loop domain to be essentially energetically neutral. This is an unanticipated outcome since we previously have shown the abasic lesion to be one of, if not the most destabilising lesion in duplex DNA; an expectation consistent with abasic sites, in a formal sense, involving the removal of the guanine base, producing a pairing and stacking ‘hole’ in the DNA helix (Vesnaver et al., Reference Vesnaver, Chang, Eisenberg, Grollman and Breslauer1989; Gelfand et al., Reference Gelfand, Plum, Grollman, Johnson and Breslauer1996, Reference Gelfand, Plum, Grollman, Johnson and Breslauer1998; Minetti et al., Reference Minetti, Sun, Jacobs, Kang, Remeta and Breslauer2018). However, contrary to this impact on duplex DNA, we have demonstrated that the repeat self-structure is able to adjust in a compensatory manner that entirely masks the energetic cost of the loss of one putative base pair and the 5′ and 3′ associated base stacking interactions (Völker et al., Reference Völker, Plum, Klump and Breslauer2009). By contrast, introducing an ‘inappropriate’ base (i.e., replacing a C or G with an A to form a AGG or CAA triplet) within the repeat loop is likely to be less destabilising than an abasic site, given the plethora of non-standard, non-Watson and Crick base pairing interactions that have been shown to be only marginally less stable than the canonical Watson and Crick base pair (Nelson et al., Reference Nelson, Martin and Tinoco1981; Aboul-ela et al., Reference Aboul-ela, Koh, Tinoco and Martin1985; Wu et al., Reference Wu, McDowell and Turner1995; SantaLucia, Reference SantaLucia1998; SantaLucia and Hicks, Reference SantaLucia and Hicks2004). Focusing on the sequence examples mentioned earlier, replacement of a CGG repeat by AGG, or CAG by CAA would result in CˑA or AˑC base pairs, if sequence alignment of the kind frequently proposed for repeat slip-outs is maintained. Stable DNA structures composed of AˑC base pairs have been reported (Hunter et al., Reference Hunter, Brown, Anand and Kennard1986; Boulard et al., Reference Boulard, Cognet, Gabarro-Arpa, Le Bret, Sowers and Fazakerley1992; Allawi and SantaLucia, Reference Allawi and SantaLucia1998). Based on this analysis, it is unlikely that a single or even a few interruptions of the repeat sequence would suffice to disrupt repeat DNA self-structure to such an extent that uncontrolled expansion is delayed or absent.

Consequently, based on the body of data available, it is reasonable to conclude that one or a few interruptions to the repeat sequences are insufficient to fatally disrupt repeat secondary structure, and thereby prevent expansion. As a result, one needs to look elsewhere for possible explanations.

Repeat DNA secondary structures are dynamic ensembles, with the ensemble distributions dictated by differential energy levels

We propose that the answer to this conundrum lies in the dynamic nature of repeat DNA slip outs within larger repeat domains. As we have shown, short repeat slip outs within larger repeat domains result in dynamic repeat bulge loops, which we dubbed ‘rollamers.’ Rollamers are dynamic, interconverting, positional isomers that can form in multiple energetically equivalent positions within the larger repeat sequence (Völker et al., Reference Völker, Gindikin, Klump, Plum and Breslauer2012). Such an arrangement of loop distributions is favoured by a Boltzmann entropy gain. Relatively facile interconversions between different repeat loop positions over the entire repeat sequence domain results, at equilibrium, in an ensemble distribution rather than a single repeat loop isomer, with each possible loop position being approximately equally populated. Indeed, we have postulated that the fleeting nature of such bulge loops due to relatively facile loop migration may be one of the contributing factors that cause the DNA replication and repair machineries to erroneously expand (or contract) repeats when they encounter rollameric substrates. In fact, consistent with this expectation, we have shown that loop migration can cause abasic site lesions to escape processing by the critical repair enzyme APE1 (Völker and Breslauer, Reference Völker and Breslauer2022). The presence of an abasic site lesion in place of one of the guanines in a CAG repeat, however, alters the ensemble distributions of the rollamers, since in this construct different loop positions are no longer energetically equivalent (Völker et al., Reference Völker, Plum, Gindikin and Breslauer2019). Under such a circumstance, the system adjusts by altering the relative populations of loop isomer states to minimise the energetic penalty caused by the lesion.

How repeat domain interruptions alter the repeat DNA secondary structure ensemble distributions

Following similar reasoning, we postulate that the presence of an interruption of the repeat sequence also will cause a change in repeat bulge loop rollamer distribution within such interrupted repeat sequences. As schematically shown in Fig. 2 and elaborated on in the figure legend, the altered triplet can either form conventional base pairs within upstream or downstream duplex domains (i.e., shown by Isomers I and II in Fig. 2), or result in the altered triplet partitioning into the repeat bulge loop domain while simultaneously causing formation of a mismatch in the duplex domain (Isomers III, IV and V in Fig. 2). Isomer III represents a special case due to partitioning of the mismatch at the 5′ loop junction, most likely resulting in an enlarged loop domain. In the former case, the energetic impact of the repeat interruption to the repeat bulge loop is indistinguishable from equivalent-sized repeat bulge loop rollamers in uninterrupted repeats. In the latter case, the energetic impact of the interruption is defined by the impact of the base mismatch and potential contributions from the loop modification. As a consequence, the different loop isomers are no longer energetically equivalent, and significant changes in the populations of different loop isomer positions can be expected. For example, the presence of an abasic site within a repeat almost completely inhibits population of those loop isomers where the abasic site partitions into the duplex domain (Völker et al., Reference Völker, Plum, Gindikin and Breslauer2019). A similar, but perhaps less profound, effect would be expected for the energetically less costly mismatches.

Figure 2. Schematic representation of the impact of repeat interruptions on the repeat bulge loop ensemble as shown for the [CAG]8ˑ[CTG]4 system containing a CAA interruption in place of the 6th CAG repeat (Mutated base pair indicated in red letters). The [CAG]8ˑ[CTG]4 complex results in a 4-repeat bulge loop that can be positioned in 5 possible loop positions, identified by roman numerals I-V, in the 5′ to 3′ directions. The colour coding of the CAG repeat segments into yellow (repeats 1–4) and red (repeats 5–8) is meant as a visual aid to help identify which repeat is partitioned into which domain in each of the 5 loop isomers. Loop position determines whether the CAA repeat is part of the duplex or the loop domain with the blue ball highlighting the position of the mutated A in each loop isomer. When the CAA triplet is partitioned into the loop domain, the complementary TTC triplet in the opposing strand forms base pairs with a CAG repeat that is part of the upstream duplex region, resulting in a GˑT mismatch (green ellipsoid). Note that the different loop isomers can be classified into 3 general groups; as defined by the energetic impact of the repeat interruptions on the repeat bulge loop isomer; with the differential energetic impacts dictating the differential loop populations. Group 1: isomers I and II contain a conventionally base paired CAA/GTT triplet in the downstream duplex domain, (potential loop Isomers with the CAA/GTT triplet in the upstream duplex domain are not shown in this example but can be considered essentially equivalent); Group 2: Loop isomers IV and V contain a mismatched triplet CAG/GTT in the upstream duplex domain. Analogous to group 1 above, potential loop Isomers with the mismatch in the downstream duplex domain are not shown, but also can be considered essentially equivalent; and Group 3: Loop isomer III contains the GˑT mismatch at the 5′ junction and we propose that it likely is part of an expanded loop domain. Note that a mismatch at the 3′ junction is possible, depending on the nature of the repeat interruption, as defined by the triplet sequence. An altered base in 1st, 2nd, or 3rd positions has unique impacts at the 3′ and 5′ junctions, but only impacts group 1 and 2 insofar as it alters nearest neighbours in the duplex. Fig. 2 is intended as an illustrative example. Longer repeats, larger slip outs, and repeat interruptions at different positions will produce different loop isomer arrangements, but conceptually they are represented by the three group classifications shown in Fig. 2.

Based on our abasic site data, as discussed above (Völker et al., Reference Völker, Plum, Gindikin and Breslauer2019), we posit that loop isomer positions resulting in a mismatch and altered repeat loop sequence are unlikely to be populated to any significant extent due to the energetic damage such modifications cause. In other words, the interruption of the repeat sequence acts like a (possibly leaky) barrier to rollameric loop distribution. By thermodynamically discouraging population of some potential rollamer positions, the interruption of the repeat sequence causes it to behave as if equivalent to a shorter length repeat domain, particularly as far as the propensity to expand is concerned, which is exactly what has been observed empirically.

The mechanism we propose by which repeat interruptions increase repeat DNA stability represents an example of the importance of the thermodynamic impact of the final state in addition to the commonly considered initial state when one tries to assess biological outcomes.

Low probability, repeat bulge loop isomer states present the possibility for mismatch repair processes to result in the apparent loss of the repeat interruption

Finally, some comments regarding the apparent loss of interrupting triplets such as AGG in an expanded domain alluded to by Khristich and Mirkin in their review (and references therein) (Khristich and Mirkin, Reference Khristich and Mirkin2020). Two possible explanations exist that can be tested by inspection of the expanded sequence. If expansion happens only as a consequence of the restricted/preferred sequence space available to the ‘correct’ repeat bulge loop rollamer, then the expanded domain would only reflect the ‘correct’ repeats, while the altered triplet should still be present near the 3′ or 5′ end of the repeat domain. Alternatively, the thermodynamic argument for reduced repeat expansion propensity, due to single base/triplet interruptions of the repeat sequences, does not exclude the possibility that the repeat bulge loops adopt loop positions where their positional partition results in the upstream duplex domain containing a mismatch, primarily corresponding to isomers IV and V in Fig. 2. Although thermodynamic considerations make population of such loop states low probability events, they would be highly consequential by resulting in repair of the interrupting repeat, hence our use of the term ‘possibly leaky’ barrier. This reasoning is essentially a classic thermodynamic/dynamic argument; namely, that high energy states that are sparsely populated, but do exist, can lead to additional biologically consequential processing pathways if the lifetime of the sparsely populated state is sufficiently long so as to be recognised and processed by the relevant repair enzyme. Such successful repair would remove the interrupting repeat, an outcome consistent with observation.

Given that mismatch repair has been implicated in facilitating repeat expansion events (McMurray, Reference McMurray2008; Iyer et al., Reference Iyer, Pluciennik, Napierala and Wells2015; Schmidt and Pearson, Reference Schmidt and Pearson2016; Iyer and Pluciennik, Reference Iyer and Pluciennik2021), this cascade of events may also trigger expansion of the now interruption-free repeat domain, also consistent with the observed outcomes from the data. The empirical observations of delayed repeat expansions in interrupted repeat sequences are a strong indicator that rollamer isomers for which loop partition result in the formation of mismatches in the repeat duplex domain are low probability events, as otherwise mismatch repair process could be expected to enhance rates of repeat expansion events.

Going Forward: Potential Experimental Assessments of the proposed mechanisms for interruption-induced increase in repeat DNA stability

In closing, we wish to point out that the proposed mechanism, elaborated here, for increased stability of repeat DNA sequences with one or more interruptions, provides testable predictions that allow one to confirm or refute the proposed hypothesis. Notably, determining the thermodynamic consequences of a single interruption of the repeat sequence in various loop positions within a static repeat bulge loop will assess if the consequence of a single base disruption on repeat self-structure stability is indeed negligible, as we suggest above. Furthermore, monitoring loop distribution of a dynamic rollamer system similar to that outlined in Fig. 2, should allow one to determine if the presence of the repeat disruption indeed influences rollamer loop distribution, as hypothesised here; thereby reducing the effective repeat length. Single-molecule studies may prove useful in this regard (Hu et al., Reference Hu, Morten and Magennis2021; Bianco et al., Reference Bianco, Hu, Henrich and Magennis2022). Finally, it may be possible to test the extent to which the mismatch repair system is able to repair the repeat disruption by using a site specifically located mismatch at the repeat /nonrepeat DNA sequence junction in a rollameric system.

Concluding remarks

To summarise, we have presented a novel energetic-based explanation for the puzzling observations remarked upon by Khristish and Mirkin in their review (Khristich and Mirkin, Reference Khristich and Mirkin2020) that interruptions in repeat sequences lead to increased stability and delayed expansion of these domains. Our proposed mechanism also can explain the equally puzzling observed loss of the interrupting repeat when expansion eventually does happen. Specifically, we describe for the first time how interruption of a repeat domain restricts the ensemble space available to dynamic, slip out, repeat bulge loops by introducing energetic barriers to loop migration. We present the novel proposal that these barriers arise because some possible loop isomers result in energetically costly mismatches in the duplex portion of the repeat domain. We further propose for the first time that the reduced ensemble space is the causative feature for the observed delay in repeat DNA expansion. We further propose for the first time that the observed loss of the interrupting repeat in some expanded DNAs may be due to transient occupation of loop isomer positions that result in a mismatch in the duplex stem due to leakiness in the energy barrier. We propose the novel hypothesis that if the lifetime of such a low probability event allows for recognition by the mismatch repair system, then ‘repair’ of the repeat interruption can occur; thereby rationalising the absence of the interruption in the final expanded DNA ‘product.’

We are pleased that Khristch and Mirkin produced such a comprehensive and provocative critical review. In the best of circumstances, such quality reviews serve as intellectual launching pads that motivate the scientific community to focus on explanations for counterintuitive observations. Their presentation of some puzzling biomedical correlative outcomes stimulated us to refocus on our biophysical studies of such systems. Our resulting proposed mechanistic pathways provide novel insights into a biomedically important set of coupled genotypic phenomena that map the linkage between DNA origami thermodynamics and phenotypic disease states.

Open peer review

To view the open peer review materials for this article, please visit http://doi.org/10.1017/qrd.2023.6.

Acknowledgements

The authors would like to thank Drs Craig A. Gelfand and G. Eric Plum for helpful discussions.

Author contribution

J.V. and K.J.B. contributed equally to this work.

Competing interest

The authors declare none.

Financial disclosure

Supported by grants from the NIH GM23509, GM34469, and CA47995 (all to K.J.B.).

References

Aboul-ela, F, Koh, D, Tinoco, I Jr and Martin, FH (1985) Base–base mismatches. Thermodynamics of double helix formation for dCA3XA3G + dCT3YT3G (X, Y = A,C,G,D. Nucleic Acids Research 13(13), 48114824. https://doi.org/10.1093/nar/13.13.4811CrossRefGoogle Scholar
Allawi, HT and SantaLucia, J (1998) Nearest-neighbor thermodynamics of internal A·C mismatches in DNA: Sequence dependence and pH effects. Biochemistry 37(26), 94359444. https://doi.org/10.1021/bi9803729CrossRefGoogle ScholarPubMed
Ashley, CT and Warren, ST (1995) Trinucleotide repeat expansion and human disease. Annual Review of Genetics 29, 703728. https://doi.org/10.1146/annurev.ge.29.120195.003415CrossRefGoogle ScholarPubMed
Bianco, S, Hu, T, Henrich, O and Magennis, SW (2022) Heterogeneous migration routes of DNA triplet repeat slip-outs. Biophysical Reports 2(3), 100070. https://doi.org/10.1016/j.bpr.2022.100070CrossRefGoogle ScholarPubMed
Boulard, Y, Cognet, JAH, Gabarro-Arpa, J, Le Bret, M, Sowers, LC and Fazakerley, GV (1992) The pH dependent configurations of the C.A mispair in DNA. Nucleic Acids Research 20(8), 19331941. https://doi.org/10.1093/nar/20.8.1933CrossRefGoogle Scholar
Gacy, AM, Goellner, G, Juranic, N, Macura, S and McMurray, CT (1995) Trinucleotide repeats that expand in human disease form hairpin structures in vitro. Cell 81(4), 533540. https://doi.org/10.1016/0092-8674(95)90074-8CrossRefGoogle ScholarPubMed
Gelfand, CA, Plum, GE, Grollman, AP, Johnson, F and Breslauer, KJ (1996) The impact of a bistrand abasic lesion on DNA duplex properties. Biopolymers 38(4), 439445.3.0.CO;2-U>CrossRefGoogle ScholarPubMed
Gelfand, CA, Plum, GE, Grollman, AP, Johnson, F and Breslauer, KJ (1998) Thermodynamic consequences of an abasic lesion in duplex DNA are strongly dependent on base sequence. Biochemistry 37(20), 73217327.CrossRefGoogle ScholarPubMed
Hu, T, Morten, MJ and Magennis, SW (2021) Conformational and migrational dynamics of slipped-strand DNA three-way junctions containing trinucleotide repeats. Nature Communications 12(1), 204. https://doi.org/10.1038/s41467-020-20426-3CrossRefGoogle ScholarPubMed
Hunter, WN, Brown, T, Anand, NN and Kennard, O (1986) Structure of an adenine˙cytosine base pair in DNA and its implications for mismatch repair. Nature 320(6062), 552555. https://doi.org/10.1038/320552a0CrossRefGoogle ScholarPubMed
Iyer, RR and Pluciennik, A (2021) DNA mismatch repair and its role in Huntington’s disease. Journal of Huntington’s Disease 10, 7594. https://doi.org/10.3233/JHD-200438CrossRefGoogle ScholarPubMed
Iyer, RR, Pluciennik, A, Napierala, M and Wells, RD (2015) DNA triplet repeat expansion and mismatch repair. Annual Review of Biochemistry 84(1), 199226. https://doi.org/10.1146/annurev-biochem-060614-034010CrossRefGoogle ScholarPubMed
Jarem, DA, Huckaby, LV and Delaney, S (2010) AGG interruptions in (CGG)n DNA repeat tracts modulate the structure and thermodynamics of non-B conformations in vitro. Biochemistry 49(32), 68266837. https://doi.org/10.1021/bi1007782CrossRefGoogle ScholarPubMed
Khristich, AN and Mirkin, SM (2020) On the wrong DNA track: Molecular mechanisms of repeat-mediated genome instability. Journal of Biological Chemistry 295(13), 41344170. https://doi.org/10.1074/jbc.REV119.007678CrossRefGoogle ScholarPubMed
Lenzmeier, BA and Freudenreich, CH (2003) Trinucleotide repeat instability: A hairpin curve at the crossroads of replication, recombination, and repair. Cytogenetic and Genome Research 100(1–4), 724. https://doi.org/10.1159/000072836CrossRefGoogle ScholarPubMed
Li, M, Völker, J, Breslauer, KJ and Wilson, DM (2014) APE1 incision activity at abasic sites in tandem repeat sequences. Journal of Molecular Biology 426(11), 21832198. https://doi.org/10.1016/j.jmb.2014.03.014CrossRefGoogle ScholarPubMed
McMurray, CT (1999) DNA secondary structure: A common and causative factor for expansion in human disease. Proceedings of the National Academy of Sciences of the United States of America 96(5), 18231825.CrossRefGoogle ScholarPubMed
McMurray, CT (2008) Hijacking of the mismatch repair system to cause CAG expansion and cell death in neurodegenerative disease. DNA Repair 7(7), 11211134. https://doi.org/10.1016/j.dnarep.2008.03.013CrossRefGoogle ScholarPubMed
Minetti, CA, Sun, JY, Jacobs, DP, Kang, I, Remeta, DP and Breslauer, KJ (2018) Impact of bistrand abasic sites and proximate orientation on DNA global structure and duplex energetics. Biopolymers 109(8), e23098. https://doi.org/10.1002/bip.23098CrossRefGoogle ScholarPubMed
Mitas, M (1997) Trinucleotide repeats associated with human disease. Nucleic Acids Research 25(12), 22452254.CrossRefGoogle ScholarPubMed
Nelson, JW, Martin, FH and Tinoco, I Jr (1981) DNA and RNA oligomer thermodynamics: The effect of mismatched bases on double-helix stability. Biopolymers 20(12), 25092531. https://doi.org/10.1002/bip.1981.360201204CrossRefGoogle ScholarPubMed
Orr, HT and Zoghbi, HY (2007) Trinucleotide repeat disorders. Annual Review of Neuroscience 30, 575621.CrossRefGoogle ScholarPubMed
Pearson, CE and Sinden, RR (1998) Trinucleotide repeat DNA structures: Dynamic mutations from dynamic DNA. Current Opinion in Structural Biology, 8(3), 321330. https://doi.org/10.1016/s0959-440x(98)80065-1.CrossRefGoogle ScholarPubMed
SantaLucia, J (1998) A unified view of polymer, dumbbell, and oligonucleotide DNA nearest-neighbor thermodynamics. Proceedings of the National Academy of Sciences of the United States of America 95(4), 14601465. https://doi.org/10.1073/pnas.95.4.1460CrossRefGoogle ScholarPubMed
SantaLucia, J and Hicks, D (2004) The thermodynamics of DNA structural motifs. Annual Review of Biophysics and Biomolecular Structure 33, 415440. https://doi.org/10.1146/annurev.biophys.32.110601.141800CrossRefGoogle ScholarPubMed
Schmidt, MHM and Pearson, CE (2016) Disease-associated repeat instability and mismatch repair. DNA Repair 38, 117126. https://doi.org/10.1016/j.dnarep.2015.11.008CrossRefGoogle ScholarPubMed
Sinden, RR, Potaman, VN, Oussatcheva, EA, Pearson, CE, Lyubchenko, YL and Shlyakhtenko, LS (2002) Triplet repeat DNA structures and human genetic disease: Dynamic mutations from dynamic DNA. Journal of Biosciences 27(1 Suppl 1), 5365. https://doi.org/10.1007/bf02703683CrossRefGoogle ScholarPubMed
Sutherland, GR and Richards, RI (1995) Simple tandem DNA repeats and human genetic disease. Proceedings of the National Academy of Sciences of the United States of America 92(9), 36363641.CrossRefGoogle ScholarPubMed
Vesnaver, G, Chang, CN, Eisenberg, M, Grollman, AP and Breslauer, KJ (1989) Influence of abasic and anucleosidic sites on the stability, conformation, and melting behavior of a DNA duplex: Correlations of thermodynamic and structural data. Proceedings of the National Academy of Sciences of the United States of America 86(10), 36143618. https://doi.org/10.1073/pnas.86.10.3614CrossRefGoogle ScholarPubMed
Völker, J and Breslauer, KJ (2022) Differential repair enzyme-substrate selection within dynamic DNA energy landscapes. Quarterly Reviews of Biophysics 55, e1. https://doi.org/10.1017/S0033583521000093CrossRefGoogle Scholar
Völker, J, Gindikin, V, Klump, HH, Plum, GE and Breslauer, KJ (2012) Energy landscapes of dynamic ensembles of rolling triplet repeat bulge loops: Implications for DNA expansion associated with disease states. Journal of the American Chemical Society 134(13), 60336044. https://doi.org/10.1021/ja3010896CrossRefGoogle ScholarPubMed
Völker, J, Klump, HH and Breslauer, KJ (2007) DNA metastability and biological regulation: Conformational dynamics of metastable omega-DNA bulge loops. Journal of the American Chemical Society 129(16), 52725280. https://doi.org/10.1021/ja070258qCrossRefGoogle ScholarPubMed
Völker, J, Klump, HH and Breslauer, KJ (2008) DNA energy landscapes via calorimetric detection of microstate ensembles of metastable macrostates and triplet repeat diseases. Proceedings of the National Academy of Sciences of the United States of America 105(47), 1832618330.CrossRefGoogle ScholarPubMed
Völker, J, Plum, GE, Gindikin, V and Breslauer, KJ (2019) Dynamic DNA energy landscapes and substrate complexity in triplet repeat expansion and DNA repair. Biomolecules 9(11), E709. https://doi.org/10.3390/biom9110709CrossRefGoogle ScholarPubMed
Völker, J, Plum, GE, Gindikin, V, Klump, HH and Breslauer, KJ (2014) Impact of bulge loop size on DNA triplet repeat domains: Implications for DNA repair and expansion. Biopolymers 101(1), 112. https://doi.org/10.1002/bip.22236CrossRefGoogle ScholarPubMed
Völker, J, Plum, GE, Klump, HH and Breslauer, KJ (2009) DNA repair and DNA triplet repeat expansion: The impact of abasic lesions on triplet repeat DNA energetics. Journal of the American Chemical Society 131(26), 93549360. https://doi.org/10.1021/ja902161eCrossRefGoogle ScholarPubMed
Wright, GEB, Black, HF, Collins, JA, Gall-Duncan, T, Caron, NS, Pearson, CE and Hayden, MR (2020) Interrupting sequence variants and age of onset in Huntington’s disease: Clinical implications and emerging therapies. The Lancet Neurology 19(11), 930939. https://doi.org/10.1016/S1474-4422(20)30343-4CrossRefGoogle ScholarPubMed
Wu, M, McDowell, JA and Turner, DH (1995) A periodic table of tandem mismatches in RNA. Biochemistry 34(10), 32043211. https://doi.org/10.1021/bi00010a009CrossRefGoogle ScholarPubMed
Figure 0

Figure 1. (a) Cartoon version of a repeat bulge loop within a larger repeat domain. Such a static representation does not highlight the reality that such repeat bulge loops really are dynamic ensembles of loop isomers as indicated in (b) (Völker et al.,2012). Note that in these cartoon representations, we present the repeat bulge loop as a ‘unstructured ring’ to account for the (largely unknown) fluctuating microstructures that likely make up the repeat bulge loop ensemble within a given rollamer isomer (Völker et al.,2008).

Figure 1

Figure 2. Schematic representation of the impact of repeat interruptions on the repeat bulge loop ensemble as shown for the [CAG]8ˑ[CTG]4 system containing a CAA interruption in place of the 6th CAG repeat (Mutated base pair indicated in red letters). The [CAG]8ˑ[CTG]4 complex results in a 4-repeat bulge loop that can be positioned in 5 possible loop positions, identified by roman numerals I-V, in the 5′ to 3′ directions. The colour coding of the CAG repeat segments into yellow (repeats 1–4) and red (repeats 5–8) is meant as a visual aid to help identify which repeat is partitioned into which domain in each of the 5 loop isomers. Loop position determines whether the CAA repeat is part of the duplex or the loop domain with the blue ball highlighting the position of the mutated A in each loop isomer. When the CAA triplet is partitioned into the loop domain, the complementary TTC triplet in the opposing strand forms base pairs with a CAG repeat that is part of the upstream duplex region, resulting in a GˑT mismatch (green ellipsoid). Note that the different loop isomers can be classified into 3 general groups; as defined by the energetic impact of the repeat interruptions on the repeat bulge loop isomer; with the differential energetic impacts dictating the differential loop populations. Group 1: isomers I and II contain a conventionally base paired CAA/GTT triplet in the downstream duplex domain, (potential loop Isomers with the CAA/GTT triplet in the upstream duplex domain are not shown in this example but can be considered essentially equivalent); Group 2: Loop isomers IV and V contain a mismatched triplet CAG/GTT in the upstream duplex domain. Analogous to group 1 above, potential loop Isomers with the mismatch in the downstream duplex domain are not shown, but also can be considered essentially equivalent; and Group 3: Loop isomer III contains the GˑT mismatch at the 5′ junction and we propose that it likely is part of an expanded loop domain. Note that a mismatch at the 3′ junction is possible, depending on the nature of the repeat interruption, as defined by the triplet sequence. An altered base in 1st, 2nd, or 3rd positions has unique impacts at the 3′ and 5′ junctions, but only impacts group 1 and 2 insofar as it alters nearest neighbours in the duplex. Fig. 2 is intended as an illustrative example. Longer repeats, larger slip outs, and repeat interruptions at different positions will produce different loop isomer arrangements, but conceptually they are represented by the three group classifications shown in Fig. 2.

Review: How sequence alterations enhance the stability and delay expansion of DNA triplet repeat domains. — R0/PR1

Conflict of interest statement

I have no conflict of interest with respect to reviewing this ms.

Comments

Comments to Author: This is an interesting manuscript on an important subject, and after minor revision is certainly appropriate for publication in QRB Discovery. I would point out, however, that I found the ms as written to be very hard to read, even though I am somewhat familiar with the field of triplet expansion mechanisms, and figuring out what specifically the authors were proposing requireda careful pre-reading on my part of the excellent Khristich & Mirkin review (which is heavily cited within the present ms) before I could get any clear understanding of the specific new ideas the authors are putting forward, and how they go beyond those described in the K&M review. This may be alright, since QRB Discovery is intended as a place to publish new and developing mechanistic ideas on significant biophysical problems, and thus reviewing this complex field in detail is clearly beyond the scope of a typical Discovery paper. On the other hand, the authors could make it easier for the general reader by some reorganization of their presentation and perhaps the inclusion of a more accessible molecular mechanisms ‘cartoon’ to introduce their “rollamer” schematic (Scheme 1). Some specific suggestions follow.

1. The Abstract and the two paragraphs that follow introduce the ideas to be considered in general terms, and the section headings are helpful, but then the next paragraph on Huntington Disease seems to be dropped into the ms without any clear rationalization for what points it is supposed to make and without background. I would suggest that this paragraph be moved further back in the ms after some of the ideas it presents have been discussed in more general terms, or perhaps placed near the end to show how the authors' ideas can be applied to specific triplet expansion disease problems.

2. The injection of Scheme 1 into the ms is similarly abrupt, and is probably incomprehensible to the general reader without some preliminary introduction, possibly by drawing some sort of more familiar stick-figure cartoons used to describe triplet-expansion ideas comparable to those used by M&K in their review, to make it clearer how the new mechanistic ideas developed by the authors using their ‘rollamer’ concept fit into prior ideas of triplet expansion mechanisms.

3. It would similarly be helpful if the authors could summarize more specifically in their “Concluding Remarks” what general new concepts or ideas they have introduced into the triplet expansion field and how these ideas go beyond what is presented in the M&K review, and what particular ‘open questions’ defined by M&K their approaches help to solve. Perhaps the Huntington Disease section might fit in here to better illustrate the possibilities for using specific rollamer models approaches to provide ideas for disease therapies.

After the authors have considered the above issues and revised the ms along lines related to the suggestions made above I think this work should be quite suitable for publication in QRB Discovery, and will comprise a significant contribution.

Review: How sequence alterations enhance the stability and delay expansion of DNA triplet repeat domains. — R0/PR2

Conflict of interest statement

Reviewer declares none.

Comments

Comments to Author: In this work the authors offer an energetics-based explanation to the puzzling observation that an altered DNA sequence with a bulge expansion leads to an increase in stability and delays expansion of the affected repeat domains.I would like to emphasize the ingenuity of the offered mechanistic explanation that states that the an altered DNA sequence results in high-energy mismatches in some of the conformational states (“rollamers”) potentially available for sampling by the bulge-looped DNA molecule.Such mismatched reduce the ensemble conformational space available to the DNA while also acting act as recognition sites for DNA repair enzymes.Importantly, the authors outline experimental avenues for testing their proposed explanation.

I strongly recommend publication of this manuscript in Q. Rev. Biophys.My only comment concerns the references.Throughout the text, the references are cited mostly, but not always, in the author-date format.In the References section, however, they are presented in the numbered format.

Decision: How sequence alterations enhance the stability and delay expansion of DNA triplet repeat domains. — R0/PR3

Comments

Comments to Author: Reviewer #1: In this work the authors offer an energetics-based explanation to the puzzling observation that an altered DNA sequence with a bulge expansion leads to an increase in stability and delays expansion of the affected repeat domains.I would like to emphasize the ingenuity of the offered mechanistic explanation that states that the an altered DNA sequence results in high-energy mismatches in some of the conformational states (“rollamers”) potentially available for sampling by the bulge-looped DNA molecule.Such mismatched reduce the ensemble conformational space available to the DNA while also acting act as recognition sites for DNA repair enzymes.Importantly, the authors outline experimental avenues for testing their proposed explanation.

I strongly recommend publication of this manuscript in Q. Rev. Biophys.My only comment concerns the references.Throughout the text, the references are cited mostly, but not always, in the author-date format.In the References section, however, they are presented in the numbered format.

Reviewer #3: This is an interesting manuscript on an important subject, and after minor revision is certainly appropriate for publication in QRB Discovery. I would point out, however, that I found the ms as written to be very hard to read, even though I am somewhat familiar with the field of triplet expansion mechanisms, and figuring out what specifically the authors were proposing requireda careful pre-reading on my part of the excellent Khristich & Mirkin review (which is heavily cited within the present ms) before I could get any clear understanding of the specific new ideas the authors are putting forward, and how they go beyond those described in the K&M review. This may be alright, since QRB Discovery is intended as a place to publish new and developing mechanistic ideas on significant biophysical problems, and thus reviewing this complex field in detail is clearly beyond the scope of a typical Discovery paper. On the other hand, the authors could make it easier for the general reader by some reorganization of their presentation and perhaps the inclusion of a more accessible molecular mechanisms ‘cartoon’ to introduce their “rollamer” schematic (Scheme 1). Some specific suggestions follow.

1. The Abstract and the two paragraphs that follow introduce the ideas to be considered in general terms, and the section headings are helpful, but then the next paragraph on Huntington Disease seems to be dropped into the ms without any clear rationalization for what points it is supposed to make and without background. I would suggest that this paragraph be moved further back in the ms after some of the ideas it presents have been discussed in more general terms, or perhaps placed near the end to show how the authors' ideas can be applied to specific triplet expansion disease problems.

2. The injection of Scheme 1 into the ms is similarly abrupt, and is probably incomprehensible to the general reader without some preliminary introduction, possibly by drawing some sort of more familiar stick-figure cartoons used to describe triplet-expansion ideas comparable to those used by M&K in their review, to make it clearer how the new mechanistic ideas developed by the authors using their ‘rollamer’ concept fit into prior ideas of triplet expansion mechanisms.

3. It would similarly be helpful if the authors could summarize more specifically in their “Concluding Remarks” what general new concepts or ideas they have introduced into the triplet expansion field and how these ideas go beyond what is presented in the M&K review, and what particular ‘open questions’ defined by M&K their approaches help to solve. Perhaps the Huntington Disease section might fit in here to better illustrate the possibilities for using specific rollamer models approaches to provide ideas for disease therapies.

After the authors have considered the above issues and revised the ms along lines related to the suggestions made above I think this work should be quite suitable for publication in QRB Discovery, and will comprise a significant contribution.

Decision: How sequence alterations enhance the stability and delay expansion of DNA triplet repeat domains. — R1/PR4

Comments

No accompanying comment.