an in silico study of the protonated dna triplex: in vivo stability of c+gc in a dna triple helix

4
An in silico study of the protonated DNA triplex: In vivo stability of C + GC in a DNA triple helix Jiwon Jung, Sang Hak Lee * Department of Chemistry, Seoul National University, Seoul 151-747, Republic of Korea article info Article history: Received 12 May 2010 In final form 17 June 2010 Available online 22 June 2010 abstract Protonated DNA base pairs, particularly the Cytosine–Guanine–Cytosine (CGC) triple pairs, were investi- gated its relative energetics by theoretical calculations. The energetic stabilities of C + GC and CG + C are almost the same in the gas phase, although the C + G pair is more stable than CG + by ca. 1150 cm 1 (3.34 kcal/mol). It is to be noted, however, that only C + GC was found in vivo, which seems to indicate different chemical environments for the triple pairs in vivo. Crown Copyright Ó 2010 Published by Elsevier B.V. All rights reserved. 1. Introduction The main function of DNA is the long-term storage of genetic information; thus its structural stability, including a chemical as- pect, plays a vital role in preserving the genetic information through specific base pairing [1]. However, DNA can be modified chemically or structurally by external stimuli, or it can react to environmental toxins [2,3], which can lead to its damage. Usually, most DNA damage can be fixed by repair enzymes in an healthy organism [4,5]. But if much unrepaired damage accumulates, the result might be mutations, which are a causative factor of an inher- ited disease [6]. Actually, Friedreich’s ataxia is a genetic disease caused by DNA structural mutation, especially of the DNA triple helix [7]. In fact, the triple helix structure is generated by a simple chemical change on DNA, such as protonation and deprotonation. H-DNA and sticky DNA among the triplex helices are from the protonation of DNA bases, especially cytosine [7,8]. Protonation and deprotonation are simple chemical reactions; thus they often occur in a chemical environment. However, though the triple helix is caused by pro- tonation, its molecular properties are rarely considered. A DNA triple helix is made by triple base pairing, which is called a triplex. Triplexes can be classified into two types; a (Y)-pyrimi- dine–(R)-purine–(Y)-pyrimidine pairing and a Y–R–R pairing [8]. In the case of Y–R–R, divalent metal cation plays a vital role in energetically stabilizing the triplex [9]. Otherwise, a key factor to stabilize the Y–R–Y type triplex is not a divalent metal ion but a change in intrinsic molecular properties. The previous studies, it has been discovered that there were two requirements to stabilize the Y–R–Y type triplex; the first is cytosine protonation [10,11] and the other is the Hoogsteen base pairing of the third strand [12]. Actually, in an in vivo study [8] it was found that the Y–R–Y type triplex always had a protonated cytosine, which is a prerequisite to the Hoogsteen base pairing. However, it is not easy to explain why only the protonated cyto- sine, though the guanine in the triplex also is able to be protonated as the previous result [13], is essential to stabilize the Y–R–Y type triplex, because there are many considerable factors, such as the pH condition, electrostatic interaction with the phosphate back- bone, and the solvation effect. Actually, only by using experimental methods it is difficult to select which one is the key factor to sta- bilize the Y–R–Y triplex. However, an in silico study can provide the answer to the question of how to stabilize the triplex. Thus we employed the in silico method to figure out the molecular prop- erties, such as the energetics, structure, and vibration, of a DNA tri- ple helix. 2. Computational methods In this study we carried out theoretical calculation for the pro- tonated DNA triplex at the levels of HF(6-31+G * ), B3LYP(6-31+G(d) and 6-311+G(2d,p)) using the GAUSSIAN program suite [14]. Firstly, we obtained the optimized geometry of the gas phase through the full optimization. And from this geometry solvation energies were obtained by re-optimization using solvation models, such as the Onsager model, Polarizable Continuum model (PCM), Con- ductor-like PCM (CPCM) and Integral-Equation-Formalism PCM (IEFPCM) implemented in the GAUSSIAN program suite. At all levels we specified water (dielectric constant, e = 78.39) as the solvent. Solvation energies for various solvents were calculated at the level of B3LYP/6-311+G(2d,p) using IEFPCM. Also, the vibration frequen- cies in water were obtained by using CPCM with B3LYP/6- 311+G(2d,p). 0009-2614/$ - see front matter Crown Copyright Ó 2010 Published by Elsevier B.V. All rights reserved. doi:10.1016/j.cplett.2010.06.045 * Corresponding author. Fax: +82 2 889 5719. E-mail address: [email protected] (S.H. Lee). Chemical Physics Letters 495 (2010) 117–120 Contents lists available at ScienceDirect Chemical Physics Letters journal homepage: www.elsevier.com/locate/cplett

Upload: jiwon-jung

Post on 14-Jul-2016

214 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: An in silico study of the protonated DNA triplex: In vivo stability of C+GC in a DNA triple helix

Chemical Physics Letters 495 (2010) 117–120

Contents lists available at ScienceDirect

Chemical Physics Letters

journal homepage: www.elsevier .com/ locate /cplet t

An in silico study of the protonated DNA triplex: In vivo stability of C+GCin a DNA triple helix

Jiwon Jung, Sang Hak Lee *

Department of Chemistry, Seoul National University, Seoul 151-747, Republic of Korea

a r t i c l e i n f o

Article history:Received 12 May 2010In final form 17 June 2010Available online 22 June 2010

0009-2614/$ - see front matter Crown Copyright � 2doi:10.1016/j.cplett.2010.06.045

* Corresponding author. Fax: +82 2 889 5719.E-mail address: [email protected] (S.H. Lee).

a b s t r a c t

Protonated DNA base pairs, particularly the Cytosine–Guanine–Cytosine (CGC) triple pairs, were investi-gated its relative energetics by theoretical calculations. The energetic stabilities of C+GC and CG+C arealmost the same in the gas phase, although the C+G pair is more stable than CG+ by ca. 1150 cm�1

(3.34 kcal/mol). It is to be noted, however, that only C+GC was found in vivo, which seems to indicatedifferent chemical environments for the triple pairs in vivo.

Crown Copyright � 2010 Published by Elsevier B.V. All rights reserved.

1. Introduction

The main function of DNA is the long-term storage of geneticinformation; thus its structural stability, including a chemical as-pect, plays a vital role in preserving the genetic informationthrough specific base pairing [1]. However, DNA can be modifiedchemically or structurally by external stimuli, or it can react toenvironmental toxins [2,3], which can lead to its damage. Usually,most DNA damage can be fixed by repair enzymes in an healthyorganism [4,5]. But if much unrepaired damage accumulates, theresult might be mutations, which are a causative factor of an inher-ited disease [6].

Actually, Friedreich’s ataxia is a genetic disease caused by DNAstructural mutation, especially of the DNA triple helix [7]. In fact,the triple helix structure is generated by a simple chemical changeon DNA, such as protonation and deprotonation. H-DNA and stickyDNA among the triplex helices are from the protonation of DNAbases, especially cytosine [7,8]. Protonation and deprotonationare simple chemical reactions; thus they often occur in a chemicalenvironment. However, though the triple helix is caused by pro-tonation, its molecular properties are rarely considered.

A DNA triple helix is made by triple base pairing, which is calleda triplex. Triplexes can be classified into two types; a (Y)-pyrimi-dine–(R)-purine–(Y)-pyrimidine pairing and a Y–R–R pairing [8].In the case of Y–R–R, divalent metal cation plays a vital role inenergetically stabilizing the triplex [9]. Otherwise, a key factor tostabilize the Y–R–Y type triplex is not a divalent metal ion but achange in intrinsic molecular properties. The previous studies, ithas been discovered that there were two requirements to stabilizethe Y–R–Y type triplex; the first is cytosine protonation [10,11] and

010 Published by Elsevier B.V. All r

the other is the Hoogsteen base pairing of the third strand [12].Actually, in an in vivo study [8] it was found that the Y–R–Y typetriplex always had a protonated cytosine, which is a prerequisiteto the Hoogsteen base pairing.

However, it is not easy to explain why only the protonated cyto-sine, though the guanine in the triplex also is able to be protonatedas the previous result [13], is essential to stabilize the Y–R–Y typetriplex, because there are many considerable factors, such as thepH condition, electrostatic interaction with the phosphate back-bone, and the solvation effect. Actually, only by using experimentalmethods it is difficult to select which one is the key factor to sta-bilize the Y–R–Y triplex. However, an in silico study can providethe answer to the question of how to stabilize the triplex. Thuswe employed the in silico method to figure out the molecular prop-erties, such as the energetics, structure, and vibration, of a DNA tri-ple helix.

2. Computational methods

In this study we carried out theoretical calculation for the pro-tonated DNA triplex at the levels of HF(6-31+G*), B3LYP(6-31+G(d)and 6-311+G(2d,p)) using the GAUSSIAN program suite [14]. Firstly,we obtained the optimized geometry of the gas phase throughthe full optimization. And from this geometry solvation energieswere obtained by re-optimization using solvation models, suchas the Onsager model, Polarizable Continuum model (PCM), Con-ductor-like PCM (CPCM) and Integral-Equation-Formalism PCM(IEFPCM) implemented in the GAUSSIAN program suite. At all levelswe specified water (dielectric constant, e = 78.39) as the solvent.Solvation energies for various solvents were calculated at the levelof B3LYP/6-311+G(2d,p) using IEFPCM. Also, the vibration frequen-cies in water were obtained by using CPCM with B3LYP/6-311+G(2d,p).

ights reserved.

Page 2: An in silico study of the protonated DNA triplex: In vivo stability of C+GC in a DNA triple helix

PCM

IEFP

CM

Gas

phas

e

+G

CC

G+C

C+G

CC

G+C

C+G

CC

G+C

1325

.323

�13

25.3

17�

1325

.322

�13

25.3

17�

1325

.201

�13

25.2

0212

00.0

1611

92.7

67�

5.56

617

.39

7.46

17.4

57.

4112

.60

5.92

1333

.073

�13

33.0

69�

1333

.073

�13

33.0

68�

1333

.005

�13

33.0

0596

0.72

895

6.52

1�

0.09

715

.80

7.44

15.7

87.

4112

.60

5.92

1333

.445

�13

33.4

41�

1333

.445

�13

33.4

41�

1333

.334

�13

33.3

3486

1.99

585

5.22

20.

506

15.2

97.

3015

.27

7.27

10.2

75.

38

118 J. Jung, S.H. Lee / Chemical Physics Letters 495 (2010) 117–120

3. Results and discussion

Among the several effects that may perturb the triplex, such asstructure and stability, we paid attention to the solvation effect asmany studies have carefully considered the solvation effect to fig-ure out the energetics and the structure for DNA [15], amino acid[16] and protein folding [17]. In fact, all biomolecules were instate of solution, and their function was determined by theirstructure in the solution.

In this Letter, we explain why only cytosine is protonated inthe Y–R–Y triplex of in vivo through an in silico study using thesolvation model. Firstly, according to the previous computationalstudy [13], the energy difference between two protonated dimers(C+G and CG+) was 1154 cm�1 (3.35 kcal/mol), a value obtained atthe level of B3LYP/6-311+G(2d,p), in the gas phase. Otherwise, atthe same level, C+GC(CH+� � �GC) and CG+C(C� � �H+GC) had little en-ergy difference in the gas phase. The energy difference was lessthan 0.01 kcal/mol after basis set superposition error (BSSE) cor-rection. However, there was no drastic structural difference be-tween the two isomers, as shown in Fig. 1a. This point is inaccord with the vibration analysis, in which the frequency ofthe proton stretching vibration was quite similar to each other,2374 cm�1 (C+GC) and 2392 cm�1 (CG+C), respectively.

Interestingly, though there is no energy difference betweenC+GC and CG+C in the gas phase, only the C+GC form is foundin vivo. As previously mentioned, we believe that this result isdue to the solvation energy difference. Thus we employed severalsolvation models such as the Onsager model, PCM, CPCM andIEFPCM, which are implemented in the GAUSSIAN 03 program suite.

C+G

CG+

CG+C

C+GC

C+GC

CG+C

gas phase solution

E

C

G

C

H+

CH+...GC (C

(a)

(b)

+GC)

C HG+C (CG+C)

H+

C

G

C

1154 cm-1862 cm-1

Fig. 1. (a) Optimized geometries of CH+� � �GC(C+GC) and CGH+� � �C(CG+C). (b)Schematic energy diagram for comparing stability in the gas phase and in thesolution. Ta

ble

1Ca

lcul

ated

ener

gies

for

C+G

Can

dCG

+ Ctr

iple

x.

Met

hod

On

sage

rPC

MC

C+G

CC

G+C

C+G

CC

G+C

C

HF/

6-31

+G*

E(h

)�

1325

.205

�13

25.2

03�

1325

.298

�13

25.2

91�

DE

(cm�

1)a

523.

739

1482

.452

Dip

ole

mom

ent

b14

.18

6.62

17.4

58.

13

B3L

YP/

6-31

+G*

E(h

)�

1332

.962

�13

32.9

60�

1333

.073

�13

33.0

68�

DE

(cm�

1)a

525.

201

954.

824

Dip

ole

mom

ent

12.3

06.

1515

.78

7.42

B3L

YP/

6-31

1+G

(2d,

p)E

(h)

�13

33.3

36�

1333

.334

�13

33.4

45�

1333

.441

�D

E(c

m�

1)a

448.

159

855.

224

Dip

ole

mom

ent

11.7

66.

0915

.27

7.27

aD

E(c

m�

1)

=E C

C�

E Cþ

GC

.b

Val

ues

ofdi

pole

mom

ent

are

un

der

un

itof

deby

e.

Page 3: An in silico study of the protonated DNA triplex: In vivo stability of C+GC in a DNA triple helix

0 10 20 30 40 50 60 70 80100

200

300

400

500

600

700

800

900

En

erg

y d

iffe

ren

ce (

cm-1

)

Dielectric constant

Ar

Cyc

lohe

xane

Isoq

uino

line Et

hano

l

Met

hano

l

Wat

er

Fig. 2. Plot of energy differences between C+GC and CG+C obtained at varioussolvents. From the ethanol (e = 24.55) energy difference was saturated.

J. Jung, S.H. Lee / Chemical Physics Letters 495 (2010) 117–120 119

We carried out the theoretical calculation at three different lev-els (HF/6-31+G*, B3LYP/6-31+G* and B3LYP/6-311+G(2d,p)) usingfour kinds of solvation models. In this calculation, we obtainedmeaningful results for the energy difference between C+GC andCG+C. As mentioned above, in the gas phase, there is no energy dif-ference. For the Onsager model, energy differences were observedat approximately 523 cm�1 (1.52 kcal/mol), 525 cm�1 (1.52 kcal/mol) and 450 cm�1 (1.31 kcal/mol) in a water solvent. These valuesare meaningful if we consider the room temperature energy,227 cm�1 (0.65 kcal/mol). Additionally, as summarized in Table 1,the values of the energy difference obtained using other solvationmodels are at least twofold larger than those obtained using the

Rel

ativ

e In

tens

ity (a

. u.)

CG+C

C+GC

Rel

ativ

e In

tens

ity (a

. u.)

0 500 1000 1500 2000 2500 3000 3500 4000

0 500 1000 1500 2000 2500 3000 3500 4000

Wavenumber (cm-1)

Gas Phase

νproton

νproton

Fig. 3. Predicted IR spectra in the gas phase and in the condensed phase. These spectra wthe proton stretching mode in the triplex.

Onsager model. In other models, energy differences were predictedfrom 1482 cm�1 (4.30 kcal/mol) to 855 cm�1 (2.48 kcal/mol). Thesevalues are more reliable than those of Onsager, because the Onsag-er model treats the solute molecule as only one sphere. Thus theobtained energy difference from the Onsager model might under-estimate the solvation effect. Nevertheless, the entire energetictendency, in which C+GC is more stable than CG+C in a solution,was coincident at all solvation models and calculation levels asshown in Fig. 1b.

Furthermore, we carefully investigated the dipole moment oftwo isomers. The dipole moment can be a direct clue to explainthe energetic difference in a solution. In fact a dipole moment doesnot perturb the intrinsic energetics of a molecule, but in a solutionthe dipole moment is a crucial factor to stabilize the molecule be-cause solvation energy is a result of the interaction between themolecular charge distribution and the reaction field of the solvent.Thus the isomer of a larger dipole moment is in a more stable formin a solution, especially in a polar solvent. In this calculation, thedipole moment of C+GC was predicted to be twice as large as thatof CG+C. A difference of the dipole moment also strongly supportsthe idea that the C+GC form was more stable in vivo.

We carried out the calculation for obtaining the energy differ-ence in the various solvent to more clearly investigate thesolvation effect for C+GC and CG+C using IEFPCM. From this calcu-lation, we found that the energy difference was increased as thefunction of the dielectric constant of the solvent increased asshown in Fig. 2. And from e = 24.55 (ethanol), the energy differenceis saturated, which is the typical phenomenon for solvation. Thuswe believe that the solvation effect plays a key role in observingonly the C+GC form in Y–R–Y triplex in vivo unlike in the gas phase.

Fig. 3 shows the simulated IR spectra for two species in the gasphase as well as in water solution. In the gas phase, the energeti-cally similar, vibrational frequencies of the proton stretching mode

0 500 1000 1500 2000 2500 3000 3500 4000

Wavenumber (cm-1)

0 500 1000 1500 2000 2500 3000 3500 4000

Rel

ativ

e In

tens

ity (a

. u.)

CG+C

C+GC

Rel

ativ

e In

tens

ity (a

. u.)

Condensed Phase

νproton

νproton

ere convoluted by the Lorentzian function with 10 cm�1 bandwidth. mproton indicates

Page 4: An in silico study of the protonated DNA triplex: In vivo stability of C+GC in a DNA triple helix

120 J. Jung, S.H. Lee / Chemical Physics Letters 495 (2010) 117–120

of C+GC and CG+C are remarkably similar 2370 cm�1 and2390 cm�1, respectively. However, the proton stretching frequen-cies are blue shift, 2774 cm�1 (C+GC) and 2590 cm�1 (CG+C) in awater solution, which means that both triplex forms exist in a sta-ble state in a solution. In these spectra, we found that the fre-quency shift of the proton stretching mode of C+GC, 409 cm�1,was twice as large as that of CG+C, 200 cm�1. It also directly sup-ports the idea that the C+GC form is energetically more stable insolution.

4. Conclusions

In summary, we carried out theoretical calculation using solva-tion model to elucidate that the solvation effect plays a major rolein stabilizing the protonated DNA triplex. In this calculation, wefound that, in the protonated CGC triplex, C+GC form is energeti-cally more stable than CG+C in the solution. This observationstrongly supports why only the C+GC form was found in vivo. Thus,we concluded that the solvation factor should be mainly consid-ered to figure out the molecular property of the DNA triplex ofin vivo. Additionally, if the solvation effect is studied, includingthe electrostatic interaction between the triplex and the phos-phate-sugar backbone, more light would be shed on the physicaland chemical properties of DNA triple helix in vivo.

Acknowledgements

S.H. Lee thanks Prof. Seong Keun Kim for stimulating discus-sions. This work was supported by Grant No. KSC-2008-S04-0002from Korea Institute of Science and Technology Information.

References

[1] P. Yakovchuk, E. Protozanova, M.D. Frank-Kamenetskii, Nucleic Acids Res. 34(2006) 564.

[2] L.R. Ferguson, W.A. Denny, Mutat. Res. 258 (1991) 123.[3] A.M. Jeffrey, Pharmacol. Ther. 28 (1985) 237.[4] J.H.J. Hoeijmakers, Nature 411 (2001) 366.[5] A. Sancar, L.A. Lindsey-Boltz, K. Unsal-Kacmaz, S. Linn, Annu. Rev. Biochem. 73

(2004) 39.[6] E.C. Friedberg, G.C. Walker, W. Siede, R.D. Wood, R.A. Schultz, T. Ellenberger,

DNA Repair and Mutagenesis, second edn., ASM Press, Washington, 2005.[7] A.M. Gacy et al., Mol. Cell 1 (1998) 583.[8] M.D. Frankkamenetskii, S.M. Mirkin, Annu. Rev. Biochem. 64 (1995) 65.[9] V.A. Malkov, O.N. Voloshin, V.N. Soyfer, M.D. Frankkamenetskii, Nucleic Acids

Res. 21 (1993) 585.[10] P. Rajagopal, J. Feigon, Nature 339 (1989) 637.[11] M.M.W. Mooren, D.E. Pulleyblank, S.S. Wijmenga, M.J.J. Blommers, C.W.

Hilbers, Nucleic Acids Res. 18 (1990) 6523.[12] I. Radhakrishnan, X. Gao, C.D.L. Santos, D. Live, D.J. Patel, Biochemistry 30

(1991) 9022–9030.[13] S.Y. Han, S.H. Lee, J. Chung, H. Bin Oh, J. Chem. Phys. 127 (2007) 245102.[14] M.J. Frisch et al., GAUSSIAN, Inc., Pittsburgh, PA, 2002.[15] D.F. Liu, T. Wyttenbach, M.T. Bowers, J. Am. Chem. Soc. 128 (2006) 15155.[16] K.T. Lee, J. Sung, K.J. Lee, S.K. Kim, Y.D. Park, J. Chem. Phys. 116 (2002) 8251.[17] M.S. Cheung, A.E. Garciat, Proc. Natl. Acad. Sci. USA 99 (2007) 685.