please note - cardiff universityorca.cf.ac.uk/69382/1/haptic driven.pdf · haptic-driven,...

35
This is an Open Access document downloaded from ORCA, Cardiff University's institutional repository: http://orca.cf.ac.uk/69382/ This is the author’s version of a work that was submitted to / accepted for publication. Citation for final published version: Anthopoulos, Athanasios, Pasqualetto, Gaia, Grimstead, Ian John and Brancale, Andrea 2014. Haptic-driven, interactive drug design: implementing a GPU-based approach to evaluate the induced fit effect. Faraday Discussions 169 , pp. 323-342. 10.1039/C3FD00139C file Publishers page: http://dx.doi.org/10.1039/C3FD00139C <http://dx.doi.org/10.1039/C3FD00139C> Please note: Changes made as a result of publishing processes such as copy-editing, formatting and page numbers may not be reflected in this version. For the definitive version of this publication, please refer to the published source. You are advised to consult the publisher’s version if you wish to cite this paper. This version is being made available in accordance with publisher policies. See http://orca.cf.ac.uk/policies.html for usage policies. Copyright and moral rights for publications made available in ORCA are retained by the copyright holders.

Upload: phungphuc

Post on 14-Oct-2018

216 views

Category:

Documents


0 download

TRANSCRIPT

This is an Open Access document downloaded from ORCA, Cardiff University's institutional

repository: http://orca.cf.ac.uk/69382/

This is the author’s version of a work that was submitted to / accepted for publication.

Citation for final published version:

Anthopoulos, Athanasios, Pasqualetto, Gaia, Grimstead, Ian John and Brancale, Andrea 2014.

Haptic-driven, interactive drug design: implementing a GPU-based approach to evaluate the

induced fit effect. Faraday Discussions 169 , pp. 323-342. 10.1039/C3FD00139C file

Publishers page: http://dx.doi.org/10.1039/C3FD00139C <http://dx.doi.org/10.1039/C3FD00139C>

Please note:

Changes made as a result of publishing processes such as copy-editing, formatting and page

numbers may not be reflected in this version. For the definitive version of this publication, please

refer to the published source. You are advised to consult the publisher’s version if you wish to cite

this paper.

This version is being made available in accordance with publisher policies. See

http://orca.cf.ac.uk/policies.html for usage policies. Copyright and moral rights for publications

made available in ORCA are retained by the copyright holders.

Hapti -d ive , i te a tive d ug desig : i ple e ti g a GPU-

ased app oa h to evaluate the i du ed fit effe t.

Atha asios A thopoulosa, , Gaia Pas ualettoa, Ia G i stead a d A d ea B a alea*

a“ hool of Pha a a d Pha a euti al “ ie es, Ca diff U i e sit , Ca diff, CF NB, UK

“ hool of Co pute “ ie e, Ca diff U i e sit , Ca diff, UK, CF AA

* a alea@ a diff.a .uk

A st a t: I this pape , e des i e a h id eta-heu isti s ethod of e e g i i izatio a d

o fo atio al sa pli g a d its appli atio i to ou hapti -d i e ole ula odelli g si ulato .

The p oposed ethod has ee desig ed to suit eal-ti e ole ula do ki g si ulatio s, he e

the ti e-lapse et ee t o su essi e liga d poses is elati el s all. I these situatio s the

e e g i i izatio p o le e o es i easi gl o ple a d haoti . The algo ith is tu ed to

take ad a tage of e e t ad a es i GPU o puti g ith as h o ous ke el e e utio , hi h

has allo ed us to i lude full p otei fle i ilit i the eal-ti e i te a ti e, hapti -d i e

si ulatio s. Fi all , i this pape e ill also dis uss the i ple e tatio of su h high-

pe fo a e o puti g app oa h i ou soft a e, dis ussi g the esults of ou i itial alidatio

studies, highlighti g the ad a tages a d li itatio s of su h i te a ti e ethodolog .

. I t odu tio

The assess e t of p otei fle i ilit upo liga d i di g e ai s o e of the ost halle gi g

aspe ts of o pute -aided d ug desig . I deed, i the last de ade, se e al ethodologies that

atte pt to i i the i du ed-fit effe t ha e ee de eloped a d i ple e ted i se e al soft a e

pa kages, f o the si ple e plo atio of ota e s of p edefi ed a i o a id side hai s to the

p edi tio of a k o e fle i ilit usi g sho t ole ula d a i s u s. I all ases, ole ula

o fo atio s a e updated usi g the app op iate algo ith , depe di g o the deg ee of fle i ilit

that the appli atio offe s. The o je ti e of fi di g sta le o p efe ed o fo atio s fo the

s ste e ui es lo ati g those o fo atio s that lie o i i u poi ts o the e e g su fa e.

Ho e e , this is a o ple ulti-di e sio al p o le . Fo a s ste ith N ato s, its pote tial

e e g is a fu tio of N Ca tesia o-o di ates. The efo e, the ole ula e e g su fa e is athe

haoti a d sea hi g fo the glo al i i u a ot al a s e feasi le. Faili g that, e plo i g the

pote tial e e g su fa e is the a to fi d a set of opti al solutio s lo al i i u s he e the

est solutio a e hose . The sea h fo opti al solutio s a e fa ilitated ith the use of

eta-heu isti s.

Meta-heu isti ethods o e a ide spe t u of algo ith i te h i ues. E olutio a algo ith s

EA a d ge eti algo ith s GA a e ethods a le to ite ati el ge e ate a set of solutio s.

E olutio a algo ith s a e a o gst the ost popula eta-heu isti s a d atte pt to sol e the

opti izatio p o le ge e ati g a populatio of possi le solutio s f o a pa e t utatio .

F o the ge e ated populatio the fittest is hose to ge e ate the e pool of solutio s a d so

fo th. Ge eti algo ith s adopt a si ila app oa h, ith the additio of the o ept of osso e

et ee t o solutio s to eate the e ge e atio . These solutio ge e atio s ill e g aduall

i p o i g ith espe t to the st ateg adopted to o e a oss the e e g la ds ape.

The e plo atio of pote tial e e g su fa es e ui es the adoptio of uphill o es i o de to

e o e f o lo al i i a. “u h st ategies fo uphill o e sele tio i lude si ulated a eali g

“A , Mo te-Ca lo MC , ite ated lo al sea h IL“ , guided lo al sea h GL“ a d a o e.

“i ulated a eali g uses a te pe atu e a ia le to ge e ate a p o a ilit fu tio fo a epti g

solutio s that go uphill o the e e g su fa e. Mo te Ca lo ethods adopt a si ila app oa h to

es ape lo al i i a a epti g uphill o es ased o a p o a ilit fu tio su h as the

Met opolis ite io . IL“ ea hes opti al solutio s ite ati el pe fo i g lo al sea h to ea h

a lo al i i u ; the a it es apes a lo al i i u is pe tu i g the dis o e ed solutio , i

hope that the pe tu atio e ha is ill i g the s ste i to a ell ith a lo e i i u

tha the pa e t o e. GL“ es apes f o lo al i i a g aduall i easi g the glo al i i u

e e g alue. This a , the s ste ill es ape f o a ell if it is t apped, taki g a uphill o e

lose to the i i u , hi h ill esult i to a e la ds ape ith the hope that a ette i i u

solutio ill e dis o e ed.

Va ious eta-heu isti s ha e ee used a e e t ole ula do ki g appli atio s. Autodo k-

Vi a used a h id IL“ st ateg usi g the Met opolis ite io to a ept opti al solutio s a d

p edi t i di g affi it of p otei -liga d s ste s. Va iatio s of the ge eti algo ith ha e ee

i ple e ted i a ious pu lished o ks to p edi t ole ula st u tu es. , “oa es used a

e olutio a st ateg to sol e this p o le a d Cai a d “hao used a h id ethod of EA

o i ed ith “A fo st u tu e p edi tio . Pa adisEO app oa hed the do ki g p o le ith a

pa allel GA i o de to le e age additio al p o essi g po e a d i p o e its pe fo a e a d

effi ie .

I ou g oup e a e de elopi g a i te a ti e ole ula do ki g p og a , hi h allo s the use to

pla e, i eal ti e, a s all ole ule i a iologi al ta get, hile feeli g the ole ula i te a tio s

it esta lishes th ough a hapti de i e. , I ou o igi al s ste the ta get st u tu e as kept igid,

hile the liga d as kept fle i le a d the s ste u de e t o ti uous e e g i i isatio , ia a

steepest des e t algo ith . Re e tl e ha e e plo ed the possi ilit of i ludi g a deg ee of

p otei fle i ilit i the i te a ti e si ulatio , taki g ad a tage of the o putatio al po e of

GPGPUs Ge e al-Pu pose G aphi s P o essi g U it . The ai halle ge i ou app oa h is that

if e a t to a hie e a le el of i te a ti it that p o ides the use ith a atu al e pe ie e, ou

algo ith should e a le to o tai a sta le, lo e e g , p otei /liga d o fo atio ithi a

est i ted ti ef a e. I ou soft a e, the liga d e ei es a sig al to o e at fi ed ti e i te als t

= s, hi h at hes the e de i g ate o s ee . Whe the sig al is e ei ed the liga d ato s

a e t a slated i to thei e positio . O the o pletio of liga d t a slatio , the ai is to

p o ide the s ste liga d a d p otei ith the est ato i st u tu e he the ti e-lapse t

e pi es a d efo e the e hapti poi te sig als to t a slate the liga d to its e t positio

i i izatio le .

I this pape , e p ese t a h id eta-heu isti ethod of e plo i g ole ula e e g su fa es

ith a fo us of sol i g this p o le as a set of i depe de t p otei liga d poses ge e ated i a

est i ted ti ef a e.

. Results a d Dis ussio

2.1 CUDA Streams

The recent generation of Kepler cards (GK110) has introduced new features in GPU computing,

including the ability to run up to 32 kernels asynchronously achieving execution overlaps very

close to that of a parallel execution. This introduces an additional dimension of parallelism in

CUDA and initiates the need of new optimization techniques in CUDA-parallel programming. To be

more specific, considering this new feature, an algorithm designer should not treat each kernel

individually any more, but as part of a wider context able to utilize the graphics card more

efficiently. However, there are some rules and constraints on the way that GPUs parallelize kernel

execution: at the present time, NVIDIA GPUs are able to run 16 scheduled blocks per streaming

multiprocessor (SM). In the case of a K20 card, that gives a total of 208 blocks able to run

concurrently at any time in its 13 SMs. From a resources perspective, the shared memory is

limited at 48kb, so asynchronously scheduled kernels in each stream must not exceed the this

limit. If it does, then the parallel execution is restricted to the number of streams whose total

shared memory usage is below the 48kb limit. Regarding register usage, similar restrictions apply

and if the total number of registers used per thread exceeds the a d’s limit (255 for GK110s) then

some register overspill might take place and the slower performing local memory might be used.

Finally, overlapping streams can only write on independent memory addresses; this is not a major

problem as arrays can be copied in multiple buffers for each stream to operate on independently.

. Pote tial E e g a d Fo e Cal ulatio s

The e e g a d fo e g adie ts o putatio s to fa ilitate the i i a sea h a d

o fo atio updates fo ou p oposed ethod e e pe fo ed usi g the MMFF s fo e-field

a d i ple e ted i CUDA.

E = ∑ E o ded + ∑ E o − o ded (1)

Fi⃗⃗ = -∇Ei (2)

Rega di g the o - o ded i te a tio s, ato s e e geo et i all apped i ed i to a D

load- ala ed i egula g id. , Cal ulatio s the took pla e ith ea h CUDA lo k espo si le fo

p o essi g the ato i i te a tio s et ee ea h ho e ell a d its eigh ou s.

O e the fo e e to is a aila le fo ea h ato i, a s ala alue step- alue eeds to e fou d i

o de to update the s ste i to the e o fo atio . The o fo atio al update is pe fo ed

fo ea h ato i the s ste i e to spa e as follo s:

C′⃗⃗⃗ i = Ci⃗⃗ + λF i (3)

Where C′i⃗⃗⃗⃗ is the update position of an atom i and Ci⃗⃗ is its position before the update.

. The G eed Algo ith I ple e tatio

Gi e that ea h i i izatio le ust e pe fo ed ithi a s ti ef a e, the i itial pla

as to fi d a lo al i i u a d te i ate sea hi g i the hope that the i i u ould

o espo d to a sta le ole ula o fo atio . Fo this easo e used a si ple steepest des e t

“D step i i izatio app oa h. Its ha a te isti is that it takes o l do hill o es the g eed

app oa h , hi h esults i o e gi g i to the fi st lo al i i u it fi ds. The o fo atio

sa pli g that e o tai ed f o this ethod as slo a d so eti es e o eous; it o l helped us

u de sta d so e additio al aspe ts of the p o le e e e t i g to sol e.

A e a ple o fo atio p odu ed the algo ith is p ese ted i Figu e a d it is o ious that

the ato geo et is o g. The ato s i the histidi e a o ati i g should e o-pla a , as the

u ea g oup o the liga d. The test s ste as elati el s all < , ato s a d the e e e o

pe fo a e est i tio s. The “D algo ith as o e gi g efo e t e pi ed a d the s ste ould

ui kl opti ize its o figu atio , ithi a fe i i izatio les, to ea h ofte a e o eous

geo et as de o st ated i Figu e . Fu the i p o e e ts o the s ste ’s o figu atio ould

e eithe eall slo o ot e ide t a d the o fo atio i the po ket a ea ould al a s e o -

ideal. As a fi st thought o e ould suspe t that this ould e a a u a p o le o the fo e-

field al ulatio s. Ho e e , p o i g the liga d i to a diffe e t a ea, i du i g fu the

o fo atio al updates, ould ui kl esto e the p e iousl disto ted a ea a d ause the sa e

p o le s i the a ea of latest i te a tio .

F o the a o e it is o ious that fo the t pe of p otei liga d si ulatio s pe fo ed ou

appli atio , fi di g the fi st pote tial e e g i i u o the ole ula e e g la ds ape, ould

ot gua a tee a easo a le o fo atio a ou d the a ea of i te a tio et ee the t o

ole ules. That ight e e ause the est of the p otei ould e ell i i ized a d o s ala

alue fo E uatio ould e ist, a le to o e all ato s i e to spa e su h that the s ste

ould ea h a lo e e e g alue. What is also o th oti g is that keepi g the liga d stalled i its

positio i o de to allo the s ste to slo l e o e ould ot al a s o k eithe . That ga e the

i p essio that e e e i i izatio le ould pi k up f o al ost he e it left off a d a

e i i u ould ot e fou d. As a esult fu the o fo atio al updates ould eithe ot e

pe fo ed o the ould e too fe ith a e s all la da alue i.e. λ = − hi h ould

ha e a uite i sig ifi a t i pa t o the s ste ’s o figu atio .

I ou atte pts to desig a effi ie t e e g su fa e algo ith e also o side ed Mo te Ca lo

a d si ulated a eali g ethods. Ho e e , gi e the st i t ti e o st ai ts a d the fi ite

a ou t of steps as a esult, pe tu atio s el i g o a p o a ilit ould e pose ou solutio to

the isk of asti g o es i side a si gle alle . This ea s that du i g a s le these

te h i ues ould u de -sa ple the pote tial e e g su fa e, o pa ed to faste sa pli g

te h i ues like ite ati e lo al sea h. Both ethods ould e good a didates to the p o le of

fi di g the est o fo atio fo a p otei liga d pose o a o -i te a ti e appli atio ithout

st i t ti e o st ai ts, ut p o a l this is ot the ase i ou s e a io.

The fi st atte pt to sol e this p o le as to pe fo a shuffle te h i ue. This ea s that at

the egi i g of ea h i i izatio le a d efo e sea hi g fo a lo al i i u , all ato s i

the s ste ould o e alo g thei fo e e to t aje to ies usi g a e tai alue. E pe i e ti g

a d fi e-tu i g ith diffe e t λ alues sho ed that a good alue ould e fou d i the − ag itude. I o de to a oid ig ato o es, a o st ai t as pla ed fo the a i u alue

of displa e e t. This te h i ue i p o ed the situatio , ut aised o e s fo the o e all ualit

of the s ste ’s o fo atio as a esult of the f e ue t shuffli g.

I a se o d atte pt, e de ided to i t odu e a uphill o e app oa h, a epti g a a do

s all uphill step as a t a sie t state to e o e the s ste f o a lo al i i u . I this a e

ould e plo e the pote tial e e g la ds apes fo othe alle s i the hope that a lo e e e g

alue a e fou d. This te h i ue i p o ed the lo al esto atio defe t e ide t i p e ious

e pe i e ts. The pote tial e e g su fa e PE“ e plo atio ould ot al a s esult i a lo e

e e g o figu atio , ut the o figu atio s isualized ithi the i te a tio a ea et ee the

t o ole ules i p o ed sig ifi a tl . This is due to the fa t that e plo i g the e e g la ds ape i

this a t igge ed a highe u e of o fo atio updates. This the aused the ato s ithi

the i te a tio a ea to o e alo g thei fo e e to s a u e of ti es a d e e tuall fo a

sta le o fo atio , ith o e t o d le gths a d a gles o oth ole ules.

These e pe i e ts led us to efo ulate ou o je ti es to seek fo a ethod a le to sea h the

pote tial e e g la ds ape fo a good i i u a d also esto e the a ea of i te a tio et ee

the t o ole ules i a si gle i i izatio le. The e algo ith should e a le to e aluate

a didate solutio s i effi ie t a e . It should also e su e that ithi ea h i i izatio le

the e ould e e ough o fo atio al updates to esto e the a ea of i te a tio .

The esulted i ple e tatio uses a e olutio a algo ith to ge e ate a set of a didate

solutio s to the p o le s ste o fo atio s . The a didates a e e aluated al ulati g thei

pote tial e e g as h o ousl usi g CUDA st ea s, hi h allo s fo a deg ee of pe fo a e

gai . The pote tial e e g alue of ea h a didate is used as a i di ato of fit ess. The fit

a didate is the o e hose pote tial e e g alue is lo e to the latest e o d of pote tial e e g

i i u old i i u . If o hild solutio has a alue lo e tha the old i i u , the the

fittest hild is hose ith ite ia set ou uphill o e st ateg . The fittest hild e o es the

pa e t of the e solutio s ge e atio a d the s ste ’s o figu atio is updated adopti g its o-

o di ates. Fi all afte the o fo atio al update, the fo e e to s fo the e l updated s ste

a e al ulated.

. Fast Su fa e E plo atio a d I du ed Lo al Resto atio

I itiall ou algo ith ould fi d its fi st alle a d ai do hill u til it ea hes a i i u . O

ea hi g a i i u , the pa e t solutio ould e u a le to ge e ate a populatio ith a

a didate solutio hose e e g alue is less tha the old i i u . At this poi t a st ateg fo

es api g the lo al i i a is eeded, hi h ould e to i ediatel t a diffe e t alle i hope

of fi di g a ette i i u a d i du e sta le o fo atio al updates at the sa e ti e. The

pe tu atio st ateg of IL“ Ite ated Lo al “ea h is ideal fo ui k ju pi g f o o e alle to

a othe a epti g a uphill o e he the s ste lies at a lo al i i u . This is a hie ed

hoosi g a s ala λ alue that is a le to pe fo a shuffle ope atio i a si ila a as e plai ed

ea lie .

IL“ i itiall ga e p o isi g esults, ho e e , the e e e t o ai p o le s ith it. The fi st is that

it ould o asio all sho a isual pulsati g effe t, he e ato s ould see to pe fo a fast

os illatio . That as e ause the ti e-lapse of the algo ith ould e pi e i ediatel afte a

pe tu atio a d the s ste o figu atio o e iti g the algo ith ould ot e opti al. This

p o le ould e easil sol ed ith a ghost pa ti les uffe holdi g ato o-o di ates afte a

pe tu atio o e. The post-pe tu atio ge e atio s ould i he it a d update o-o di ates o

the ghost uffe up u til a e i i u is ea hed lo e tha the o e asso iated ith the eal

o-o di ates a a . This sol ed the pulsati g isual effe t, ut i itiated the se o d p o le that

e des i ed, he e IL“ ould ofte fi d a good i i u ithi the fi st fe o es, a d the

a kg ou d sea h follo i g the last isited i i u ould ot e a le to fi d a ette solutio .

This ould lead i to the afo e e tio ed p o le s of too fe o fo atio al updates a d he e

i effi ie t i i izatio les.

Guided lo al sea h is ideal fo i du i g updates o the p i iple that the old i i u is ei g

aised a s all pe e tage at e e solutio e aluatio pe alt fu tio . E e tuall , the

algo ith a es ape f o the ell, ut ill spe d so e ti e i it aiti g fo its pe alt fu tio

to ea h the app op iate le els. E pe i e ts sho ed that this ethod u de pe fo ed o pa ed

to IL“ fo ou si ulatio s as sho i the esults se tio .

Ou fi al solutio a e as a a alga atio of the t o app oa hes. We i t odu ed a pe alt alue

p a d a e e g fu tio E, hi h is updated at e e step i: E i + = E i ∙ + p . The

pe alt alue is usuall i the ag itude a ge of . – . , hi h ea s that o e e

ite atio the old i i u a is slightl aised so that it a allo fo a update of the s ste ’s

eal o-o di ates. This odifi atio ai s to keep the fast t a k of alle e plo atio that IL“ a

offe a d i t odu e e ha ed o fo atio al update a ilities to ou app oa h. We ill efe to

this algo ith as IGL“ Ite ated Guided Lo al “ea h .

I esse e Ite ated Guided Lo al “ea h a hie es the follo i g: it a epts updates fo e e g

alues lose to a e e tl isited lo est e e g o figu atio , fo su se ue t solutio populatio s.

Faili g this, he su se ue t solutio ge e atio s do ot p odu e a update, it p e e ts the

algo ith f o asti g too a ge e atio s ithout pe fo i g a update, hi h i tu

e efits the isual effe t of ou appli atio as ell as the i te a ti g ole ules o figu atio ,

espe iall ithi thei i te a tio zo e.

The i ple e tatio of the a o e te h i ue is pe fo ed usi g t o e e g a ia les:

u e t_lo est a d the glo al_lo est. E e ti e a e i i u is ea hed, oth the

u e t_lo est a d the glo al_lo est take its alue. Whe a lo al i i u is ea hed a d the

e ge e atio a ot p o ide a a didate ith a lo e e e g alue, pe tu atio takes pla e fo

a e ell to e e plo ed a d the u e t_lo est takes the o espo di g e e g alue. While i

the e alle , the glo al_lo est alue is aised at e e e e g e aluatio a s all fa to su h

as . % , hile the u e t_lo est keeps ea hi g lo e alues, u til e ea h the e lo al

i i u . No at this poi t the e a e t o possi le out o es: the fi st is that the e lo al

i i u u e t_lo est , is lo e tha the glo al_lo est a d a o fo atio update is ei g

pe fo ed. Othe ise, a e pe tu atio o e is ei g pe fo ed. Ho e e , the p o a ilit of

fi di g a e i i u this a is di e tl p opo tio al to the u e of e e g e aluatio s

pe fo ed. This a , a g eate atio of o t i uti g o es o pa ed to the total u e of o es

is a hie ed, hi h i tu assists i to sol i g the lo al esto atio p o le .

This algo ith allo s fo ui k e plo atio of ole ula e e g su fa es, e ou agi g o e

o fo atio al updates tha a pu el IL“ st ateg ould pe fo . The isual esult is sig ifi a tl

i p o ed, as the e is apid lo al esto atio o the i pa t a ea.

. As h o ous E e utio

The performance of the above algorithm is important for our application as the potential energy

landscape exploration is constrained to run for a small timeframe t. The more moves we are able

to perform in the landscape, the better the quality of the resulting solution will be. In addition, the

more parent solutions the algorithm generates, the higher the probability of discovering lower

energy minimums. From the above, we understand that performance is vital for our simulations.

Initially, our force and energy kernels as well as a binning method to an irregular grid approach

were designed to minimize execution time on GK104 chipsets. After the introduction of GK110,

concurrent streaming capabilities were enhanced, giving the option of running up to 32 kernels

asynchronously, when the criteria listed in the background section were met.

For this reason, we designed an algorithm to take advantage of asynchronous kernel execution

using CUDA-streams (Hyper-Q according to NVIDIA terminology) and improve the performance of

the existing CUDA functions. In order to achieve peak performance, all the aforementioned

kernels (energy, force, binning) were refactored in order to meet the criteria for concurrent

execution and without sacrificing their individual performance. On the current generation of

graphics cards (GK110), we can now evaluate fit of a whole generation of solutions asynchronously

and almost in parallel for smaller protein-ligand systems. This means that the first energy

evaluation kernel hides the latency of the remaining s-1 evaluation kernels scheduled to run

asynchronously in s streams within a single move (a population evaluation series, followed by a

force calculation for the fittest child), where s<=16. However, if the conditions mentioned in the

background section are not met, then the number of asynchronous streams running concurrently

at any time drops as demonstrated in Figure 2.

I additio , e a hide the late of i i g the ato s i to a D i egula g id fo the ell-list

ge e atio i the fo e g adie ts ke el. This ea s that i i g o e heads a e o lo ge a issue

a d i i g a e safel pe fo ed at e e si gle o e, ith a s all ski alue�′ that ate s fo

the iggest displa e e t that e a ti ipate du i g ea h o e utofflist = ut-off + �′

. Pe fo a e of the Algo ith

The pe fo a e of ou CUDA-st ea s i ple e ted h id app oa h has ee e h a ked

agai st a s h o ous e e utio e sio i CUDA of the a o e algo ith . I o de to pe fo a fai

o pa iso , e h a ks e e s heduled su h that oth e sio s ould pe fo the e a t sa e

a ou t of e e g a d fo e g adie t ke el alls. The esults a e listed i Figu e fo —

o u e t st ea s. F o the figu es e a o se e that the e a e pe fo a e gai s flu tuati g

f o . X - X depe di g solel o the s ste size a d u e of st ea s. Rega di g sha ed

e o usage, oth ou o - o ded e e g a d fo e g adie t ke els a e usi g kB he eas the

o ded e e g a d fo es ke els do ot use sha ed e o . That ea s that ou o u e t

e e utio o e e g e aluatio s at a ti e is est i ted to fi e ke els as a si th ke el ould

o e take the kB sha ed e o pe “M li it.

Ou algo ith a hie es fi e o u e t e e g e aluatio s fo s ste sizes up to , ato s.

This is e ause the e e g ke els di ide e e utio i to lo ks of th eads, he e ea h th ead

is asso iated ith o e ato . This e uates i to a a i u of lo ks pe st ea , hi h is good

e ough to satu ate the li it of fi e o u e t st ea s gi e f o sha ed e o est i tio s

< o u e t lo ks li it . Be o d that li it, the u e of lo ks eeded to p o ess the

e e g ke els ises a o e a d that ea s that CUDA a ot ha dle the total u e of

lo ks s heduled fo as h o ous e e utio s, he e o u e effi ie sta ts dete io ati g.

Fi all , fo s ste s a o e , ato s, o u e al ost a ishes o the e e g ke els as the

u e of lo ks eeded to p o ess o e s ste alo e is lose to the a hite tu e’s li it. Ho e e ,

o u e i the fo es- i i g st ea pai still holds as ou i i g ethod has ee desig ed

usi g a se ies of fast e e uti g ke els a le to u usi g o l a fe lo ks. I additio , o ded

fo es ke els e ui e fe e th ead lo ks a d he e a o t i ute to a ds so e as h o ous

e e utio too. This is h e e i the ase of la ge s ste s , – , ato s the e is still a

s all pe fo a e gai .

Ta les a- de o st ate a o pa iso of the elapsed ti e of e e utio fo the al ulatio s

de o st ated i Figu e et ee CUDA-s h o ous a d CUDA-as h o ous e e utio s. I

additio , it appe ds e e utio ti es fo the se ial i ple e tatio o a o kstatio e uipped ith

a Tesla K- GPGPU a d a d I tel Xeo E - CPUs u i g at . GHz. The se ial e e utio

uses the Ope Ba el li a fo the MMFF s al ulatio s.

. Heu isti A ilit of the Algo ith

Figu es a-d p o ides a s he ati o e ie of the fou algo ith s e aluated i this epo t. The

g aphs depi t the steps take ea h algo ith i o de to o plete a s o putatio al le.

I o de to de o st ate ho IGL“ sol es ou p o le , the sta ti g o figu atio as hose afte

a u of , steps of the g eed algo ith . This a , the sta ti g o fo atio lies at o e

ea a lo al i i u a d Figu es a- de o st ate ho ea h of the th ee p oposed te h i ues

es ape it a d e plo e the la ds ape. Fo these al ulatio s, the pa a ete s used a e o

des i ed.

Eight e uall spa ed alues a e hose i the a ge . - . a d allo ated to ea h

o putatio al st ea ; pe tu atio fo oth GL“ a d ILG“ is happe i g hoosi g the hild solutio

ith i de fou . The e has ee so e e pe i e tatio ith diffe e t pa a ete s fo the

pe tu atio step, ut esults a e si ila . The esse e of pe tu atio is to es ape the u e t

alle a d e plo e the est of the la ds ape a d e ha e o k o ledge he e ea h hild solutio

ill i g us. The pe alt alue as set to . fo oth GL“ a d IGL“ . I IGL“ it is applied e e

ti e a pe tu atio takes pla e a d i GL“ e e ti e a e glo al i i u is ot ea hed. A

good e a ple of the e efits of ou app oa h is de o st ated i Figu es a a d he e IL“ a d

IGL“ a e o pa ed head to head. I steps th ee a d fou oth app oa hes a e pe tu ed a d i

step fi e the p oposed app oa h as a le to a ept a e good solutio slightl ela i g the

eGlo o st ai t glo al_lo est ith the aid of the pe alt fu tio . The easo h GL“ as

dis a ded is e ause it a spe d too u h ti e i the sa e ell aiti g fo the pe alt fu tio

to aise o e the latest dis o e ed e e g le els. We elie e that the p oposed ethod i he its the

e its of oth IL“ a d GL“ fo the solutio of ou p o le . Fi all , the g eed algo ith had

o e ged efo e step as e al ead e plai ed, he e e see a st aight li e depi ti g the est

hild solutio epeatedl fou d o ea h i le p o a l fo λ= . .

A othe set of e pe i e ts as desig ed to sho the flu tuatio of e e g alues usi g the fou

algo ith s i o pa iso . I o de to pe fo this e aluatio , e a t o sets of e pe i e ts. The

fi st set listed i Figu es a-d de o st ates the heu isti a ilit of ou app oa h fo liga ds

positio ed lose e ough to a p otei i o de to t igge i te a tio s. The se o d set of

e pe i e ts listed i Figu es a-d is si ila to the p e ious set, ith the o l diffe e e that it

pe fo s the sa e u e of i i izatio steps fo p otei s i thei i itial pd file st u tu e.

I the si ulatio esults p ese ted i Figu es a a d e a see that the h id ethod a d the

IL“ lea l outpe fo the GL“ a d the g eed app oa h. It is also i te esti g to see that i all fou

figu es the u es of the h id a d IL“ ethods ha i g a si ila t e d, ith the h id ethod

follo i g a o e tu ule t t aje to due to its pe alt fu tio . Figu e sho s a i te esti g

ase as e a see that IL“ is t apped at a i i u , hi h it a ot es ape f o . F o Ta le e

a see that fo the e pe i e t ith F E.pd st u tu e, IL“ ould o l ea h fou i i a a d

he e pe fo fou updates. This e plai s h this ethod a ot pe fo ell o ou

si ulatio s, as this is a o o s e a io. Fou updates a e usuall ot e ough to i g the s ste

a k to a sta le o fo atio . Ou ethod o the othe ha d, usi g the pe alt fu tio a

es ape su h a i i u a d a ies o e plo i g diffe e t e e g la ds apes, fi di g e

i i u s a d updati g the s ste i to ette o fo atio s. Rega di g GL“ o its o , as e

a see it al a s u de pe fo s o pa ed to the h id ethod a d it o l pe fo s ette tha

the IL“ at the F E e pe i e t, he e IL“ gets stu k i a ell.

The fou g aphs i Figu e p ese t si ila eha io . Ou h id ethod, togethe ith IL“ a e the

est pe fo i g o es as the al a s ha e the a ilit to dis o e deepe alle s i o pa iso to

the othe t o ethods.

F o these esults, it is e ide t that the h id app oa h is the ost suita le fo the t pe of

si ulatio s that ou soft a e pe fo s fo its a ilit to fi d a e good pote tial e e g i i u

i du i g o e o fo atio al updates tha it ould follo i g a pu e IL“ st ateg . Also, the GL“

ele e t i it does ot eall ea that the esulti g glo al i i u ould e lo e tha the

o espo di g IL“, o is the opposite t ue. The h id ethod see s p o isi g i sol i g othe

opti izatio p o le s too due to its a ilit to ha ge o e diffe e t e e g la ds apes a d

e aluate se e al diffe e t i i a at a s all te po a pote tial e e g ost.

O e all, the i p o ed h id algo ith as fast a d espo si e i ou si ulatio s. The e e e o

e ide t disto tio s i a of the ole ules st u tu e a d i te a tio s et ee liga d a d p otei

e e odelled a u atel i eal ti e. This algo ith is apa le of si ulati g s ste s of up to

, ato s, ith the i te a ti it ei g i e sel p opo tio al to the s ste size. Fo la ge

s ste s of , – , ato s, the use has to stall the liga d i side a po ket fo the

i i izatio algo ith to take effe t. The o e all o fo atio ualit a i p o e ith the

additio of a lo g a ge ele t ostati s outi e. Fi all , as the algo ith elies al ost solel o

floati g poi t ope atio s a d futu e GPGPU ge e atio s p o ise e e ette FLOP apa ilities, e

ould e pe t ou app oa h to pe fo e e ette o fo th o i g a ds.

. Algo ith P a ti al Appli atio s

To fu the alidate a d assess ou s ste e ha e also a ied out a set of si ulatio s that ai ed

to i i the a tual use of the soft a e. To e aluate the i du ed fit effe t, e ha e sele ted a

se ies of k o iologi al ta get fo hi h a se ies of stal st u tu es a e a aila le Ta le as

apo fo a d o- stallised ith o e o o e liga ds.

Ou app oa h as to e t a t a liga d f o a spe ifi o ple , the pla e it i the apo fo of the

o espo di g p otei usi g the hapti -d i e si ulato . The esults o tai ed f o the do ki g

e e the o pa ed ith the st u tu e of the o ple f o hi h the liga d as e t a ted,

easu i g oth the oot ea s ua e de iatio RM“D of the liga d a d the p otei esidues of

the i di g po ket supe positio fo the diffe e t p otei s as pe fo ed o the hole st u tu e

usi g the a k o e CA as efe e e . Fu the o e, the po ket esidues e e also o pa ed ith

the o espo di g a i o a ids i the apo fo a d thei RM“D al ulated. I this a e , e

ould esti ate of ho u h the p otei had ea ted to the liga d i di g a d if the i du ed-fit

effe t as o pa a le to the a tual stallog aphi st u tu e of the o ple .

The p otei e ha e hose p ese ts th ee s e a ios: CDK p ese ts a fai l igid i di g po ket

a d o l a fe i o side hai o e e ts a e isi le et ee the apo fo a d the

liga d/p otei o ple es. The t o hose alloste i po kets of the HCV N“ p ese t a o e

e ide t i du ed-fit effe t a d, i so e ases, the e is also a lea p otei a k o e o e e t;

the HIV e e se t a s iptase p ese ts o e of the ost d a ati effe ts of i du ed-fit, as the

i di g po ket of the o u leoside e e se t a s iptase i hi ito s NNRTIs is ot e e isi le i

the apo fo . All the si ulatio s e e pe fo ed o a uad o e Ma P o, u i g U u tu

. , e uipped ith a NVIDIA Gefo e a d a Pha to O i hapti de i e. As e o side

the idea of de elopi g a atu al a d i tuiti e i te fa e at the o e of ou soft a e de elop e t

effo ts, e ha e asked a u de g aduate Medi i al Che ist stude t to pe fo the si ulatio s.

. . CDK esults.

Fo CDK e ha e u si ulatio s, o e fo ea h liga d epo ted i Ta le . I te s of liga d

pla e e t, the hapti -d i e si ulatio pe fo ed elati el ell; the liga d RM“D alues a ged

et ee . Å a d . Å. The i di g site esidues RM“D also p odu ed so e e i te esti g

esults: Figu e sho s t o e a ples of the esults o tai ed a d it is possi le to see ho the

p otei has o ed i the si ulatio . I te esti gl , the iggest o e e ts a e see o the

esidues that a e i deed the ost fle i le he the apo a d the o ple st u tu es a e o pa ed.

Figu e sho s ho the diffe e t esidues ha e ha ged a d if thei o e e t is to a d the

o fo atio p ese t i the o ple st u tu e. I this ase, it is possi le to see a o e o ple

s e a io, he e so e esidues o e to a ds the o espo di g positio i the o ple st u tu e

i di ated i g ee a d othe s that o e a a f o that ideal positio i di ated i ed .

Ho e e , e should poi t out that the i te als pa a ete s ep ese ted i Figu e a e ot

a solute alues ut the ep ese t ho fa a a the hapti ge e ated st u tu e is to the

stallised st u tu e, elati e to the RM“D of the stallised st u tu e s the apo st u tu e. As

ost of the esidues i tuall etai the sa e o fo atio i all the diffe e t st u tu es, thei

a tual a solute RM“D-apo/ ryst alues a e e s all. He e, a i o , fa ou a le o

u fa ou a le, o e e t ge e ated f o the hapti -d i e si ulatio ill e flagged as

sig ifi a t. A e eptio is ep ese ted L s , hi h o es o side a l f o the apo

st u tu e to the JVP stal st u tu e RM“D of . Å . I the hapti d i e si ulatio , the

esidue o e to a d the positio p ese t i the stal gi i g a RM“D-hapti / st of . Å

Figu e . I this ase it is lea l e ide t the i du ed-fit effe t ge e ated the hapti -d i e

liga d pla e e t.

. . HCV NS esults.

The t o po kets sele ted fo this ta get Pal a d Thu - p ese t a o e o ious i du ed-fit

effe t, o pa ed to hat has ee o se ed ith CDK . I deed, the esults o tai ed a e o e

i fo ati e Figu e as sig ifi a t o fo atio al ha ges f o the apo st u tu e to oth the

hapti ge e ated st u tu e a d the stal o ple a e e ide t fo a good u e of esidues.

The data p ese ted i Figu e suggests that the e is a ge e al te de of the i di g site

esidues to o e i the di e tio of thei espe ti e o ple o fo atio . I o e spe ifi ase,

e e a ele a t a k o e o e e t is p ese t Figu e . I te esti gl , so e esidues o e

a a f o the ideal fi al positio a d this see s to e happe i g fo t o easo s: the pla e e t

of spe ifi liga ds is ot a u ate the liga ds RM“D a ge et ee . Å a d . Å ; so e

spe ifi esidues should u de go a o side a le o fo atio al ha ge a d the algo ith is ot

et a le to e plo e the full ota e spa e a aila le to the esidue.

. . HIV RT esults.

The HIV e e se t a s iptase p ese ts a e diffe e t s e a io: the NNRTIs po ket is ot

st u tu ed o isi le i the apo fo . The e te t of the i du ed fit effe t, i this ase, is eall

e a ka le. The hapti -d i e pla e e t esults a e sho i Figu e a d the lea l

de o st ate the o ple it of these si ulatio s. The use a push the liga d i side the po ket

a d all the RM“D alues sho s a positi e o e.

To a e tai e te t, this is ot su p isi g, o side i g ho diffe e t the apo st u tu e a d the

stal o ple a e. I deed, e a see that although the hapti -d i e si ulatio is a le to

i du e a su sta tial ea a ge e t i the p otei to a o odate the liga d i a easo a le

a e , the e te t of the p otei st u tu al ha ge upo liga d i di g i ealit is so e te si e

that, o e agai , the algo ith a ot e plo e effi ie tl su h o fo atio al spa e.

. Co lusio s

I this pape e ha e epo ted a h id e olutio a st ateg fo e plo i g ole ula e e g

la ds apes i a s all elapsed ti e pe iod. Ou algo ith as tu ed to take ad a tage of the e t a

pa allelis that CUDA st ea s a p o ide as the fo efield used as also oded i CUDA. The

p oposed ethod o ks ell fo o fo atio al sa pli g i situatio s he e the e it ite io fo

the eta-heu isti is the e pi atio of a e s all ti ef a e. We ha e also tested the algo ith

i a possi le a tual usage s e a io, i ple e ti g it i ou hapti -d i e ole ula odelli g

si ulato . Usi g a se ies of stal st u tu es, e ha e e a i ed ho the full fle i le hapti -

d i e si ulatio as a le to i i a i du ed-fit effe t.

O e all, e elie e the esults o tai ed a e positi e a d e e ou agi g. The hapti -d i e

si ulatio s o u i a e atu al a d i tuiti e fashio . Fu the o e, ou esults sho s that e

a i du e ea i gful a d app op iate o fo atio al ha ges i to a p otei pla i g a liga d

usi g a i te a ti e, eal ti e app oa h. The data o tai ed also sho s so e li itatio s of this

ethod. Although the algo ith pe fo s e t e el ell, it is possi l too o se ati e he

sa pli g the o fo atio al spa e of the p otei esidues of ou se, a o e po e ful GPGPU

a d ould e plo e o e o fo atio al states . This should ot e o side ed al a s a egati e

aspe t, as, fo e a ple, this o e autious e plo atio ould p ese e so e i po ta t st u tu al

i fo atio i a i di g po ket i those ases he the use applies a st o g fo e to the liga d

th ough the hapti de i e. Ho e e , as i the ase of HIV RT, so eti e the ha ges a e i deed

d a ati a d the ould e ui e a algo ith a le to sa ple a o side a l igge o fo atio al

spa e to o tai a a u ate p edi tio .

I o pa iso ith a lassi al a ual ole ula do ki g ethodolog , hi h ofte is pe fo ed

i th ee sepa ate stages pla e e t, e e g i i izatio a d e aluatio , the i teg ated a d

i te a ti e atu e of ou ethodolog allo s the esea he also to e plo e othe aspe ts of the

i di g p o ess. Fo e a ple, the use ould i estigate a d e aluate, to a e tai e te t, the

esults ge e ated app oa hi g the desi ed a ti e site f o diffe e t di e tio s, as the a tual

path take the liga d ould affe t the p otei o fo atio al ha ges. Clea l , a a u ate

esti ate of this p o ess ould e ui e a sig ifi a tl highe o putatio al po e tha the o e

a aila le to ou s ste , hi h u e tl a o l p o ide a s all i sight i this pa t of the i di g

e e tissue, as the esults o tai ed f o the HIV RT suggests.

Ho e e , e e if e elie e ou app oa h ep ese ts a step fo a d i add essi g se e al

i po ta t aspe t of p otei /liga d i di g, the e a e still se e al issues to e add essed. I

pa ti ula , effi ie tl e plo i g the liga d o fo atio al spa e ithi a e sho t ti ef a e a d

i a i te a ti e e i o e t e ai s a halle ge. I ou s ste , the esults fo fle i le liga ds >

ota le o ds ould e e diffe e t ased o the sta ti g o fo atio of the liga d. “ol atio ,

as i a othe do ki g algo ith s, is also diffi ult to e aluate. It is t ue that a i pli it odel

ould e used, ut ofte the i fo atio p o ided e pli it ate ole ules ould e esse tial

to u de sta d the a u ate i di g of a liga d.

I o lusio , e elie e the esults p ese ted he e a e e p o isi g a d e ou agi g. We a e

o taki g the e t step i the alidatio ou s ste a d the i te a ti e app oa h, o pa i g

the hapti -d i e si ulato to othe o e ial a d o - o e ial fle i le do ki g pa kages.

. Refe e es

1 M. A. Lill, Bio he istry, , , .

2 W. “i ko, “. Li de t a d J. A. M Ca o , Che . Biol. Drug Des., , , .

3 C. Blu a d A. Roli, A Co put Surv, , , .

4 D. E. Gold e g, Ge eti algorith s i sear h, opti izatio , a d a hi e lear i g, Readi g,

Mass: Addiso -Wesle Pu . Co, .

5 “. Ki kpat i k, C. D. Gelatt a d M. P. Ve hi, S ie e, , , .

6 N. Met opolis, A. W. Rose luth, M. N. Rose luth, A. H. Telle a d E. Telle , J. Che . Phys.,

, , .

7 R. Battiti, Rea tive sear h a d i tellige t opti izatio , st ed. Ne Yo k: “p i ge , .

8 O. T ott a d A. J. Olso , J. Co put. Che ., , , .

9 J. Fuh a , A. Ru ai ski, H.-P. Le hof a d D. Neu a , J. Co put. Che ., , , .

10 “. M. Lo g, T. T. T a , P. Ada s, P. Da e a d M. L. “ the, J. Co put. Che ., , ,

.

11 C. R. “. B asil, A. C. B. Del e a d F. L. B. da “il a, J. Co put. Che ., , , .

12 W. Cai a d Xa d “hao, J. Co put. Che ., , , .

13 R. Mu t a d D. Oku o , Parallel Co put., , , .

14 N. )o ta,I. J. G i stead, N. J. A is a d A. B a ale, J. Mol. Model., , , . .

15 A. Ri i, A. A thopoulos, A. Massa otti, I. G i stead, a d A. B a ale, Future Med. Che .,

, , .

16 A. A thopoulos, I. G i stead, a d A. B a ale, J. Co put. Che ., , , .

17 “. Pall a d B. Hess, Co put. Phys. Co u ., , , .

18 “ hulze-Gah e , U., De Bo dt, H.L., Ki , “.H. J.Med.Che . 39: -

19 A. E. Gi so , C. E. A is, J. , Be tle , F. T. Bo le, N. J. Cu ti , T. G. Da ies, J. A. E di ott, B.

T. Goldi g, “. G a t, R. J. G iffi , P. Je s u , L. N. Joh so , V. Mesgui he, D. R. Ne ell, M.

E. No le, J. A. Tu ke a d H. J. Whitfield, J. Med. Che ., , ,

20 P. Fu et, T. Me e , A. “t auss, “. Ra uglia a d J. M. Ro deau, J.M. Bioorg. Med. Che .

Lett. , , .

21 T. O. Fis h a , A. H uza, J. “. Du a, L. Ra a atha , T. Ma hood, W. T. Wi dso , H. V. Le,

T. J. Guzi, M. P. D e , K. Pa u h, R. J. Doll, E. Lees, D. Pa , W. “eghezzi a d V. Madiso ,

Biopoly ers, , , .

22 Y. N. Ka g a d J. A. “tu ke , To e Pu lished

23 D. O'Fa ell, R. T o idge, D. Ro la ds a d J. Jage , J. Mol. Biol., , , .

24 F. Velaz uez, “. Ve kat a a , C. A. Les u g, J. Du a, “. B. Rose lu , J. A. Kozlo ski a d

F. G. Njo oge, Org. Lett., , , .

25 G. N. A ilku a , O. “el uti , “. B. Rose lu , Q. )e g, Y. Jia g, T. Y. Cha , H. Pu, L. Wa g,

F. Be ett, K. X. Che , C. A. Les u g, J. Du a, “. Ga alas, Y. Hua g, P. Pi to, M. “a ig ahi,

F. Velaz uez, “. Ve kat a a , B. Vi ul ha , “. Ag a al, E. Fe a i, C. K. Jia g, H. C. Hua g,

N. Y. “hih, F. G. Njo oge a d J. A. Kozlo ski, Bioorg. Med. Che . Lett., , , .

26 K. Va d k, M. D. Cu i gs, O. N a guile, C. W. Boutto , “. Ve de ille, D. M Go a , B.

De ogelae e, K. A sso s, “. Last, K. Ro auts, A. Tah i, P. Lo , L. Hu, D. A. Beau ha p, K.

“i e a d P. Ra oisso , J. Med. Che ., , , .

27 T. A. “ta e s, R. Coulo e, J. Ra ou t, B. Tha o ekha , G. Fazal, “. Goulet, A. Jakalia ,

D. We i , Y. Tsa t izos, M. A. Poupa t, M. Bos, G. M Ke he , L. Thau ette, G. Kukolj a d

P. L. Beaulieu, Bioorg. Med. Che . Lett., , , .

28 P. L. Beaulieu, R. Coulo e, J. Dua , G. Fazal, C. God out, O. Hu ke, A. Jakalia , M. A. Jol ,

O. Lepage, M. Lli as-B u et, J. Naud, M. Poi ie , N. Riou , B. Tha o ekha , G. Kukolj a d T.

A. “ta e s, Bioorg. Med. Che . Lett., , , .

29 R. Es ouf, J. Re , C. Ross, Y. Jo es, D. “ta e s a d D. “tua t, Nat. Stru t. Biol., , ,

.

30 J. Re , R. Es ouf, E. Ga a , D. “o e s, C. Ross, I. Ki , J. Keeli g, G. Da , Y. Jo es a d

D. “tua t, Nat. Stru t. Biol., , , .

31 Mole ula Ope ati g E i o e t . . Che i al Co puti g G oup, Mo t eal,

Ca ada. http:// . he o p. o

-“t ea s a

Proteins K atoms synchronous hyper-Q Serial

1UGM.pdb 2.103 1.8 0.87 127.5

3RDD.pdb 2.786 1.976 1.115 235

3F9E.pdb 4.74 2.508 1.378 647.5

3FAU.pdb 5.917 3.126 1.677 1017.5

2O0O.pdb 7.165 3.074 1.662 1447.5

3O05.pdb 13.053 4.216 2.368 7430

2EAR.pdb 15.528 4.14 2.826 7450

1TBG.pdb 26.927 5.46 4.74 32277.5

-“t ea s

Protein K atoms synchronous hyper-Q serial

1UGM.pdb 2.103 1.272 0.393 89.16667

3RDD.pdb 2.786 1.502 0.511 165

3F9E.pdb 4.74 1.862 0.74 455.8333

3FAU.pdb 5.917 2.307 0.868 719.1667

2O0O.pdb 7.165 2.248 0.941 1015.833

3O05.pdb 13.053 3.022 1.486 5870

2EAR.pdb 15.528 2.999 1.8 5423.333

1TBG.pdb 26.927 3.559 3.062 25385.83

Ta le a- Results fo CUDA-s h o ous, CUDA-as h o ous a d se ial e e utio s.

Ta le a. Nu e of o fo atio al updates fo ea h p otei fo the fou diffe e t algo ith i

app oa hes i Figu es a-d.

Hybrid ILS GLS Greedy

1UGM.pdb 27 24 20 15

3VB3.pdb 26 20 22 11

3F9E.pdb 18 4 17 5

3OOP.pdb 26 20 18 10

Table 2b. Nu er of o for atio al updates for ea h protei for the four differe t algorith i

approa hes i Figures 6a-d.

Hybrid ILS GLS Greedy

3RDD 23 20 25 13

3FAU 25 20 25 15

2O0O 29 22 19 18

3O0P 27 20 25 12

Ta le . C stal st u tu es used i the hapti -d i e si ulatio s

P otei PDB Liga d Na e

CDK

HCL -

G) MBP

JVP LIG

R F “C

R H “CE

R I “CF

R R “C

EK CK

FKL CK

HCV N“ B

NB -

UPH C

U O E

GNW XNC

J JE

J JG

JJU MB

P otei PDB Liga d Na e

CDK

HCL -

G) MBP

JVP LIG

R F “C

HIV RT

RTJ -

VRT NVP F ag e t

Figu e . A s apshot of liga d app oa hi g a p otei usi g a g eed lo al sea h app oa h. The

algo ith is o ti uousl i p o i g afte the i itial i pa t, ut settli g at e o eous

o figu atio s e ide t geo et i disto tio o the p otei a d the liga d

Figu e . Visual p ofili g data fo a o e. As h o ous e e utio of s st ea s he e the total

lo k a d sha ed e o o ditio s a e ot et. O e lappi g is di ided i to st ea g oups that

eet the ha d a e’s lo k a d sha ed e o est i tio s. The kE lo ks ep ese t the o -

o ded e e g ke els, a d the o espo di g o es u de the kF lo k a e the o ded fo es a d

the D g id i i g ke els.

Figu e a-d. Pe fo a e e h a ks et ee CUDA-as h o ous a d CUDA-s h o ous

st ea e sio s. The sa e a ou t of e e g ke els , a d Fo e g adie t ke els

, / st ea s a e e aluated f o oth app oa hes i ea h g aph.

0

5

10

0 20 40 60 80

Pe

rfo

rma

nce

(se

c)

Number of atoms (x1000)

Hyper-Q vs Synchronous

4-Streams (a)

synchronous hyper-Q

0

4

8

0 20 40 60 80

Pe

rfo

rma

nce

Number of atoms (x1000)

Hyper-Q vs Synchronous

8-Streams (b)

synchronous hyper-Q

0

3

6

0 20 40 60 80

Pe

rfo

rma

nce

Number of atoms (x1000)

Hyper-Q vs Synchronous

12-Streams (c)

synchronous hyper-Q

0

3

6

0 20 40 60 80

Pe

rfo

rma

nce

Number of atoms (x1000)

Hyper-Q vs Synchronous

16-Streams (d)

synchronous hyper-Q

Figures 4 a-d. A schematic overview figure of the four algorithms compared in this report,

depicting how a calculation circle of t=33ms is subdivided into 8 steps of a group of energy kernel

evaluation streams for the 1UGM.pdb protein file. The system is already minimized with a local

search method prior to running the experiment.

-2000

-1500

-1000

-500

0

0 2 4 6 8 10

IGLS (a)

eold eglob

-2000

-1500

-1000

-500

0

0 2 4 6 8 10

ILS (b)

eold eglob

-1550

-1500

-1450

-1400

-1350

0 2 4 6 8 10

GLS

eold eglob

-2000

-1500

-1000

-500

0

0 5 10

Greedy (d)

eold eglob

Figu es a-d. Co pa iso of the diffe e t app oa hes ith the liga d pla ed i to a po ket.

-2000

-1500

-1000

-500

0

0 10 20 30 40

De

lta

En

erg

y

Moves

Meta-Heuristics Comparison

1UGM.pdb (a)

Hybrid

ILS

-1500

-1000

-500

0

500

1000

1500

0 10 20 30 40

De

lta

En

erg

y

Moves

Meta-Heuristics Comparison

3F9E.pdb (b)

Hybrid

ILS

-8000

-6000

-4000

-2000

0

0 10 20 30 40

De

lta

En

erg

y

Moves

Meta-Heuristics Comparison

3VB3.pdb (c)

Hybrid ILS GLS Greedy -16000

-12000

-8000

-4000

0

0 10 20 30 40

De

lta

En

erg

y

Moves

Meta-Heuristics Comparison

3OOP.pdb (d)

HYBRID

ILS

Figu es a-d. Co pa iso of the diffe e t app oa hes ith fou u - i i ized p otei s.

-6000

-4000

-2000

0

2000

4000

0 10 20 30 40

3RDD

IGLS ILS GLS Greedy

-6000

-4000

-2000

0

2000

4000

6000

0 10 20 30 40

3FAU

IGLS ILS GLS Greedy

-2000

0

2000

4000

6000

8000

10000

0 10 20 30 40

2O0O

IGLS ILS GLS Greedy

-16000

-14000

-12000

-10000

-8000

-6000

-4000

-2000

0

0 10 20 30 40

3O05

IGLS ILS GLS Greedy

Figu e . “ apshot the “FC liga d a uall do ked i the CDK as e a ples of esidue

o e e ts. The apo st u tu e ep ese ted i g e ; the stallised o ple ep ese ted i a ;

the st u tu e o tai ed f o the si ulatio is i di ated i o a ge.

Figu e . Visual ep ese tatio of ho CDK esidues had o ed du i g the si ulatio . The

pe e tage of st u tu es st u tu es – % hose esidue RM“D- ryst/hapti efe ed as

RM“D /h i the lege d is lo e tha o espo de t esidue RM“D-apo/ ryst efe ed as RM“D

a/ a e sho i g ee hile highe o es a e sho i ed.

Figu e : “ apshot of liga d LIG i the CDK i di g site. L s o es to a d the positio

p ese t i the stallized st u tu e. The apo st u tu e ep ese ted i g e ; the stallised

o ple ep ese ted i a ; the st u tu e o tai ed f o the si ulatio is i di ated i o a ge.

Figu e . Visual ep ese tatio of ho HCV N“ B esidues had o ed du i g the si ulatio . The

pe e tage of st u tu es hose esidue RM“D- ryst/hapti efe ed as RM“D /h i the lege d is

lo e tha o espo de t esidue RM“D-apo/ ryst efe ed as RM“D a/ a e sho i g ee

hile highe o es a e sho i ed.

Figu e . “ apshot of JE i the HCV N“ i di g po ket. His a k o e o e e t is

highlighted. The apo st u tu e ep ese ted i g e ; the stallised o ple ep ese ted i a ;

the st u tu e o tai ed f o the si ulatio is i di ated i pu ple.

Figu e . “ apshot of liga d NVP i the HIV RT. The apo st u tu e ep ese ted i g e ; the

stallised o ple ep ese ted i g ee ; the st u tu e o tai ed f o the si ulatio is i di ated

i lue.