federated hpc clouds applied to radiation therapy

25
Federated HPC Clouds applied to Radiation Therapy A. Gómez , L.M. Carril, R. Valin, J.C. Mouriño, C. Cotelo ISC Cloud‘13, Heidelberg (Germany) Sep. 23-24th, 2013

Upload: andres-gomez

Post on 03-Jul-2015

272 views

Category:

Health & Medicine


0 download

DESCRIPTION

Presentation delivered in the Research Track at ISC CLOUD'13 at Heidelberg (Germany) on Sep. 24th 2013. It describe the Virtual Cluster Architecture developed during BonFIRE project and the reasons to do it. Some proof-of-concept experiments are also presented

TRANSCRIPT

Page 1: Federated HPC Clouds Applied to Radiation Therapy

Federated HPC Clouds applied to

Radiation Therapy

A. Gómez, L.M. Carril, R. Valin,

J.C. Mouriño, C. Cotelo

ISC Cloud‘13, Heidelberg (Germany)

Sep. 23-24th, 2013

Page 2: Federated HPC Clouds Applied to Radiation Therapy

Overview

Context.

Virtual Cluster Architecture.

Experiments on BonFIRE.

Conclusions.

The research leading to these results has received funding from the European Commision's

Seventh Framework Programme (FP7/2007-2013) under grant agreement number 257386

Page 3: Federated HPC Clouds Applied to Radiation Therapy

Context: eIMRT service

CTs Treatment Results

Results

TPS

Second calculation

Personalized: One patient, one treatment

Page 4: Federated HPC Clouds Applied to Radiation Therapy

eIMRT architecture

IaaSSaaS

Workflow based

on Monte Carlo

simulations

Page 5: Federated HPC Clouds Applied to Radiation Therapy

eIMRT WorkfloweIMRT code: Prepares inputs

for BEAMnrc MC. Seconds in

master computer

BEAMnrc MC simulations.

Independent jobs on CEs.

eIMRT code: collects outputs and

prepares inputs for DOSXYZnrc

Seconds in master computer

eIMRT code: collects outputs and generates final output.. Seconds in master computer

DOSXYZnrc MC simulations.

Independent jobs on CEs.

Page 6: Federated HPC Clouds Applied to Radiation Therapy

SaaS issues

Local cluster: – Could not be enough with many clients.

– Interferences between customer’s requests.

– Shared resources: Time-to-solution not guaranteed.

Grid:– Interferences between clients.

– Shared resources: Time-to-solution not guaranteed.

Cloud:– One treatment, one virtual cluster.

– No interferences between treatments, customers.

– But, How to guarantee the time-to-solution in a multi-tenant out-of-control infrastructure?

Page 7: Federated HPC Clouds Applied to Radiation Therapy

IaaS issues for HPC/HTC SaaS

Failures of sites. Needs Fault-tolerant design.

Application Performance Variability between deployments. Needs elasticity.– Different IaaS back-end servers.

– Multi-tenancy. Sharing resources among IaaS

customers.

– Different Cloud providers.

– Evolution of IaaS infrastructure.

J. Schad, et al, Runtime Measurements in the Cloud:

Observing, Analyzing, and Reducing Variance., Proceedings of

the VLDB Endowment, Vol. 3, No. 1, 2010

Page 8: Federated HPC Clouds Applied to Radiation Therapy

Proposal: Autonomous Virtual

Cluster Architecture

Page 9: Federated HPC Clouds Applied to Radiation Therapy

Virtual Cluster Architecture

Page 10: Federated HPC Clouds Applied to Radiation Therapy

Virtual Cluster single site

NFS

Cluster

management:

OGS + custom

scripts

Page 11: Federated HPC Clouds Applied to Radiation Therapy

Virtual Cluster-two sites

Page 12: Federated HPC Clouds Applied to Radiation Therapy

Fault-tolerant VC two sites

Page 13: Federated HPC Clouds Applied to Radiation Therapy

Elasticity Engine

Controls number of CEs based on KeyApplication Performance measurements.

Enlarges the cluster to keep performance and fulfill deadlines.

Decreases size if App. Performance is higher than needed, to decrease costs.

Page 14: Federated HPC Clouds Applied to Radiation Therapy

Proof-of-Concept Experiments

Page 15: Federated HPC Clouds Applied to Radiation Therapy

BonFIRE Infrastructure

Vendor Freq.

(GHz)

Cores RAM

(GB)

Intel 2.33 2*2 4

AMD 1,7 2*12 48

Intel 2,5 2*4 32

Intel 2.93 2*4 24

INRIA: Vendor Freq. (GHz) Cores RAM

(GB)

Intel 3.2 2*2 2

Intel 2.66 2*2 8

AMD 2.6 4*12 196

AMD 2 2 4

Intel I7 2.53 2 4

Intel I7 2.1 4 8

Intel Atom 1 2

AMD

T56N1.65 2 2

HLRS:

Cloud Manager:

OpenNebula 3.0

Page 16: Federated HPC Clouds Applied to Radiation Therapy

DISTRIBUTED VIRTUAL CLUSTER

EXPERIMENT

VCOC, FIRE Engineering Workshop, Ghent, Nov. 6th – 7th 2012

Page 17: Federated HPC Clouds Applied to Radiation Therapy

Application execution. One vs Two sites

VC Conf.: Distributed VC (_dist)

BonFIRE sites:

– INRIA: Master + CEs

– HLRS: CEs

Deployment time decreases.

App:Two sites faster than one site.

But because second site has better

CPUs.

Impact of deployment ~ 10% total

time.

Page 18: Federated HPC Clouds Applied to Radiation Therapy

SPECIFIC DEADLINE OBJECTIVE

EXPERIMENT

VCOC, FIRE Engineering Workshop, Ghent, Nov. 6th – 7th 2012

Page 19: Federated HPC Clouds Applied to Radiation Therapy

Horizontal elasticity

Monitoring application

performance works.

We have modified software

to produce information more

frequently.

Execution with deadline.

Elasticity works.

Page 20: Federated HPC Clouds Applied to Radiation Therapy

FAULT TOLERANCE EXPERIMENT

WITH ELASTICITY

VCOC, FIRE Engineering Workshop, Ghent, Nov. 6th – 7th 2012

Page 21: Federated HPC Clouds Applied to Radiation Therapy

Virtual Cluster

SYNC

Page 22: Federated HPC Clouds Applied to Radiation Therapy

Fault-tolerance

BonFIRE sites:

– HLRS (Master + 4 CEs)

– INRIA (Shadow + 4 CEs)

Demanded performance

(500H/s)

Fault simulated putting HLRS

VMs in CANCEL.

INRIA Shadow took control of

cluster.

Elasticity worked, demanding

more CEs to INRIA.

Page 23: Federated HPC Clouds Applied to Radiation Therapy

CONCLUSIONS

VCOC, FIRE Engineering Workshop, Ghent, Nov. 6th – 7th 2012

Page 24: Federated HPC Clouds Applied to Radiation Therapy

Conclusions

Distributed VC can be used to speed up HTC applications.

Elasticity Engine based on Key Application Performance indicator for HTC works.

High QoS can be provided in VC using distributed VC + elasticity.

BonFIRE provides infrastructure for experiments about new concepts and services on Cloud.

Page 25: Federated HPC Clouds Applied to Radiation Therapy

THANKS

Questions?

[email protected]