https://portal.futuregrid.org science clouds and futuregrid’s perspective june 18 2012 science...

10
https://portal.futuregrid.org Science Clouds and FutureGrid’s Perspective June 18 2012 Science Clouds Workshop HPDC 2012 Delft Geoffrey Fox [email protected] Director, Digital Science Center, Pervasive Technology Institute Associate Dean for Research and Graduate Studies, School of Informatics and Computing Indiana University Bloomington

Upload: marvin-thomas

Post on 29-Dec-2015

215 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Https://portal.futuregrid.org Science Clouds and FutureGrid’s Perspective June 18 2012 Science Clouds Workshop HPDC 2012 Delft Geoffrey Fox gcf@indiana.edu

https://portal.futuregrid.org

Science Clouds and FutureGrid’s Perspective

June 18 2012Science Clouds Workshop

HPDC 2012 DelftGeoffrey Fox

[email protected]

Director, Digital Science Center, Pervasive Technology Institute

Associate Dean for Research and Graduate Studies,  School of Informatics and Computing

Indiana University Bloomington

Page 2: Https://portal.futuregrid.org Science Clouds and FutureGrid’s Perspective June 18 2012 Science Clouds Workshop HPDC 2012 Delft Geoffrey Fox gcf@indiana.edu

https://portal.futuregrid.org 2

Panel Questions• Brief description of the project – goal, testbed

infrastructure, target users/application.• Unique science cloud characteristics - how is/are your

science application(s) distinct from traditional commercial cloud applications?

• Discuss some of the challenges and findings from the project

• Discuss any practical findings that are useful for cloud admins, developers and/or users.

• Discuss future research, development and challenges for wider adoptability of cloud environments for use of science based on your project experience.

Page 3: Https://portal.futuregrid.org Science Clouds and FutureGrid’s Perspective June 18 2012 Science Clouds Workshop HPDC 2012 Delft Geoffrey Fox gcf@indiana.edu

https://portal.futuregrid.org

What is FutureGrid?• FutureGrid modeled on Grid5000• FutureGrid mission is to enable experimental work that advances:

a) Innovation and scientific understanding of distributed computing and parallel computing paradigms,

b) The engineering science of middleware that enables these paradigms,

c) The use and drivers of these paradigms by important applications, and,

d) The education of a new generation of students and workforce on the use of these paradigms and their applications.

• The implementation of mission includes• Distributed flexible hardware

with supported use• Identified IaaS and PaaS “core” software

with supported use• Outreach

• ~4500 cores in 5 sites 2.30%

4.00%

4.00%

4.60%

8.60%

8.60%

14.90%

15.50%

15.50%

15.50%

23.60%

32.80%

35.10%

44.80%

52.30%

56.90%

PAPI

Pegasus

Vampir

Globus

gLite

Unicore 6

Genesis II

OpenNebula

OpenStack

Twister

XSEDE Software Stack

MapReduce

Hadoop

HPC

Eucalyptus

Nimbus

FutureGrid Usage

Page 4: Https://portal.futuregrid.org Science Clouds and FutureGrid’s Perspective June 18 2012 Science Clouds Workshop HPDC 2012 Delft Geoffrey Fox gcf@indiana.edu

https://portal.futuregrid.org

FutureGrid: Inca Monitoring

Page 5: Https://portal.futuregrid.org Science Clouds and FutureGrid’s Perspective June 18 2012 Science Clouds Workshop HPDC 2012 Delft Geoffrey Fox gcf@indiana.edu

https://portal.futuregrid.org 5

5 Use Types for FutureGrid• 220 approved projects June 17 2012– https://portal.futuregrid.org/projects

• Training Education and Outreach (8%)– Semester and short events; promising for small universities

• Interoperability test-beds (3%)– Grids and Clouds; Standards; from Open Grid Forum OGF

• Domain Science applications (31%)– Life science highlighted (18%), Non Life Science (13%)

• Computer science (47%)– Largest current category

• Computer Systems Evaluation (27%)– XSEDE (TIS, TAS), OSG, EGI

• Clouds are meant to need less support than other models; FutureGrid needs more user support …….

Page 6: Https://portal.futuregrid.org Science Clouds and FutureGrid’s Perspective June 18 2012 Science Clouds Workshop HPDC 2012 Delft Geoffrey Fox gcf@indiana.edu

https://portal.futuregrid.org 6

https://portal.futuregrid.org/projects

Page 7: Https://portal.futuregrid.org Science Clouds and FutureGrid’s Perspective June 18 2012 Science Clouds Workshop HPDC 2012 Delft Geoffrey Fox gcf@indiana.edu

https://portal.futuregrid.org 7

Recent Projects

Page 8: Https://portal.futuregrid.org Science Clouds and FutureGrid’s Perspective June 18 2012 Science Clouds Workshop HPDC 2012 Delft Geoffrey Fox gcf@indiana.edu

https://portal.futuregrid.org

Distribution of FutureGrid Technologies and Areas

• 220 Projects• Hard to support

multiple IaaS on same cluster• Dynamically provision

PAPI

Pegasus

Vampir

Globus

gLite

Unicore 6

Genesis II

OpenNebula

OpenStack

Twister

XSEDE Software Stack

MapReduce

Hadoop

HPC

Eucalyptus

Nimbus

2.30%

4.00%

4.00%

4.60%

8.60%

8.60%

14.90%

15.50%

15.50%

15.50%

23.60%

32.80%

35.10%

44.80%

52.30%

56.90%

Education9%

Computer Science

35%

other Domain Science

14%

Life Science15%

Inter-op-erability

3%

Technology Evaluation

24%

Page 9: Https://portal.futuregrid.org Science Clouds and FutureGrid’s Perspective June 18 2012 Science Clouds Workshop HPDC 2012 Delft Geoffrey Fox gcf@indiana.edu

https://portal.futuregrid.org 9

Using Science Clouds in a Nutshell• High Throughput Computing; pleasingly parallel; grid applications• Multiple users (long tail of science) and usages (parameter searches)• Internet of Things (Sensor nets) as in cloud support of smart phones• (Iterative) MapReduce including “most” data analysis• Exploiting elasticity and platforms (HDFS, Object Stores, Queues ..)• Use worker roles, services, portals (gateways) and workflow• Good Strategies:

– Build the application as a service; – Build on existing cloud deployments such as Hadoop; – Use PaaS if possible; – Design for failure; – Use as a Service (e.g. SQLaaS) where possible; – Address Challenge of Moving Data

Page 10: Https://portal.futuregrid.org Science Clouds and FutureGrid’s Perspective June 18 2012 Science Clouds Workshop HPDC 2012 Delft Geoffrey Fox gcf@indiana.edu

https://portal.futuregrid.org 10

Cosmic Comments• Are clouds different from Grids: in principle or in practice?• Does a “modest-size private science cloud” make sense

– Too small to be elastic• Should governments fund use of commercial clouds (or build their own)

– Most science doesn’t have privacy issues motivating some private clouds• Does Cloud + MPI Engine cover the future?• Most interest in clouds from “new” applications such as life sciences• Recent cloud infrastructure (Eucalyptus 3, OpenStack Essex) much improved• More employment opportunities in clouds than HPC and Grids; so cloud

related activities popular with students• Science Cloud Summer School July 30-August 3

– Part of virtual summer school in computational science and engineering and expect over 200 participants spread over 9 sites

• Science Cloud and MapReduce XSEDE Community groups