hank childs, lawrence berkeley lab & uc davis workshop on exascale data management, analysis,...

30
Hank Childs, Lawrence Berkeley Lab & UC Davis Workshop on Exascale Data Management, Analysis, and Visualization Houston, TX 2/22/11 Visualization & The Exascale

Upload: bryce-sparks

Post on 12-Jan-2016

213 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Hank Childs, Lawrence Berkeley Lab & UC Davis Workshop on Exascale Data Management, Analysis, and Visualization Houston, TX 2/22/11 Visualization & The

Hank Childs, Lawrence Berkeley Lab & UC Davis

Workshop on Exascale Data Management, Analysis, and VisualizationHouston, TX2/22/11

Visualization & The Exascale

Page 2: Hank Childs, Lawrence Berkeley Lab & UC Davis Workshop on Exascale Data Management, Analysis, and Visualization Houston, TX 2/22/11 Visualization & The

Outline

Standard operating procedure at the terascale/small petascale

Visualization at the exascale

Page 3: Hank Childs, Lawrence Berkeley Lab & UC Davis Workshop on Exascale Data Management, Analysis, and Visualization Houston, TX 2/22/11 Visualization & The

Disclaimer

To provide context for non-vis people, I am presenting what I I believe is a representative workflow for visualization at the Office of Science Labs right now.

I fully understand that many folks -- including many folks in this room -- are researching innovative techniques that are different from this workflow. Further, these techniques are often even being deployed in a production environment.

Regardless, I think this work flow represents X% of visualization work.

Page 4: Hank Childs, Lawrence Berkeley Lab & UC Davis Workshop on Exascale Data Management, Analysis, and Visualization Houston, TX 2/22/11 Visualization & The

Basic workflow for production visualization

Simulation

Simulation

cycle Satisfied criteriato do restart dump?

Satisfied criteriato do vis dump?

DiskDiskVisualizati

on program

Visualization

program

… acting as a post-processor… acting as a post-processor

Page 5: Hank Childs, Lawrence Berkeley Lab & UC Davis Workshop on Exascale Data Management, Analysis, and Visualization Houston, TX 2/22/11 Visualization & The

Who, What, Where, When, Why, and How Why do we need visualization?

Insight into a simulation When do people run visualization programs?

Whenever they want! (*) Where do visualization programs run?

Either on the supercomputer or on a dedicated vis cluster.

Who’s running visualization programs? Simulation code consumers Simulation code developers Visualization experts

Page 6: Hank Childs, Lawrence Berkeley Lab & UC Davis Workshop on Exascale Data Management, Analysis, and Visualization Houston, TX 2/22/11 Visualization & The

Important “What”: What do they want to

do? Exploration

Arrive at scientific insight Comparison Debugging!

Confirmation Make sure the calculation is running

smoothly Communication

Movies / images for PowerPoint 1D curves for scientific papers

These activities are different and it is possible, if not likely,

that different processing paradigms will be used to accomplish them at the

exascale.

These activities are different and it is possible, if not likely,

that different processing paradigms will be used to accomplish them at the

exascale.

Page 7: Hank Childs, Lawrence Berkeley Lab & UC Davis Workshop on Exascale Data Management, Analysis, and Visualization Houston, TX 2/22/11 Visualization & The

Important How: How do visualization programs work on big data?

P0P1

P3

P2

P8P7 P6

P5

P4

P9

Pieces of data

(on disk)

Read Process Render

Processor 0

Read Process Render

Processor 1

Read Process Render

Processor 2

Parallel visualizationprogram

P0P0 P3P3P2P2

P5P5P4P4 P7P7P6P6

P9P9P8P8

P1P1

Parallel Simulation Code

I refer to this technique as “pure parallelism”.

I refer to this technique as “pure parallelism”.

Page 8: Hank Childs, Lawrence Berkeley Lab & UC Davis Workshop on Exascale Data Management, Analysis, and Visualization Houston, TX 2/22/11 Visualization & The

Pure parallelism

Pure parallelism is data-level parallelism, but… Multi-resolution can be data-level parallelism Out-of-core can be data-level parallelism

Pure parallelism: “brute force” … processing full resolution data using data-level parallelism

Pros: Easy to implement

Cons: Requires large I/O capabilities Requires large amount of primary memory requires big machines

Page 9: Hank Childs, Lawrence Berkeley Lab & UC Davis Workshop on Exascale Data Management, Analysis, and Visualization Houston, TX 2/22/11 Visualization & The

Pure parallelism and today’s tools

VisIt, ParaView, & EnSight primarily employ a pure parallelism + client-server strategy. All tools working on advanced techniques as

well Of course, there’s lots more technology out

there besides those three tools: Parallel VTK VAPOR VISUS and more…

Page 10: Hank Childs, Lawrence Berkeley Lab & UC Davis Workshop on Exascale Data Management, Analysis, and Visualization Houston, TX 2/22/11 Visualization & The

Outline

Standard operating procedure at the terascale/small petascale

Visualization at the exascale How will input data change? Will life be different for our users? Will there be vis clusters? What will they look

like? How will I/O trends impact visualization? What are possible solutions for data movement

problems?

Page 11: Hank Childs, Lawrence Berkeley Lab & UC Davis Workshop on Exascale Data Management, Analysis, and Visualization Houston, TX 2/22/11 Visualization & The

How does increased computing power affect the data to be visualized?

Large # of time steps

Large ensembles

High-res meshes

Large # of variables/ more physics

Your mileage may vary; some simulations produce a lot of data and some don’t.

Your mileage may vary; some simulations produce a lot of data and some don’t.

Thanks!: Sean Ahern & Ken Joy

Thanks!: Sean Ahern & Ken Joy

Page 12: Hank Childs, Lawrence Berkeley Lab & UC Davis Workshop on Exascale Data Management, Analysis, and Visualization Houston, TX 2/22/11 Visualization & The

Vis clusters

Today’s vis clusters are designed to be proportional to the “big iron”. Typical rule of thumb is that the little iron should

have 10% of the memory of the big iron. (Most vis clusters have fallen short of the 10% rule

for the last few years.) Exascale machine = $200M, ~1/3rd of which is

for memory. Memory for proposed vis cluster = $6.7M.

I personally view it as unlikely that we will have 10%-sized vis clusters. Additional issue with getting data off machine.

Page 13: Hank Childs, Lawrence Berkeley Lab & UC Davis Workshop on Exascale Data Management, Analysis, and Visualization Houston, TX 2/22/11 Visualization & The

I/O and visualization Pure parallelism is

almost always >50% I/O and sometimes 98% I/O

Amount of data to visualize is typically O(total mem)

FLOPs Memory I/O

Petascale machine

“Exascale machine”

Two big factors: ① how much data you have to read② how fast you can read it

Relative I/O (ratio of total memory and I/O) is key

Page 14: Hank Childs, Lawrence Berkeley Lab & UC Davis Workshop on Exascale Data Management, Analysis, and Visualization Houston, TX 2/22/11 Visualization & The

Trends in I/O

Machine Year Time to write memory

ASCI Red 1997 300 sec

ASCI Blue Pacific 1998 400 sec

ASCI White 2001 660 sec

ASCI Red Storm 2004 660 sec

ASCI Purple 2005 500 sec

Jaguar XT4 2007 1400 sec

Roadrunner 2008 1600 sec

Jaguar XT5 2008 1250 sec

Exascale ~2020 ????

Thanks!: Dave Pugmire, ORNL

Thanks!: Dave Pugmire, ORNL

Page 15: Hank Childs, Lawrence Berkeley Lab & UC Davis Workshop on Exascale Data Management, Analysis, and Visualization Houston, TX 2/22/11 Visualization & The

Why is relative I/O getting slower?

I/O is quickly becoming a dominant cost in the overall supercomputer procurement.

Simulation codes aren’t as exposed. And will be even less exposed with

proposed future architectures.

We need to de-emphasize I/O in our visualization and analysis techniques.

We need to de-emphasize I/O in our visualization and analysis techniques.

Page 16: Hank Childs, Lawrence Berkeley Lab & UC Davis Workshop on Exascale Data Management, Analysis, and Visualization Houston, TX 2/22/11 Visualization & The

Assertion: we are going to need a lot of solutions.

All visualization and analysis work

Multi-res

In situ

Out-of-core

Data subsetting

Do remaining ~5% on SC w/ pure parallelism

Page 17: Hank Childs, Lawrence Berkeley Lab & UC Davis Workshop on Exascale Data Management, Analysis, and Visualization Houston, TX 2/22/11 Visualization & The

Where does visualization fit into the ecosystem? Data movement will be expensive on the

exascale machine. Beneficial to process data in situ.

Current thinking from exascale gurus: you’ll have 1000 threads on a node, some will do physics, some will provide services. Visualization could be a service in this system … or visualization could be done on a separate

node located nearby dedicated to visualization/analysis/IO/etc.

… or a combination of the two.

Page 18: Hank Childs, Lawrence Berkeley Lab & UC Davis Workshop on Exascale Data Management, Analysis, and Visualization Houston, TX 2/22/11 Visualization & The

Where does visualization fit into the ecosystem? Visualization could be a service in this

system… How do visualization routines interact with

the simulation? How do we write routines that can be re-used

across many simulations? Are we prepared to run on nodes with 1000-

way parallelism? Likely with no cache-coherency?

Are our algorithms ready to run on O(1M) nodes?

Page 19: Hank Childs, Lawrence Berkeley Lab & UC Davis Workshop on Exascale Data Management, Analysis, and Visualization Houston, TX 2/22/11 Visualization & The

Where does visualization fit into the ecosystem? … or visualization could be done on a

separate node located nearby dedicated to visualization/analysis/IO/etc. OK, where exactly? Likely still to be issues with running on the

accelerator.

I personally think it is very unlikely we will need to run visualization algorithms at billion way concurrency.

Page 20: Hank Childs, Lawrence Berkeley Lab & UC Davis Workshop on Exascale Data Management, Analysis, and Visualization Houston, TX 2/22/11 Visualization & The

Possible in situ visualization scenariosVisualization could be a service in this system…

… or visualization could be done on a separate node located nearby dedicated to visualization/analysis/IO/etc.

Physics #1

Physics #1Physics #2

Physics #2

Physics #n

Physics #n

Services

Services

VizViz

Physics #1

Physics #1Physics #2

Physics #2

Physics #n

Physics #n

Services

Services

VizViz

Physics #1

Physics #1Physics #2

Physics #2

Physics #n

Physics #n

Services

Services

VizViz

Physics #1

Physics #1Physics #2

Physics #2

Physics #n

Physics #n

Services

Services

VizViz

Physics #1

Physics #1Physics #2

Physics #2

Physics #n

Physics #n

Services

Services

VizViz

Physics #1

Physics #1Physics #2

Physics #2

Physics #n

Physics #n

ServicesServices

Physics #1

Physics #1Physics #2

Physics #2

Physics #n

Physics #n

ServicesServices

Physics #1

Physics #1Physics #2

Physics #2

Physics #n

Physics #n

ServicesServices

Physics #1

Physics #1Physics #2

Physics #2

Physics #n

Physics #n

ServicesServices

One of many nodes

dedicated to vis/analysis/IO

One of many nodes

dedicated to vis/analysis/IO

Accelerator, similar to HW on rest of exascale machine (e.g. GPU)

… or maybe this is a high memory quad-core running Linux!

Specialized vis & analysis resources

Specialized vis & analysis resources

… or maybe the data is reduced and sent to dedicated resources off machine!

… And likely many more configurations

VizViz

VizViz

VizViz

VizViz

We will possibly need to run on:

-The accelerator in a lightweight way-The accelerator in a heavyweight way-A vis cluster (?)& data movement is a big issue…

We will possibly need to run on:

-The accelerator in a lightweight way-The accelerator in a heavyweight way-A vis cluster (?)& data movement is a big issue…

Page 21: Hank Childs, Lawrence Berkeley Lab & UC Davis Workshop on Exascale Data Management, Analysis, and Visualization Houston, TX 2/22/11 Visualization & The

Reducing cells to results (e.g. pixels or numbers) can be hard.

Must to reduce data every step of the way. Example: contour + normals + render

Important that you have less data in pixels than you had in cells. (*)

Easier example: synthetic diagnostics

Physics #1

Physics #1Physics #2

Physics #2

Physics #n

Physics #n

ServicesServices

Physics #1

Physics #1Physics #2

Physics #2

Physics #n

Physics #n

ServicesServices

Physics #1

Physics #1Physics #2

Physics #2

Physics #n

Physics #n

ServicesServices

Physics #1

Physics #1Physics #2

Physics #2

Physics #n

Physics #n

ServicesServices

One of many nodes

dedicated to vis/analysis/IO

One of many nodes

dedicated to vis/analysis/IO

Specialized vis & analysis resources

Specialized vis & analysis resources

Page 22: Hank Childs, Lawrence Berkeley Lab & UC Davis Workshop on Exascale Data Management, Analysis, and Visualization Houston, TX 2/22/11 Visualization & The

In situ

In situ will almost certainly play a large role in the portfolio … but it is not a panacea.

Pros: No I/O & opportunity to minimize data moved

Cons: Very memory constrained Some operations not possible

Once the simulation has advanced, you cannot go back and analyze it

User must know what to look a priori Expensive resource to hold hostage!

Page 23: Hank Childs, Lawrence Berkeley Lab & UC Davis Workshop on Exascale Data Management, Analysis, and Visualization Houston, TX 2/22/11 Visualization & The

Will life be different for our users?

Page 24: Hank Childs, Lawrence Berkeley Lab & UC Davis Workshop on Exascale Data Management, Analysis, and Visualization Houston, TX 2/22/11 Visualization & The

Multi-resolution techniques

Pros Drastically reduce I/O & memory requirements Confidence in pictures; multi-res hierarchy

addresses “many cells to one pixel issue” Cons

Not always meaningful to process simplified version of the data.

How do we generate hierarchical representations on the exascale machine? What costs do they incur (data movement costs, storage costs)?

Page 25: Hank Childs, Lawrence Berkeley Lab & UC Davis Workshop on Exascale Data Management, Analysis, and Visualization Houston, TX 2/22/11 Visualization & The

Do we have our use cases covered?

Three primary use cases: Exploration Confirmation Communication

Examples:Scientific discoveryDebugging

Examples:Scientific discoveryDebugging

Examples:Data analysisImages / moviesComparison

Examples:Data analysisImages / moviesComparison

Examples:Data analysisImages / movies

Examples:Data analysisImages / movies

Multi-res

In situ

Page 26: Hank Childs, Lawrence Berkeley Lab & UC Davis Workshop on Exascale Data Management, Analysis, and Visualization Houston, TX 2/22/11 Visualization & The

Under-represented topics in this talk. We will have quintillions of data points …

how do we meaningfully represent that with millions of pixels?

We are unusual: we are data consumers, not data producers, and the exascale machine is being designed for data producers

Data is going to be different at the exascale: ensembles, multi-physics, etc. The outputs of visualization software will be

different.

Page 27: Hank Childs, Lawrence Berkeley Lab & UC Davis Workshop on Exascale Data Management, Analysis, and Visualization Houston, TX 2/22/11 Visualization & The

Under-represented topics in this talk, pt 2

We have an extreme SWE challenge: write once, use many

Our community has done little work in hybrid parallelism and it is critical to our future.

Accelerators on exascale machine are likely not to have cache coherency Do all of our algorithms work in a GPU-type

setting? We have a huge investment in CPU-SW. What

now?

Page 28: Hank Childs, Lawrence Berkeley Lab & UC Davis Workshop on Exascale Data Management, Analysis, and Visualization Houston, TX 2/22/11 Visualization & The

Summary

The exascale machine will almost certainly lead to a paradigm shift in the way visualization programs process data. Where to process data and what data to move

will be a central issue. Our algorithms will need to run:

in a light weight manner (e.g. on a few of the thousands of cores on the accelerator)

and/or using all the threads on an accelerator and/or on machines that may not have

accelerators at all.

Page 29: Hank Childs, Lawrence Berkeley Lab & UC Davis Workshop on Exascale Data Management, Analysis, and Visualization Houston, TX 2/22/11 Visualization & The

Backup slides

Page 30: Hank Childs, Lawrence Berkeley Lab & UC Davis Workshop on Exascale Data Management, Analysis, and Visualization Houston, TX 2/22/11 Visualization & The

Summary: exascale impacts

I/O will be slow FLASH drives will hurt vis community

Massive number of threads on a node; unclear if there will be cache coherency May very well look like GPU programming

Data movement huge issue Likely outcome: we get a few threads of the

GPU to do our job. Must fit into ecosystem Trade off on where to do vis.