![Page 1: The Top Six Advantages of CUDA-Ready Clusterson-demand.gputechconf.com/gtc/2015/webinar/gtc-express-cuda-ready...You focus on coding –not infrastructure ... • You have access to](https://reader034.vdocuments.net/reader034/viewer/2022051723/5abe55fb7f8b9ab02d8cc28e/html5/thumbnails/1.jpg)
The Top Six Advantages of
CUDA-Ready Clusters
Ian Lumb
Bright Evangelist
GTC Express Webinar
January 21, 2015
![Page 2: The Top Six Advantages of CUDA-Ready Clusterson-demand.gputechconf.com/gtc/2015/webinar/gtc-express-cuda-ready...You focus on coding –not infrastructure ... • You have access to](https://reader034.vdocuments.net/reader034/viewer/2022051723/5abe55fb7f8b9ab02d8cc28e/html5/thumbnails/2.jpg)
2
“We scientists are time-constrained,” said Dr. Yamanaka. “Our priority is our research, not managing our clusters. Bright [Cluster Manager] is intuitive to use, and with it I can effectively manage my cluster without wasting time writing scripts, or synchronizing management tool revisions. Provisioning is fast and easy too. I prefer this approach over open source toolkits.”
http://www.brightcomputing.com/News-Tokyo-Institute-of-Technology-Gordon-Bell-Prize-Winner-Uses-Bright-Cluster-Manager-to-Develop-Applications-for-One-of-the-Worlds-Fastest-Supercomputers
![Page 3: The Top Six Advantages of CUDA-Ready Clusterson-demand.gputechconf.com/gtc/2015/webinar/gtc-express-cuda-ready...You focus on coding –not infrastructure ... • You have access to](https://reader034.vdocuments.net/reader034/viewer/2022051723/5abe55fb7f8b9ab02d8cc28e/html5/thumbnails/3.jpg)
CUDA-Ready Clusters
1. You focus on coding – not infrastructure & toolchains
2. You’re always in sync – with GPUs + CUDA
3. You cross-develop with confidence and ease
• Maintaining and using highly customized environments
4. You choose and combine in programming GPUs
• CUDA or OpenCL or OpenACC
• … and combine with MPI
5. You have converged HPC + Big Data Analytics
• You have access to Hadoop alongside HPC
6. You seamlessly utilize ‘The Cloud’
• You extend into AWS, deploy OpenStack, …
CUDA-ready clusters are GPU developer-ready
![Page 4: The Top Six Advantages of CUDA-Ready Clusterson-demand.gputechconf.com/gtc/2015/webinar/gtc-express-cuda-ready...You focus on coding –not infrastructure ... • You have access to](https://reader034.vdocuments.net/reader034/viewer/2022051723/5abe55fb7f8b9ab02d8cc28e/html5/thumbnails/4.jpg)
CUDA-Ready Clusters
1. You focus on coding – not infrastructure &
toolchains
2. You’re always in sync – with GPUs + CUDA
3. You cross-develop with confidence and ease
• Maintaining and using highly customized environments
4. You choose and combine in programming GPUs
• CUDA or OpenCL or OpenACC
• … and combine with MPI
5. You have converged HPC + Big Data Analytics
• You have access to Hadoop alongside HPC
6. You seamlessly utilize ‘The Cloud’
• You extend into AWS, deploy OpenStack, …
CUDA-ready clusters are GPU developer-ready
![Page 5: The Top Six Advantages of CUDA-Ready Clusterson-demand.gputechconf.com/gtc/2015/webinar/gtc-express-cuda-ready...You focus on coding –not infrastructure ... • You have access to](https://reader034.vdocuments.net/reader034/viewer/2022051723/5abe55fb7f8b9ab02d8cc28e/html5/thumbnails/5.jpg)
Cluster Management Shell
Bright Cluster Manager — CUDA Environment
User PortalCluster Management GUI
SSL / SOAP / X509 / IPtables
Cluster Management Daemon
Dis
k
Eth
ern
et
Inte
rcon
ne
ct
IPM
I / iL
O
PD
U
CP
U
GP
Us
Me
mo
ry
Slurm
PBS Pro
Torque/Maui
Torque/MOAB
Grid Engine
LSF
Monitoring
Automation
Health Checks
Management
Compilers
Libraries
Debuggers
Profilers
Provisioning
SLES / RHEL / CentOS / SL
![Page 6: The Top Six Advantages of CUDA-Ready Clusterson-demand.gputechconf.com/gtc/2015/webinar/gtc-express-cuda-ready...You focus on coding –not infrastructure ... • You have access to](https://reader034.vdocuments.net/reader034/viewer/2022051723/5abe55fb7f8b9ab02d8cc28e/html5/thumbnails/6.jpg)
6
Unified Memory
http://info.brightcomputing.com/Blog/bid/196783/Bright-Cluster-Manager-Integrates-Support-for-CUDA-6
![Page 7: The Top Six Advantages of CUDA-Ready Clusterson-demand.gputechconf.com/gtc/2015/webinar/gtc-express-cuda-ready...You focus on coding –not infrastructure ... • You have access to](https://reader034.vdocuments.net/reader034/viewer/2022051723/5abe55fb7f8b9ab02d8cc28e/html5/thumbnails/7.jpg)
7
![Page 8: The Top Six Advantages of CUDA-Ready Clusterson-demand.gputechconf.com/gtc/2015/webinar/gtc-express-cuda-ready...You focus on coding –not infrastructure ... • You have access to](https://reader034.vdocuments.net/reader034/viewer/2022051723/5abe55fb7f8b9ab02d8cc28e/html5/thumbnails/8.jpg)
8
![Page 9: The Top Six Advantages of CUDA-Ready Clusterson-demand.gputechconf.com/gtc/2015/webinar/gtc-express-cuda-ready...You focus on coding –not infrastructure ... • You have access to](https://reader034.vdocuments.net/reader034/viewer/2022051723/5abe55fb7f8b9ab02d8cc28e/html5/thumbnails/9.jpg)
9
![Page 10: The Top Six Advantages of CUDA-Ready Clusterson-demand.gputechconf.com/gtc/2015/webinar/gtc-express-cuda-ready...You focus on coding –not infrastructure ... • You have access to](https://reader034.vdocuments.net/reader034/viewer/2022051723/5abe55fb7f8b9ab02d8cc28e/html5/thumbnails/10.jpg)
10NVIDIA GPU Boost
![Page 11: The Top Six Advantages of CUDA-Ready Clusterson-demand.gputechconf.com/gtc/2015/webinar/gtc-express-cuda-ready...You focus on coding –not infrastructure ... • You have access to](https://reader034.vdocuments.net/reader034/viewer/2022051723/5abe55fb7f8b9ab02d8cc28e/html5/thumbnails/11.jpg)
11
Mo
der
niz
ed m
on
ito
rin
g fo
r H
PC
clu
ster
sh
ttp
://i
nsi
de
hp
c.co
m/2
01
4/1
1/m
on
ito
rin
g-h
pc-
clu
ster
s-m
od
ern
ized
/
![Page 12: The Top Six Advantages of CUDA-Ready Clusterson-demand.gputechconf.com/gtc/2015/webinar/gtc-express-cuda-ready...You focus on coding –not infrastructure ... • You have access to](https://reader034.vdocuments.net/reader034/viewer/2022051723/5abe55fb7f8b9ab02d8cc28e/html5/thumbnails/12.jpg)
Cluster Health Management
Provide problem free environment for running jobs
Four elements
1. Cluster management automation
2. Regular health checks
3. Pre-job health checks
4. Hardware stability & performance tests
All elements above are configurable and extensible
![Page 13: The Top Six Advantages of CUDA-Ready Clusterson-demand.gputechconf.com/gtc/2015/webinar/gtc-express-cuda-ready...You focus on coding –not infrastructure ... • You have access to](https://reader034.vdocuments.net/reader034/viewer/2022051723/5abe55fb7f8b9ab02d8cc28e/html5/thumbnails/13.jpg)
CUDA-Ready Clusters
1. You focus on coding – not infrastructure & toolchains
2. You’re always in sync – with GPUs + CUDA
3. You cross-develop with confidence and ease
• Maintaining and using highly customized environments
4. You choose and combine in programming GPUs
• CUDA or OpenCL or OpenACC
• … and combine with MPI
5. You have converged HPC + Big Data Analytics
• You have access to Hadoop alongside HPC
6. You seamlessly utilize ‘The Cloud’
• You extend into AWS, deploy OpenStack, …
CUDA-ready clusters are GPU developer-ready
![Page 14: The Top Six Advantages of CUDA-Ready Clusterson-demand.gputechconf.com/gtc/2015/webinar/gtc-express-cuda-ready...You focus on coding –not infrastructure ... • You have access to](https://reader034.vdocuments.net/reader034/viewer/2022051723/5abe55fb7f8b9ab02d8cc28e/html5/thumbnails/14.jpg)
Syncing with GPUs + CUDA …
Innovation characterizes the entire history and
evolution of GPU programmability through CUDA
• BUT … introduces challenges and opportunities …
Bright Computing’s approach leverages
• People
• Proactively maintaining business and technical relationships
• Process
• `Hands-on engineering’ begins with release candidates
– Preliminary to fully productized implementations
• Product
• Bright Cluster Manager released once twice per year
– Updates flow continuously …
http://info.brightcomputing.com/blog/cuda-6.5-something-for-nothing
http://www.brightcomputing.com/News-Bright-Cluster-Manager-Adds-Support-for-the-NVIDIA-Tesla-K80-Dual-GPU-Accelerator
![Page 15: The Top Six Advantages of CUDA-Ready Clusterson-demand.gputechconf.com/gtc/2015/webinar/gtc-express-cuda-ready...You focus on coding –not infrastructure ... • You have access to](https://reader034.vdocuments.net/reader034/viewer/2022051723/5abe55fb7f8b9ab02d8cc28e/html5/thumbnails/15.jpg)
CUDA-Ready Clusters
1. You focus on coding – not infrastructure & toolchains
2. You’re always in sync – with GPUs + CUDA
3. You cross-develop with confidence and ease
• Maintaining and using highly customized environments
4. You choose and combine in programming GPUs
• CUDA or OpenCL or OpenACC
• … and combine with MPI
5. You have converged HPC + Big Data Analytics
• You have access to Hadoop alongside HPC
6. You seamlessly utilize ‘The Cloud’
• You extend into AWS, deploy OpenStack, …
CUDA-ready clusters are GPU developer-ready
![Page 16: The Top Six Advantages of CUDA-Ready Clusterson-demand.gputechconf.com/gtc/2015/webinar/gtc-express-cuda-ready...You focus on coding –not infrastructure ... • You have access to](https://reader034.vdocuments.net/reader034/viewer/2022051723/5abe55fb7f8b9ab02d8cc28e/html5/thumbnails/16.jpg)
16
Available Versions of the CUDA Toolkit
![Page 17: The Top Six Advantages of CUDA-Ready Clusterson-demand.gputechconf.com/gtc/2015/webinar/gtc-express-cuda-ready...You focus on coding –not infrastructure ... • You have access to](https://reader034.vdocuments.net/reader034/viewer/2022051723/5abe55fb7f8b9ab02d8cc28e/html5/thumbnails/17.jpg)
17
Using CUDA 6.0
![Page 18: The Top Six Advantages of CUDA-Ready Clusterson-demand.gputechconf.com/gtc/2015/webinar/gtc-express-cuda-ready...You focus on coding –not infrastructure ... • You have access to](https://reader034.vdocuments.net/reader034/viewer/2022051723/5abe55fb7f8b9ab02d8cc28e/html5/thumbnails/18.jpg)
![Page 19: The Top Six Advantages of CUDA-Ready Clusterson-demand.gputechconf.com/gtc/2015/webinar/gtc-express-cuda-ready...You focus on coding –not infrastructure ... • You have access to](https://reader034.vdocuments.net/reader034/viewer/2022051723/5abe55fb7f8b9ab02d8cc28e/html5/thumbnails/19.jpg)
CUDA-Ready Clusters
1. You focus on coding – not infrastructure & toolchains
2. You’re always in sync – with GPUs + CUDA
3. You cross-develop with confidence and ease
• Maintaining and using highly customized environments
4. You choose and combine in programming GPUs
• CUDA or OpenCL or OpenACC
• … and combine with MPI
5. You have converged HPC + Big Data Analytics
• You have access to Hadoop alongside HPC
6. You seamlessly utilize ‘The Cloud’
• You extend into AWS, deploy OpenStack, …
CUDA-ready clusters are GPU developer-ready
![Page 20: The Top Six Advantages of CUDA-Ready Clusterson-demand.gputechconf.com/gtc/2015/webinar/gtc-express-cuda-ready...You focus on coding –not infrastructure ... • You have access to](https://reader034.vdocuments.net/reader034/viewer/2022051723/5abe55fb7f8b9ab02d8cc28e/html5/thumbnails/20.jpg)
HPC Development Environment
Compilers (GNU, Intel*, AMD, Portland*, etc.)
Debuggers and profilers (GNU, TAU, Allinea,
TotalView)
MPI libraries (OpenMPI, MPICH, MPICH-MX,
MVAPICH)
Other libraries (threading libraries, OpenMP, Global
Arrays, HDF5, IIPP, TBB, NetCDF, PETSc, etc.)
Mathematical libraries (ACML, MKL*, FFTW, GMP,
GotoBLAS, ScaLAPACK, etc.)
Environment modules
![Page 21: The Top Six Advantages of CUDA-Ready Clusterson-demand.gputechconf.com/gtc/2015/webinar/gtc-express-cuda-ready...You focus on coding –not infrastructure ... • You have access to](https://reader034.vdocuments.net/reader034/viewer/2022051723/5abe55fb7f8b9ab02d8cc28e/html5/thumbnails/21.jpg)
Programming GPUs
CUDA
OpenCL
OpenACC
MPI
Tools
• CUDA gdb
• nvidia-smi
• CUDA Utility Library
• Examples
• 3rd Party
• Allinea
• Rogue Wave
![Page 22: The Top Six Advantages of CUDA-Ready Clusterson-demand.gputechconf.com/gtc/2015/webinar/gtc-express-cuda-ready...You focus on coding –not infrastructure ... • You have access to](https://reader034.vdocuments.net/reader034/viewer/2022051723/5abe55fb7f8b9ab02d8cc28e/html5/thumbnails/22.jpg)
CUDA Development Environment
![Page 23: The Top Six Advantages of CUDA-Ready Clusterson-demand.gputechconf.com/gtc/2015/webinar/gtc-express-cuda-ready...You focus on coding –not infrastructure ... • You have access to](https://reader034.vdocuments.net/reader034/viewer/2022051723/5abe55fb7f8b9ab02d8cc28e/html5/thumbnails/23.jpg)
CUDA-Ready Clusters
1. You focus on coding – not infrastructure & toolchains
2. You’re always in sync – with GPUs + CUDA
3. You cross-develop with confidence and ease
• Maintaining and using highly customized environments
4. You choose and combine in programming GPUs
• CUDA or OpenCL or OpenACC
• … and combine with MPI
5. You have converged HPC + Big Data Analytics
• You have access to Hadoop alongside HPC
6. You seamlessly utilize ‘The Cloud’
• You extend into AWS, deploy OpenStack, …
CUDA-ready clusters are GPU developer-ready
![Page 24: The Top Six Advantages of CUDA-Ready Clusterson-demand.gputechconf.com/gtc/2015/webinar/gtc-express-cuda-ready...You focus on coding –not infrastructure ... • You have access to](https://reader034.vdocuments.net/reader034/viewer/2022051723/5abe55fb7f8b9ab02d8cc28e/html5/thumbnails/24.jpg)
HPC and Hadoop
Use GPUs for HPC and Big Data Analytics
Introduce GPUs into Hadoop clusters
Make use of Hadoop services
![Page 25: The Top Six Advantages of CUDA-Ready Clusterson-demand.gputechconf.com/gtc/2015/webinar/gtc-express-cuda-ready...You focus on coding –not infrastructure ... • You have access to](https://reader034.vdocuments.net/reader034/viewer/2022051723/5abe55fb7f8b9ab02d8cc28e/html5/thumbnails/25.jpg)
25
![Page 26: The Top Six Advantages of CUDA-Ready Clusterson-demand.gputechconf.com/gtc/2015/webinar/gtc-express-cuda-ready...You focus on coding –not infrastructure ... • You have access to](https://reader034.vdocuments.net/reader034/viewer/2022051723/5abe55fb7f8b9ab02d8cc28e/html5/thumbnails/26.jpg)
26
![Page 27: The Top Six Advantages of CUDA-Ready Clusterson-demand.gputechconf.com/gtc/2015/webinar/gtc-express-cuda-ready...You focus on coding –not infrastructure ... • You have access to](https://reader034.vdocuments.net/reader034/viewer/2022051723/5abe55fb7f8b9ab02d8cc28e/html5/thumbnails/27.jpg)
CUDA-Ready Clusters
1. You focus on coding – not infrastructure & toolchains
2. You’re always in sync – with GPUs + CUDA
3. You cross-develop with confidence and ease
• Maintaining and using highly customized environments
4. You choose and combine in programming GPUs
• CUDA or OpenCL or OpenACC
• … and combine with MPI
5. You have converged HPC + Big Data Analytics
• You have access to Hadoop alongside HPC
6. You seamlessly utilize ‘The Cloud’
• You extend into AWS, deploy OpenStack, …
CUDA-ready clusters are GPU developer-ready
![Page 28: The Top Six Advantages of CUDA-Ready Clusterson-demand.gputechconf.com/gtc/2015/webinar/gtc-express-cuda-ready...You focus on coding –not infrastructure ... • You have access to](https://reader034.vdocuments.net/reader034/viewer/2022051723/5abe55fb7f8b9ab02d8cc28e/html5/thumbnails/28.jpg)
GPUs in the Cloud? The Top Four Reasons
1. You can realize possibilities using the cloud
• You can scale up and scale out
2. You still realize the promise of GPU programmability
• … via HPC in the cloud
3. Your use of the cloud is transparent
• You’ve found ways to `hide’ latency
• Constraints apply for MPI apps
4. Your go-to apps still work in the cloud
http://info.brightcomputing.com/Blog/bid/196290/The-Top-4-Reasons-You-Should-Try-Cloud-Based-GPUs-for-HPC
![Page 29: The Top Six Advantages of CUDA-Ready Clusterson-demand.gputechconf.com/gtc/2015/webinar/gtc-express-cuda-ready...You focus on coding –not infrastructure ... • You have access to](https://reader034.vdocuments.net/reader034/viewer/2022051723/5abe55fb7f8b9ab02d8cc28e/html5/thumbnails/29.jpg)
Scenario I — “Cluster on Demand”
node001
head nodenode002
node003
Cloud Utilization
![Page 30: The Top Six Advantages of CUDA-Ready Clusterson-demand.gputechconf.com/gtc/2015/webinar/gtc-express-cuda-ready...You focus on coding –not infrastructure ... • You have access to](https://reader034.vdocuments.net/reader034/viewer/2022051723/5abe55fb7f8b9ab02d8cc28e/html5/thumbnails/30.jpg)
Scenario II — “Cluster Extension”
head node
node001 node002 node003
node004
node005
node006
node007
Cloud Utilization
![Page 31: The Top Six Advantages of CUDA-Ready Clusterson-demand.gputechconf.com/gtc/2015/webinar/gtc-express-cuda-ready...You focus on coding –not infrastructure ... • You have access to](https://reader034.vdocuments.net/reader034/viewer/2022051723/5abe55fb7f8b9ab02d8cc28e/html5/thumbnails/31.jpg)
31
![Page 32: The Top Six Advantages of CUDA-Ready Clusterson-demand.gputechconf.com/gtc/2015/webinar/gtc-express-cuda-ready...You focus on coding –not infrastructure ... • You have access to](https://reader034.vdocuments.net/reader034/viewer/2022051723/5abe55fb7f8b9ab02d8cc28e/html5/thumbnails/32.jpg)
CUDA-Ready Clusters
1. You focus on coding – not infrastructure & toolchains
2. You’re always in sync – with GPUs + CUDA
3. You cross-develop with confidence and ease
• Maintaining and using highly customized environments
4. You choose and combine in programming GPUs
• CUDA or OpenCL or OpenACC
• … and combine with MPI
5. You have converged HPC + Big Data Analytics
• You have access to Hadoop alongside HPC
6. You seamlessly utilize ‘The Cloud’
• You extend into AWS, deploy OpenStack, …
CUDA-ready clusters are GPU developer-ready
![Page 33: The Top Six Advantages of CUDA-Ready Clusterson-demand.gputechconf.com/gtc/2015/webinar/gtc-express-cuda-ready...You focus on coding –not infrastructure ... • You have access to](https://reader034.vdocuments.net/reader034/viewer/2022051723/5abe55fb7f8b9ab02d8cc28e/html5/thumbnails/33.jpg)
Case Study: TUAT (1)
The Customer
• Engages materials-science research
• Compares computational models with physical experiments
• High-resolution, 3D phase field modeling at large scales
using GPUs
The Challenge
• Make available the latest innovations in GPU technology
without distracting focus from research
![Page 34: The Top Six Advantages of CUDA-Ready Clusterson-demand.gputechconf.com/gtc/2015/webinar/gtc-express-cuda-ready...You focus on coding –not infrastructure ... • You have access to](https://reader034.vdocuments.net/reader034/viewer/2022051723/5abe55fb7f8b9ab02d8cc28e/html5/thumbnails/34.jpg)
Case Study: TUAT (2)
The Solution
• Laboratory GPU cluster designed and implemented by
HPCTech Corp.
• Bright Cluster Manager deployed by HPCTech
• Use Bright to fully manage the entire CUDA environment –
including regular updates
• Use modules environment via Bright to manage multiple CUDA
environments
• Prototype simulations using laboratory HPC cluster
• Includes debugging and tuning code
• Execute large-scale simulations using TSUBAME
• The Results …
![Page 35: The Top Six Advantages of CUDA-Ready Clusterson-demand.gputechconf.com/gtc/2015/webinar/gtc-express-cuda-ready...You focus on coding –not infrastructure ... • You have access to](https://reader034.vdocuments.net/reader034/viewer/2022051723/5abe55fb7f8b9ab02d8cc28e/html5/thumbnails/35.jpg)
Calculation steps : 25000 150000 275000
51μm
0.01 0.38 [wt.%]
Caption: Snapshots of austenite-to-ferrite transformation behavior in Fe-C alloy simulated by
a multi-phase-field method. Upper and lower panels show time evolution of ferrite grains and
carbon concentration during the phase transformation. The simulation was performed on 512
× 512 × 256 computational grids using 8 GPUs in lab cluster. (Prof. A. Yamanaka, TUAT)
![Page 36: The Top Six Advantages of CUDA-Ready Clusterson-demand.gputechconf.com/gtc/2015/webinar/gtc-express-cuda-ready...You focus on coding –not infrastructure ... • You have access to](https://reader034.vdocuments.net/reader034/viewer/2022051723/5abe55fb7f8b9ab02d8cc28e/html5/thumbnails/36.jpg)
0
1
2
3
4
5
128 256 512
Ela
pse
d t
ime
[×10
00
s]
Number of GPUs
Caption: Performance of multiple-GPU computation of multi-phase-field simulation of
austenite-to-ferrite transformation in Fe-C alloy. The performance was measured by
performing the simulations on TSUBAME2.5 supercomputer of Tokyo Institute of
Technology. The number of computational grids, crystal grains and calculation steps were
5123, 4068 and 105, respectively. (Prof. A. Yamanaka, TUAT, priv. comm.)
http://www.tuat.ac.jp/~yamanaka/
![Page 37: The Top Six Advantages of CUDA-Ready Clusterson-demand.gputechconf.com/gtc/2015/webinar/gtc-express-cuda-ready...You focus on coding –not infrastructure ... • You have access to](https://reader034.vdocuments.net/reader034/viewer/2022051723/5abe55fb7f8b9ab02d8cc28e/html5/thumbnails/37.jpg)
37
Case Study: TUAT (3)
“We scientists are time-constrained,” said Dr. Yamanaka. “Our priority is our research, not managing our clusters. Bright is intuitive to use, and with it I can effectively manage my cluster without wasting time writing scripts, or synchronizing management tool revisions. Provisioning is fast and easy too. I prefer this approach over open source toolkits.”
![Page 38: The Top Six Advantages of CUDA-Ready Clusterson-demand.gputechconf.com/gtc/2015/webinar/gtc-express-cuda-ready...You focus on coding –not infrastructure ... • You have access to](https://reader034.vdocuments.net/reader034/viewer/2022051723/5abe55fb7f8b9ab02d8cc28e/html5/thumbnails/38.jpg)
CUDA-Ready Clusters
1. You focus on coding – not infrastructure & toolchains
2. You’re always in sync – with GPUs + CUDA
3. You cross-develop with confidence and ease
• Maintaining and using highly customized environments
4. You choose and combine in programming GPUs
• CUDA or OpenCL or OpenACC
• … and combine with MPI
5. You have converged HPC + Big Data Analytics
• You have access to Hadoop alongside HPC
6. You seamlessly utilize ‘The Cloud’
• You extend into AWS, deploy OpenStack, …
CUDA-ready clusters are GPU developer-ready
![Page 40: The Top Six Advantages of CUDA-Ready Clusterson-demand.gputechconf.com/gtc/2015/webinar/gtc-express-cuda-ready...You focus on coding –not infrastructure ... • You have access to](https://reader034.vdocuments.net/reader034/viewer/2022051723/5abe55fb7f8b9ab02d8cc28e/html5/thumbnails/40.jpg)
Additional Slides
![Page 41: The Top Six Advantages of CUDA-Ready Clusterson-demand.gputechconf.com/gtc/2015/webinar/gtc-express-cuda-ready...You focus on coding –not infrastructure ... • You have access to](https://reader034.vdocuments.net/reader034/viewer/2022051723/5abe55fb7f8b9ab02d8cc28e/html5/thumbnails/41.jpg)
![Page 42: The Top Six Advantages of CUDA-Ready Clusterson-demand.gputechconf.com/gtc/2015/webinar/gtc-express-cuda-ready...You focus on coding –not infrastructure ... • You have access to](https://reader034.vdocuments.net/reader034/viewer/2022051723/5abe55fb7f8b9ab02d8cc28e/html5/thumbnails/42.jpg)
42
![Page 43: The Top Six Advantages of CUDA-Ready Clusterson-demand.gputechconf.com/gtc/2015/webinar/gtc-express-cuda-ready...You focus on coding –not infrastructure ... • You have access to](https://reader034.vdocuments.net/reader034/viewer/2022051723/5abe55fb7f8b9ab02d8cc28e/html5/thumbnails/43.jpg)
Cluster Health Management
Goal: provide problem free environment for running jobs
Four elements1. Cluster management automation
2. Regular health checks• Actions that return PASS, FAIL or UNKNOWN
• Can be associated with a settable severity and a message
• Can launch an action based on any response value
3. Pre-job health checks• Let the workload manager hold the job very briefly
• Check the health of each reserved node
• If unhealthy, take the node offline, inform the system administrator
• Let the workload manager reschedule the job to a different set of nodes
4. Hardware stability & performance tests• Very wide range of tests
• May include disk overwrites and reboot(s)
All elements above are configurable and extensible
![Page 44: The Top Six Advantages of CUDA-Ready Clusterson-demand.gputechconf.com/gtc/2015/webinar/gtc-express-cuda-ready...You focus on coding –not infrastructure ... • You have access to](https://reader034.vdocuments.net/reader034/viewer/2022051723/5abe55fb7f8b9ab02d8cc28e/html5/thumbnails/44.jpg)
44
Bright API