the race towards exascale - hpc advisory council · exascale will be enabled via co-design...

20
The Race Towards Exascale Co-Design Architecture For Next Generation HPC Systems Gilad Shainer HPC Advisory Council China, November 2015

Upload: others

Post on 25-May-2020

5 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: The Race Towards Exascale - HPC Advisory Council · Exascale will be Enabled via Co-Design Architecture Software –Hardware Hardware –Hardware (e.g. GPU-Direct) Software –Software

The Race Towards Exascale Co-Design Architecture For Next Generation HPC Systems

Gilad Shainer

HPC Advisory Council

China, November 2015

Page 2: The Race Towards Exascale - HPC Advisory Council · Exascale will be Enabled via Co-Design Architecture Software –Hardware Hardware –Hardware (e.g. GPU-Direct) Software –Software

2

The Ever Growing Demand for Higher Performance

2000 202020102005

“Roadrunner”

1st

2015

Terascale Petascale Exascale

Single-Core to Many-CoreSMP to Clusters

Performance Development

Co-Design

HW SW

APP

Hardware

Software

Application

Technology Development

Page 3: The Race Towards Exascale - HPC Advisory Council · Exascale will be Enabled via Co-Design Architecture Software –Hardware Hardware –Hardware (e.g. GPU-Direct) Software –Software

3

Co-Design Architecture to Enable Exascale Performance

CPU-Centric Co-Design

Limited to Main CPU Usage

Results in Performance Limitation

Creating Synergies

Enables Higher Performance and Scale

Software

In-CPU

Computing

In-Network

Computing

In-Storage

Computing

Page 4: The Race Towards Exascale - HPC Advisory Council · Exascale will be Enabled via Co-Design Architecture Software –Hardware Hardware –Hardware (e.g. GPU-Direct) Software –Software

4

Exascale will be Enabled via Co-Design Architecture

Software – Hardware

Hardware – Hardware (e.g. GPU-Direct)

Software – Software (e.g. OpenUCX)

Industry – Users – Academia

Standard, Open Source, Eco-System

Programmable, Configurable, Innovative

HW SW

APP Hardware

Software

Application

Page 5: The Race Towards Exascale - HPC Advisory Council · Exascale will be Enabled via Co-Design Architecture Software –Hardware Hardware –Hardware (e.g. GPU-Direct) Software –Software

5

Breaking the Application Latency Wall

• Today: Network device latencies are on the order of 100 nanoseconds

• Challenge: Enabling the next order of magnitude improvement in application performance

• Solution: Creating synergies between software and hardware

Co-Design Paves the Road to Exascale Performance

10 years ago

~10

microsecond

~100

microsecond

NetworkCommunication

Framework

Today

~10

microsecond

Communication

Framework

~0.1

microsecond

Network

~1

microsecond

Communication

Framework

Future

~0.05

microsecond

Co-Design

Network

Page 6: The Race Towards Exascale - HPC Advisory Council · Exascale will be Enabled via Co-Design Architecture Software –Hardware Hardware –Hardware (e.g. GPU-Direct) Software –Software

6

The Pizza Example

Processing Processing Processing Transportation

More CPUs – More Pizzas

Same Time per Pizza

Page 7: The Race Towards Exascale - HPC Advisory Council · Exascale will be Enabled via Co-Design Architecture Software –Hardware Hardware –Hardware (e.g. GPU-Direct) Software –Software

7

Accelerating The Process Via Co-Design

Processing Processing Processing Transportation

Hiding The Baking Time

Faster Pizza Delivery

Page 8: The Race Towards Exascale - HPC Advisory Council · Exascale will be Enabled via Co-Design Architecture Software –Hardware Hardware –Hardware (e.g. GPU-Direct) Software –Software

8

• The road to Exascale requires order of magnitude performance improvements

• Co-Design architecture enables all active devices to become co-processors

The Road to Exascale – Co-Design System Architecture

Mapping Communication Frameworks on All Active Devices

Page 9: The Race Towards Exascale - HPC Advisory Council · Exascale will be Enabled via Co-Design Architecture Software –Hardware Hardware –Hardware (e.g. GPU-Direct) Software –Software

9

Co-Design Architecture Depends on Offloading Technologies

Programmability

RDMA GPUDirect Virtualization

Backward and Future Compatibility

Direct Communications

Applications (Innovations, Scalability, Performance)

Software-Defined Network

(SDN)

Co-Design Requires Intelligence Everywhere

Offloading Technologies: Intelligent Components (Interconnect, Storage etc.)

Page 10: The Race Towards Exascale - HPC Advisory Council · Exascale will be Enabled via Co-Design Architecture Software –Hardware Hardware –Hardware (e.g. GPU-Direct) Software –Software

10

Paving The Road to Exascale

Page 11: The Race Towards Exascale - HPC Advisory Council · Exascale will be Enabled via Co-Design Architecture Software –Hardware Hardware –Hardware (e.g. GPU-Direct) Software –Software

11

Exascale Co-Design Collaboration

Collaborative Effort

Industry, National Laboratories and Academia

The Next Generation

HPC Software Framework

Page 12: The Race Towards Exascale - HPC Advisory Council · Exascale will be Enabled via Co-Design Architecture Software –Hardware Hardware –Hardware (e.g. GPU-Direct) Software –Software

12

Collaboration

• Mellanox co-designs network interface and contributes MXM technology

– Infrastructure, transport, shared memory, protocols, integration with OpenMPI/SHMEM,

MPICH

• ORNL co-designs network interface and contributes UCCS project

– InfiniBand optimizations, Cray devices, shared memory

• NVIDIA co-designs high-quality support for GPU devices

– GPU-Direct, GDR copy, etc.

• IBM co-designs network interface and contributes ideas and concepts from PAMI

• UH/UTK focus on integration with their research platforms

Page 13: The Race Towards Exascale - HPC Advisory Council · Exascale will be Enabled via Co-Design Architecture Software –Hardware Hardware –Hardware (e.g. GPU-Direct) Software –Software

13

What We Don’t Want to Achieve

Page 14: The Race Towards Exascale - HPC Advisory Council · Exascale will be Enabled via Co-Design Architecture Software –Hardware Hardware –Hardware (e.g. GPU-Direct) Software –Software

14

Different Model - Co-Design Effort

• Co-design effort between national laboratories, academia, and industry

Applications: LAMMPS, NWCHEM, etc.

Programming models: MPI, PGAS/Gasnet, etc.

Middleware: UCX

Driver and Hardware

Co

-Desig

n

Page 15: The Race Towards Exascale - HPC Advisory Council · Exascale will be Enabled via Co-Design Architecture Software –Hardware Hardware –Hardware (e.g. GPU-Direct) Software –Software

15

UCX Framework Mission

• Collaboration between industry, laboratories, and academia

• Create open-source production grade communication framework for HPC applications

• To enable the highest performance through co-design of software-hardware interfaces

• To unify industry - national laboratories - academia efforts

Performance oriented

Optimization for low-software overheads in

communication path allows near native-level

performance

Community driven

Collaboration between industry, laboratories,

and academia

Production quality

Developed, maintained, tested, and used by

industry and researcher community

API

Exposes broad semantics that target data centric and HPC programming models and

applications

Research

The framework concepts and ideas are

driven by research in academia, laboratories,

and industry

Cross platform

Support for Infiniband, Cray, various shared

memory (x86-64 and Power), GPUs

Co-design of Exascale Network APIs

Page 16: The Race Towards Exascale - HPC Advisory Council · Exascale will be Enabled via Co-Design Architecture Software –Hardware Hardware –Hardware (e.g. GPU-Direct) Software –Software

16

UCX High-level Overview

Page 17: The Race Towards Exascale - HPC Advisory Council · Exascale will be Enabled via Co-Design Architecture Software –Hardware Hardware –Hardware (e.g. GPU-Direct) Software –Software

17

UCX Information

Page 18: The Race Towards Exascale - HPC Advisory Council · Exascale will be Enabled via Co-Design Architecture Software –Hardware Hardware –Hardware (e.g. GPU-Direct) Software –Software

18

GPUDirect RDMA and Sync (3.0 and 4.0)

• Hardware – Hardware co-design

Page 19: The Race Towards Exascale - HPC Advisory Council · Exascale will be Enabled via Co-Design Architecture Software –Hardware Hardware –Hardware (e.g. GPU-Direct) Software –Software

19

HPC|Music Project

HPC music is an advanced research project about High Performance Computing and Music

Production dedicated to enable HPC in music creation. Its goal is to develop HPC cluster

and cloud solutions that further enable the future of music production and reproduction.

Page 20: The Race Towards Exascale - HPC Advisory Council · Exascale will be Enabled via Co-Design Architecture Software –Hardware Hardware –Hardware (e.g. GPU-Direct) Software –Software

Thank You!

Web: www.hpcadvisorycouncil.com

Email: [email protected]

Facebook: http://www.facebook.com/HPCAdvisoryCouncil

Twitter: www.twitter.com/hpccouncil

YouTube: www.youtube.com/user/hpcadvisorycouncil