intro to cuda

57

GPU Algorithms David Hauck github.com/ davidhauck @david_hauck_mke davidhauck40.blogsp ot.com dhauck@skylinetechnolog ies.com

Upload: david-hauck

Post on 22-Jan-2017

31 views

Category:

Software

0 download

Report

Download

Embed Size (px):

TRANSCRIPT

Page 1: Intro to Cuda

GPU AlgorithmsDavid Hauck

github.com/davidhauck

@david_hauck_mke

davidhauck40.blogspot.com

[email protected]

Page 2: Intro to Cuda

Graphics Processing Unit

Page 3: Intro to Cuda

Page 4: Intro to Cuda

Page 5: Intro to Cuda

Why?

Page 6: Intro to Cuda

Page 7: Intro to Cuda

Graphics Processing Unit

Page 8: Intro to Cuda

Graphics Processing Unit

General Purpose

Page 9: Intro to Cuda

T EM S

R

Page 10: Intro to Cuda

HOST

Page 11: Intro to Cuda

DEVICE

Page 12: Intro to Cuda

Page 13: Intro to Cuda

PCI Bus

Copy initial data to DEVICE

Page 14: Intro to Cuda

PCI Bus

Run DEVICE Executable

Page 15: Intro to Cuda

PCI Bus

Copy Results Back To HOST

Page 16: Intro to Cuda

Still Running on CPU

Page 17: Intro to Cuda

Still Running on CPUGPU is a Resource

Page 18: Intro to Cuda

Page 19: Intro to Cuda

MEMORYCONSCIOUSNESS

Page 20: Intro to Cuda

HOST DEVICEPOINTERSPOINTERS

Page 21: Intro to Cuda

int *a;

Page 22: Intro to Cuda

int *a;int *d_a;

Page 23: Intro to Cuda

arr = malloc(size);

Page 24: Intro to Cuda

arr = malloc(size);

cudaMalloc(&d_arr, size);

Page 25: Intro to Cuda

free(arr);

Page 26: Intro to Cuda

free(arr);

cudaFree(d_arr);

Page 27: Intro to Cuda

memcpy(dest, source, size);

Page 28: Intro to Cuda

memcpy(dest, source, size);

cudaMemcpy(&dest, src, size, …);

Page 29: Intro to Cuda

1: HOST DEVICE

2: EXECUTE

3: DEVICE HOST

Page 30: Intro to Cuda

1: HOST DEVICE

3: DEVICE HOST

cudaMemcpy();

Page 31: Intro to Cuda

1: HOST DEVICEcudaMemcpy(

&dest,source,size, ..hostToDevice);

Page 32: Intro to Cuda

EXECUTION

Page 33: Intro to Cuda

__global__ void myKernel(int *a){}

Page 34: Intro to Cuda

myKernel<<<1,1>>>(d_arr);

Page 35: Intro to Cuda

Let’s do an example

Page 36: Intro to Cuda

abcd

+

efgh

=

ijkl

Page 37: Intro to Cuda

abcd

+

efgh

=

ijkl

Page 38: Intro to Cuda

abcd

+

efgh

=

ijkl

threadIdx.x

0

1

2

3

Page 39: Intro to Cuda

int index = threadIdx.x;c[index] =

a[index] + b[index];

Page 40: Intro to Cuda

Let’s invent an ALGORITHM

Page 41: Intro to Cuda

K-Means Clustering

Page 42: Intro to Cuda

Page 43: Intro to Cuda

Page 44: Intro to Cuda

Page 45: Intro to Cuda

Page 46: Intro to Cuda

Page 47: Intro to Cuda

Page 48: Intro to Cuda

Page 49: Intro to Cuda

Page 50: Intro to Cuda

Page 51: Intro to Cuda

Page 52: Intro to Cuda

Page 53: Intro to Cuda

Page 54: Intro to Cuda

Page 55: Intro to Cuda

CODE

Page 56: Intro to Cuda

Shared Memory• ~48k• Multiple GB device memory (100x higher latency)• Access memory in order• 1 2 3• 4 5 6• 7 8 9

Page 57: Intro to Cuda

Considerations• Transistors are allocated to arithmetic, not memory. Sometimes

better to recompute rather than cache• Copying to/from host takes a while. Sometimes sequential operations

can stay on gpu• Avoid serialization (shared memory bank conflicts)• Asynchronous memory operations

Introduction to CUDA programming - DSTdst.lbl.gov/.../downloads/Introduction-to-CUDA-programming.pdfIntroduction to CUDA Programming Hemant Shukla [email protected] . Trends ... vector

Introduction to programming in CUDA C - GitHub …...Introduction to programming in CUDA C Will Landau A review: GPU parallelism and CUDA architecture Beginning CUDA C Hello world

Intro, CMake. CUDA execution configuration

An Introduction to GPU Computing and CUDA Architecturedeveloper.download.nvidia.com/CUDA/training/GTC... · What is CUDA? CUDA Architecture Expose GPU computing for general purpose

Introduction to CUDA 5

Introduction to CUDA - TUMIntroduction to CUDA Oliver Meister November 7th 2012 Oliver Meister: Introduction to CUDA ... software-side: programming models for GPU computing: CUDA,

Introduction to Accelerators: CUDA+OpenCL · (NVIDIA CUDA Programming Guide) 18 ... CUDA C OpenCL CUDA Fortran DirectCompute NVIDIA GPU + Driver. ... (3,1) Thread (4,1) Thread (0,2)

Introduction to Graphics Hardware and GPU’s · GPU Intro Enter CUDA CUDA is NVIDIA’s general purpose parallel computing architecture • Designed for calculation- intensive computation

Hetero Lecture Slides 002 Lecture 1 Lecture 1 4 Cuda Intro

CSE 599 I Accelerated Computing - Programming GPUS 599 I... · - Course Introduction - Intro to CUDA C - CUDA parallelism model ... Lecture 1.2 – Course Introduction Accelerated

Intro to CUDA Programming John Pormann, Ph.D. [email protected] [email protected] [email protected]

Introduction to CUDA (1 of 2) - GitHub Pagescis565-spring-2012.github.io/lectures/01-25-CUDA-Intro-1-of-2.pdfIntroduction to CUDA (1 of 2) Patrick Cozzi University of Pennsylvania

Intro to CUDA-Aware MPI and NVIDIA GPUDirect | GTC 2013on-demand.gputechconf.com/gtc/...Intro-CUDA-Aware-MPI-NVIDIA-G… · regular MPI CUDA-aware MPI Jacobi Results (1000 steps)

New Introduction To CUDA · 2010. 1. 27. · GPU and CUDA • Popular – Over 100 million CUDA enabled GPU sold • Easy to program using CUDA – C and C++ Integration – Sizeable

Introduction To CUDA · GPU and CUDA • Popular – Over 100 million CUDA enabled GPU sold • Easy to program using CUDA – C and C++ Integration – Sizeable computing libraries

CUDA by Example (PDF) - Nvidiadeveloper.download.nvidia.com/books/cuda-by-example/cuda-by... · CUDA by Example An IntroductIon to GenerAl-PurPose GPu ProGrAmmInG JAson sAnders edwArd

PARALLEL PROGRAMMING MANY-CORE COMPUTING: INTRO (1/5)rob/parallel-programming... · CUDA Toolkit 1.1 CUDA Toolkit 2.0 • Win XP 64 • Atomics support Tools updates • Multi-GPU

Introduction to CUDA - HPC Advisory CouncilWhatisCUDA. CUDA =Compute-UniﬁedDeviceArchitecutre – Asetofhardwarearchitectureandthreadingruntimeconceptstoworkwithmassiveparallelism

Intro to CUDA - University of Tennessee€¦ · Introduction to CUDA C/C++ A Basic CUDA Program Outline intmain(){// Allocate memory for array on host // Allocate memory for array

CUDA Without Cuda (CUDA Libraries) - Nvidiadeveloper.download.nvidia.com/CUDA/training/ntrotoCUDALibraries.pdf · CUDA Without Cuda (CUDA Libraries) GPU Computing Webinar 7/16/2011

Intro to CUDA - FAMAF UNCWhat is CUDA? • CUDA means: Compute Unified Device Architecture. • CUDA is developed by NVIDIA for computing over graphic devices. • The architecture

CUDA - A Very Short Intro - Werlberger · Graz University of Technology Introduction The GPU: CUDA Architecture Real-World Example Tricks (?)End CUDA - A Very Short Intro Manuel Werlberger

GPU History CUDA - cse.uaa.alaska.eduafkjm/cs331/handouts/gpu-cuda.pdf · CUDA Intro Graphics Pipeline Elements 1. A scene description: vertices, triangles, colors, lighting 2.Transformations

Cuda Guide to Deflasking

Jared Law CUDA: Super-Computing Made Easy. Jared Law NVidia CUDA: Why CUDA? What is CUDA? Where/how is CUDA being used? What does CUDA mean to programmers?

An#Introduction#to#CUDA/OpenCL# …parlab.eecs.berkeley.edu/sites/all/parlab/files/CatanzaroIntroToG... · Mapping#CUDA#to#Nvidia#GPUs#! ... Introduction to CUDA! CUDA Programming

Matrix Multiplication with CUDA — A basic introduction to the CUDA

Introduction to CUDA Fortran - GPU Technology Conferenceon-demand.gputechconf.com/.../S3050-Intro-to-CUDA-Fortran.pdf · Introduction •CUDA is a scalable model for parallel computing

Introduction to CUDA Computing

An Introduction to GPGPU Programming - CUDA Architecture447977/FULLTEXT01.pdf · An Introduction to GPGPU Programming - CUDA Architecture . ... 3 SOME COMMONLY USED CUDA API ... CUDA

Introduction to Scientific Programming using GPGPU and CUDA · Introduction to Scientific Programming using GPGPU and CUDA ... (NVIDIA CUDA Programming Guide) ... CUDA C OpenCL CUDA

Introduction to CUDA Programming

Intro to GPGPU with CUDA (DevLink)