cuda gpu computing

1 CUDA GPU Computing Advisor ： Cho-Chin Lin Student ： Chien- Chen Lai

Upload: nico

Post on 14-Jan-2016

69 views

Category:

Documents

0 download

Report

Download

Embed Size (px):

DESCRIPTION

CUDA GPU Computing. Advisor ： Cho-Chin Lin Student ： Chien-Chen Lai. Outline. Introduction and Motivation. What is driving the many-cores?. Control. ALU. ALU. ALU. ALU. DRAM. Cache. DRAM. Design philosophies are different. - PowerPoint PPT Presentation

TRANSCRIPT

CUDA GPU Computing

Advisor： Cho-Chin Lin

Student ： Chien-Chen Lai

Outline

Introduction and Motivation

What is driving the many-cores?

Quadro FX 5600

NV35 NV40

G70G70-512

G71

Tesla C870

NV30

3.0 GHzCore 2 Quad3.0 GHz

Core 2 Duo3.0 GHz Pentium 4

GeForce8800 GTX

100

200

300

400

500

600

Jan 2003 Jul 2003 Jan 2004 Jul 2004 Jan 2005 Jul 2005 Jan 2006 Jul 2006 Jan 2007 Jul 2007

Design philosophies are different.

DRAM

Cache

ALUControl

ALU

DRAM

CPU GPU

The GPU is specialized for compute-intensive, massively data parallel computation (exactly what graphics rendering is about).

So, more transistors can be devoted to data processing rather than data caching and flow control

CPU VS. GPU

Jamie and Adam demonstrate the difference between a CPU and GPU.

This is not your advisor’s parallel computer! Significant application-level speedup over

uni-processor executionNo more “killer micros”

Easy entrance An initial, naïve code typically get at least 2-

3X speedup

This is not your advisor’s parallel computer! Wide availability to end users

available on laptops, desktops, clusters, super-computers

Numerical precision and accuracy IEEE floating-point and double precision

Historic GPGPU Constraints

Input Registers

Fragment Program

Output Registers

Constants

Texture

Temp Registers

per threadper Shaderper Context

FB Memory

Dealing with graphics API Working with the corner cases of

the graphics API Addressing modes

Limited texture size/dimension Shader capabilities

Limited outputs Instruction sets

Lack of Integer & bit ops Communication limited

No interaction between pixels No scatter store ability - a[i] = p

CUDA - No more shader functions. CUDA integrated CPU+GPU application C program

Serial or modestly parallel C code executes on CPU Highly parallel SPMD kernel C code executes on GPU

CPU Serial CodeGrid 0

. . .

GPU Parallel Kernel

KernelA<<< nBlk, nTid >>>(args);

Grid 1CPU Serial Code

GPU Parallel Kernel

KernelB<<< nBlk, nTid >>>(args);

CUDA for Multi-Core CPU A single GPU thread is too small for a CPU Thread

CUDA emulation does this and performs poorly CPU cores designed for ILP, SIMD

Optimizing compilers work well with iterative loops Turn GPU thread blocks from CUDA into iterative CPU loops

CUDA Grid

GPU CPU

Compiler

CUDA for Multi-Core CPU

Application C on single core CPU

Time

CUDA on 4-core CPU

Time

Speedup*

CUDA on G80

Time

MRI-FHD ~1000s 230s ~4x 8.5s

CP 180s 45s 4x .28s

SAD 42.5ms 25.6ms 1.66x 4.75ms

MM (4Kx4K) 7.84s** 15.5s 3.69x 1.12s

CUDA-Based GPU Computing Framework for GNU Octaveon-demand.gputechconf.com/gtc/2012/posters/P0213...CUDA-Based GPU Computing Framework for GNU Octave Inspired by Jacket from AccelerEyes

An Introduction to GPU Computing and CUDA Architecturedeveloper.download.nvidia.com/CUDA/training/GTC... · What is CUDA? CUDA Architecture Expose GPU computing for general purpose

GPU-Computing mit CUDA und OpenCL in der Praxis

CUDA-Based GPU Computing Framework for GNU Octavedeveloper.download.nvidia.com/GTC/PDF/GTC2012/Posters/P... · 2012-05-09 · CUDA-Based GPU Computing Framework for GNU Octave Inspired

GPU Computing and CUDA

Introduction to GPU Computing with OpenCL - Nvidiadeveloper.download.nvidia.com/CUDA/training/NVIDIA... · Introduction to GPU Computing with OpenCL. ... //nvdeveloper.nvidia.com/login.asp

GPU Computing with Nvidia CUDA - Department of Electrical ...€¦ · GPU Computing with Nvidia CUDA 1 Analogic Corp. 4/14/2011 David Kaeli, Perhaad Mistry, Rodrigo Dominguez, Dana

GPU Computing with CUDA Lecture 2 - CUDA Memories · GPU Computing with CUDA Lecture 2 - CUDA Memories Christopher Cooper Boston University August, 2011 UTFSM, Valparaíso, Chile

Schulung: Einführung in das GPU-Computing mit NVIDIA CUDA

GPU-Programmierung: OpenCL€¦ · Einsatzgebiete von GPU-Computing Entwicklung von GPU-Computing 2 OpenCL Entwicklung Architektur Spracheigenschaften Vergleich mit CUDA Beispiel

Introduction to CUDA - TUMIntroduction to CUDA Oliver Meister November 7th 2012 Oliver Meister: Introduction to CUDA ... software-side: programming models for GPU computing: CUDA,

Characterizing and Detecting CUDA Program Bugs · ABSTRACT While CUDA has become a major parallel computing platform and programming model for general-purpose GPU computing, CUDA-induced

GPU Computing with CUDA - University of …outreach.sbel.wisc.edu/Workshops/GPUworkshop/2014/2_Advanced.pdfGPU Computing with CUDA ... Shared Memory Use (Dot Product, Matrix Multiplication)

IAP09 CUDA@MIT 6.963 - Lecture 01: GPU Computing using CUDA (David Luebke, NVIDIA)

NVIDIA CUDA Programming Guide - Artificial Intelligence ... · 3.1 GPU Computing Case Studies ... The key to CUDA is the C compiler for the GPU. ... NVIDIA CUDA Programming Guide

Tutorial on GPU computing - Lorena A. · PDF fileTutorial on GPU computing ... •CUDA is a compiler and toolkit for programming NVIDIA GPUs. ... GPU computing = General-purpose on

GPU Computing with CUDA - univ-reims.frcosy.univ-reims.fr/~cjaillet/www/pub/fichiers/enseignement/Info...GPU Computing with CUDA ... • ATI Stream by AMD • CUDA by NVIDIA • OpenCL

GPU Computing with CUDA Lecture 3 - Efficient …GPU Computing with CUDA Lecture 3 - Efficient Shared Memory Use Christopher Cooper Boston University August, 2011 UTFSM, Valparaíso,

GPU Algorithms III/IV Computing with CUDA · GPU Algorithms III/IV Computing with CUDA MADALGO Summer School on Algorithms for Modern Parallel and Distributed Models Suresh Venkatasubramanian

Introduction to GPU computing for statisticicans · CUDA systems GPU computing with R CUDA and our CUDA systems Specs of our CUDA systems I No graphical user interface or remote desktop

GPU Computing with CUDA

GPU Computing with CUDA Lecture 1 - Introduction · GPU Computing with CUDA Lecture 1 - Introduction Christopher Cooper Boston University August, 2011 UTFSM, Valparaíso, Chile 1

GPU Computing with CUDA Lecture 8 - CUDA Libraries - …GPU Computing with CUDA Lecture 8 - CUDA Libraries - CUFFT, PyCUDA Christopher Cooper Boston University August, 2011 UTFSM,

NVIDIA CUDA Software and GPU Parallel Computing Architecturekr.nvidia.com/content/cudazone/download/showcase/... · NVIDIA CUDA Software and GPU Parallel Computing Architecture David

GPU (Graphics Processing Unit) Programming in CUDANVIDIA CUDA Programming Guide) ... CUDA C OpenCL CUDA Fortran ... GPU Computing Applications. Soluzioni alternative a CUDA per GPU

NVIDIA GPU Computing Webinars Further CUDA Optimization

CUDA Without Cuda (CUDA Libraries) - Nvidiadeveloper.download.nvidia.com/CUDA/training/ntrotoCUDALibraries.pdf · CUDA Without Cuda (CUDA Libraries) GPU Computing Webinar 7/16/2011

GPU Computing with Nvidia CUDA - Northeastern University€¦ · GPU Computing with Nvidia CUDA 1 Analogic Corp. 4/14/2011 David Kaeli, Perhaad Mistry, Rodrigo Dominguez, ... binary

GPU Computing with CUDA Lecture 6 - CUDA Libraries - Thrust · GPU Computing with CUDA Lecture 6 - CUDA Libraries - Thrust Christopher Cooper Boston University August, 2011 UTFSM,

[Harvard CS264] 03 - Introduction to GPU Computing, CUDA Basics

NVIDIA GPU Computing Webinars CUDA Memory Optimization

GPU Computing with CUDA Lecture 1 - Introduction › pasi › files › 2011 › 07 › Lecture1.pdf · GPU Computing with CUDA Lecture 1 - Introduction Christopher Cooper ... Graphic

CUDA Libraries and Tools - Nvidia › content › GTC › documents › SC09_CUDA...CUDA Libraries & Tools NVIDIA GPU with the CUDA Parallel Computing Architecture CUDA C OpenCL Direct

Introduction To CUDA · GPU and CUDA • Popular – Over 100 million CUDA enabled GPU sold • Easy to program using CUDA – C and C++ Integration – Sizeable computing libraries

GPU Computing: The Democratization of Parallel Computingskadron/cuda_asplos08_tutorial/1-Intro-overvie… · GPU Computing with CUDA brings data-parallel computing to the masses Over