an analytical model for a gpu. overview svm kernel behavior: need for other metrics

16
An Analytical Model for a GPU

Post on 21-Dec-2015

273 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: An Analytical Model for a GPU. Overview SVM Kernel Behavior: Need for other metrics

An Analytical Model for a GPU

Page 2: An Analytical Model for a GPU. Overview SVM Kernel Behavior: Need for other metrics

Overview

Page 3: An Analytical Model for a GPU. Overview SVM Kernel Behavior: Need for other metrics

SVM Kernel Behavior: Need for other metrics

Page 4: An Analytical Model for a GPU. Overview SVM Kernel Behavior: Need for other metrics

Degree of Parallelism

GPU Architecture Each SM executes multiple warps in a time-

sharing fashion while one or more are waiting for memory values

Hiding the execution cost of warps that are executed concurrently.

How many memory requests can be serviced and how many warps can be executed together while one warp is waiting for memory values.

Page 5: An Analytical Model for a GPU. Overview SVM Kernel Behavior: Need for other metrics

MWP and CWP

Memory Warp: The warp that is waiting for memory values

Memory Warp waiting period: The time period from right after one warp sent memory

requests until all the memory requests from the same warp are serviced.

CWP (Computation Warp Parallelism) represents the number of warps that the SM processor can

execute during one memory warp waiting period plus one. MWP (Memory Warp Parallelism)

represents the maximum number of warps per SM that can access the memory simultaneously during memory warp waiting period

Page 6: An Analytical Model for a GPU. Overview SVM Kernel Behavior: Need for other metrics

Relationship between MWP and CWP: CWP > MWP

What is getting hidden?

Total execution time (a) 8 warps (b) 4 warps

Page 7: An Analytical Model for a GPU. Overview SVM Kernel Behavior: Need for other metrics

What is going on here?

Page 8: An Analytical Model for a GPU. Overview SVM Kernel Behavior: Need for other metrics

Relationship between MWP and CWP: MWP > CWP

Total execution time (a) 8 warps (b) 4 warps

Page 9: An Analytical Model for a GPU. Overview SVM Kernel Behavior: Need for other metrics

Relationship between MWP and CWP: MWP > CWP

Total execution time (8 warps)

Page 10: An Analytical Model for a GPU. Overview SVM Kernel Behavior: Need for other metrics

Not Enough Warps Running

Page 11: An Analytical Model for a GPU. Overview SVM Kernel Behavior: Need for other metrics

CPI

PTX instruction set

Page 12: An Analytical Model for a GPU. Overview SVM Kernel Behavior: Need for other metrics

Model

Page 13: An Analytical Model for a GPU. Overview SVM Kernel Behavior: Need for other metrics

ExampleBlocks: 80Threads: 1285 blocks per SM

Page 14: An Analytical Model for a GPU. Overview SVM Kernel Behavior: Need for other metrics
Page 15: An Analytical Model for a GPU. Overview SVM Kernel Behavior: Need for other metrics
Page 16: An Analytical Model for a GPU. Overview SVM Kernel Behavior: Need for other metrics