www.bsc.es belgrade, 25 september 2014 george s. markomanolis, oriol jorba, kim serradell...

47
www.bsc.es Belgrade, 25 September 2014 George S. Markomanolis, Oriol Jorba, Kim Serradell Performance analysis Tools: a case study of NMMB on Marenostrum Supercomputer

Upload: wilfrid-stafford

Post on 05-Jan-2016

221 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Www.bsc.es Belgrade, 25 September 2014 George S. Markomanolis, Oriol Jorba, Kim Serradell Performance analysis Tools: a case study of NMMB on Marenostrum

www.bsc.es

Belgrade, 25 September 2014

George S. Markomanolis, Oriol Jorba, Kim Serradell

Performance analysis Tools: a case study of NMMB on Marenostrum

Supercomputer

Page 2: Www.bsc.es Belgrade, 25 September 2014 George S. Markomanolis, Oriol Jorba, Kim Serradell Performance analysis Tools: a case study of NMMB on Marenostrum

2

Outline

2

Introduction to ParaverExamples with NMMB/BSC-CTMVarious Paraver views

Configuration of Extrae tool

Summary

Page 3: Www.bsc.es Belgrade, 25 September 2014 George S. Markomanolis, Oriol Jorba, Kim Serradell Performance analysis Tools: a case study of NMMB on Marenostrum

3

Tools

3

Since 1991

Based on traces

Open source: http//:www.bsc.es/paraver

Core tools:ParaverExtrae Dimemas

Page 4: Www.bsc.es Belgrade, 25 September 2014 George S. Markomanolis, Oriol Jorba, Kim Serradell Performance analysis Tools: a case study of NMMB on Marenostrum

4

Paraver

4

Every behavioral aspect/metric described as a function of time

Those functions of time can be rendered into a 2D image

Statistics can be computed for each possible value or range of values of that function of time

Page 5: Www.bsc.es Belgrade, 25 September 2014 George S. Markomanolis, Oriol Jorba, Kim Serradell Performance analysis Tools: a case study of NMMB on Marenostrum

5

Extrae

5

BSC instrumentation package

When/WhereParallel programming model runtimeSelected user functionsPeriodic samplesUser events

Additional information Counters

Page 6: Www.bsc.es Belgrade, 25 September 2014 George S. Markomanolis, Oriol Jorba, Kim Serradell Performance analysis Tools: a case study of NMMB on Marenostrum

6

Timelines

6

Representation

Function of time

Colour encoding

Page 7: Www.bsc.es Belgrade, 25 September 2014 George S. Markomanolis, Oriol Jorba, Kim Serradell Performance analysis Tools: a case study of NMMB on Marenostrum

7

Paraver – Generic View

7

Part of the timeline Colours for different events Example for 68 MPI processes 1 hour global domain, 24km, 64 layers, meteo configuration

Page 8: Www.bsc.es Belgrade, 25 September 2014 George S. Markomanolis, Oriol Jorba, Kim Serradell Performance analysis Tools: a case study of NMMB on Marenostrum

8

Paraver – Menu (from BSC Tools presentation)

8

Page 9: Www.bsc.es Belgrade, 25 September 2014 George S. Markomanolis, Oriol Jorba, Kim Serradell Performance analysis Tools: a case study of NMMB on Marenostrum

9

Paraver – Load configuration (from BSC Tools presentation)

9

Page 10: Www.bsc.es Belgrade, 25 September 2014 George S. Markomanolis, Oriol Jorba, Kim Serradell Performance analysis Tools: a case study of NMMB on Marenostrum

10

Paraver – Menu (from BSC Tools presentation)

10

Page 11: Www.bsc.es Belgrade, 25 September 2014 George S. Markomanolis, Oriol Jorba, Kim Serradell Performance analysis Tools: a case study of NMMB on Marenostrum

11

Paraver – Profiles (from BSC Tools presentation)

11

Page 12: Www.bsc.es Belgrade, 25 September 2014 George S. Markomanolis, Oriol Jorba, Kim Serradell Performance analysis Tools: a case study of NMMB on Marenostrum

12

Paraver – Profiles (from BSC Tools presentation)

12

Page 13: Www.bsc.es Belgrade, 25 September 2014 George S. Markomanolis, Oriol Jorba, Kim Serradell Performance analysis Tools: a case study of NMMB on Marenostrum

13

Paraver – Histograms (from BSC Tools presentation)

13

Page 14: Www.bsc.es Belgrade, 25 September 2014 George S. Markomanolis, Oriol Jorba, Kim Serradell Performance analysis Tools: a case study of NMMB on Marenostrum

14

Paraver – Histograms (from BSC Tools presentation)

14

Page 15: Www.bsc.es Belgrade, 25 September 2014 George S. Markomanolis, Oriol Jorba, Kim Serradell Performance analysis Tools: a case study of NMMB on Marenostrum

15

Paraver –View

15

Running and observing the events

Computation

Page 16: Www.bsc.es Belgrade, 25 September 2014 George S. Markomanolis, Oriol Jorba, Kim Serradell Performance analysis Tools: a case study of NMMB on Marenostrum

16

Paraver – Computation View

16

Create a profile view for the following part of the trace

Page 17: Www.bsc.es Belgrade, 25 September 2014 George S. Markomanolis, Oriol Jorba, Kim Serradell Performance analysis Tools: a case study of NMMB on Marenostrum

17

Paraver – Profile View

17

Create a profile view for the following part of the trace

Page 18: Www.bsc.es Belgrade, 25 September 2014 George S. Markomanolis, Oriol Jorba, Kim Serradell Performance analysis Tools: a case study of NMMB on Marenostrum

18

Paraver – Profile View

18

Percentage of MPI callsAverage=98.7% is the

parallel efficiency Maximum = 99.98% is the

communication efficiencyAvg/max = 0.99 is perfect

load balanced only for thispart of the trace

Page 19: Www.bsc.es Belgrade, 25 September 2014 George S. Markomanolis, Oriol Jorba, Kim Serradell Performance analysis Tools: a case study of NMMB on Marenostrum

19

Paraver – Useful Duration

19

Part of the timeline 1 hour global domain, 24km, 64 layers, meteo configuration Green low computation, blue significant computation (useful

duration view)

Page 20: Www.bsc.es Belgrade, 25 September 2014 George S. Markomanolis, Oriol Jorba, Kim Serradell Performance analysis Tools: a case study of NMMB on Marenostrum

20

Paraver – Time histogram

20

For better load balancing is needed to have vertical lines

Page 21: Www.bsc.es Belgrade, 25 September 2014 George S. Markomanolis, Oriol Jorba, Kim Serradell Performance analysis Tools: a case study of NMMB on Marenostrum

21

Paraver – Instructions histogram

21

The computation is not uniform

Page 22: Www.bsc.es Belgrade, 25 September 2014 George S. Markomanolis, Oriol Jorba, Kim Serradell Performance analysis Tools: a case study of NMMB on Marenostrum

22

Paraver – Instructions per cycle (IPC)

22

Efficient computation

Useful efficient computation

Page 23: Www.bsc.es Belgrade, 25 September 2014 George S. Markomanolis, Oriol Jorba, Kim Serradell Performance analysis Tools: a case study of NMMB on Marenostrum

23

Paraver – Useful computation histogram

23

Page 24: Www.bsc.es Belgrade, 25 September 2014 George S. Markomanolis, Oriol Jorba, Kim Serradell Performance analysis Tools: a case study of NMMB on Marenostrum

24

Paraver – Useful time histogram

24

Page 25: Www.bsc.es Belgrade, 25 September 2014 George S. Markomanolis, Oriol Jorba, Kim Serradell Performance analysis Tools: a case study of NMMB on Marenostrum

25

Paraver – Useful IPC histogram

25

Page 26: Www.bsc.es Belgrade, 25 September 2014 George S. Markomanolis, Oriol Jorba, Kim Serradell Performance analysis Tools: a case study of NMMB on Marenostrum

26

Paraver – Useful L2 cache miss hit ratio

26

Per user function

Table

Page 27: Www.bsc.es Belgrade, 25 September 2014 George S. Markomanolis, Oriol Jorba, Kim Serradell Performance analysis Tools: a case study of NMMB on Marenostrum

27

Paraver – MPI calls

27

MPI calls excluding computation

MPI calls with partial communication visualization

Page 28: Www.bsc.es Belgrade, 25 September 2014 George S. Markomanolis, Oriol Jorba, Kim Serradell Performance analysis Tools: a case study of NMMB on Marenostrum

28

Paraver – Total bytes sent

28

Page 29: Www.bsc.es Belgrade, 25 September 2014 George S. Markomanolis, Oriol Jorba, Kim Serradell Performance analysis Tools: a case study of NMMB on Marenostrum

29

Paraver – Max bytes sent

29

Page 30: Www.bsc.es Belgrade, 25 September 2014 George S. Markomanolis, Oriol Jorba, Kim Serradell Performance analysis Tools: a case study of NMMB on Marenostrum

30

Paraver – Percentage of MPI time per user function

30

Page 31: Www.bsc.es Belgrade, 25 September 2014 George S. Markomanolis, Oriol Jorba, Kim Serradell Performance analysis Tools: a case study of NMMB on Marenostrum

31

Paraver – Communication matrix

31

Page 32: Www.bsc.es Belgrade, 25 September 2014 George S. Markomanolis, Oriol Jorba, Kim Serradell Performance analysis Tools: a case study of NMMB on Marenostrum

32

MPI – Send a message

32

Page 33: Www.bsc.es Belgrade, 25 September 2014 George S. Markomanolis, Oriol Jorba, Kim Serradell Performance analysis Tools: a case study of NMMB on Marenostrum

33

Paraver – User functions

33

User functions

Useful user functions

Page 34: Www.bsc.es Belgrade, 25 September 2014 George S. Markomanolis, Oriol Jorba, Kim Serradell Performance analysis Tools: a case study of NMMB on Marenostrum

Paraver – Global – 24km - Meteo

Simulation: 02/12/2005

Page 35: Www.bsc.es Belgrade, 25 September 2014 George S. Markomanolis, Oriol Jorba, Kim Serradell Performance analysis Tools: a case study of NMMB on Marenostrum

Paraver – Global – 24km – Meteo – between radiations

Page 36: Www.bsc.es Belgrade, 25 September 2014 George S. Markomanolis, Oriol Jorba, Kim Serradell Performance analysis Tools: a case study of NMMB on Marenostrum

Paraver – Global – 24km – Meteo – radiation

Page 37: Www.bsc.es Belgrade, 25 September 2014 George S. Markomanolis, Oriol Jorba, Kim Serradell Performance analysis Tools: a case study of NMMB on Marenostrum

Communication matrix

Page 38: Www.bsc.es Belgrade, 25 September 2014 George S. Markomanolis, Oriol Jorba, Kim Serradell Performance analysis Tools: a case study of NMMB on Marenostrum

Paraver – Global – 24km – Meteo/Dust/Chem

Simulation: 21/05/2010

Page 39: Www.bsc.es Belgrade, 25 September 2014 George S. Markomanolis, Oriol Jorba, Kim Serradell Performance analysis Tools: a case study of NMMB on Marenostrum

Paraver – Global – 24km – Meteo/Dust/Chem

Simulation: 21/09/2010

Page 40: Www.bsc.es Belgrade, 25 September 2014 George S. Markomanolis, Oriol Jorba, Kim Serradell Performance analysis Tools: a case study of NMMB on Marenostrum

Paraver – (useful) user functions

Page 41: Www.bsc.es Belgrade, 25 September 2014 George S. Markomanolis, Oriol Jorba, Kim Serradell Performance analysis Tools: a case study of NMMB on Marenostrum

Paraver – (useful) user functions

Page 42: Www.bsc.es Belgrade, 25 September 2014 George S. Markomanolis, Oriol Jorba, Kim Serradell Performance analysis Tools: a case study of NMMB on Marenostrum

Computation load imbalance

Page 43: Www.bsc.es Belgrade, 25 September 2014 George S. Markomanolis, Oriol Jorba, Kim Serradell Performance analysis Tools: a case study of NMMB on Marenostrum

Zoom between radiation calls for dust/sea-salt

Page 44: Www.bsc.es Belgrade, 25 September 2014 George S. Markomanolis, Oriol Jorba, Kim Serradell Performance analysis Tools: a case study of NMMB on Marenostrum

Extrae

44

How to use:

mpirun … wrapper.sh /path/umo.x

Contents of wrapper.sh file:

export EXTRAE_HOME=/installation_path/export LD_PRELOAD=/installation_path/lib/libmpitrace.soexport LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/installation_path/libsource ${EXTRAE_HOME}/etc/extrae.shexport EXTRAE_CONFIG_FILE=/path/extrae_config.xml$*

Page 45: Www.bsc.es Belgrade, 25 September 2014 George S. Markomanolis, Oriol Jorba, Kim Serradell Performance analysis Tools: a case study of NMMB on Marenostrum

<?xml version=‘1.0’?></trace enabled=“yes”…

<mpi enabled=“yes”> <counters enabled=“no”/></mpi><user-functions enabled=“yes” list=“/path/fucntions_list.txt”> <counters enabled=“yes”/><user-functions>

<counters enabled=“yes”> <cpu enabled=“yes” starting-set-distribution=“1”> <set enabled=“yes” domain=“user” changeat-globalops=“0”> PAPI_TOT_INS,PAPI_TOT_CYC </set> </cpu>

<buffer enabled=“yes”> <size enabled=“yes”> 1000000</size> <circular enabled=“no”></buffer>

…<merge enabled=“yes”…>$TRACE_NAME$</merge></trace>

Extrae – XML file

45

Page 46: Www.bsc.es Belgrade, 25 September 2014 George S. Markomanolis, Oriol Jorba, Kim Serradell Performance analysis Tools: a case study of NMMB on Marenostrum

Summary

46

The performance analysis of an application is a long and sometimes difficult task

We used Extrae/Paraver to analyze our modelPerformance tools are needed more and more!Hardware counters are important to study the computation

phasesLoad imbalance issues are well known to the community but

need to be studiedWe identified some serialization issuesExtrae needs to be properly configured

Page 47: Www.bsc.es Belgrade, 25 September 2014 George S. Markomanolis, Oriol Jorba, Kim Serradell Performance analysis Tools: a case study of NMMB on Marenostrum

www.bsc.es

Thank you!

Questions?

47