ansys fluent performance intel xeon 5675 comparison ansys fluent 12.1 to ansys fluent · pdf...

13
ANSYS FLUENT Performance Intel Xeon 5675 Comparison ANSYS FLUENT 12.1 to ANSYS FLUENT 13.0 Performance Testing Performed at QLogic NETtrack Developer Center

Upload: dangnhu

Post on 13-Mar-2018

568 views

Category:

Documents


10 download

TRANSCRIPT

Page 1: ANSYS FLUENT Performance Intel Xeon 5675 Comparison ANSYS FLUENT 12.1 to ANSYS FLUENT · PDF file · 2011-06-24ANSYS FLUENT Performance Intel Xeon 5675 Comparison ANSYS FLUENT 12.1

ANSYS FLUENT Performance

Intel Xeon 5675 Comparison

ANSYS FLUENT 12.1 to ANSYS FLUENT 13.0

Performance Testing Performed

at

QLogic NETtrack Developer Center

Page 2: ANSYS FLUENT Performance Intel Xeon 5675 Comparison ANSYS FLUENT 12.1 to ANSYS FLUENT · PDF file · 2011-06-24ANSYS FLUENT Performance Intel Xeon 5675 Comparison ANSYS FLUENT 12.1

Agenda

• Partners in the Testing

• Test Configuration

• Performance Profile

• Difference in Performance Rating

• User and MPI Utilization Differences

• MPI Function Calls

• Message Rate

• NETtrack Developer Center

• ANSYS Customer Test Drive Program

Month DD, YYYYQLogic Confidential2

Page 3: ANSYS FLUENT Performance Intel Xeon 5675 Comparison ANSYS FLUENT 12.1 to ANSYS FLUENT · PDF file · 2011-06-24ANSYS FLUENT Performance Intel Xeon 5675 Comparison ANSYS FLUENT 12.1

Partners in the Testing

• The information contained within this document was generated in QLogic’s

NDC laboratory

• Participating parties: ANSYS, Dell, Intel and QLogic

• Cluster Resources: NETTrack Developer Center (NDC) Dell C6100 – Sapphire II

• For more info regarding participants, please refer to the following sites:

www.qlogic.com, www.Dell.com, www.ansys.com, www.Intel.com

Month DD, YYYY3 QLogic Confidential

Page 4: ANSYS FLUENT Performance Intel Xeon 5675 Comparison ANSYS FLUENT 12.1 to ANSYS FLUENT · PDF file · 2011-06-24ANSYS FLUENT Performance Intel Xeon 5675 Comparison ANSYS FLUENT 12.1

NDC – Test Configuration

QLogic Confidential4

Server: Dell PowerEdge C6100 (Sapphire-II)

CPU: 2 - Intel Xeon X5675 @ 3.07 GHz (Hex Core)

Mem: 48GB (12x4G) DDR3 @ 1333MHz

OS: RHEL 5.5 / Kernel 2.6.18-194.el5

App: ANSYS FLUENT 12.1.11 and 13.0

MPI: Platform MPI

HCA: QLE7340 - QDR InfiniBand

Stack: OFED 1.5.2

Switch: 12300 - 36-port QDR InfiniBand

F/W: 6.0.2.1.11

Settings: 4K MTU 1 Virtual Lane

Page 5: ANSYS FLUENT Performance Intel Xeon 5675 Comparison ANSYS FLUENT 12.1 to ANSYS FLUENT · PDF file · 2011-06-24ANSYS FLUENT Performance Intel Xeon 5675 Comparison ANSYS FLUENT 12.1

Performance Profiles

Month DD, YYYYQLogic Confidential5

Page 6: ANSYS FLUENT Performance Intel Xeon 5675 Comparison ANSYS FLUENT 12.1 to ANSYS FLUENT · PDF file · 2011-06-24ANSYS FLUENT Performance Intel Xeon 5675 Comparison ANSYS FLUENT 12.1

Performance Profile

ANSYS FLUENT 12.1 vs. ANSYS FLUENT 13.0

• 14% average performance improvement with ANSYS FLUENT 13.0 across the seven benchmark tests

• Greatest performance improvement shown with Sedan (k-eps Turbulence model with Pressure-Based Coupled Implicit solver). Significant gains due to new hybrid parallel implementation in ANSYS FLUENT 13.0

• Least performance gains shown with Truck_14M and Truck_111M benchmark tests, both tests use DES Turbulence model with Segregated Implicit solver

6 QLogic Confidential

0

2000

4000

6000

8000

10000

12000

14000

Aircraft_2M Sedan_4m Eddy_417k Truck_14M Truck_Poly_14M Truck_111m

ANSYS FLUENT Comparison16-nodes/192-cores

FLUENT r12.1

FLUENT r13.0

11%53%

7%

1% 13%

-2%

Page 7: ANSYS FLUENT Performance Intel Xeon 5675 Comparison ANSYS FLUENT 12.1 to ANSYS FLUENT · PDF file · 2011-06-24ANSYS FLUENT Performance Intel Xeon 5675 Comparison ANSYS FLUENT 12.1

Profile: User and MPI Overhead

16-Nodes/192-Cores

• ANSYS FLUENT 13 achieves a 14%

better average rating across the

seven tests. In producing these better

results:

• Utilizes an average of 2% less time in

computing

• Increased MPI communications by an

average of 4%

QLogic Confidential7

Benchmark Name Usage v12.1.11 v13.0 % Change

aircraft_2m Rating 9818.2 10922.9 11%

User Time 39.33% 35.82% -9%

MPI time 60.67% 64.18% 6%

sedan_4m Rating 7912.1 12134.8 53%

User Time 49.62% 49.31% -1%

MPI time 50.38% 50.69% 1%

eddy_417k Rating 12272.7 13110.8 7%

User Time 30.51% 27.81% -9%

MPI time 69.49% 72.19% 4%

truck_14m Rating 1497.1 1511 1%

User Time 48.00% 38.51% -20%

MPI time 52.00% 61.49% 18%

truck_poly_14m Rating 1346.8 1525.7 13%

User Time 47.76% 40.41% -15%

MPI time 52.24% 59.59% 14%

truck_111m Rating 180.6 176.9 -2%

User Time 30.51% 44.99% 47%

MPI time 69.49% 55.01% -21%

Average Change Rating 31%

User Time -2%

MPI time 4%

Fluent Version

Page 8: ANSYS FLUENT Performance Intel Xeon 5675 Comparison ANSYS FLUENT 12.1 to ANSYS FLUENT · PDF file · 2011-06-24ANSYS FLUENT Performance Intel Xeon 5675 Comparison ANSYS FLUENT 12.1

Profile: MPI Functions

Eddy_417

• The following are the most used calls– MPI_Iprobe (stat gathering), MPI_Isend,

MPI_Irecv, MPI_ALLreduce

• The following calls produce the most Overhead: MPI_Recv, MPI_Allreduce,

MPI_Barrier, MPI_Init, MPI_Waitall, MPI_Iprobe, MPI_Isend.

Month DD, YYYYQLogic Confidential8

ANSYS FLUENT 12.1.11 ANSYS FLUENT 13.0

Page 9: ANSYS FLUENT Performance Intel Xeon 5675 Comparison ANSYS FLUENT 12.1 to ANSYS FLUENT · PDF file · 2011-06-24ANSYS FLUENT Performance Intel Xeon 5675 Comparison ANSYS FLUENT 12.1

Profile: MPI Functions

Truck_111

• The following are the most used calls– MPI_Iprobe (stat gathering), MPI_Isend,

MPI_Irecv, MPI_ALLreduce

• The following calls produce the most Overhead (73%): MPI_Recv, MPI_Allreduce,

MPI_Waitall.

Month DD, YYYYQLogic Confidential9

ANSYS FLUENT 12.1.11 ANSYS FLUENT 13.0

Page 10: ANSYS FLUENT Performance Intel Xeon 5675 Comparison ANSYS FLUENT 12.1 to ANSYS FLUENT · PDF file · 2011-06-24ANSYS FLUENT Performance Intel Xeon 5675 Comparison ANSYS FLUENT 12.1

Profile: Message Rate

Eddy_417K & Truck_111M

Eddy_417K Model

• ANSYS FLUENT 13.0 sends 8% less messages at 192-cores

• Average message size at 192-cores is 15% smaller with 13.0

• 97% of the messages are less than 1K in size

Truck_111M Model

• ANSYS FLUENT 13.0 sends 57% less messages at 192-cores

• Average message size at 192-cores is 66% larger with13.0

• 98% of the messages are less than 65K in size

Month DD, YYYYQLogic Confidential10

Page 11: ANSYS FLUENT Performance Intel Xeon 5675 Comparison ANSYS FLUENT 12.1 to ANSYS FLUENT · PDF file · 2011-06-24ANSYS FLUENT Performance Intel Xeon 5675 Comparison ANSYS FLUENT 12.1

Take a

Test

Dri

ve

Page 12: ANSYS FLUENT Performance Intel Xeon 5675 Comparison ANSYS FLUENT 12.1 to ANSYS FLUENT · PDF file · 2011-06-24ANSYS FLUENT Performance Intel Xeon 5675 Comparison ANSYS FLUENT 12.1

QLogic NETtrack Developer Center

Test Drive Program

• Test Drive your models/simulations at the QLogic NETtrack Developer

Center

• Range of Configurations

• Nodes, Cores, Memory

• Interconnects

• Storage

• Determine the Optimal Performance Setup for your Business

Get Started – Contact us at [email protected]

Page 13: ANSYS FLUENT Performance Intel Xeon 5675 Comparison ANSYS FLUENT 12.1 to ANSYS FLUENT · PDF file · 2011-06-24ANSYS FLUENT Performance Intel Xeon 5675 Comparison ANSYS FLUENT 12.1

Month DD, YYYY13 QLogic Confidential