infrastructure optimization for seismic processing (eng)

38
Performance Boost for Seismic Processing with right IT infrastructure Vsevolod Shabad [email protected] CEO and founder +7 (985) 765-76-03

Upload: netproject

Post on 19-Jan-2017

243 views

Category:

Business


3 download

TRANSCRIPT

Page 1: Infrastructure optimization for seismic processing (eng)

Performance Boost forSeismic Processingwith right IT infrastructure

Vsevolod Shabad [email protected] and founder +7 (985) 765-76-03

Page 2: Infrastructure optimization for seismic processing (eng)

NetProject at a glance

System integrator with strong oil&gas focus:we build and expand the IT infrastructurefor Geological and Geophysical applications

2

Page 3: Infrastructure optimization for seismic processing (eng)

How to make the HPC right balanced?

The applications performance must be limited only by the licenses:

•Paradigm ES 360, GeoDepth, Echos•Schlumberger Omega•Landmark SeisSpace ProMAX•CGG Geocluster

with minimal overall cost

3

Page 4: Infrastructure optimization for seismic processing (eng)

Cluster components utilization

30%

40%

50%

60%

70%

80%

90%

100%

Optimal

(limited only by CPU)

Suboptimal

(nothing limits - excessive resources)

Suboptimal

(limited by Network but not CPU)

Cluster performance must be limited only by the CPU!

CPU RAM Network Local HDD Shared File System

4

Page 5: Infrastructure optimization for seismic processing (eng)

Key ideas for optimization

Massive reduction:•CPU downtime for compute nodes•CPU load by auxiliary tasks•operating costs

5

The typical sample of HPC CPU load during workday

(real case)

Page 6: Infrastructure optimization for seismic processing (eng)

Candidates for optimization

Compute nodes

File nodes and storage

Management nodes and resource schedulers

Cluster interconnect

Backup and restore subsystem

Monitoring and management subsystem

Cooling subsystem

6

Page 7: Infrastructure optimization for seismic processing (eng)

Compute nodes optimization

CPU & RAM: optimal choice

CPU offload with RDMA

CPU offload with GPU

Right form factor of servers

7

Page 8: Infrastructure optimization for seismic processing (eng)

CPU & RAM: optimal choice sample

Part Haswell-EP (Xeon E5-2600 v3) Broadwell-EP (E5-2600 v4)

Server type HPE ProLiant XL230a Gen9

CPU 2x Xeon E5-2650 v3

(20 cores, 2.3 GHz)

2x Xeon E5-2680 v4

(28 cores, 2.4 GHz)

RAM 160 GB (8 GB/core)

PC4-2133

224 GB (8 GB/core)

PC4-2400

HDD 2x 1.8 TB SAS 10K RPM

Network 1x Infiniband FDR, 4x GigE

List Price, USD 17 456 18 366

Performance, TFLOPS 0.74 1.08

List Price per core, USD 872.8 655.9 (–24.8%)

List Price per TFLOPS, USD 23 713.4 17 081.5 (–28.0%)

8

Typical compute node for Paradigm ES 360

Page 9: Infrastructure optimization for seismic processing (eng)

CPU & RAM: non-optimal choice sample

Part Broadwell-EP (E5-2600 v4) Broadwell-EP (E5-2600 v4)

Server type HPE ProLiant XL230a Gen9

CPU 2x Xeon E5-2680 v4

(28 cores, 2.4 GHz)

2x Xeon E5-2699 v4

(44 cores, 2.2 GHz)

RAM 224 GB (8 GB/core)

PC4-2400

352 GB (8 GB/core)

PC4-2400

HDD 2x 1.8 TB SAS 10K RPM

Network 1x Infiniband FDR, 4x GigE

List Price, USD 18 366 36 396

Performance, TFLOPS 1.08 1.55

List Price per core, USD 655.9 827.2 (+26.1%)

List Price per TFLOPS, USD 17 081.5 23 499,5 (+37,6%)

9

Typical compute node for Paradigm ES 360

Page 10: Infrastructure optimization for seismic processing (eng)

Some non-obvious restrictions

HPE Proliant XL230a Gen9:•can‘t work with single CPU

•two same processors required•can‘t use the 32GB RDIMM for DDR4 RAM

•only LRDIMM for 32GB DDR4 allowed•…

Fujitsu CX400 M1:•can‘t use two Infiniband HBAs

with liquid cooling•only one Infiniband HBA allowed

•…

10

Page 11: Infrastructure optimization for seismic processing (eng)

CPU offload with RDMA

Direct buffer-to-buffer data move without CPU and operating system

Page 12: Infrastructure optimization for seismic processing (eng)

CPU offload with RDMA

RDMA usage in HPC:•between compute nodes•between compute node and file node (storage)

12

Page 14: Infrastructure optimization for seismic processing (eng)

RDMA must be supported by the apps!

Seismic processing:•Paradigm ES 360, GeoDepth, Echos

•since version 15.5•Schlumberger Omega, Landmark ProMAX

•with MPI

File systems:• IBM Spectrum Scale (ex GPFS)•Lustre •BeeGFS (ex FhGFS)

14

Page 15: Infrastructure optimization for seismic processing (eng)

CPU offload with GPU

Paradigm:•Echos RTM

Schlumberger:•Omega RTM, Kirchhoff

Tsunami:•Tsunami RTM

15

Full list: http://www.nvidia.co.uk/content/EMEAI/PDF/tesla-

gpu-applications/gpu-apps-catalog-eu.pdf

Page 16: Infrastructure optimization for seismic processing (eng)

Sample of GPU efficiency

Statoil custom seismic processing application

NVIDIA GPU (GeForce GTX280)

Source: http://www.idi.ntnu.no/~elster/master-studs/owej/owe-johansen-master-

ntnu.pdf

16

Page 17: Infrastructure optimization for seismic processing (eng)

Right form factor of servers

Rack-optimized 1U•HPE Proliant DL360 Gen9, …

Rack-optimized 2U•Fujitsu RX2540 M2, …

High-density servers•Lenovo NeXtScale M5, …

Blade servers•DELL M1000e, …

17

Page 18: Infrastructure optimization for seismic processing (eng)

Typical high-density chassis

Chassis Number of nodes Chassis height, RU

Lenovo NeXtScale n1200 12 6

HPE Apollo 2000 4 2

HPE Apollo a6000 10 5 (+1.5)

Fujitsu Primergy CX400 S2 4 2

Huawei X6800 8 4

DELL PowerEdge FX2 8 2

DELL PowerEdge C6320 4 2

18

Page 19: Infrastructure optimization for seismic processing (eng)

File nodes and storage

Right choice of file system with RDMA и QoS

High-density block storage arrays

Short IOPS caching with SSD

File system overhead reduction

Backup to Redirect-On-Write snapshots

Transparent data migration to tapes

19

Page 20: Infrastructure optimization for seismic processing (eng)

Right choice of file system

IBM

Spectrum

Scale

(GPFS)

Lustre

(+ ZFS)

BeeGFS Panasas

PanFS

EMC Isilon

Huawei 9000

NetApp FAS(NFS)

RDMA support

at client side

Yes Yes Yes No No

Redirect-On-Write

snapshot support

Yes Yes

(with ZFS)

No Yes Yes

Transparent

migration to tape

Yes No No No No

Commercial support Yes “Yes”

(no for ZFS-

on-Linux)

Yes Yes Yes

Installation difficulty High High High Low Low

Single-Thread

performance

Moderate Low High Low Low

20

Page 22: Infrastructure optimization for seismic processing (eng)

Right choice of block storage array

Requirements sample (moderate cluster for SLB Omega):

• I/O throughput — 3 GB/sec on sequential I/O•usable capacity — 300 TB

Solutions:•EMC VNX5400 — 174 spindles, 39 RU•NetApp E5600 — 69 spindles, 12 RU

Front-end I/O ports options:•SAS, Fibre Channel, 10GbE, Infiniband

NetApp E5600 disk array supports iSER connection via Infiniband fabric (without FC & SAS)

22

Page 23: Infrastructure optimization for seismic processing (eng)

Short IOPS caching with SSD

Most applicable for:•metadata•trace headers

File system support:• IBM Spectrum Scale:

•Highly-available write cache (HAWC)•Local Read-Only Cache (LROC)

•Panasas ActiveStor:• internal Panasas ActiveStor capabilities

•Lustre:•no

•EMC Isilon:•SmartFlash

23

Page 24: Infrastructure optimization for seismic processing (eng)

Backup to Redirect-On-Write snapshots

Traditional backup to tapes:•500 TB of data,

verification after write•four LTO-7 (300 MB/sec) drives:

•500 000 000 / (4 * 300 * 2 * 3600) = 58 hoursunder ideal conditions

Innovative backup (snapshots):•10 minutes

regardless of data volume

24

Page 25: Infrastructure optimization for seismic processing (eng)

Transparent data migration to tapes

Middleware: IBM Spectrum Archive (LTFS)

Drives IBM TS1150: 360 MB/s, 10 TB per cartridge

25

Page 26: Infrastructure optimization for seismic processing (eng)

Resource schedulers

Cluster usage alignment and optimization

Energy aware scheduling

Top-3 systems:• IBM Spectrum LSF

(Platform LSF)•Altair PBS Pro•Adaptive Computing

MOAB HPC Suite

26

Page 27: Infrastructure optimization for seismic processing (eng)

Cluster interconnect

Key technology — RDMA

Two options:• Infiniband FDR/EDR•Ethernet 40G/56G/100G (with RoCE)

27

Page 28: Infrastructure optimization for seismic processing (eng)

Backup and restore subsystem

Protection from logical data damage•with Redirect-On-Write snapshots

Protection from от physical data damageis not necessary

•with dedicated seismic data archive on LTO-7 tapes

28

Page 29: Infrastructure optimization for seismic processing (eng)

Snapshot technologies comparative

Redirect-On-Write Copy-On-WriteNetApp FAS3240 IBM Storwize v7000

Source: NetProject’s comparative testing for JSC NOVATEK (2012)

29

Page 30: Infrastructure optimization for seismic processing (eng)

Monitoring and management

Dedicated Gigabit Ethernet fabric

Management software:•Altair PBS Pro + HPE CMU• IBM Platform Cluster Manager

30

Page 31: Infrastructure optimization for seismic processing (eng)

Cooling subsystem

Air conditioning:•applicable for any equipment (up to 15 кW/rack)•temperature range could be expanded

Liquid conditioning:•equipment adaptation

or “liquid coolingrack door” required

•high level of data center density

31

Page 32: Infrastructure optimization for seismic processing (eng)

Temperature range expansion

4% operational savings from cooling for every 10 C increase in operating temperature (Intel, IDC, Gartner)

Incompatible:•with NVIDIA GRID•with CPU > 120W

ATD module price for Fujitsu Primergy is 26 USD!

32

Page 33: Infrastructure optimization for seismic processing (eng)

Our product portfolio:

Servers:•Lenovo, HPE, DELL, Fujitsu, Cisco, Huawei, Intel,

Supermicro, Inspur, Sugon, …

Storage:•NetApp, IBM, HDS, Panasas, Huawei, EMC, …

Network:•Mellanox, Cisco, Lenovo, Huawei, Brocade

Tape libraries:• IBM, HPE, Quantum

Resource scheduling tools:• IBM Platform, Altair, Adaptive Computing

33

Page 34: Infrastructure optimization for seismic processing (eng)

Company strengths

Strong industry focus on G&G IT infrastructure

Deep knowledge and experience:• Industry specifics and major applications:

•better than most system integrators and IT infrastructure vendors

• IT infrastructure products and technologies•better than most G&G software vendors

Advanced project management methodology

Structured knowledgebase of past projects

High engineering culture of staff

Deep customer involvement in projects

34

Page 35: Infrastructure optimization for seismic processing (eng)

Company weaknesses

Strong industry focus on G&G IT infrastructure

Low level of company brand recognition

Lack of overseas market experience

A small number of employees

Limited financial resources

35

Page 36: Infrastructure optimization for seismic processing (eng)

Company background in a brief

Technology experience:•Networking since 1996•Storage since 2004•Servers since 2006•VDI since 2010•HPC since 2014

Oil & Gas industry focus since 2012•metallurgy and banking focus in 1996-2014

Big Data experience since 2007

ISO 9001:2008 certified since 2012

36

Page 37: Infrastructure optimization for seismic processing (eng)

Partner reference about NetProject

At the last time we are increasingly ask NetProject for

consulting about hardware configurations, clusters,

workstations, and infrastructure solutions.

This company's team has participated in the creation of the

various data center architectures for the wide spectrum of

Paradigm technologies and has proved itself from the best side.

Serge Levin,Paradigm Geophysical (Russia) Sales Director

37

Page 38: Infrastructure optimization for seismic processing (eng)

A proven way to win! Join to us!

http://www.netproject.ru/

38