neethi 1suresh , lois thomas , bipin kumar 1 · dns for large domains: challenges for computation...

9
DNS for large domains: Challenges for computation and storage Neethi Suresh 1 , Lois Thomas 1 , Bipin Kumar 1 Ravi S. Nanjundiah 1,2 and Suryachandra A. Rao 1 1 Indian Institute of Tropical Meteorology, Pune, India 2 Center for Atmospheric and Oceanic Science, Indian Institute of Science, Bangalore, India

Upload: others

Post on 05-Jul-2020

5 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Neethi 1Suresh , Lois Thomas , Bipin Kumar 1 · DNS for large domains: Challenges for computation and storage Neethi 1Suresh, Lois Thomas 1, Bipin Kumar Ravi S. Nanjundiah1,2 and

DNS for large domains: Challenges for computation and storage

Neethi Suresh1, Lois Thomas1, Bipin Kumar1

Ravi S. Nanjundiah1,2 and Suryachandra A. Rao1

1 Indian Institute of Tropical Meteorology, Pune, India 2 Center for Atmospheric and Oceanic Science,

Indian Institute of Science, Bangalore, India

Page 2: Neethi 1Suresh , Lois Thomas , Bipin Kumar 1 · DNS for large domains: Challenges for computation and storage Neethi 1Suresh, Lois Thomas 1, Bipin Kumar Ravi S. Nanjundiah1,2 and

15-Dec-17 Neethi Suresh

Motivation

Adapted from E. Bondenschatz, S.P. Malinowski, R.A. Shaw & F. Stratmann,

Science,327, p. 970 (2010).

1 µm

1 mm

10 cm

10 m 1 km

50 km

Page 3: Neethi 1Suresh , Lois Thomas , Bipin Kumar 1 · DNS for large domains: Challenges for computation and storage Neethi 1Suresh, Lois Thomas 1, Bipin Kumar Ravi S. Nanjundiah1,2 and

Warm air

(environment)

Cloudy (cold)

air

Motivation

Page 4: Neethi 1Suresh , Lois Thomas , Bipin Kumar 1 · DNS for large domains: Challenges for computation and storage Neethi 1Suresh, Lois Thomas 1, Bipin Kumar Ravi S. Nanjundiah1,2 and

Model Eulerian (Fluid flow equations)

Lagrangian (droplet movement equations)

Finite particle

response time

Periodic BC

: vapor gas constant

: dry air gas constant

v

d

R

R

LSf : turbulent forcing

(form large scale

: li

:

quid water conten

vapor mixing ra

t

tio

)

v

l

q

q

Kumar et al., JAS, 2014, JAMES, 2017, Götzfried et al., JFM 2017

Page 5: Neethi 1Suresh , Lois Thomas , Bipin Kumar 1 · DNS for large domains: Challenges for computation and storage Neethi 1Suresh, Lois Thomas 1, Bipin Kumar Ravi S. Nanjundiah1,2 and

Domain decomposition on 2D processor topology

Computational Details

Pseudo-spectral method used to convert Partial Differential Equations (PDE) to set of Ordinary Differential Equations (ODE). System ODE is solved by 2nd order Predictor-corrector (time stepping) method.

1 mm

FORTRAN 90 + MPI + OpenMP

Page 6: Neethi 1Suresh , Lois Thomas , Bipin Kumar 1 · DNS for large domains: Challenges for computation and storage Neethi 1Suresh, Lois Thomas 1, Bipin Kumar Ravi S. Nanjundiah1,2 and

Flowchart FORTRAN 90 + MPI + OpenMP

Page 7: Neethi 1Suresh , Lois Thomas , Bipin Kumar 1 · DNS for large domains: Challenges for computation and storage Neethi 1Suresh, Lois Thomas 1, Bipin Kumar Ravi S. Nanjundiah1,2 and

Before After

Computational time reduced by 17%.

Code optimization

77 %

23 %

60 %

40%

Page 8: Neethi 1Suresh , Lois Thomas , Bipin Kumar 1 · DNS for large domains: Challenges for computation and storage Neethi 1Suresh, Lois Thomas 1, Bipin Kumar Ravi S. Nanjundiah1,2 and

Computational details

Four different domains

D1 : 12.8 cm3, D2: 25.6 cm3,

D3: 51.2 cm3 , D4: 102.4 cm3

Scaling `Diff1' is the difference in times with respect to previous

small domain. Similarly, difference in the times with

respect to smallest domain is represented by `Diff2'.

Nt #droplets required for that computational domain.

Total times for 60000 iteration using 1024 cores.

Requirement (for one experiment)

Core: min (16384)

Up to 500 TB

More memory per core/node

Very fast inter-processor connections

6989561 sec 80 days : 2m3

Page 9: Neethi 1Suresh , Lois Thomas , Bipin Kumar 1 · DNS for large domains: Challenges for computation and storage Neethi 1Suresh, Lois Thomas 1, Bipin Kumar Ravi S. Nanjundiah1,2 and

Carried out simulation of cloud micro-physics in different domains. Cope optimization saved 17% total time.

Scaling analysis shows a superliner speed-up. Also optimized

number of cores have been suggested for each domain. Provide projected time and required resources for bigger domain.

Conclusions