data processing for the square kilometre array...

31
Data Processing for the Square Kilometre Array Telescope Bojan Nikolic Astrophysics Group, Cavendish Lab University of Cambridge Email: [email protected] Project Engineer & Architecture Task Leader SKA Science Data Processor Consortium Streaming 2015 Workshop Indianapolis 27th October 2015

Upload: others

Post on 25-Mar-2020

4 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Data Processing for the Square Kilometre Array Telescopebn204/publications/2015/SKA-SDP-Streaming.pdf · Data Processing for the Square Kilometre Array Telescope Bojan Nikolic Astrophysics

Data Processing for

the Square Kilometre

Array Telescope

Bojan Nikolic

Astrophysics Group, Cavendish Lab

University of Cambridge

Email: [email protected]

Project Engineer & Architecture Task Leader

SKA Science Data Processor Consortium

Streaming 2015 Workshop – Indianapolis

27th October 2015

Page 2: Data Processing for the Square Kilometre Array Telescopebn204/publications/2015/SKA-SDP-Streaming.pdf · Data Processing for the Square Kilometre Array Telescope Bojan Nikolic Astrophysics

Not covered in this talk

Not covered:

• Science Objectives of the SKA

• Timelines, project programmatic and organisational structure

• Design of the receptors (aperture arrays & dishes) and their associated digital electronics/communications equipment

Also quite a few spare slides on topics covered.

Find me later if you’d like information on these

Page 3: Data Processing for the Square Kilometre Array Telescopebn204/publications/2015/SKA-SDP-Streaming.pdf · Data Processing for the Square Kilometre Array Telescope Bojan Nikolic Astrophysics

THE SQUARE KILOMETRE

ARRAY TELESCOPE

Page 4: Data Processing for the Square Kilometre Array Telescopebn204/publications/2015/SKA-SDP-Streaming.pdf · Data Processing for the Square Kilometre Array Telescope Bojan Nikolic Astrophysics

What is the Square Kilometre Array (SKA) ?

Page 5: Data Processing for the Square Kilometre Array Telescopebn204/publications/2015/SKA-SDP-Streaming.pdf · Data Processing for the Square Kilometre Array Telescope Bojan Nikolic Astrophysics

Mid frequency dishes and dense aperture array

concept

Page 6: Data Processing for the Square Kilometre Array Telescopebn204/publications/2015/SKA-SDP-Streaming.pdf · Data Processing for the Square Kilometre Array Telescope Bojan Nikolic Astrophysics

Low frequency array and the survey telescope

concept

Page 7: Data Processing for the Square Kilometre Array Telescopebn204/publications/2015/SKA-SDP-Streaming.pdf · Data Processing for the Square Kilometre Array Telescope Bojan Nikolic Astrophysics

Phased Aperture array for 40 – 650 MHz

Page 8: Data Processing for the Square Kilometre Array Telescopebn204/publications/2015/SKA-SDP-Streaming.pdf · Data Processing for the Square Kilometre Array Telescope Bojan Nikolic Astrophysics

Phased Aperture array: 3 million antennas

Page 9: Data Processing for the Square Kilometre Array Telescopebn204/publications/2015/SKA-SDP-Streaming.pdf · Data Processing for the Square Kilometre Array Telescope Bojan Nikolic Astrophysics

Scientific Context – a partner to ALMA, EELT,

JWST

Credit:A.

Marinkovic/XCam/ALMA(ESO/NAOJ/NRAO)

Credit: Northrop Grumman (artists impression)

Credit:ESO/L. Calçada (artists impression)

Credit: SKA Organisation (artists impression)

Page 10: Data Processing for the Square Kilometre Array Telescopebn204/publications/2015/SKA-SDP-Streaming.pdf · Data Processing for the Square Kilometre Array Telescope Bojan Nikolic Astrophysics

Scientific Context – a partner to ALMA, EELT,

JWST

Credit:A.

Marinkovic/XCam/ALMA(ESO/NAOJ/NRAO)

Credit: Northrop Grumman (artists impression)

Credit:ESO/L. Calçada (artists impression)

Credit: SKA Organisation (artists impression)

European ELT

• ~40m optical telescope

• Completion ~2025

• Budget ~1.1 bn EUR

JWST:

• 6.5m space near-infrared telescope

• Launch 2018

• Budget ~8 bn USD

ALMA:

• 66 high precision sub-mm antennas

• Completed in 2013

• Budget ~1.5 bn USD

Square Kilometre Array

• Two next generation low-frequency arrays

• Completion ~2022 for Phase 1

• Budget 0.65 bn EUR for Phase 1

Construction

Page 11: Data Processing for the Square Kilometre Array Telescopebn204/publications/2015/SKA-SDP-Streaming.pdf · Data Processing for the Square Kilometre Array Telescope Bojan Nikolic Astrophysics

What will the Square Kilometre Array (SKA) be?

Radio Telescope

Makes Images of the Sky at radio (5m-3cm) wavelengths

~100 more sensitive than current telescopes

Complements ALMA, JWST (successor to Hubble), and E-ELT

Currently in Design

Construction begins 2018

Full operations expected at end 2022

Significant funds already committed by participating countries

Major Engineering Project

Two remote desert sites

>100k receiving elements

Major ICT Project

Subject of this talk!

Page 12: Data Processing for the Square Kilometre Array Telescopebn204/publications/2015/SKA-SDP-Streaming.pdf · Data Processing for the Square Kilometre Array Telescope Bojan Nikolic Astrophysics

SCIENCE DATA PROCESSOR

CONTEXT

Page 13: Data Processing for the Square Kilometre Array Telescopebn204/publications/2015/SKA-SDP-Streaming.pdf · Data Processing for the Square Kilometre Array Telescope Bojan Nikolic Astrophysics

The SKA Observatory – Phase 1 : “SKA1”

Receptors: Aperture Arrays and Dishes

Digital Signal Processing: FPGA + (maybe) ASICS & GPUs

Data Processing: general computing

People, buildings,

roads, ground works,

communications,

electrical power,

maintenance,

transportation,

catering at desert

sites….

Page 14: Data Processing for the Square Kilometre Array Telescopebn204/publications/2015/SKA-SDP-Streaming.pdf · Data Processing for the Square Kilometre Array Telescope Bojan Nikolic Astrophysics

SKA Context Diagram

SKA1 Low:Low Frequency Aperture Array

SKA1 Mid:Dish Antennas with Single-Pixel feeds

LFAA Correlator/

Beam Former

Science Data Processor

Implementation(Australia)

Science Data Processor

Implementation(South Africa)

SKA1 Mid Correlator/

Beam Former

Pulsar Search

Processor(South Africa)

Monitor and Control

Astro

nom

ers

Pulsar Search Processor(Australia)

These are off-

site! (In Perth &

Cape Town)

Page 15: Data Processing for the Square Kilometre Array Telescopebn204/publications/2015/SKA-SDP-Streaming.pdf · Data Processing for the Square Kilometre Array Telescope Bojan Nikolic Astrophysics

COMPUTING IS THE MAJOR PART

OF SKA TELESCOPES BY DESIGN

Role of computing/processing in SKA:

Page 16: Data Processing for the Square Kilometre Array Telescopebn204/publications/2015/SKA-SDP-Streaming.pdf · Data Processing for the Square Kilometre Array Telescope Bojan Nikolic Astrophysics

Large “D” – vs – Large “N”

GBT 100-m diameter telescope SKA LFAA prototype array

No 1 aim: collect as many photons as possible

No 2 aim: maximum separation of collectors -> achieve high angular resolution

Page 17: Data Processing for the Square Kilometre Array Telescopebn204/publications/2015/SKA-SDP-Streaming.pdf · Data Processing for the Square Kilometre Array Telescope Bojan Nikolic Astrophysics

X X X X X X

SKY Image

Detect &

amplify

Digitise

& delay

Correlate

Process

( ==

SDP)

Calibrate, grid, FFT

Integrate

s

B

B . s

1 2

Astronomical signal

(EM wave) • Visibility:

V(B) = E1 E2*

= I(s) exp( i w B.s/c )

• Resolution determined by

maximum baseline

qmax ~ l / Bmax

• Field of View (FoV)

determined by the size of

each dish

qdish ~ l / D

Basics of Interferometry – how signals are

combined

Page 18: Data Processing for the Square Kilometre Array Telescopebn204/publications/2015/SKA-SDP-Streaming.pdf · Data Processing for the Square Kilometre Array Telescope Bojan Nikolic Astrophysics

SKA SCIENCE DATA

PROCESSOR OVERVIEW

Page 19: Data Processing for the Square Kilometre Array Telescopebn204/publications/2015/SKA-SDP-Streaming.pdf · Data Processing for the Square Kilometre Array Telescope Bojan Nikolic Astrophysics

Key Characteristics of SKA Data Processing

Very large data volumes, all data are processed in each observation

Noisy Data

Corrected for by deconvolution using iterative algorithms (~10 iterations)

Sparse and Incomplete Sampling

Corrected by jointly solving for the sky brightness distribution and for the slowly changing corruption effects using iterative algorithms

Corrupted Measurements

Loosely coupled tasks, large degree of parallelism is inherently available

Multiple dimensions of

data parallelism

Page 20: Data Processing for the Square Kilometre Array Telescopebn204/publications/2015/SKA-SDP-Streaming.pdf · Data Processing for the Square Kilometre Array Telescope Bojan Nikolic Astrophysics

SDP Top-level Components & Key Performance

Requirements -- SKA Phase 1

SDP Local Monitor & Control

High Performance

• ~100 PetaFLOPS

Data Intensive

• ~100 PetaBytes/observation (job)

Partially real-time

• ~10s response time

Partially iterative

• ~10 iterations/job (~3 hour)

Telescope Manager

C

S

P

Regio

nal C

entre

s &

Astro

nom

ers

High Volume & High Growth Rate

• ~100 PetaByte/year

Infrequent Access

• ~few times/year max

Data Processor Long Term

Archive

Delivery

System

Data Distribution

•~100 PetaByte/year from Cape Town & Perth to rest of World

Data Discovery

•Visualisation of 100k by 100k by 100k voxel cubes

Science Data Processor

1 Tera

Byte/s

Page 21: Data Processing for the Square Kilometre Array Telescopebn204/publications/2015/SKA-SDP-Streaming.pdf · Data Processing for the Square Kilometre Array Telescope Bojan Nikolic Astrophysics

The challenges?

• Power efficiency

– Funding agencies more tolerant of cap-ex then power op-ex

• Scalability of software

– Hardware roadmaps indicate h/w will reach requirements

– Demonstrated software scaling is only 1/1000th of requirement

• Project risks

– Inevitable significant interaction between software engineers and

idiosyncratic domain specific knowledge

– Software project

• Extensibility, system scalability, maintainability

– SKA1 is the first “milestone” – expecting significant expansion in

the 2020s

– 50yr observatory lifetime

Page 22: Data Processing for the Square Kilometre Array Telescopebn204/publications/2015/SKA-SDP-Streaming.pdf · Data Processing for the Square Kilometre Array Telescope Bojan Nikolic Astrophysics

Relevance of Streaming 2015 technologies

• Multiple Loosely Coupled Task – Yes– Within each iteration & between observation

• BSP – Yes: a few (~10) iterations– Synchronisation at end of each iteration

– Relatively small synchronised state (sky model + calibration)

• Workflow – yes

• Streaming – yes (but not interactive)

• Steering – yes (feed back of calibration solutions to the receptors)

• Data Driven – yes & yes– In the dataflow execution model sense (i.e., not demand-driven)

– In the programming model sense (i.e., a declarative description of how to reduce the data are attached to the data themselves)

Page 23: Data Processing for the Square Kilometre Array Telescopebn204/publications/2015/SKA-SDP-Streaming.pdf · Data Processing for the Square Kilometre Array Telescope Bojan Nikolic Astrophysics

Why streaming?

• Detecting transient phenomena

– Need to generate alerts latest about ~10s

after receiving the data

• Impractical to store data for later

processing

– Data rate in ~ 1 TeraByte/s

– -> “stream” but with data chunked by

observation

Page 24: Data Processing for the Square Kilometre Array Telescopebn204/publications/2015/SKA-SDP-Streaming.pdf · Data Processing for the Square Kilometre Array Telescope Bojan Nikolic Astrophysics

ARCHITECTURE HIGHLIGHTS

SKA Science Data Processor

Page 25: Data Processing for the Square Kilometre Array Telescopebn204/publications/2015/SKA-SDP-Streaming.pdf · Data Processing for the Square Kilometre Array Telescope Bojan Nikolic Astrophysics

Programming model

• Hybrid programming model:– Dataflow at coarse-grained level:

• About 1 million tasks/s max over the whole processor (-> ~10s – 100s milli second tasks), consuming ~100 MegaByte each

• Static scheduling at coarsest-level (down to “data-island”)– Static partitioning of the large-volume input data

• Dynamic scheduling within data island:– Failure recovery, dynamic load-balancing

• Data driven (all data will be used)

– Shared memory model at fine-grained level e.g.: threads/OpenMP/SIMT-like

• ~100s active threads per shared memory space

• Allows manageable working memory size, computational efficiency

Page 26: Data Processing for the Square Kilometre Array Telescopebn204/publications/2015/SKA-SDP-Streaming.pdf · Data Processing for the Square Kilometre Array Telescope Bojan Nikolic Astrophysics

Why?

• Shared memory model essential at fine-grain to control working memory requirements

• Dataflow (but all of these are still to be proven in our application):

– Load-balancing

– Minimisation of data movement

– Handling failure

– Adaptability to different system architectures

Page 27: Data Processing for the Square Kilometre Array Telescopebn204/publications/2015/SKA-SDP-Streaming.pdf · Data Processing for the Square Kilometre Array Telescope Bojan Nikolic Astrophysics

Fault Tolerance – “Non-Precious” Data

• Classify arcs in the dataflow graph as precious or non-precious

• Precious data are treated in usual way –failover, restart, RAID, etc.

• Non-precious data can be dropped:– If they are input to a map-type operation then no

output

– If they are input to a reduction then result is computed without them

• Stragglers outputting non-precious data can be terminated after a relatively short time-out

Page 28: Data Processing for the Square Kilometre Array Telescopebn204/publications/2015/SKA-SDP-Streaming.pdf · Data Processing for the Square Kilometre Array Telescope Bojan Nikolic Astrophysics

The non-precious data concept - illustration

FULL DATASET

FREQ 1 FREQ 2 FREQ 3 FREQ NF

SPLIT BY FREQ

GRID

FFT

Single Frequency Image

GRID

FFT

Single Frequency Image

Reduce by pixel-vise addition

GRID

FFT

Single Frequency Image

GRID

FFT

Single Frequency Image

2 Freq Image 2 Freq Image

Reduce by pixel-vise addition

Combined Image

Normalise by 4

Combined Image

FULL DATASET

FREQ 1 FREQ 2 FREQ 3 FREQ NF

SPLIT BY FREQ

GRID

FFT

Single Frequency Image

GRID

FFT

Single Frequency Image

GRID

FFT

Single Frequency Image

GRID

FFT

Single Frequency Image

2 Freq Image 1 Freq Image

Reduce by pixel-vise addition

Combined Image

Normalise by 3

Combined Image

Page 29: Data Processing for the Square Kilometre Array Telescopebn204/publications/2015/SKA-SDP-Streaming.pdf · Data Processing for the Square Kilometre Array Telescope Bojan Nikolic Astrophysics

Data Ingress and Interconnect Concept

1st stage

BDN

1st stage

BDN

1st stage

BDN

2nd stage

BDN

2nd stage

BDN

2nd stage

BDN

2nd stage

BDN

2nd stage

BDN

Com

pu

te Is

lan

d

Com

pu

te Is

lan

d

Com

pu

te Is

lan

d

Com

pu

te Is

lan

d

Com

pu

te Is

lan

d

Com

pu

te Is

lan

d

Com

pu

te Is

lan

d

Com

pu

te Is

lan

d

Com

pu

te Is

lan

d

Com

pu

te Is

lan

d

Com

pu

te Is

lan

d

Com

pu

te Is

lan

d

Com

pu

te Is

lan

d

Com

pu

te Is

lan

d

Com

pu

te Is

lan

d

CSP

Node

CSP

Node

CSP

Node

CSP

Node

CSP

Node

CSP

Node

CSP

Node

CSP

Node

CSP

Node

Science Archive switch Science Archive switch

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

LNK/ACT LNK/ACT LNK/ACT LNK/ACT

RESET

PWRRPSDIAG

TEMPFAN

FDX/HDXLNK/ACT

COMBO PORTS

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

LNK/ACT LNK/ACT LNK/ACT LNK/ACT

RESET

PWRRPSDIAG

TEMPFAN

FDX/HDXLNK/ACT

COMBO PORTS

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

LNK/ACT LNK/ACT LNK/ACT LNK/ACT

RESET

PWRRPSDIAG

TEMPFAN

FDX/HDXLNK/ACT

COMBO PORTS

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

LNK/ACT LNK/ACT LNK/ACT LNK/ACT

RESET

PWRRPSDIAG

TEMPFAN

FDX/HDXLNK/ACT

COMBO PORTS

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

LNK/ACT LNK/ACT LNK/ACT LNK/ACT

RESET

PWRRPSDIAG

TEMPFAN

FDX/HDXLNK/ACT

COMBO PORTS

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

LNK/ACT LNK/ACT LNK/ACT LNK/ACT

RESET

PWRRPSDIAG

TEMPFAN

FDX/HDXLNK/ACT

COMBO PORTS

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

LNK/ACT LNK/ACT LNK/ACT LNK/ACT

RESET

PWRRPSDIAG

TEMPFAN

FDX/HDXLNK/ACT

COMBO PORTS

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

LNK/ACT LNK/ACT LNK/ACT LNK/ACT

RESET

PWRRPSDIAG

TEMPFAN

FDX/HDXLNK/ACT

COMBO PORTS

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

LNK/ACT LNK/ACT LNK/ACT LNK/ACT

RESET

PWRRPSDIAG

TEMPFAN

FDX/HDXLNK/ACT

COMBO PORTS

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

LNK/ACT LNK/ACT LNK/ACT LNK/ACT

RESET

PWRRPSDIAG

TEMPFAN

FDX/HDXLNK/ACT

COMBO PORTS

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

LNK/ACT LNK/ACT LNK/ACT LNK/ACT

RESET

PWRRPSDIAG

TEMPFAN

FDX/HDXLNK/ACT

COMBO PORTS

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

LNK/ACT LNK/ACT LNK/ACT LNK/ACT

RESET

PWRRPSDIAG

TEMPFAN

FDX/HDXLNK/ACT

COMBO PORTS

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

LNK/ACT LNK/ACT LNK/ACT LNK/ACT

RESET

PWRRPSDIAG

TEMPFAN

FDX/HDXLNK/ACT

COMBO PORTS

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

LNK/ACT LNK/ACT LNK/ACT LNK/ACT

RESET

PWRRPSDIAG

TEMPFAN

FDX/HDXLNK/ACT

COMBO PORTS

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

LNK/ACT LNK/ACT LNK/ACT LNK/ACT

RESET

PWRRPSDIAG

TEMPFAN

FDX/HDXLNK/ACT

COMBO PORTS

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

3rd stage switch

2nd stage

LLN

2nd stage

LLN

2nd stage

LLN

2nd stage

LLN

1st stage

LLN

1st stage

LLN

INGEST LAYER

DELIV

Sto

rag

e P

od

Sto

rag

e P

od

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

LNK/ACT LNK/ACT LNK/ACT LNK/ACT

RESET

PWRRPSDIAG

TEMPFAN

FDX/HDXLNK/ACT

COMBO PORTS

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

25

26

27

28

29

30

31

32

33

34

35

36

37

38

39

40

41

42

43

44

45

46

47

48

LNK/ACT LNK/ACT LNK/ACT LNK/ACT

RESET

PWRRPSDIAG

TEMPFAN

FDX/HDXLNK/ACT

COMBO PORTS

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

20

21

22

23

24

Long Term Storage

High Performance Buffer

Low-Latency Network

Medium Performance Buffer

Bulk Data Network

Archive Network

Page 30: Data Processing for the Square Kilometre Array Telescopebn204/publications/2015/SKA-SDP-Streaming.pdf · Data Processing for the Square Kilometre Array Telescope Bojan Nikolic Astrophysics

Final Remarks

From SKA-SDP Phase 1 perspective:

• Probably all of the paradigms and ideas we need are available

• Most technologies required for implementing these are available but scattered between many products and platforms

• Need to move towards integrating these together and stable APIs

Page 31: Data Processing for the Square Kilometre Array Telescopebn204/publications/2015/SKA-SDP-Streaming.pdf · Data Processing for the Square Kilometre Array Telescope Bojan Nikolic Astrophysics