astronomy toolkits and data structures

29
Astronomy toolkits and data structures Andrew Jenkins Durham University

Upload: audi

Post on 15-Jan-2016

35 views

Category:

Documents


0 download

DESCRIPTION

Astronomy toolkits and data structures. Andrew Jenkins Durham University. Data requirements of cosmological simulations. Adrian Jenkins Durham University. Talk outline. DiRAC and its major users New astronomical instruments and missions Mock catalogues Millennium simulation and database - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Astronomy toolkits and data structures

Astronomy toolkits and data structures

Andrew Jenkins

Durham University

Page 2: Astronomy toolkits and data structures

Data requirements of cosmological simulations

Adrian Jenkins

Durham University

Page 3: Astronomy toolkits and data structures

3

Talk outline

• DiRAC and its major users• New astronomical instruments and

missions• Mock catalogues• Millennium simulation and database• Future directions for simulations

Page 4: Astronomy toolkits and data structures

DiRAC 2 facility

• Cambridge HPC Service: data analytic cluster

• Cambridge COSMOS shared memory service

• Durham ICC Service: data centric cluster (6720 core - idataPlex)

• Edinburgh 6144 node Bluegene/Q• Leicester IT services: complexity cluster

Page 5: Astronomy toolkits and data structures

DiRAC2 facility used by

Time allocated by RAC. Supports large projects (up to 3 years), and smaller allocations.

• Large users: UKQCD Virgo Consortium (UK) UKMHD

Horizon, Leicester …

Page 6: Astronomy toolkits and data structures

JWST

Launch date: ~2017-8

Cost >$5 billion

EUCLID

Launch date:~2019

Cost ~€500 million

Page 7: Astronomy toolkits and data structures
Page 8: Astronomy toolkits and data structures

Future large surveys• Photometric e.g. Pan-STARRs, DES, LSST, Euclid-VIS

• Spectroscopic e.g. BOSS, BigBOSS, Euclid-NIS

• Multi-wavelength e.g SKA (HI)

Wide-field (>10,000 sq deg), wide redshift (z=0-3)

z-surveys: 10-50 million galaxies imaging surveys ~billions of galaxies

Page 9: Astronomy toolkits and data structures

Why build a mock?

• Test galaxy formation models• Test algorithms - validation• Test processing pipelines• Assess survey performance (FoM)

Large surveys need mocks now!

Page 10: Astronomy toolkits and data structures

Mock catalogues need observables

SFRSFHStellar massCold gas massBlack hole mass

imagesFull SED (UV, Optical, FIR, Radio)Galaxies : stars, gas, AGN

Page 11: Astronomy toolkits and data structures

Euclid OU-LE3 requirements for simulations

CSWG OU-SIM

Cosmologicalsimulators

Instrumentsimulators

Page 12: Astronomy toolkits and data structures

Generic needs from Euclid• Position, redshift

• Emission line properties/spectra Line flux, equivalent width• Broad photometry to AB~24-24.5 Euclid NIR Euclid VIS Pan-STARRS griz DES grizy CFHTLS ugriyz WFCAM ZYJHK SDSS ugriz VISTA-VHS-VIDEO ZYJHKs• Photometric redshifts

Page 13: Astronomy toolkits and data structures

Specific needs: clustering

• 1% P(k) accuracy• Covariance estimates: P(k) etc• Initial conditions for reconstruction• Different cosmologies• Different galaxy formation models

(vary bias)

Page 14: Astronomy toolkits and data structures

Specific needs – clusters of galaxies

• DM haloes M>1.e+13Msun, r(ΔΔ2500, 500,200; velocity dispersion along axes from DM particles

• For each galaxy host halo ID, central or sat?

• Simulated images for cluster detection and mass determination through weak/strong lensimg

Page 15: Astronomy toolkits and data structures

Specific needs: weak lensing

• Galaxies and DM to generate kappa map

• Galaxy shapes with noise (no IA) • Galaxy shapes with IA• Shear at each galaxy position• Image properties: mask, bright stars, chip boundaries,

CCD defects, ghosts, variations in depth & background

Page 16: Astronomy toolkits and data structures

16

Infrastructure required to make mocks

• Require large simulations

• To date these have been simulations of dark matter in large cosmological volumes.

Page 17: Astronomy toolkits and data structures

17

Page 18: Astronomy toolkits and data structures

18

Page 19: Astronomy toolkits and data structures

19

Page 20: Astronomy toolkits and data structures

20

Page 21: Astronomy toolkits and data structures

21

Page 22: Astronomy toolkits and data structures
Page 23: Astronomy toolkits and data structures
Page 24: Astronomy toolkits and data structures

24

Input simulations

• Large N-body simulations

• Approaching a trillion particles

Page 25: Astronomy toolkits and data structures

MXXL simulation

Page 26: Astronomy toolkits and data structures

2626

Page 27: Astronomy toolkits and data structures

27

Future needsSimulations for Euclid multi-trillion particle simulations

Produce multi-petabyte datasets

Data growing faster than network capabilities

Need to scale databases up

Ideally would like to serve the raw simulation data - two or more orders of magnitude larger.

Page 28: Astronomy toolkits and data structures

28

Current and future simulations

Page 29: Astronomy toolkits and data structures

29

Summary

• Cosmological simulations are required to make the best use of observatories and space missions

• The size of the required simulations makes this a Big data problem

• Databases have proved very successful way of presenting processed data

• Making the raw simulation data public desirable - but very challenging given financial constraints.