lecture 1 data assimilation basics
TRANSCRIPT
Day 5 Lecture 1 Data Assimilation Hendrik Elbern 1
DRAGON ADVANCED TRAINING COURSE IN ATMOSPHERE REMOTE SENSING
Lecture 1Data Assimilation Basics
H. ElbernRhenish Institute for Environmental Research at the
University of Cologneand
Virt. Inst. for Inverse Modelling of Atmopheric Cjemical Composition
Day 5 Lecture 1 Data Assimilation Hendrik Elbern 2
DRAGON ADVANCED TRAINING COURSE IN ATMOSPHERE REMOTE SENSING
Objective of this lecture
Understanding of• the fundamental objectives of data
assimilation and inverse modelling• Theory of data assimilation
– three basic sources of information– combination of information sources– the basic ideas of 4D-var and Kalman-filter– the limiting assumptions and a look forward
Day 5 Lecture 1 Data Assimilation Hendrik Elbern 3
DRAGON ADVANCED TRAINING COURSE IN ATMOSPHERE REMOTE SENSING
General textbook literature for data assimilation• Daley, R., Atmospheric Data Analysis, Cambridge University Press, pp. 457, 1991.
– Well established connection between statistics and practical data assimilation– Meanwhile behind cutting edge operational data assimilation implementations
• Bennet, A., Inverse Methods in Physical Oceanography, Cambridge University Press, pp. 346, 1992.
• Bennet, A., Inverse Modeling of the Ocean and Atmosphere, Cambridge University Press, pp. 234, 2002.
– Mathematically very sound, with focus on oceangraphy– emphasis on science, rather than operational usage
•• Kalnay, E., Atmospheric Modeling, Data Assimilation and Predictability,
Cambridge University Press, pp. 341, 2003, Chap. – Well presented pedagogical introduction to data assimilation theory
• Thiebaux, H.J. and Pedder, M.A. Spatial Objective Analysis with applications in atmospheric science, Academic Press1987.
– Mathematically rigorous formalism with Optimum Interpolation as central algorithm
Additionally, a collection of overview papers:• Special Issue: J. Met. Soc. Japan, Vol. 75, No. 1B, pages 1-138, 1997.
Day 5 Lecture 1 Data Assimilation Hendrik Elbern 4
DRAGON ADVANCED TRAINING COURSE IN ATMOSPHERE REMOTE SENSING
Underlying mathematics:
• Jazwinski, A., Stochastic Processes and Filtering Theory, Academic Press,San Diego, pp. 376, 1970.
• Marchuk, G., Numerical solutions of the problems of the problems of thedynamics of the atmosphere and ocean, Gidrometeoizadat, Leningrad, 1974.
• Courant, R. and Hilbert, D. Methods of mathematical physics, Volume 1, New York: Interscience, 1953.
• Courant, R. and Hilbert, D. Methods of mathematical physics, Partial differentialequations, Volume 2, New York: Interscience, 1962.
• Lions, J., Optimal control of systems governed by partial differential equations, Spinger Verlag, Berlin, 1971.
Day 5 Lecture 1 Data Assimilation Hendrik Elbern 5
DRAGON ADVANCED TRAINING COURSE IN ATMOSPHERE REMOTE SENSING
Terminology
Inverse ModellingThe inverse modelling problem consists of using
the actual result of some measurements to infer the values of the parameters that characterize the system.
A. Tarantola (2005)
Day 5 Lecture 1 Data Assimilation Hendrik Elbern 6
DRAGON ADVANCED TRAINING COURSE IN ATMOSPHERE REMOTE SENSING
Objective of atmospheric data assimilation (1)
The ambitious and elusive goal of dataassimilation is to provide a dynamicallyconsistent motion picture of theatmosphere and oceans, in three spacedimensions, with known error bars.
M. Ghil and P. Malanotte-Rizzoli (1991)
Day 5 Lecture 1 Data Assimilation Hendrik Elbern 7
DRAGON ADVANCED TRAINING COURSE IN ATMOSPHERE REMOTE SENSING
Objective of atmospheric data assimilation (2)• "is to produce a regular,
physically consistent four dimensionalrepresentation of the state of the atmosphere
• from a heterogeneousarray of in situ and remote instruments
• which sample imperfectly and irregularly in space and time.
Data assimilation • extracts the signal from
noisy observations (filtering)
• interpolates in space and time (interpolation) and
• reconstructs state variables that are not sampled by the observation network (completion).“ (Daley, 1997)
Day 5 Lecture 1 Data Assimilation Hendrik Elbern 8
DRAGON ADVANCED TRAINING COURSE IN ATMOSPHERE REMOTE SENSING
Data assimilation has much of curve fitting!
y
a
b { |1
|0 x
y=ax+b
Day 5 Lecture 1 Data Assimilation Hendrik Elbern 9
DRAGON ADVANCED TRAINING COURSE IN ATMOSPHERE REMOTE SENSING
Characteristics in data assimilation,in comparison to remote sensing retrievals
• high dimensional problem: dim (x) > 105
• highly underdetermined system (few observations with respect to model freedom: dim(x)>>dim(y) )
• nonlinear dynamics• constraints by physical laws are insufficient
Day 5 Lecture 1 Data Assimilation Hendrik Elbern 10
DRAGON ADVANCED TRAINING COURSE IN ATMOSPHERE REMOTE SENSING
Advanced data assimilation is an application of the principles of Data Analysis.
(As in satellite retrievals) we strive for estimating a latent, not apparent parameter set x.
We dispose of 1. indirect information on the state and processes in terms of
manifest, though insufficient or inappropriate data y, and2. a deterministic model M connecting x and y, in some way.
Bayes’ rule gives access to the probability of state x, given y and M
Day 5 Lecture 1 Data Assimilation Hendrik Elbern 11
DRAGON ADVANCED TRAINING COURSE IN ATMOSPHERE REMOTE SENSING
inversion
radiance dataasimilation
dataretrieval
DA in the satellite data application chainlevel 0
detector tensions
level 1spectra
level 2geolocated
geophysical parameters
level 3 geophysical parameters
on regular grids
level 4 user applications
priordata
geophysicalmodels
Day 5 Lecture 1 Data Assimilation Hendrik Elbern 12
DRAGON ADVANCED TRAINING COURSE IN ATMOSPHERE REMOTE SENSING
PredictabilityWhich credibility can the prediction claim?
Compute probability density function
(in some way by “optimal perturbations”)
ObservabilityBy which observations
“how unambiguous” can the state estimate be?
compute (some norm of)analysis error covariance
matrix
SensitivityHow probable/stable is the simulation/state ?
compute leading singular vectors
discontinuities
Data AssimilationInverse Modelling
estimate most probable stateparameter values
ill p
osed
ness
whi
ch o
bser
vatio
ns?
mos
t pro
babl
epr
edic
tion
optimal perturbations
DA and related algorithms: How can dynamic system understanding be
assessed?
Day 5 Lecture 1 Data Assimilation Hendrik Elbern 13
DRAGON ADVANCED TRAINING COURSE IN ATMOSPHERE REMOTE SENSING
Observation configuration optimisation(targeted observations)
Given:
Stabilitymeasure:
Problem: 1. Finde perturbation dx(t0), which triggers the mostuncertain system evolution dx(t)!
Solution:Calculate eigenvectors of
with maximal eigenvectors
L(t0,t) integration operator
2. Infer the observation configuration, which minimizesthe analysis error covariance matrix A (in some suitablesense) : trace, determinant, max. EV.
Day 5 Lecture 1 Data Assimilation Hendrik Elbern 14
DRAGON ADVANCED TRAINING COURSE IN ATMOSPHERE REMOTE SENSING
Ad hoc (empirical) approaches (1)Interpolation
Day 5 Lecture 1 Data Assimilation Hendrik Elbern 15
DRAGON ADVANCED TRAINING COURSE IN ATMOSPHERE REMOTE SENSING
Ad hoc (empirical) approaches (2)Nudging
Caution: prone to excitement of artificial modes
Day 5 Lecture 1 Data Assimilation Hendrik Elbern 16
DRAGON ADVANCED TRAINING COURSE IN ATMOSPHERE REMOTE SENSING
Optimality criteria: Which property can be attributed to our analysis result?
(Need for quantification)• maximum likelyhood:
– maximum of probability density function• minimal variance: l2 norm
– parameters optimal, for which analysis error spread is minimal (for Gaussian/normal and log-normal error distributions), Best Linear Unbiased Estimate (BLUE)
• minimax norm (discrete cases)• maximum entropy
Day 5 Lecture 1 Data Assimilation Hendrik Elbern 17
DRAGON ADVANCED TRAINING COURSE IN ATMOSPHERE REMOTE SENSING
Data assimilation: Synergy of Information sourcesexample: 2 data sets with Gaussian distribution
(here: minimal variance= maximum likelihood)
a priori (=prediction or climatology) observation
Bayes’ rule:
xa
Analysis (=estimation) BLUEBLUEBest Linear Unbiased Estimate
xb yoxaxb yoxb
Example: inconsistent data
Day 5 Lecture 1 Data Assimilation Hendrik Elbern 18
DRAGON ADVANCED TRAINING COURSE IN ATMOSPHERE REMOTE SENSING
Day 5 Lecture 1 Data Assimilation Hendrik Elbern 19
DRAGON ADVANCED TRAINING COURSE IN ATMOSPHERE REMOTE SENSING
A 2 information bits example (1): Assumptions
Day 5 Lecture 1 Data Assimilation Hendrik Elbern 20
DRAGON ADVANCED TRAINING COURSE IN ATMOSPHERE REMOTE SENSING
A 2 information bits example (2): estimation
Day 5 Lecture 1 Data Assimilation Hendrik Elbern 21
DRAGON ADVANCED TRAINING COURSE IN ATMOSPHERE REMOTE SENSING
A 2 information bits example (3): estimation
Day 5 Lecture 1 Data Assimilation Hendrik Elbern 22
DRAGON ADVANCED TRAINING COURSE IN ATMOSPHERE REMOTE SENSING
A 2 information bits example (4): estimation
Day 5 Lecture 1 Data Assimilation Hendrik Elbern 23
DRAGON ADVANCED TRAINING COURSE IN ATMOSPHERE REMOTE SENSING
Single Observation, direct inversion
Day 5 Lecture 1 Data Assimilation Hendrik Elbern 24
DRAGON ADVANCED TRAINING COURSE IN ATMOSPHERE REMOTE SENSING
Single observation, Minimisation
Day 5 Lecture 1 Data Assimilation Hendrik Elbern 25
DRAGON ADVANCED TRAINING COURSE IN ATMOSPHERE REMOTE SENSING
Optimal Interpolation
Notation: Ide, K., P. Courtier, M. Ghil, and A. Lorenc, Unified notation for data assimilation: operational sequential and variational, J. Met. Soc. Jap., 75, 181--189, 1997.
Day 5 Lecture 1 Data Assimilation Hendrik Elbern 26
DRAGON ADVANCED TRAINING COURSE IN ATMOSPHERE REMOTE SENSING
Generalized Formulation (1)
Day 5 Lecture 1 Data Assimilation Hendrik Elbern 27
DRAGON ADVANCED TRAINING COURSE IN ATMOSPHERE REMOTE SENSING
Generalized Formulation (2)
Day 5 Lecture 1 Data Assimilation Hendrik Elbern 28
DRAGON ADVANCED TRAINING COURSE IN ATMOSPHERE REMOTE SENSING
Generalized Formulation (3)
Day 5 Lecture 1 Data Assimilation Hendrik Elbern 29
DRAGON ADVANCED TRAINING COURSE IN ATMOSPHERE REMOTE SENSING
Generalized Formulation (4)
Day 5 Lecture 1 Data Assimilation Hendrik Elbern 30
DRAGON ADVANCED TRAINING COURSE IN ATMOSPHERE REMOTE SENSING
Generalized Formulation (5)
Day 5 Lecture 1 Data Assimilation Hendrik Elbern 31
DRAGON ADVANCED TRAINING COURSE IN ATMOSPHERE REMOTE SENSING
Kalman filter: basic equations
Day 5 Lecture 1 Data Assimilation Hendrik Elbern 32
DRAGON ADVANCED TRAINING COURSE IN ATMOSPHERE REMOTE SENSING
4D-var
Kalman Filter
Types of assimilation algorithms:“smoother” and filter
Day 5 Lecture 1 Data Assimilation Hendrik Elbern 33
DRAGON ADVANCED TRAINING COURSE IN ATMOSPHERE REMOTE SENSING
Why 4D-variational data assimilation?
• provides BLUE (Best Linear Unbiased Estimator). Remark: 3D-var ingested in model does not!
• Potential for “consisteny” within the assimilation intervall (O(1 day))Remark: fast manifold perturbations (=“initialisation problem”) mitigated, but not removed. Inconsistencies at the end of the assimilation windows.
• Allows for extensions to estimate analysis error covariance matrix and temporal correlations (red noise)
Day 5 Lecture 1 Data Assimilation Hendrik Elbern 34
DRAGON ADVANCED TRAINING COURSE IN ATMOSPHERE REMOTE SENSING
How does 4D-var work?
Day 5 Lecture 1 Data Assimilation Hendrik Elbern 35
DRAGON ADVANCED TRAINING COURSE IN ATMOSPHERE REMOTE SENSING
Derivation of 4D-var (1)
Day 5 Lecture 1 Data Assimilation Hendrik Elbern 36
DRAGON ADVANCED TRAINING COURSE IN ATMOSPHERE REMOTE SENSING
Derivation of 4D-var (2)
Day 5 Lecture 1 Data Assimilation Hendrik Elbern 37
DRAGON ADVANCED TRAINING COURSE IN ATMOSPHERE REMOTE SENSING
Derivation of 4D-var (3)
Day 5 Lecture 1 Data Assimilation Hendrik Elbern 38
DRAGON ADVANCED TRAINING COURSE IN ATMOSPHERE REMOTE SENSING
Derivation of 4D-var (4)
Day 5 Lecture 1 Data Assimilation Hendrik Elbern 39
DRAGON ADVANCED TRAINING COURSE IN ATMOSPHERE REMOTE SENSING
Derivation of 4D-var (5)
Day 5 Lecture 1 Data Assimilation Hendrik Elbern 40
DRAGON ADVANCED TRAINING COURSE IN ATMOSPHERE REMOTE SENSING
Derivation of 4D-var (6)
Day 5 Lecture 1 Data Assimilation Hendrik Elbern 41
DRAGON ADVANCED TRAINING COURSE IN ATMOSPHERE REMOTE SENSING
Derivation of 4D-var (7)
Day 5 Lecture 1 Data Assimilation Hendrik Elbern 42
DRAGON ADVANCED TRAINING COURSE IN ATMOSPHERE REMOTE SENSING
Collection of assimilation formulavariational data assimilation (3D-var und 4D-var)
Optimal Interpolation (OI)
Day 5 Lecture 1 Data Assimilation Hendrik Elbern 43
DRAGON ADVANCED TRAINING COURSE IN ATMOSPHERE REMOTE SENSING
The “Initialisation” / Filtering Problem (1)• Constraints by physical/chemical equations are
insufficient, as they allow for physically/chemically admissible states, but partly very unlikely to occur: – high amplitude gravity waves (“fast
manifolds”)– pronounced chemical imbalances
• Legacy procedure: introduce penalty term for fast changes (Normal Mode Initialisation, NMI)
Day 5 Lecture 1 Data Assimilation Hendrik Elbern 44
DRAGON ADVANCED TRAINING COURSE IN ATMOSPHERE REMOTE SENSING
The “Initialisation” / Filtering Problem (2)
• or filter function (Digital Filter Initialisation, DFI, here with Lanczos-filter ), x model state:
t=0 model start time
t=Mt=-M t=1
Day 5 Lecture 1 Data Assimilation Hendrik Elbern 45
DRAGON ADVANCED TRAINING COURSE IN ATMOSPHERE REMOTE SENSING
Outlook
• So far, only systems with Gaussian errors are treated.
• Sophisticted related algorithms (4D-var, KF) are far from being fully exploited.
• However, limits of Gaussian assumptions are obvious: assimilation for precipitation forecasts and aerosol process poor.
• Further, more general approaches necessary: MCMC, importance resampling, …