multilevel monte carlo methods for uncertainty
TRANSCRIPT
Multilevel Monte Carlo methods for uncertaintyquantification in subsurface flow
Aretha Teckentrup
Department of Mathematical SciencesUniversity of Bath
Joint work with:
Rob Scheichl and Elisabeth Ullmann (Bath),
Julia Charrier (Marseille), Mike Giles (Oxford),
Andrew Cliffe, Minho Park (Nottingham),
Christian Ketelsen and Panayot Vassilevski (LLNL)
May 23, 2013
A. Teckentrup (Bath) MLMC May 23, 2013 1 / 23
Outline
Motivation and model problem: uncertainty quantification inradioactive waste disposal
Standard Monte Carlo
Multilevel Monte Carlo
Multilevel Markov chain Monte Carlo
A. Teckentrup (Bath) MLMC May 23, 2013 2 / 23
Application: WIPP test caseUS Dept Energy Radioactive Waste Isolation Pilot Plant (WIPP) in New Mexico
Cross-section at WIPP
SANTA ROSA FORMATION
D E W E Y L A K E F O R M A T I O N
NA
SH
D
RA
W
S A L A D O F O R M A T I O N
R U S T L E R F O R M A T I O N
Forty NinerTamari sk
Unna med Lower Member
WIP
P-2
5
WIP
P-3
3
H-1
8
WIP
P-2
1SH
AFT
S
WIP
P-1
2
H-1
5
P-1
8
vertical exaggeration 20:1
Phill
ips,
19
87
Magenta Dolomite
Culebra Dolomite
Cross section through the rock at theWIPP site
Crucial to assess the risk ofradionuclides reentering thehuman environment
Culebra Dolomite layer acts asprincipal pathway for transportof radionuclides(2D to reasonable approximation)
A. Teckentrup (Bath) MLMC May 23, 2013 3 / 23
Uncertainty in Groundwater Flow
QUATERNARY
MERCIA MUDSTONE
VN-S CALDER
FAULTED VN-S CALDER
N-S CALDER
FAULTED N-S CALDER
DEEP CALDER
FAULTED DEEP CALDER
VN-S ST BEES
FAULTED VN-S ST BEES
N-S ST BEES
FAULTED N-S ST BEES
DEEP ST BEES
FAULTED DEEP ST BEES
BOTTOM NHM
FAULTED BNHM
SHALES + EVAP
BROCKRAM
FAULTED BROCKRAM
COLLYHURST
FAULTED COLLYHURST
CARB LST
FAULTED CARB LST
N-S BVG
FAULTED N-S BVG
UNDIFF BVG
FAULTED UNDIFF BVG
F-H BVG
FAULTED F-H BVG
BLEAWATH BVG
FAULTED BLEAWATH BVG
TOP M-F BVG
FAULTED TOP M-F BVG
N-S LATTERBARROW
DEEP LATTERBARROW
N-S SKIDDAW
DEEP SKIDDAW
GRANITE
FAULTED GRANITE
WASTE VAULTS
CROWN SPACE
EDZ
c©NIREX UK Ltd.
Modelling and simulation essential to assess repositoryperformance
Darcy’s law for an incompressible fluid → elliptic partialdifferential equations
−∇ · (k∇p) = f
Lack of data → uncertainty in model parameters
Quantify impact of uncertainty on outputs throughstochastic modelling (→ random variables)
A. Teckentrup (Bath) MLMC May 23, 2013 4 / 23
Uncertainty in Groundwater Flow
QUATERNARY
MERCIA MUDSTONE
VN-S CALDER
FAULTED VN-S CALDER
N-S CALDER
FAULTED N-S CALDER
DEEP CALDER
FAULTED DEEP CALDER
VN-S ST BEES
FAULTED VN-S ST BEES
N-S ST BEES
FAULTED N-S ST BEES
DEEP ST BEES
FAULTED DEEP ST BEES
BOTTOM NHM
FAULTED BNHM
SHALES + EVAP
BROCKRAM
FAULTED BROCKRAM
COLLYHURST
FAULTED COLLYHURST
CARB LST
FAULTED CARB LST
N-S BVG
FAULTED N-S BVG
UNDIFF BVG
FAULTED UNDIFF BVG
F-H BVG
FAULTED F-H BVG
BLEAWATH BVG
FAULTED BLEAWATH BVG
TOP M-F BVG
FAULTED TOP M-F BVG
N-S LATTERBARROW
DEEP LATTERBARROW
N-S SKIDDAW
DEEP SKIDDAW
GRANITE
FAULTED GRANITE
WASTE VAULTS
CROWN SPACE
EDZ
c©NIREX UK Ltd.
Modelling and simulation essential to assess repositoryperformance
Darcy’s law for an incompressible fluid → elliptic partialdifferential equations
−∇ · (k∇p) = f
Lack of data → uncertainty in model parameters
Quantify impact of uncertainty on outputs throughstochastic modelling (→ random variables)
A. Teckentrup (Bath) MLMC May 23, 2013 4 / 23
Typical model
Typical simplified model for k is a log–normal random field,k = exp[g], where g is a scalar, isotropic Gaussian field.
To sample from k, use Karhunen-Loeve expansion:
log k(x, ω) ≈J∑j=1
√µjφj(x)Zj(ω),
with Zj(ω) i.i.d. N(0, 1).
A. Teckentrup (Bath) MLMC May 23, 2013 5 / 23
Stochastic modelling
Many reasons for stochastic modelling in earth sciences:I lack of data (e.g. data assimilation for weather prediction)I unresolvable scales (e.g. atmospheric dispersion modelling)
Input: best knowledge about system, statistics of input parameters,measured data with error statistics, etc...
Output: statistics of quantities of interest or of entire state space
ZJ(ω) ∈ RJ Model(M)−→ XM (ω) ∈ RM Output−→ QM,J(ω) ∈ Rrandom input state vector quantity of interest
e.g. ZJ multivariate Gaussian; XM numerical solution of PDE; QM,J
a (non)linear functional of XM
Q(ω) inaccessible random variable s.t. E[QM,J ]M,J→∞−→ E[Q]
A. Teckentrup (Bath) MLMC May 23, 2013 6 / 23
Stochastic modelling
Many reasons for stochastic modelling in earth sciences:I lack of data (e.g. data assimilation for weather prediction)I unresolvable scales (e.g. atmospheric dispersion modelling)
Input: best knowledge about system, statistics of input parameters,measured data with error statistics, etc...
Output: statistics of quantities of interest or of entire state space
ZJ(ω) ∈ RJ Model(M)−→ XM (ω) ∈ RM Output−→ QM,J(ω) ∈ Rrandom input state vector quantity of interest
e.g. ZJ multivariate Gaussian; XM numerical solution of PDE; QM,J
a (non)linear functional of XM
Q(ω) inaccessible random variable s.t. E[QM,J ]M,J→∞−→ E[Q]
A. Teckentrup (Bath) MLMC May 23, 2013 6 / 23
Stochastic modelling
Many reasons for stochastic modelling in earth sciences:I lack of data (e.g. data assimilation for weather prediction)I unresolvable scales (e.g. atmospheric dispersion modelling)
Input: best knowledge about system, statistics of input parameters,measured data with error statistics, etc...
Output: statistics of quantities of interest or of entire state space
ZJ(ω) ∈ RJ Model(M)−→ XM (ω) ∈ RM Output−→ QM,J(ω) ∈ Rrandom input state vector quantity of interest
e.g. ZJ multivariate Gaussian; XM numerical solution of PDE; QM,J
a (non)linear functional of XM
Q(ω) inaccessible random variable s.t. E[QM,J ]M,J→∞−→ E[Q]
A. Teckentrup (Bath) MLMC May 23, 2013 6 / 23
Standard Monte CarloUsually interested in finding the expected value (or higher order moments)of output functional Q.
The standard Monte Carlo estimator for this is
E [Q] ≈ E [QM,J ] ≈1N
N∑i=1
Q(i)M,J := QMC,
where Q(i)M,J is the ith i.i.d sample computed with Model(M).
The mean square error can be shown to equal
E[(QMCh − E[Q]
)2] = V[QMCh ] +
(E[QMC
h ]− E[Q])2
=V[QM,J ]
N︸ ︷︷ ︸sampling error
+(E[QM,J −Q]
)2
︸ ︷︷ ︸model error (“bias”)
⇒ very large N and M!
A. Teckentrup (Bath) MLMC May 23, 2013 7 / 23
Standard Monte CarloUsually interested in finding the expected value (or higher order moments)of output functional Q.
The standard Monte Carlo estimator for this is
E [Q] ≈ E [QM,J ] ≈1N
N∑i=1
Q(i)M,J := QMC,
where Q(i)M,J is the ith i.i.d sample computed with Model(M).
The mean square error can be shown to equal
E[(QMCh − E[Q]
)2] = V[QMCh ] +
(E[QMC
h ]− E[Q])2
=V[QM,J ]
N︸ ︷︷ ︸sampling error
+(E[QM,J −Q]
)2
︸ ︷︷ ︸model error (“bias”)
⇒ very large N and M!
A. Teckentrup (Bath) MLMC May 23, 2013 7 / 23
Standard Monte CarloUsually interested in finding the expected value (or higher order moments)of output functional Q.
The standard Monte Carlo estimator for this is
E [Q] ≈ E [QM,J ] ≈1N
N∑i=1
Q(i)M,J := QMC,
where Q(i)M,J is the ith i.i.d sample computed with Model(M).
The mean square error can be shown to equal
E[(QMCh − E[Q]
)2] = V[QMCh ] +
(E[QMC
h ]− E[Q])2
=V[QM,J ]
N︸ ︷︷ ︸sampling error
+(E[QM,J −Q]
)2
︸ ︷︷ ︸model error (“bias”)
⇒ very large N and M!A. Teckentrup (Bath) MLMC May 23, 2013 7 / 23
Complexity of Standard Monte Carlo
Assuming that
(A1)∣∣E[QM,J −Q]
∣∣ = O(M−α) (model error)
(A2) Cost(Q(i)M,J) = O(Mγ) (PDE solver)
there exist M and N such that the total cost to obtain a mean squareerror
E[(QMC − E[Q])2
]= O(ε2)
is
Cost(QMC) = O(ε−2−γ/α)
Typically α = 1/2, γ = 1: for ε = 10−3 we have Cost = O(1012)!
A. Teckentrup (Bath) MLMC May 23, 2013 8 / 23
Complexity of Standard Monte Carlo
Assuming that
(A1)∣∣E[QM,J −Q]
∣∣ = O(M−α) (model error)
(A2) Cost(Q(i)M,J) = O(Mγ) (PDE solver)
there exist M and N such that the total cost to obtain a mean squareerror
E[(QMC − E[Q])2
]= O(ε2)
is
Cost(QMC) = O(ε−2−γ/α)
Typically α = 1/2, γ = 1: for ε = 10−3 we have Cost = O(1012)!
A. Teckentrup (Bath) MLMC May 23, 2013 8 / 23
Multilevel Monte Carlo [Heinrich, ’01], [Giles, ’07]
The multilevel method works on a sequence of levels, s.t. M` = sM`−1
and J` = sJ`−1, ` = 0, 1, . . . , L, and set Q` = QM`,J` .
Linearity of expectation gives us
E [QL] = E [Q0] +L∑`=1
E [Q` −Q`−1]
Define the following multilevel MC estimator for E[Q]:
QMLL := QMC
0 +L∑`=1
(Q` −Q`−1)MC
Terms are estimated independently, with N` samples on level `.
A. Teckentrup (Bath) MLMC May 23, 2013 9 / 23
Multilevel Monte Carlo [Heinrich, ’01], [Giles, ’07]
The multilevel method works on a sequence of levels, s.t. M` = sM`−1
and J` = sJ`−1, ` = 0, 1, . . . , L, and set Q` = QM`,J` .
Linearity of expectation gives us
E [QL] = E [Q0] +L∑`=1
E [Q` −Q`−1]
Define the following multilevel MC estimator for E[Q]:
QMLL := QMC
0 +L∑`=1
(Q` −Q`−1)MC
Terms are estimated independently, with N` samples on level `.
A. Teckentrup (Bath) MLMC May 23, 2013 9 / 23
Multilevel Monte Carlo [Heinrich, ’01], [Giles, ’07]
The mean square error of the this estimator is
E[(QMLL − E[Q]
)2] = V[QMLL ]︸ ︷︷ ︸
sampling error
+(E[QML
L ]− E[Q])2︸ ︷︷ ︸
model error
=V[Q0]N0
+L∑`=1
V[Q` −Q`−1]N`
+(E[QL −Q]
)2N0 still needs to be large, but samples are much cheaper to obtain oncoarser level
N` (` > 0) much smaller, since V[Q` −Q`−1]→ 0 as M` →∞
A. Teckentrup (Bath) MLMC May 23, 2013 10 / 23
Multilevel Monte Carlo [Heinrich, ’01], [Giles, ’07]
The mean square error of the this estimator is
E[(QMLL − E[Q]
)2] = V[QMLL ]︸ ︷︷ ︸
sampling error
+(E[QML
L ]− E[Q])2︸ ︷︷ ︸
model error
=V[Q0]N0
+L∑`=1
V[Q` −Q`−1]N`
+(E[QL −Q]
)2N0 still needs to be large, but samples are much cheaper to obtain oncoarser level
N` (` > 0) much smaller, since V[Q` −Q`−1]→ 0 as M` →∞
A. Teckentrup (Bath) MLMC May 23, 2013 10 / 23
Complexity of Multilevel Monte Carlo
Assume (A1) (model error O(M−α` )), (A2) (cost/sample O(Mγ` )) and
(A3) V[Q` −Q`−1] = O(M−β` )
with 2α ≥ min(β, γ). Then there exist L and {N`} such that the total
cost to obtain a mean square error
E[(QML
L − E[Q])2]
= O(ε2)
is
Cost(QMLL ) =
O(ε−2) if β > γ
O(ε−2 log(ε)2) if β = γ
O(ε−2−(γ−β)/α) if β < γ
{N`} chosen to minimise cost for a fixed variance
A. Teckentrup (Bath) MLMC May 23, 2013 11 / 23
Convergence analysis of MLMC
Can prove that for typical 2D model problems in subsurface flow, (A1)and (A3) are satisfied with α = 1/2, β = 1. With an optimal linear solver(γ = 1), the computational costs are bounded by:
d MLMC MC
2 O(ε−2) O(ε−4)
A. Teckentrup (Bath) MLMC May 23, 2013 12 / 23
Convergence analysis of MLMC
Can prove that for typical 2D model problems in subsurface flow, (A1)and (A3) are satisfied with α = 1/2, β = 1. With an optimal linear solver(γ = 1), the computational costs are bounded by:
d MLMC MC
2 O(106) O(1012)
0.1 % accuracy −→ ε = 10−3
A. Teckentrup (Bath) MLMC May 23, 2013 12 / 23
Numerical Example (multilevel MC)D = (0, 1)2, Q = ‖p‖L2(D), JL =∞, M0 = 82.
Typical 2D model problem.
Left: Number of samples per level. Right: Total computational cost.
A. Teckentrup (Bath) MLMC May 23, 2013 13 / 23
Can the multilevel idea be extended to Markov chain Monte Carlo?
A. Teckentrup (Bath) MLMC May 23, 2013 14 / 23
Incorporating data - Bayesian approach
Recall:
ZJ(ω) ∈ RJ Model(M)−→ XM (ω) ∈ RM Output−→ QM,J(ω) ∈ Rrandom input state vector quantity of interest
“Prior” in our model was multivariate Gaussian ZJ := [Z1, . . . , ZJ ]:
P(ZJ) h (2π)−J/2∏Jj=1 exp
(−Z2
j
2
)
Usually data Fobs related to outputs (e.g. pressure) also available.To reduce uncertainty, incorporate Fobs → the “posterior”
A. Teckentrup (Bath) MLMC May 23, 2013 15 / 23
Incorporating data - Bayesian approach
Bayes’ Theorem: (RHS computable! Proportionality constant 1/P(Fobs) not!)
πM,J(ZJ)︸ ︷︷ ︸posterior
:= P(ZJ |Fobs) h LM (Fobs |ZJ)︸ ︷︷ ︸likelihood
P(ZJ)︸ ︷︷ ︸prior
Likelihood model (e.g. Gaussian):
LM (Fobs |ZJ) h exp“−‖Fobs − FM (ZJ)‖2
σ2fid,M
”FM (ZJ) ... model response; σfid,M ... fidelity parameter (M -dep.)
A. Teckentrup (Bath) MLMC May 23, 2013 16 / 23
Incorporating data - Bayesian approach
Bayes’ Theorem: (RHS computable! Proportionality constant 1/P(Fobs) not!)
πM,J(ZJ)︸ ︷︷ ︸posterior
:= P(ZJ |Fobs) h LM (Fobs |ZJ)︸ ︷︷ ︸likelihood
P(ZJ)︸ ︷︷ ︸prior
Likelihood model (e.g. Gaussian):
LM (Fobs |ZJ) h exp“−‖Fobs − FM (ZJ)‖2
σ2fid,M
”FM (ZJ) ... model response; σfid,M ... fidelity parameter (M -dep.)
A. Teckentrup (Bath) MLMC May 23, 2013 16 / 23
ALGORITHM 1. (Standard Metropolis Hastings MCMC)
Choose Z0J .
At state ZnJ generate proposal Z′J from distribution q(Z′J |ZnJ)(for simplicity symmetric, e.g. random walk).
Accept sample Z′J with probability αM,J = min(
1,πM,J(Z′J)πM,J(ZnJ)
),
i.e. Zn+1J = Z′J with probability αM,J ; otherwise stay at Zn+1
J = ZnJ .
Samples ZnJ used as usual for inference (even though not i.i.d.):
EπM,J [Q] ≈ EπM,J [QM,J ] ≈ 1N
N∑n=1
Q(n)M,J := QMetH
Pros:
Produces a Markov chain{ZnJ}n∈N, with ZnJ ∼ πM,Jas n→∞.
Cons:
Evaluating αM,J expensive for large M .Acceptance rate αM,J very low for large J(< 10%).
A. Teckentrup (Bath) MLMC May 23, 2013 17 / 23
ALGORITHM 1. (Standard Metropolis Hastings MCMC)
Choose Z0J .
At state ZnJ generate proposal Z′J from distribution q(Z′J |ZnJ)(for simplicity symmetric, e.g. random walk).
Accept sample Z′J with probability αM,J = min(
1,πM,J(Z′J)πM,J(ZnJ)
),
i.e. Zn+1J = Z′J with probability αM,J ; otherwise stay at Zn+1
J = ZnJ .
Samples ZnJ used as usual for inference (even though not i.i.d.):
EπM,J [Q] ≈ EπM,J [QM,J ] ≈ 1N
N∑n=1
Q(n)M,J := QMetH
Pros:
Produces a Markov chain{ZnJ}n∈N, with ZnJ ∼ πM,Jas n→∞.
Cons:
Evaluating αM,J expensive for large M .Acceptance rate αM,J very low for large J(< 10%).
A. Teckentrup (Bath) MLMC May 23, 2013 17 / 23
Multilevel Markov Chain Monte Carlo
Key ingredients in multilevel method:
Models with less DOFs on coarser levels much cheaper to solve
V[Q` −Q`−1]→ 0 as `→∞ ⇒ fewer samples on finer levels
Telescoping sum: E [QL] = E [Q0] +∑L
`=1 E [Q`]− E [Q`−1]
In MCMC setting target distribution depends on `, so need to definemultilevel estimator carefully!
EπL [QL] = Eπ0 [Q0]︸ ︷︷ ︸standard MCMC
+L∑`=1
Eπ` [Q`]− Eπ`−1 [Q`−1]︸ ︷︷ ︸2 level MCMC (NEW)
QMLMetHL :=
1N0
N0∑n=1
Q0(Zn0 ) +L∑`=1
1N`
N∑n=1
(Q`(Zn` )−Q`−1(Zn`−1)
)Idea: Split Zn` = [zn`,C, z
n`,F], where zn`,C has length J`−1.
A. Teckentrup (Bath) MLMC May 23, 2013 18 / 23
Multilevel Markov Chain Monte Carlo
Key ingredients in multilevel method:
Models with less DOFs on coarser levels much cheaper to solve
V[Q` −Q`−1]→ 0 as `→∞ ⇒ fewer samples on finer levels
Telescoping sum: E [QL] = E [Q0] +∑L
`=1 E [Q`]− E [Q`−1]
In MCMC setting target distribution depends on `, so need to definemultilevel estimator carefully!
EπL [QL] = Eπ0 [Q0] +L∑`=1
Eπ` [Q`]− Eπ`−1 [Q`−1]
EπL [QL] = Eπ0 [Q0]︸ ︷︷ ︸standard MCMC
+L∑`=1
Eπ` [Q`]− Eπ`−1 [Q`−1]︸ ︷︷ ︸2 level MCMC (NEW)
QMLMetHL :=
1N0
N0∑n=1
Q0(Zn0 ) +L∑`=1
1N`
N∑n=1
(Q`(Zn` )−Q`−1(Zn`−1)
)Idea: Split Zn` = [zn`,C, z
n`,F], where zn`,C has length J`−1.
A. Teckentrup (Bath) MLMC May 23, 2013 18 / 23
Multilevel Markov Chain Monte Carlo
Key ingredients in multilevel method:
Models with less DOFs on coarser levels much cheaper to solve
V[Q` −Q`−1]→ 0 as `→∞ ⇒ fewer samples on finer levels
Telescoping sum: E [QL] = E [Q0] +∑L
`=1 E [Q`]− E [Q`−1]
In MCMC setting target distribution depends on `, so need to definemultilevel estimator carefully!
EπL [QL] = Eπ0 [Q0]︸ ︷︷ ︸standard MCMC
+L∑`=1
Eπ` [Q`]− Eπ`−1 [Q`−1]︸ ︷︷ ︸2 level MCMC (NEW)
QMLMetHL :=
1N0
N0∑n=1
Q0(Zn0 ) +L∑`=1
1N`
N∑n=1
(Q`(Zn` )−Q`−1(Zn`−1)
)
Idea: Split Zn` = [zn`,C, zn`,F], where zn`,C has length J`−1.
A. Teckentrup (Bath) MLMC May 23, 2013 18 / 23
Multilevel Markov Chain Monte Carlo
Key ingredients in multilevel method:
Models with less DOFs on coarser levels much cheaper to solve
V[Q` −Q`−1]→ 0 as `→∞ ⇒ fewer samples on finer levels
Telescoping sum: E [QL] = E [Q0] +∑L
`=1 E [Q`]− E [Q`−1]
In MCMC setting target distribution depends on `, so need to definemultilevel estimator carefully!
EπL [QL] = Eπ0 [Q0]︸ ︷︷ ︸standard MCMC
+L∑`=1
Eπ` [Q`]− Eπ`−1 [Q`−1]︸ ︷︷ ︸2 level MCMC (NEW)
QMLMetHL :=
1N0
N0∑n=1
Q0(Zn0 ) +L∑`=1
1N`
N∑n=1
(Q`(Zn` )−Q`−1(Zn`−1)
)Idea: Split Zn` = [zn`,C, z
n`,F], where zn`,C has length J`−1.
A. Teckentrup (Bath) MLMC May 23, 2013 18 / 23
ALGORITHM 2 (Two-level Metropolis Hastings MCMC for Q` −Q`−1)
At states Zn`−1,Zn` (of the independent level ` and level `− 1 Markov chains)
1 Generate new state Zn+1`−1 using Algorithm 1.
2 Propose Z′` = [Zn+1`−1 , z
′`,F] with z′`,F generated via random walk.
(novel transition prob. qML depends on level `− 1 acceptance prob. α`−1)
3 Accept Z′` with probability
α`(Z′` |Zn` ) = min(
1,π`(Z′`) qML(Zn` |Z′`)π`(Zn` ) qML(Z′` |Zn` )
)
Follows quite easily & both terms have been computed previously.
A. Teckentrup (Bath) MLMC May 23, 2013 19 / 23
ALGORITHM 2 (Two-level Metropolis Hastings MCMC for Q` −Q`−1)
At states Zn`−1,Zn` (of the independent level ` and level `− 1 Markov chains)
1 Generate new state Zn+1`−1 using Algorithm 1.
2 Propose Z′` = [Zn+1`−1 , z
′`,F] with z′`,F generated via random walk.
(novel transition prob. qML depends on level `− 1 acceptance prob. α`−1)
3 Accept Z′` with probability
α`(Z′` |Zn` ) = min(
1,π`(Z′`) qML(Zn` |Z′`)π`(Zn` ) qML(Z′` |Zn` )
)
Follows quite easily & both terms have been computed previously.
A. Teckentrup (Bath) MLMC May 23, 2013 19 / 23
ALGORITHM 2 (Two-level Metropolis Hastings MCMC for Q` −Q`−1)
At states Zn`−1,Zn` (of the independent level ` and level `− 1 Markov chains)
1 Generate new state Zn+1`−1 using Algorithm 1.
2 Propose Z′` = [Zn+1`−1 , z
′`,F] with z′`,F generated via random walk.
(novel transition prob. qML depends on level `− 1 acceptance prob. α`−1)
3 Accept Z′` with probability
α`(Z′` |Zn` ) = min(
1,π`(Z′`) qML(Zn` |Z′`)π`(Zn` ) qML(Z′` |Zn` )
)
Follows quite easily & both terms have been computed previously.
A. Teckentrup (Bath) MLMC May 23, 2013 19 / 23
ALGORITHM 2 (Two-level Metropolis Hastings MCMC for Q` −Q`−1)
At states Zn`−1,Zn` (of the independent level ` and level `− 1 Markov chains)
1 Generate new state Zn+1`−1 using Algorithm 1.
2 Propose Z′` = [Zn+1`−1 , z
′`,F] with z′`,F generated via random walk.
(novel transition prob. qML depends on level `− 1 acceptance prob. α`−1)
3 Accept Z′` with probability
α`(Z′` |Zn` ) = min
(1,
π`(Z′`)π`−1(zn`,C)
π`(Zn` )π`−1(Zn+1`−1 )
)Follows quite easily & both terms have been computed previously.
A. Teckentrup (Bath) MLMC May 23, 2013 19 / 23
Example: one step
Given Zn`−1 and Zn` , the possible (n+ 1)th states of the two chains are:
Level `− 1 test Level ` test Zn+1`−1 Zn+1
`
reject accept Zn`−1 [Zn`−1, z′`,F]
accept accept Z′`−1 [Z′`−1, z′`,F]
reject reject Zn`−1 [zn`,C, zn`,F]
accept reject Z′`−1 [zn`,C, zn`,F]
A. Teckentrup (Bath) MLMC May 23, 2013 20 / 23
Convergence analysis of MLMCMC
What can we prove?
We have a genuine Markov chain on every level.
Multilevel algorithm is consistent (no bias between levels).
Multilevel algorithm converges for any initial state.
Same Complexity Theorem as for Multilevel Monte Carlo.(completely abstract and applicable also in DA for NWP)
I For typical 2D model problems in subsurface flow, we have α = 1/2,β = 1/2. With an optimal linear solver (γ = 1), the computationalcosts are bounded by:
d MLMCMC MCMC
2 O(ε−3) O(ε−4)
A. Teckentrup (Bath) MLMC May 23, 2013 21 / 23
Convergence analysis of MLMCMC
What can we prove?
We have a genuine Markov chain on every level.
Multilevel algorithm is consistent (no bias between levels).
Multilevel algorithm converges for any initial state.
Same Complexity Theorem as for Multilevel Monte Carlo.(completely abstract and applicable also in DA for NWP)
I For typical 2D model problems in subsurface flow, we have α = 1/2,β = 1/2. With an optimal linear solver (γ = 1), the computationalcosts are bounded by:
d MLMCMC MCMC
2 O(109) O(1012)
0.1 % accuracy −→ ε = 10−3
A. Teckentrup (Bath) MLMC May 23, 2013 21 / 23
Numerical Example (multilevel MCMC)D = (0, 1)2, Q = keff , JL = 169, M0 = 162.Data (artificial): Pressure p at 9 random points in domain.
0 1 2 3−20
−18
−16
−14
−12
−10
−8
−6
−4
−2
0
Level
log 2 V
aria
nce
Ql
Ql − Q
l−1
0 1 2 3−8
−7
−6
−5
−4
−3
−2
−1
0
1
2
3
Level
log 2 |M
ean|
Ql
Ql − Q
l−1
0 1 2 310
2
103
104
105
106
107
108
Level
Sa
mp
les
ε = 0.005
ε = 0.001
ε = 0.0005
ε = 0.0003
10−3
101
102
Accuracy ε
ε2 Cos
t
Standard MCMCMultilevel MCMC
A. Teckentrup (Bath) MLMC May 23, 2013 22 / 23
Conclusions
Standard (Markov chain) Monte Carlo algorithms are oftenprohibitively expensive.
Multilevel versions greatly reduce the cost.
Multilevel algorithms are generally applicable.
Full convergence analysis available.
A. Teckentrup (Bath) MLMC May 23, 2013 23 / 23