multilevel monte carlo methods for uncertainty

41
Multilevel Monte Carlo methods for uncertainty quantification in subsurface flow Aretha Teckentrup Department of Mathematical Sciences University of Bath Joint work with: Rob Scheichl and Elisabeth Ullmann (Bath), Julia Charrier (Marseille), Mike Giles (Oxford), Andrew Cliffe, Minho Park (Nottingham), Christian Ketelsen and Panayot Vassilevski (LLNL) May 23, 2013 A. Teckentrup (Bath) MLMC May 23, 2013 1 / 23

Upload: others

Post on 30-Jan-2022

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Multilevel Monte Carlo methods for uncertainty

Multilevel Monte Carlo methods for uncertaintyquantification in subsurface flow

Aretha Teckentrup

Department of Mathematical SciencesUniversity of Bath

Joint work with:

Rob Scheichl and Elisabeth Ullmann (Bath),

Julia Charrier (Marseille), Mike Giles (Oxford),

Andrew Cliffe, Minho Park (Nottingham),

Christian Ketelsen and Panayot Vassilevski (LLNL)

May 23, 2013

A. Teckentrup (Bath) MLMC May 23, 2013 1 / 23

Page 2: Multilevel Monte Carlo methods for uncertainty

Outline

Motivation and model problem: uncertainty quantification inradioactive waste disposal

Standard Monte Carlo

Multilevel Monte Carlo

Multilevel Markov chain Monte Carlo

A. Teckentrup (Bath) MLMC May 23, 2013 2 / 23

Page 3: Multilevel Monte Carlo methods for uncertainty

Application: WIPP test caseUS Dept Energy Radioactive Waste Isolation Pilot Plant (WIPP) in New Mexico

Cross-section at WIPP

SANTA ROSA FORMATION

D E W E Y L A K E F O R M A T I O N

NA

SH

D

RA

W

S A L A D O F O R M A T I O N

R U S T L E R F O R M A T I O N

Forty NinerTamari sk

Unna med Lower Member

WIP

P-2

5

WIP

P-3

3

H-1

8

WIP

P-2

1SH

AFT

S

WIP

P-1

2

H-1

5

P-1

8

vertical exaggeration 20:1

Phill

ips,

19

87

Magenta Dolomite

Culebra Dolomite

Cross section through the rock at theWIPP site

Crucial to assess the risk ofradionuclides reentering thehuman environment

Culebra Dolomite layer acts asprincipal pathway for transportof radionuclides(2D to reasonable approximation)

A. Teckentrup (Bath) MLMC May 23, 2013 3 / 23

Page 4: Multilevel Monte Carlo methods for uncertainty

Uncertainty in Groundwater Flow

QUATERNARY

MERCIA MUDSTONE

VN-S CALDER

FAULTED VN-S CALDER

N-S CALDER

FAULTED N-S CALDER

DEEP CALDER

FAULTED DEEP CALDER

VN-S ST BEES

FAULTED VN-S ST BEES

N-S ST BEES

FAULTED N-S ST BEES

DEEP ST BEES

FAULTED DEEP ST BEES

BOTTOM NHM

FAULTED BNHM

SHALES + EVAP

BROCKRAM

FAULTED BROCKRAM

COLLYHURST

FAULTED COLLYHURST

CARB LST

FAULTED CARB LST

N-S BVG

FAULTED N-S BVG

UNDIFF BVG

FAULTED UNDIFF BVG

F-H BVG

FAULTED F-H BVG

BLEAWATH BVG

FAULTED BLEAWATH BVG

TOP M-F BVG

FAULTED TOP M-F BVG

N-S LATTERBARROW

DEEP LATTERBARROW

N-S SKIDDAW

DEEP SKIDDAW

GRANITE

FAULTED GRANITE

WASTE VAULTS

CROWN SPACE

EDZ

c©NIREX UK Ltd.

Modelling and simulation essential to assess repositoryperformance

Darcy’s law for an incompressible fluid → elliptic partialdifferential equations

−∇ · (k∇p) = f

Lack of data → uncertainty in model parameters

Quantify impact of uncertainty on outputs throughstochastic modelling (→ random variables)

A. Teckentrup (Bath) MLMC May 23, 2013 4 / 23

Page 5: Multilevel Monte Carlo methods for uncertainty

Uncertainty in Groundwater Flow

QUATERNARY

MERCIA MUDSTONE

VN-S CALDER

FAULTED VN-S CALDER

N-S CALDER

FAULTED N-S CALDER

DEEP CALDER

FAULTED DEEP CALDER

VN-S ST BEES

FAULTED VN-S ST BEES

N-S ST BEES

FAULTED N-S ST BEES

DEEP ST BEES

FAULTED DEEP ST BEES

BOTTOM NHM

FAULTED BNHM

SHALES + EVAP

BROCKRAM

FAULTED BROCKRAM

COLLYHURST

FAULTED COLLYHURST

CARB LST

FAULTED CARB LST

N-S BVG

FAULTED N-S BVG

UNDIFF BVG

FAULTED UNDIFF BVG

F-H BVG

FAULTED F-H BVG

BLEAWATH BVG

FAULTED BLEAWATH BVG

TOP M-F BVG

FAULTED TOP M-F BVG

N-S LATTERBARROW

DEEP LATTERBARROW

N-S SKIDDAW

DEEP SKIDDAW

GRANITE

FAULTED GRANITE

WASTE VAULTS

CROWN SPACE

EDZ

c©NIREX UK Ltd.

Modelling and simulation essential to assess repositoryperformance

Darcy’s law for an incompressible fluid → elliptic partialdifferential equations

−∇ · (k∇p) = f

Lack of data → uncertainty in model parameters

Quantify impact of uncertainty on outputs throughstochastic modelling (→ random variables)

A. Teckentrup (Bath) MLMC May 23, 2013 4 / 23

Page 6: Multilevel Monte Carlo methods for uncertainty

Typical model

Typical simplified model for k is a log–normal random field,k = exp[g], where g is a scalar, isotropic Gaussian field.

To sample from k, use Karhunen-Loeve expansion:

log k(x, ω) ≈J∑j=1

√µjφj(x)Zj(ω),

with Zj(ω) i.i.d. N(0, 1).

A. Teckentrup (Bath) MLMC May 23, 2013 5 / 23

Page 7: Multilevel Monte Carlo methods for uncertainty

Stochastic modelling

Many reasons for stochastic modelling in earth sciences:I lack of data (e.g. data assimilation for weather prediction)I unresolvable scales (e.g. atmospheric dispersion modelling)

Input: best knowledge about system, statistics of input parameters,measured data with error statistics, etc...

Output: statistics of quantities of interest or of entire state space

ZJ(ω) ∈ RJ Model(M)−→ XM (ω) ∈ RM Output−→ QM,J(ω) ∈ Rrandom input state vector quantity of interest

e.g. ZJ multivariate Gaussian; XM numerical solution of PDE; QM,J

a (non)linear functional of XM

Q(ω) inaccessible random variable s.t. E[QM,J ]M,J→∞−→ E[Q]

A. Teckentrup (Bath) MLMC May 23, 2013 6 / 23

Page 8: Multilevel Monte Carlo methods for uncertainty

Stochastic modelling

Many reasons for stochastic modelling in earth sciences:I lack of data (e.g. data assimilation for weather prediction)I unresolvable scales (e.g. atmospheric dispersion modelling)

Input: best knowledge about system, statistics of input parameters,measured data with error statistics, etc...

Output: statistics of quantities of interest or of entire state space

ZJ(ω) ∈ RJ Model(M)−→ XM (ω) ∈ RM Output−→ QM,J(ω) ∈ Rrandom input state vector quantity of interest

e.g. ZJ multivariate Gaussian; XM numerical solution of PDE; QM,J

a (non)linear functional of XM

Q(ω) inaccessible random variable s.t. E[QM,J ]M,J→∞−→ E[Q]

A. Teckentrup (Bath) MLMC May 23, 2013 6 / 23

Page 9: Multilevel Monte Carlo methods for uncertainty

Stochastic modelling

Many reasons for stochastic modelling in earth sciences:I lack of data (e.g. data assimilation for weather prediction)I unresolvable scales (e.g. atmospheric dispersion modelling)

Input: best knowledge about system, statistics of input parameters,measured data with error statistics, etc...

Output: statistics of quantities of interest or of entire state space

ZJ(ω) ∈ RJ Model(M)−→ XM (ω) ∈ RM Output−→ QM,J(ω) ∈ Rrandom input state vector quantity of interest

e.g. ZJ multivariate Gaussian; XM numerical solution of PDE; QM,J

a (non)linear functional of XM

Q(ω) inaccessible random variable s.t. E[QM,J ]M,J→∞−→ E[Q]

A. Teckentrup (Bath) MLMC May 23, 2013 6 / 23

Page 10: Multilevel Monte Carlo methods for uncertainty

Standard Monte CarloUsually interested in finding the expected value (or higher order moments)of output functional Q.

The standard Monte Carlo estimator for this is

E [Q] ≈ E [QM,J ] ≈1N

N∑i=1

Q(i)M,J := QMC,

where Q(i)M,J is the ith i.i.d sample computed with Model(M).

The mean square error can be shown to equal

E[(QMCh − E[Q]

)2] = V[QMCh ] +

(E[QMC

h ]− E[Q])2

=V[QM,J ]

N︸ ︷︷ ︸sampling error

+(E[QM,J −Q]

)2

︸ ︷︷ ︸model error (“bias”)

⇒ very large N and M!

A. Teckentrup (Bath) MLMC May 23, 2013 7 / 23

Page 11: Multilevel Monte Carlo methods for uncertainty

Standard Monte CarloUsually interested in finding the expected value (or higher order moments)of output functional Q.

The standard Monte Carlo estimator for this is

E [Q] ≈ E [QM,J ] ≈1N

N∑i=1

Q(i)M,J := QMC,

where Q(i)M,J is the ith i.i.d sample computed with Model(M).

The mean square error can be shown to equal

E[(QMCh − E[Q]

)2] = V[QMCh ] +

(E[QMC

h ]− E[Q])2

=V[QM,J ]

N︸ ︷︷ ︸sampling error

+(E[QM,J −Q]

)2

︸ ︷︷ ︸model error (“bias”)

⇒ very large N and M!

A. Teckentrup (Bath) MLMC May 23, 2013 7 / 23

Page 12: Multilevel Monte Carlo methods for uncertainty

Standard Monte CarloUsually interested in finding the expected value (or higher order moments)of output functional Q.

The standard Monte Carlo estimator for this is

E [Q] ≈ E [QM,J ] ≈1N

N∑i=1

Q(i)M,J := QMC,

where Q(i)M,J is the ith i.i.d sample computed with Model(M).

The mean square error can be shown to equal

E[(QMCh − E[Q]

)2] = V[QMCh ] +

(E[QMC

h ]− E[Q])2

=V[QM,J ]

N︸ ︷︷ ︸sampling error

+(E[QM,J −Q]

)2

︸ ︷︷ ︸model error (“bias”)

⇒ very large N and M!A. Teckentrup (Bath) MLMC May 23, 2013 7 / 23

Page 13: Multilevel Monte Carlo methods for uncertainty

Complexity of Standard Monte Carlo

Assuming that

(A1)∣∣E[QM,J −Q]

∣∣ = O(M−α) (model error)

(A2) Cost(Q(i)M,J) = O(Mγ) (PDE solver)

there exist M and N such that the total cost to obtain a mean squareerror

E[(QMC − E[Q])2

]= O(ε2)

is

Cost(QMC) = O(ε−2−γ/α)

Typically α = 1/2, γ = 1: for ε = 10−3 we have Cost = O(1012)!

A. Teckentrup (Bath) MLMC May 23, 2013 8 / 23

Page 14: Multilevel Monte Carlo methods for uncertainty

Complexity of Standard Monte Carlo

Assuming that

(A1)∣∣E[QM,J −Q]

∣∣ = O(M−α) (model error)

(A2) Cost(Q(i)M,J) = O(Mγ) (PDE solver)

there exist M and N such that the total cost to obtain a mean squareerror

E[(QMC − E[Q])2

]= O(ε2)

is

Cost(QMC) = O(ε−2−γ/α)

Typically α = 1/2, γ = 1: for ε = 10−3 we have Cost = O(1012)!

A. Teckentrup (Bath) MLMC May 23, 2013 8 / 23

Page 15: Multilevel Monte Carlo methods for uncertainty

Multilevel Monte Carlo [Heinrich, ’01], [Giles, ’07]

The multilevel method works on a sequence of levels, s.t. M` = sM`−1

and J` = sJ`−1, ` = 0, 1, . . . , L, and set Q` = QM`,J` .

Linearity of expectation gives us

E [QL] = E [Q0] +L∑`=1

E [Q` −Q`−1]

Define the following multilevel MC estimator for E[Q]:

QMLL := QMC

0 +L∑`=1

(Q` −Q`−1)MC

Terms are estimated independently, with N` samples on level `.

A. Teckentrup (Bath) MLMC May 23, 2013 9 / 23

Page 16: Multilevel Monte Carlo methods for uncertainty

Multilevel Monte Carlo [Heinrich, ’01], [Giles, ’07]

The multilevel method works on a sequence of levels, s.t. M` = sM`−1

and J` = sJ`−1, ` = 0, 1, . . . , L, and set Q` = QM`,J` .

Linearity of expectation gives us

E [QL] = E [Q0] +L∑`=1

E [Q` −Q`−1]

Define the following multilevel MC estimator for E[Q]:

QMLL := QMC

0 +L∑`=1

(Q` −Q`−1)MC

Terms are estimated independently, with N` samples on level `.

A. Teckentrup (Bath) MLMC May 23, 2013 9 / 23

Page 17: Multilevel Monte Carlo methods for uncertainty

Multilevel Monte Carlo [Heinrich, ’01], [Giles, ’07]

The mean square error of the this estimator is

E[(QMLL − E[Q]

)2] = V[QMLL ]︸ ︷︷ ︸

sampling error

+(E[QML

L ]− E[Q])2︸ ︷︷ ︸

model error

=V[Q0]N0

+L∑`=1

V[Q` −Q`−1]N`

+(E[QL −Q]

)2N0 still needs to be large, but samples are much cheaper to obtain oncoarser level

N` (` > 0) much smaller, since V[Q` −Q`−1]→ 0 as M` →∞

A. Teckentrup (Bath) MLMC May 23, 2013 10 / 23

Page 18: Multilevel Monte Carlo methods for uncertainty

Multilevel Monte Carlo [Heinrich, ’01], [Giles, ’07]

The mean square error of the this estimator is

E[(QMLL − E[Q]

)2] = V[QMLL ]︸ ︷︷ ︸

sampling error

+(E[QML

L ]− E[Q])2︸ ︷︷ ︸

model error

=V[Q0]N0

+L∑`=1

V[Q` −Q`−1]N`

+(E[QL −Q]

)2N0 still needs to be large, but samples are much cheaper to obtain oncoarser level

N` (` > 0) much smaller, since V[Q` −Q`−1]→ 0 as M` →∞

A. Teckentrup (Bath) MLMC May 23, 2013 10 / 23

Page 19: Multilevel Monte Carlo methods for uncertainty

Complexity of Multilevel Monte Carlo

Assume (A1) (model error O(M−α` )), (A2) (cost/sample O(Mγ` )) and

(A3) V[Q` −Q`−1] = O(M−β` )

with 2α ≥ min(β, γ). Then there exist L and {N`} such that the total

cost to obtain a mean square error

E[(QML

L − E[Q])2]

= O(ε2)

is

Cost(QMLL ) =

O(ε−2) if β > γ

O(ε−2 log(ε)2) if β = γ

O(ε−2−(γ−β)/α) if β < γ

{N`} chosen to minimise cost for a fixed variance

A. Teckentrup (Bath) MLMC May 23, 2013 11 / 23

Page 20: Multilevel Monte Carlo methods for uncertainty

Convergence analysis of MLMC

Can prove that for typical 2D model problems in subsurface flow, (A1)and (A3) are satisfied with α = 1/2, β = 1. With an optimal linear solver(γ = 1), the computational costs are bounded by:

d MLMC MC

2 O(ε−2) O(ε−4)

A. Teckentrup (Bath) MLMC May 23, 2013 12 / 23

Page 21: Multilevel Monte Carlo methods for uncertainty

Convergence analysis of MLMC

Can prove that for typical 2D model problems in subsurface flow, (A1)and (A3) are satisfied with α = 1/2, β = 1. With an optimal linear solver(γ = 1), the computational costs are bounded by:

d MLMC MC

2 O(106) O(1012)

0.1 % accuracy −→ ε = 10−3

A. Teckentrup (Bath) MLMC May 23, 2013 12 / 23

Page 22: Multilevel Monte Carlo methods for uncertainty

Numerical Example (multilevel MC)D = (0, 1)2, Q = ‖p‖L2(D), JL =∞, M0 = 82.

Typical 2D model problem.

Left: Number of samples per level. Right: Total computational cost.

A. Teckentrup (Bath) MLMC May 23, 2013 13 / 23

Page 23: Multilevel Monte Carlo methods for uncertainty

Can the multilevel idea be extended to Markov chain Monte Carlo?

A. Teckentrup (Bath) MLMC May 23, 2013 14 / 23

Page 24: Multilevel Monte Carlo methods for uncertainty

Incorporating data - Bayesian approach

Recall:

ZJ(ω) ∈ RJ Model(M)−→ XM (ω) ∈ RM Output−→ QM,J(ω) ∈ Rrandom input state vector quantity of interest

“Prior” in our model was multivariate Gaussian ZJ := [Z1, . . . , ZJ ]:

P(ZJ) h (2π)−J/2∏Jj=1 exp

(−Z2

j

2

)

Usually data Fobs related to outputs (e.g. pressure) also available.To reduce uncertainty, incorporate Fobs → the “posterior”

A. Teckentrup (Bath) MLMC May 23, 2013 15 / 23

Page 25: Multilevel Monte Carlo methods for uncertainty

Incorporating data - Bayesian approach

Bayes’ Theorem: (RHS computable! Proportionality constant 1/P(Fobs) not!)

πM,J(ZJ)︸ ︷︷ ︸posterior

:= P(ZJ |Fobs) h LM (Fobs |ZJ)︸ ︷︷ ︸likelihood

P(ZJ)︸ ︷︷ ︸prior

Likelihood model (e.g. Gaussian):

LM (Fobs |ZJ) h exp“−‖Fobs − FM (ZJ)‖2

σ2fid,M

”FM (ZJ) ... model response; σfid,M ... fidelity parameter (M -dep.)

A. Teckentrup (Bath) MLMC May 23, 2013 16 / 23

Page 26: Multilevel Monte Carlo methods for uncertainty

Incorporating data - Bayesian approach

Bayes’ Theorem: (RHS computable! Proportionality constant 1/P(Fobs) not!)

πM,J(ZJ)︸ ︷︷ ︸posterior

:= P(ZJ |Fobs) h LM (Fobs |ZJ)︸ ︷︷ ︸likelihood

P(ZJ)︸ ︷︷ ︸prior

Likelihood model (e.g. Gaussian):

LM (Fobs |ZJ) h exp“−‖Fobs − FM (ZJ)‖2

σ2fid,M

”FM (ZJ) ... model response; σfid,M ... fidelity parameter (M -dep.)

A. Teckentrup (Bath) MLMC May 23, 2013 16 / 23

Page 27: Multilevel Monte Carlo methods for uncertainty

ALGORITHM 1. (Standard Metropolis Hastings MCMC)

Choose Z0J .

At state ZnJ generate proposal Z′J from distribution q(Z′J |ZnJ)(for simplicity symmetric, e.g. random walk).

Accept sample Z′J with probability αM,J = min(

1,πM,J(Z′J)πM,J(ZnJ)

),

i.e. Zn+1J = Z′J with probability αM,J ; otherwise stay at Zn+1

J = ZnJ .

Samples ZnJ used as usual for inference (even though not i.i.d.):

EπM,J [Q] ≈ EπM,J [QM,J ] ≈ 1N

N∑n=1

Q(n)M,J := QMetH

Pros:

Produces a Markov chain{ZnJ}n∈N, with ZnJ ∼ πM,Jas n→∞.

Cons:

Evaluating αM,J expensive for large M .Acceptance rate αM,J very low for large J(< 10%).

A. Teckentrup (Bath) MLMC May 23, 2013 17 / 23

Page 28: Multilevel Monte Carlo methods for uncertainty

ALGORITHM 1. (Standard Metropolis Hastings MCMC)

Choose Z0J .

At state ZnJ generate proposal Z′J from distribution q(Z′J |ZnJ)(for simplicity symmetric, e.g. random walk).

Accept sample Z′J with probability αM,J = min(

1,πM,J(Z′J)πM,J(ZnJ)

),

i.e. Zn+1J = Z′J with probability αM,J ; otherwise stay at Zn+1

J = ZnJ .

Samples ZnJ used as usual for inference (even though not i.i.d.):

EπM,J [Q] ≈ EπM,J [QM,J ] ≈ 1N

N∑n=1

Q(n)M,J := QMetH

Pros:

Produces a Markov chain{ZnJ}n∈N, with ZnJ ∼ πM,Jas n→∞.

Cons:

Evaluating αM,J expensive for large M .Acceptance rate αM,J very low for large J(< 10%).

A. Teckentrup (Bath) MLMC May 23, 2013 17 / 23

Page 29: Multilevel Monte Carlo methods for uncertainty

Multilevel Markov Chain Monte Carlo

Key ingredients in multilevel method:

Models with less DOFs on coarser levels much cheaper to solve

V[Q` −Q`−1]→ 0 as `→∞ ⇒ fewer samples on finer levels

Telescoping sum: E [QL] = E [Q0] +∑L

`=1 E [Q`]− E [Q`−1]

In MCMC setting target distribution depends on `, so need to definemultilevel estimator carefully!

EπL [QL] = Eπ0 [Q0]︸ ︷︷ ︸standard MCMC

+L∑`=1

Eπ` [Q`]− Eπ`−1 [Q`−1]︸ ︷︷ ︸2 level MCMC (NEW)

QMLMetHL :=

1N0

N0∑n=1

Q0(Zn0 ) +L∑`=1

1N`

N∑n=1

(Q`(Zn` )−Q`−1(Zn`−1)

)Idea: Split Zn` = [zn`,C, z

n`,F], where zn`,C has length J`−1.

A. Teckentrup (Bath) MLMC May 23, 2013 18 / 23

Page 30: Multilevel Monte Carlo methods for uncertainty

Multilevel Markov Chain Monte Carlo

Key ingredients in multilevel method:

Models with less DOFs on coarser levels much cheaper to solve

V[Q` −Q`−1]→ 0 as `→∞ ⇒ fewer samples on finer levels

Telescoping sum: E [QL] = E [Q0] +∑L

`=1 E [Q`]− E [Q`−1]

In MCMC setting target distribution depends on `, so need to definemultilevel estimator carefully!

EπL [QL] = Eπ0 [Q0] +L∑`=1

Eπ` [Q`]− Eπ`−1 [Q`−1]

EπL [QL] = Eπ0 [Q0]︸ ︷︷ ︸standard MCMC

+L∑`=1

Eπ` [Q`]− Eπ`−1 [Q`−1]︸ ︷︷ ︸2 level MCMC (NEW)

QMLMetHL :=

1N0

N0∑n=1

Q0(Zn0 ) +L∑`=1

1N`

N∑n=1

(Q`(Zn` )−Q`−1(Zn`−1)

)Idea: Split Zn` = [zn`,C, z

n`,F], where zn`,C has length J`−1.

A. Teckentrup (Bath) MLMC May 23, 2013 18 / 23

Page 31: Multilevel Monte Carlo methods for uncertainty

Multilevel Markov Chain Monte Carlo

Key ingredients in multilevel method:

Models with less DOFs on coarser levels much cheaper to solve

V[Q` −Q`−1]→ 0 as `→∞ ⇒ fewer samples on finer levels

Telescoping sum: E [QL] = E [Q0] +∑L

`=1 E [Q`]− E [Q`−1]

In MCMC setting target distribution depends on `, so need to definemultilevel estimator carefully!

EπL [QL] = Eπ0 [Q0]︸ ︷︷ ︸standard MCMC

+L∑`=1

Eπ` [Q`]− Eπ`−1 [Q`−1]︸ ︷︷ ︸2 level MCMC (NEW)

QMLMetHL :=

1N0

N0∑n=1

Q0(Zn0 ) +L∑`=1

1N`

N∑n=1

(Q`(Zn` )−Q`−1(Zn`−1)

)

Idea: Split Zn` = [zn`,C, zn`,F], where zn`,C has length J`−1.

A. Teckentrup (Bath) MLMC May 23, 2013 18 / 23

Page 32: Multilevel Monte Carlo methods for uncertainty

Multilevel Markov Chain Monte Carlo

Key ingredients in multilevel method:

Models with less DOFs on coarser levels much cheaper to solve

V[Q` −Q`−1]→ 0 as `→∞ ⇒ fewer samples on finer levels

Telescoping sum: E [QL] = E [Q0] +∑L

`=1 E [Q`]− E [Q`−1]

In MCMC setting target distribution depends on `, so need to definemultilevel estimator carefully!

EπL [QL] = Eπ0 [Q0]︸ ︷︷ ︸standard MCMC

+L∑`=1

Eπ` [Q`]− Eπ`−1 [Q`−1]︸ ︷︷ ︸2 level MCMC (NEW)

QMLMetHL :=

1N0

N0∑n=1

Q0(Zn0 ) +L∑`=1

1N`

N∑n=1

(Q`(Zn` )−Q`−1(Zn`−1)

)Idea: Split Zn` = [zn`,C, z

n`,F], where zn`,C has length J`−1.

A. Teckentrup (Bath) MLMC May 23, 2013 18 / 23

Page 33: Multilevel Monte Carlo methods for uncertainty

ALGORITHM 2 (Two-level Metropolis Hastings MCMC for Q` −Q`−1)

At states Zn`−1,Zn` (of the independent level ` and level `− 1 Markov chains)

1 Generate new state Zn+1`−1 using Algorithm 1.

2 Propose Z′` = [Zn+1`−1 , z

′`,F] with z′`,F generated via random walk.

(novel transition prob. qML depends on level `− 1 acceptance prob. α`−1)

3 Accept Z′` with probability

α`(Z′` |Zn` ) = min(

1,π`(Z′`) qML(Zn` |Z′`)π`(Zn` ) qML(Z′` |Zn` )

)

Follows quite easily & both terms have been computed previously.

A. Teckentrup (Bath) MLMC May 23, 2013 19 / 23

Page 34: Multilevel Monte Carlo methods for uncertainty

ALGORITHM 2 (Two-level Metropolis Hastings MCMC for Q` −Q`−1)

At states Zn`−1,Zn` (of the independent level ` and level `− 1 Markov chains)

1 Generate new state Zn+1`−1 using Algorithm 1.

2 Propose Z′` = [Zn+1`−1 , z

′`,F] with z′`,F generated via random walk.

(novel transition prob. qML depends on level `− 1 acceptance prob. α`−1)

3 Accept Z′` with probability

α`(Z′` |Zn` ) = min(

1,π`(Z′`) qML(Zn` |Z′`)π`(Zn` ) qML(Z′` |Zn` )

)

Follows quite easily & both terms have been computed previously.

A. Teckentrup (Bath) MLMC May 23, 2013 19 / 23

Page 35: Multilevel Monte Carlo methods for uncertainty

ALGORITHM 2 (Two-level Metropolis Hastings MCMC for Q` −Q`−1)

At states Zn`−1,Zn` (of the independent level ` and level `− 1 Markov chains)

1 Generate new state Zn+1`−1 using Algorithm 1.

2 Propose Z′` = [Zn+1`−1 , z

′`,F] with z′`,F generated via random walk.

(novel transition prob. qML depends on level `− 1 acceptance prob. α`−1)

3 Accept Z′` with probability

α`(Z′` |Zn` ) = min(

1,π`(Z′`) qML(Zn` |Z′`)π`(Zn` ) qML(Z′` |Zn` )

)

Follows quite easily & both terms have been computed previously.

A. Teckentrup (Bath) MLMC May 23, 2013 19 / 23

Page 36: Multilevel Monte Carlo methods for uncertainty

ALGORITHM 2 (Two-level Metropolis Hastings MCMC for Q` −Q`−1)

At states Zn`−1,Zn` (of the independent level ` and level `− 1 Markov chains)

1 Generate new state Zn+1`−1 using Algorithm 1.

2 Propose Z′` = [Zn+1`−1 , z

′`,F] with z′`,F generated via random walk.

(novel transition prob. qML depends on level `− 1 acceptance prob. α`−1)

3 Accept Z′` with probability

α`(Z′` |Zn` ) = min

(1,

π`(Z′`)π`−1(zn`,C)

π`(Zn` )π`−1(Zn+1`−1 )

)Follows quite easily & both terms have been computed previously.

A. Teckentrup (Bath) MLMC May 23, 2013 19 / 23

Page 37: Multilevel Monte Carlo methods for uncertainty

Example: one step

Given Zn`−1 and Zn` , the possible (n+ 1)th states of the two chains are:

Level `− 1 test Level ` test Zn+1`−1 Zn+1

`

reject accept Zn`−1 [Zn`−1, z′`,F]

accept accept Z′`−1 [Z′`−1, z′`,F]

reject reject Zn`−1 [zn`,C, zn`,F]

accept reject Z′`−1 [zn`,C, zn`,F]

A. Teckentrup (Bath) MLMC May 23, 2013 20 / 23

Page 38: Multilevel Monte Carlo methods for uncertainty

Convergence analysis of MLMCMC

What can we prove?

We have a genuine Markov chain on every level.

Multilevel algorithm is consistent (no bias between levels).

Multilevel algorithm converges for any initial state.

Same Complexity Theorem as for Multilevel Monte Carlo.(completely abstract and applicable also in DA for NWP)

I For typical 2D model problems in subsurface flow, we have α = 1/2,β = 1/2. With an optimal linear solver (γ = 1), the computationalcosts are bounded by:

d MLMCMC MCMC

2 O(ε−3) O(ε−4)

A. Teckentrup (Bath) MLMC May 23, 2013 21 / 23

Page 39: Multilevel Monte Carlo methods for uncertainty

Convergence analysis of MLMCMC

What can we prove?

We have a genuine Markov chain on every level.

Multilevel algorithm is consistent (no bias between levels).

Multilevel algorithm converges for any initial state.

Same Complexity Theorem as for Multilevel Monte Carlo.(completely abstract and applicable also in DA for NWP)

I For typical 2D model problems in subsurface flow, we have α = 1/2,β = 1/2. With an optimal linear solver (γ = 1), the computationalcosts are bounded by:

d MLMCMC MCMC

2 O(109) O(1012)

0.1 % accuracy −→ ε = 10−3

A. Teckentrup (Bath) MLMC May 23, 2013 21 / 23

Page 40: Multilevel Monte Carlo methods for uncertainty

Numerical Example (multilevel MCMC)D = (0, 1)2, Q = keff , JL = 169, M0 = 162.Data (artificial): Pressure p at 9 random points in domain.

0 1 2 3−20

−18

−16

−14

−12

−10

−8

−6

−4

−2

0

Level

log 2 V

aria

nce

Ql

Ql − Q

l−1

0 1 2 3−8

−7

−6

−5

−4

−3

−2

−1

0

1

2

3

Level

log 2 |M

ean|

Ql

Ql − Q

l−1

0 1 2 310

2

103

104

105

106

107

108

Level

Sa

mp

les

ε = 0.005

ε = 0.001

ε = 0.0005

ε = 0.0003

10−3

101

102

Accuracy ε

ε2 Cos

t

Standard MCMCMultilevel MCMC

A. Teckentrup (Bath) MLMC May 23, 2013 22 / 23

Page 41: Multilevel Monte Carlo methods for uncertainty

Conclusions

Standard (Markov chain) Monte Carlo algorithms are oftenprohibitively expensive.

Multilevel versions greatly reduce the cost.

Multilevel algorithms are generally applicable.

Full convergence analysis available.

A. Teckentrup (Bath) MLMC May 23, 2013 23 / 23