wavelet methods peter f. craigmile

24
Wavelet methods Peter F. Craigmile http://www.stat.osu.edu/ ~ pfc/ Summer School on Extreme Value Modeling and Water Resources Universite Lyon 1, France. 13-24 Jun 2016 Thank you to The Camille Jordan Institute, the MultiRisk LEFE project, and the Labex Milyon. My research is partially supported by the US National Science Foundation under grants NSF-DMS-1407604 and NSF-SES-1424481. 1

Upload: others

Post on 12-May-2022

6 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Wavelet methods Peter F. Craigmile

Wavelet methods

Peter F. Craigmile

http://www.stat.osu.edu/~pfc/

Summer School on Extreme Value Modeling and Water Resources

Universite Lyon 1, France. 13-24 Jun 2016

Thank you to The Camille Jordan Institute, the MultiRisk LEFE project,

and the Labex Milyon. My research is partially supported by the US National Science

Foundation under grants NSF-DMS-1407604 and NSF-SES-1424481.

1

Page 2: Wavelet methods Peter F. Craigmile

Introduction (again)

• We use spectral analysis to view a time series in terms of a sum of sinusoids at different

frequencies.

– Unless we do something “sneaky”, we lose all information about time.

• A wavelet transformation uses wavelets (“small waves”) to decompose a time series

into:

1. averages over a specified (typically long) time scale, and

2. changes of averages over a range of shorter time scales.

• It is a time-scale (some say frequency) decomposition.

• Thus wavelet methods are a natural way to explore and model non-stationarity.

2

Page 3: Wavelet methods Peter F. Craigmile

Some useful references

• For time series: Vidakovic [1998] and Percival and Walden [2000].

• For gridded spatial processes (images): Gonzalez and Woods [2001], Mondal and Percival

[2012], Geilhufe et al. [2013]

• Berliner et al. [1999], Wikle et al. [2001], and Craigmile and Guttorp [2011] use hierarchical

Bayesian space-time wavelet-based models.

• Examples from the analysis of longitudinal data: Brown et al. [2001], Morris et al. [2003],

Vannucci et al. [2003], Morris and Carroll [2006], Morris et al. [2006], Ramirez and Vidakovic

[2007].

3

Page 4: Wavelet methods Peter F. Craigmile

The discrete wavelet transform (DWT)

• Suppose {Xt} is a time series, observed from time 0 to N − 1 (slight change of notation).

Let X = (X0 . . . XN−1)T .

(assume N divisible by 2J , for some J).

• Most simply, the DWT coefficients are

W = WX,

where W = N ×N is an orthonormal DWT matrix.

• In practice use pyramid algorithm [Mallat, 1989] to calculate W .

4

Page 5: Wavelet methods Peter F. Craigmile

Defining the wavelet and scaling filter

• We define two filters:

1. {hl} is a wavelet (mother) filter of even length L,

i.e.,∑

l hl = 0,∑

l hlhl+2k = 1[k=0].

2. {gl = (−1)lhL−1−l} is the scaling (father) filter.

• There are many choices of wavelet filter we can use in practice.

– Each filter has different properties.

5

Page 6: Wavelet methods Peter F. Craigmile

Daubechies wavelet filters

• A popular choice is of filters is theDaubechies class [Daubechies, 1992, Sect. 6.2], which

includes the extremal phase (D) and least asymmetric (LA) sets of wavelet filters.

– This class of orthogonal filters are characterized by the maximum number of vanishing

moments.

– The wavelet filter {hl}, of even width L, passes high frequencies but filters out low ones.

– It has L/2 vanishing moments.

6

Page 7: Wavelet methods Peter F. Craigmile

Daubechies wavelet filters, cont.

−1.0

0.0

1.0

Wave

let filter ●

−1.0

0.0

1.0

Haar

Scalin

g filt

er ● ●

0.0 0.2 0.4

0.0

0.5

1.0

1.5

2.0

f

Wave

let S

q. G

ain

−1.0

0.0

1.0

Wave

let filter

●●

−1.0

0.0

1.0

D4

Scalin

g filt

er

0.0 0.2 0.4

0.0

0.5

1.0

1.5

2.0

f

Wave

let S

q. G

ain

−1.0

0.0

1.0

Wave

let filter

● ● ●●

−1.0

0.0

1.0

D8

Scalin

g filt

er

●●

●●

● ● ●

0.0 0.2 0.4

0.0

0.5

1.0

1.5

2.0

f

Wave

let S

q. G

ain

−1.0

0.0

1.0

Wave

let filter

● ●●

●●

−1.0

0.0

1.0

LA8

Scalin

g filt

er

● ●

●● ●

0.0 0.2 0.4

0.0

0.5

1.0

1.5

2.0

f

Wave

let S

q. G

ain

7

Page 8: Wavelet methods Peter F. Craigmile

The pyramid algorithm

• Let V0,t = Xt for each t.

• For each j = 1, . . . , J calculate

Wj,t =

L−1∑

l=0

hl Vj−1,2t+1−l mod Nj;

Vj,t =

L−1∑

l=0

gl Vj−1,2t+1−l mod Nj,

for Nj = N/2j and t = 0, . . . , Nj − 1.

8

Page 9: Wavelet methods Peter F. Craigmile

Partitioning the DWT coefficients

• Decomposition of W :

W = (W 1,W 2, . . . ,W J ,V J).

• W j are the Nj level j wavelet coefficients:

– associated with changes in averages on scale 2j−1;

– relates to times spaced 2j units

– is associated with the frequency interval

λj = [−1/2j,−1/2j+1) ∪ (1/2j+1, 1/2j]

(this follows by examining Fourier transforms of the wavelet and scaling filters).

• V J are the NJ scaling coefficients:

– associated with averages on scale 2J ;

– relates to times spaced 2J units apart.

9

Page 10: Wavelet methods Peter F. Craigmile

Example: Malindi coral

1800 1850 1900 1950 2000

4.1

4.3

4.5

4.7

Year

Neg.

change in 1

8O

0 5 10 15 20

−0.2

0.2

0.6

1.0

Lag

AC

F

5 10 15 20

−0.2

0.2

0.6

1.0

Lag

Part

ial A

CF

• 194 year record of the δ18O oxygen isotope measured from a 4m high coral colony growing

at a depth of 6m (low tide) in Malindi, Kenya [Cole et al., 2000].

• A decrease in the oxygen value corresponds to an increase in the sea surface tem-

perature (SST) (roughly -0.24 per mil concentration corresponds to 1oC).

10

Page 11: Wavelet methods Peter F. Craigmile

DWT plot of the Malindi coral series

1800 1850 1900 1950 2000

time

XW

1W

2W

3W

4V

4

11

Page 12: Wavelet methods Peter F. Craigmile

Decomposing trend using wavelets

• Can also decompose W as

W = W s +W b +W nb.

W s : scaling coefficients and zeros elsewhere.

W b : wavelet coefficients affected by boundaries.

W nb : wavelet coefficients not affected by boundaries.

• Since X = WTW

X = WT (W s +W b) +WT (W nb)

= µ + ǫ,

where

µ : an estimate of trend;

ǫ : “tapered” estimate of error (missing some of the dependence).

12

Page 13: Wavelet methods Peter F. Craigmile

A trend model with FD process errors

• Suppose that a model for the Malindi oxygen isotope series is

Xt = µt + ǫt,

where µt is a polynomial of degree K, and ǫt is a mean zero stationary Gaussian error

process.

• Suppose {ǫt} is a long memory process – the fractionally differenced (FD) process with

parameters d and σ2.

13

Page 14: Wavelet methods Peter F. Craigmile

Estimating the model using a wavelet decomposition

• We use a filter of length L/2 ≥ max{K + 1, ⌊d + 1/2⌋}.

• Since the wavelet filter acts as a differencing filter:

– W nb does not contain the trend component;

– E(W nb) = 0;

– Can useW nb to estimate the FD process parameters (e.g., via least squares or maximum

likelihood).

• Can employ approximate schemes to estimate the process parameters [e.g., McCoy and

Walden, 1996, Craigmile et al., 2005, Moulines et al., 2008].

14

Page 15: Wavelet methods Peter F. Craigmile

Returning to the Malindi example

• Using an approximate ML estimate:

d = 0.359 (95% CI of [0.143,0.597])

σ = 0.0667.

• Strong evidence of long range dependence.

– But cannot discriminate between a stationary (d < 1/2) or nonstationary FD process

(d ≥ 1/2).

15

Page 16: Wavelet methods Peter F. Craigmile

Estimating the trend

1800 1850 1900 1950 2000

4.1

4.2

4.3

4.4

4.5

4.6

4.7

Year

Neg.

change in 1

8O

• Evidence that the oxygen concentration is above average for the latter years. (Monte Carlo

test for trend P-value < 0.001.)

• Can extend to nonpolynomial {µt} using wavelet shrinkage methods [e.g., Donoho and

Johnstone, 1994, Percival and Walden, 2000].

16

Page 17: Wavelet methods Peter F. Craigmile

Limitations of the DWT

• Not shift invariant [e.g., Percival and Mofjeld, 1997].

• We lose time resolution by downsampling (the use of the ’2t’ in the filtering operation).

• Instead, we can use themaximum overlap discrete wavelet transform (MODWT).

• Let gl = gl/√2 and hl = hl/

√2. Then the pyramid algorithm that yields the MODWT

scaling and wavelet coefficient on level j is given by:

Vj,t =∑

l

gl Vj−1,t−l mod N ;

Wj,t =∑

l

hl Vj−1,t−l mod N ,

respectively.

• This transformation is shift invariant. On each level j, the wavelet or scaling coefficient

can be assigned to a specific time point (not t but a circular shift of t).

17

Page 18: Wavelet methods Peter F. Craigmile

MODWT plot of the Malindi coral series

1800 1850 1900 1950 2000

time

XW

1W

2W

3W

4V

4

18

Page 19: Wavelet methods Peter F. Craigmile

Decomposing the variance using wavelets

• Suppose we perform the MODWT (ignoring circularity) on the entire process, {Xt}, ana-

lyzing to all possible scales.

• Let Hj(·) denote the transfer function (Fourier transform) of the wavelet filter that would

yield the level j wavelet coefficients directly from {Xt}.

• Then the wavelet variance decomposes the variance of Xt:

var(Xt) =

∫ 1/2

−1/2

SX(f )df =

∫ 1/2

−1/2

∞∑

j=1

|Hj(f )|2SX(f )df

=

∞∑

j=1

∫ 1/2

−1/2

|Hj(f )|2SX(f )df

=

∞∑

j=1

var(Wj,t).

• This is a time-scale (frequency) decomposition of the variance since each wavelet coefficient

is associated with a certain time point and scale, λj.

19

Page 20: Wavelet methods Peter F. Craigmile

Decomposing the variance continued

• Given data, simple to estimate var(Wj,t) using averages of the squared wavelet coefficients.

• Properties of these estimators for stationary (and some nonstationary) processes are well

known [Percival, 1995, Serroukh et al., 2000]

• Can also extend to gappy time series [Mondal and Percival, 2010] and to images [Mondal

and Percival, 2012, Geilhufe et al., 2013].

• This extends to the decomposition of

cov(Xt, Xt+h) and cov(Xt, Yt+h)

(and associated correlations) for certain processes {Xt} and {Yt} [Serroukh and Walden,

2000a,b, Whitcher et al., 2000].

• Interesting spatio-temporal extensions!

20

Page 21: Wavelet methods Peter F. Craigmile

Wavelet-based analysis of variance for the daily Swedish temperatures

(See the last set of lectures!)

0 2 4 6 8 10

−2

02

4

log(ν

j2)

6: Sundsvalls FlygplatsDaily average temperature

log(τj)

0 2 4 6 8

−2

02

4

log(ν

j2)

10: Films Kyrkby ADaily average temperature

log(τj)

0 2 4 6 8 10

−2

02

4

log(ν

j2)

16: Kilsbergen−SuttarbodaDaily average temperature

log(τj)

0 2 4 6 8 10

−7

−5

−3

−1

log(ν

j2)

log(τj)

6: Sundsvalls FlygplatsDeseasonalized and standardized temp.

0 2 4 6 8

−7

−5

−3

−1

log(ν

j2)

log(τj)

10: Films Kyrkby ADeseasonalized and standardized temp.

0 2 4 6 8 10−

7−

5−

3−

1

log(ν

j2)

log(τj)

16: Kilsbergen−SuttarbodaDeseasonalized and standardized temp

21

Page 22: Wavelet methods Peter F. Craigmile

Wavelet-based analysis of variance for the daily Swedish temperatures

5 10 15

0.1

00

.20

0.3

00

.40

Location ID

LR

D e

stim

ate

s

(a)

12 14 16 18

56

58

60

62

64

Longitude

La

titu

de

.202

.254

.166

.238 .2410

.2013 .3114

.1915 .2316

.191

.173

.165

.227

.2412

.1717

.209.2311

(b)

22

Page 23: Wavelet methods Peter F. Craigmile

References

L. M. Berliner, C. K. Wikle, and R. F. Milliff. Multiresolution wavelet analyses in hierarchical Bayesian turbulence models. In P. Muller and B. Vidakovic, editors,

Bayesian Inference in Wavelet-based Models, pages 341–359, New York, 1999. Springer.

P. J. Brown, T. Fearn, and M. Vannucci. Bayesian wavelet regression on curves with application to a spectroscopic calibration problem. Journal of the American

Statistical Association, 96:398–408, 2001.

J. E. Cole, R. B. Dunbar, T. R. McClanahan, and N. A. Muthiga. Tropical pacific forcing of decadal SST variability in the western Indian ocean over the past

two centuries. Science, 287:617–619, 2000.

P. F. Craigmile and P. Guttorp. Space-time modelling of trends in temperature series. Journal of Time Series Analysis, 32:378–395, 2011.

P. F. Craigmile, D. B. Percival, and P. Guttorp. Wavelet based estimation for polynomial contaminated fractionally differenced processes. IEEE Transactions on

Signal Processing, 53:3151–3161, 2005.

I. Daubechies. Ten Lectures on Wavelets. CBMS-NSF Series in Applied Mathematics. SIAM, Philadelphia, 1992.

D. L. Donoho and I. M. Johnstone. Ideal spatial adaptation by wavelet shrinkage. Biometrika, 81(3):425–455, 1994.

M. Geilhufe, D. B. Percival, and H. L. Stern. Two-dimensional wavelet variance estimation with application to sea ice SAR images. Computers and Geosciences,

54(C):351–360, Apr. 2013.

R. C. Gonzalez and R. E. Woods. Digital Image Processing (Second Edition). Prentice Hall, Upper Saddle River, New Jersey, 2001.

S. Mallat. A theory for multiresolution signal decomposition: The wavelet representation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 11:

674–693, 1989.

E. J. McCoy and A. T. Walden. Wavelet Analysis and Synthesis of Stationary Long-Memory Processes. Journal of Computational and Graphical Statistics, 5:

26–56, Mar. 1996.

D. Mondal and D. Percival. Wavelet variance analysis for gappy time series. Annals of the Institute of Statistical Mathematics, 62:943–966, 2010.

D. Mondal and D. B. Percival. Wavelet variance analysis for random fields on a regular lattice. Image Processing, IEEE Transactions on, 21(2):537–549, 2012.

J. S. Morris and R. J. Carroll. Wavelet-based functional mixed models. Journal of the Royal Statistical Society, Series B: Statistical Methodology, 68:179–199,

2006.

J. S. Morris, M. Vannucci, P. J. Brown, and R. J. Carroll. Wavelet-based nonparametric modeling of hierarchical functions in colon carcinogenesis. Journal of

the American Statistical Association, 98:573–583, 2003.

J. S. Morris, C. Arroyo, B. Coull, L. M. Ryan, R. Herrick, and S. L. Gortmaker. Using wavelet-based functional mixed models to characterize population

23

Page 24: Wavelet methods Peter F. Craigmile

heterogeneity in accelerometer profiles: A case study. Journal of the American Statistical Association, 101:1352–1364, 2006.

E. Moulines, F. Roueff, and M. S. Taqqu. A wavelet whittle estimator of the memory parameter of a nonstationary Gaussian time series. The Annals of Statistics,

36:1925–1956, 2008.

D. Percival and A. Walden. Wavelet Methods for Time Series Analysis. Cambridge University Press, Cambridge, England, 2000.

D. B. Percival. On Estimation of the Wavelet Variance. Biometrika, 3:619–631, 1995.

D. B. Percival and H. Mofjeld. Analysis of Subtidal Coastal Sea Level Fluctuations Using Wavelets. JASA, 92(439):868–880, 1997.

P. Ramirez and B. Vidakovic. Wavelet-based 2D multifractal spectrum with applications in analysis of digital mammography images. Technical report, Department

of Biomedical Engineering, Georgia Tech., 2007.

A. Serroukh and A. T. Walden. Wavelet scale analysis of bivariate time series I: Motivation and estimation. Journal of Statistical Planning and Inference, 13:

1–36, 2000a.

A. Serroukh and A. T. Walden. Wavelet scale analysis of bivariate time series II: Statistical properties for linear processes. Journal of Statistical Planning and

Inference, 13:37–56, 2000b.

A. Serroukh, A. T. Walden, and D. B. Percival. Statistical properties and uses of the wavelet variance estimator for the scale analysis of time series. JASA, 95:

184–196, 2000.

M. Vannucci, P. Brown, and T. Fearn. A decision theoretical approach to wavelet regression on curves with a high number of regressors. Journal of Statistical

Planning and Inference, 112:195–212, 2003.

B. Vidakovic. Statistical Modeling by Wavelets. Wiley, New York, NY, 1998.

B. Whitcher, P. Guttorp, and D. B. Percival. Wavelet analysis of covariance with application to atmospheric time series. Journal of Geophysical Research, 105

(D11):14941–14962, 2000.

C. K. Wikle, R. F. Milliff, D. Nychka, and L. M. Berliner. Spatiotemporal hierarchical Bayesian modeling tropical ocean surface winds. Journal of the American

Statistical Association, 96:382–397, 2001.

24