(with rob j. hyndman, nikos kourentzes and fotis ...€¦ · kourentzes, petropoulos, trapero...

49
Annual Q 1 M 1 M 2 M 3 Q 2 M 4 M 5 M 6 Q 3 M 7 M 8 M 9 Q 4 M 10 M 11 M 12 1 George Athanasopoulos (with Rob J. Hyndman, Nikos Kourentzes and Fotis Petropoulos) Forecasting with temporal hierarchies

Upload: others

Post on 23-Jul-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

Annual

Q1

M1 M2 M3

Q2

M4 M5 M6

Q3

M7 M8 M9

Q4

M10 M11 M12

1

George Athanasopoulos(with Rob J. Hyndman, Nikos Kourentzes andFotis Petropoulos)

Forecasting with temporalhierarchies

Page 2: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

Outline

1 Introduction

2 Temporal hierarchies

3 Optimal combination forecasts

4 A Monte-Carlo simulation study

5 Conclusion

Forecasting with temporal hierarchies Introduction 2

Page 3: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

Temporal aggregation

Key issue:å Aggregating model/forecasts versus

modelling/forecasting the aggregate.

Temporal aggregation literature: Amemiyaand Wu (1972), Tiao (1972), Brewer(1973), Wei (1978, 1980), Rosanna andSeater (1992, 1995),..., Silvestrini et al.(2008), Silvestrini and Veredas (2008).

All within the ARIMA framework.

Forecasting with temporal hierarchies Introduction 3

Page 4: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

Temporal aggregation1 Effect on the structure of dynamics.

Aggregation complicates/contaminates/changesdynamics.Loss of information of components.

2 Parameter estimation efficiency.

Losses always happen here no matter what modelyou are considering.

3 Effect on forecasting. What is the optimal level ofaggregation?

Results vary both empirically and in simulations.Impossible to set some guidelines for the empiricalanalyst/forecaster.Can use disaggregate series to forecast aggregatesbut not visa versa.

Forecasting with temporal hierarchies Introduction 4

Page 5: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

Temporal aggregation1 Effect on the structure of dynamics.

Aggregation complicates/contaminates/changesdynamics.Loss of information of components.

2 Parameter estimation efficiency.

Losses always happen here no matter what modelyou are considering.

3 Effect on forecasting. What is the optimal level ofaggregation?

Results vary both empirically and in simulations.Impossible to set some guidelines for the empiricalanalyst/forecaster.Can use disaggregate series to forecast aggregatesbut not visa versa.

Forecasting with temporal hierarchies Introduction 4

Page 6: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

Temporal aggregation1 Effect on the structure of dynamics.

Aggregation complicates/contaminates/changesdynamics.Loss of information of components.

2 Parameter estimation efficiency.

Losses always happen here no matter what modelyou are considering.

3 Effect on forecasting. What is the optimal level ofaggregation?

Results vary both empirically and in simulations.Impossible to set some guidelines for the empiricalanalyst/forecaster.Can use disaggregate series to forecast aggregatesbut not visa versa.

Forecasting with temporal hierarchies Introduction 4

Page 7: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

Outline

1 Introduction

2 Temporal hierarchies

3 Optimal combination forecasts

4 A Monte-Carlo simulation study

5 Conclusion

Forecasting with temporal hierarchies Temporal hierarchies 5

Page 8: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

Basic idea

Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF.

å For a series observed at the highest possiblefrequency, construct aggregate series up to theannual level.

å Do not choose a level of aggregation. Forecastall series and optimally combine resulting in aset of reconciled forecasts at all frequencies.

Key implication:

å Reconciled forecasts align managerialobjectives. How do we do in forecast accuracy?

Forecasting with temporal hierarchies Temporal hierarchies 6

Page 9: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

Basic idea

Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF.

å For a series observed at the highest possiblefrequency, construct aggregate series up to theannual level.

å Do not choose a level of aggregation. Forecastall series and optimally combine resulting in aset of reconciled forecasts at all frequencies.

Key implication:

å Reconciled forecasts align managerialobjectives. How do we do in forecast accuracy?

Forecasting with temporal hierarchies Temporal hierarchies 6

Page 10: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

Basic idea

Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF.

å For a series observed at the highest possiblefrequency, construct aggregate series up to theannual level.

å Do not choose a level of aggregation. Forecastall series and optimally combine resulting in aset of reconciled forecasts at all frequencies.

Key implication:

å Reconciled forecasts align managerialobjectives. How do we do in forecast accuracy?

Forecasting with temporal hierarchies Temporal hierarchies 6

Page 11: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

Basic idea

Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF.

å For a series observed at the highest possiblefrequency, construct aggregate series up to theannual level.

å Do not choose a level of aggregation. Forecastall series and optimally combine resulting in aset of reconciled forecasts at all frequencies.

Key implication:

å Reconciled forecasts align managerialobjectives. How do we do in forecast accuracy?

Forecasting with temporal hierarchies Temporal hierarchies 6

Page 12: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

Basic idea

Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF.

å For a series observed at the highest possiblefrequency, construct aggregate series up to theannual level.

å Do not choose a level of aggregation. Forecastall series and optimally combine resulting in aset of reconciled forecasts at all frequencies.

Key implication:

å Reconciled forecasts align managerialobjectives. How do we do in forecast accuracy?

Forecasting with temporal hierarchies Temporal hierarchies 6

Page 13: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

General notationWe set aggregation levels k to be a factor of m, thehighest sampling frequency per year. E.g., forquarterly series, m = 4, we consider three levels ofaggregation: k = {4,2,1}.

Annual = y[4]i

Semi-Annual1 = y[2]2i−1

Q1 = y[1]4i−3 Q2 = y[1]4i−2

Semi-Annual2 = y[2]2i

Q3 = y[1]4i−1 Q4 = y[1]4i

Forecasting with temporal hierarchies Temporal hierarchies 7

Page 14: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

General notationWe set aggregation levels k to be a factor of m, thehighest sampling frequency per year. E.g., forquarterly series, m = 4, we consider three levels ofaggregation: k = {4,2,1}.

Annual = y[4]i

Semi-Annual1 = y[2]2i−1

Q1 = y[1]4i−3 Q2 = y[1]4i−2

Semi-Annual2 = y[2]2i

Q3 = y[1]4i−1 Q4 = y[1]4i

Forecasting with temporal hierarchies Temporal hierarchies 7

Page 15: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

General notation

Collecting these in one column vector,

yi =(y[4]i ,y

[2]′

i ,y[1]′

i

)′.

Hence,yi = Sy[1]

i .

For m = 4,

S =

1 1 1 11 1 0 00 0 1 1

I4

Forecasting with temporal hierarchies Temporal hierarchies 8

Page 16: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

General notation

Collecting these in one column vector,

yi =(y[4]i ,y

[2]′

i ,y[1]′

i

)′.

Hence,yi = Sy[1]

i .

For m = 4,

S =

1 1 1 11 1 0 00 0 1 1

I4

Forecasting with temporal hierarchies Temporal hierarchies 8

Page 17: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

General notation: monthly data

Annual

Semi-Annual1

Q1

M1 M2 M3

Q2

M4 M5 M6

Semi-Annual2

Q3

M7 M8 M9

Q4

M10 M11 M12

k = {12, 6, 3, 1};k = {12, 4, 2, 1};k = {12, 6, 4, 3, 2, 1}.

Forecasting with temporal hierarchies Temporal hierarchies 9

Page 18: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

General notation: monthly data

Annual

FourM1

BiM1

M1 M2

BiM2

M3 M4

FourM2

BiM3

M5 M6

BiM4

M7 M8

FourM3

BiM5

M9 M10

BiM6

M11 M12

k = {12, 6, 3, 1};k = {12, 4, 2, 1};k = {12, 6, 4, 3, 2, 1}.

Forecasting with temporal hierarchies Temporal hierarchies 9

Page 19: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

General notation: monthly data

Annual

FourM1

BiM1

M1 M2

BiM2

M3 M4

FourM2

BiM3

M5 M6

BiM4

M7 M8

FourM3

BiM5

M9 M10

BiM6

M11 M12

k = {12, 6, 3, 1};k = {12, 4, 2, 1};k = {12, 6, 4, 3, 2, 1}.

Forecasting with temporal hierarchies Temporal hierarchies 9

Page 20: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

General notation: monthly data

ASemiA1

SemiA2

FourM1

FourM2

FourM3

Q1

...Q4

BiM1

...BiM6

M1

...M12

︸ ︷︷ ︸

yi

=

1 1 1 1 1 1 1 1 1 1 1 11 1 1 1 1 1 0 0 0 0 0 00 0 0 0 0 0 1 1 1 1 1 11 1 1 1 0 0 0 0 0 0 0 00 0 0 0 1 1 1 1 0 0 0 00 0 0 0 0 0 0 0 1 1 1 11 1 1 0 0 0 0 0 0 0 0 0

...0 0 0 0 0 0 0 0 0 1 1 11 1 0 0 0 0 0 0 0 0 0 0

...0 0 0 0 0 0 0 0 0 0 1 1

I12

︸ ︷︷ ︸

S

M1

M2

M3

M4

M5

M6

M7

M8

M9

M10

M11

M12

︸ ︷︷ ︸

y[1]i

Forecasting with temporal hierarchies Temporal hierarchies 10

Page 21: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

Outline

1 Introduction

2 Temporal hierarchies

3 Optimal combination forecasts

4 A Monte-Carlo simulation study

5 Conclusion

Forecasting with temporal hierarchies Optimal combination forecasts 11

Page 22: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

Forecasting framework

Let h be the required forecast horizon at the annuallevel. For each aggregation level k we generatem/k × h base forecasts and stack them the sameway as the data,

yh = (y[m]h , . . . , y[k3]′

h , y[k2]′

h , y[1]′

h )′.

Reconciliation regression,

yh = Sβ(h) + εh

where β(h) = E[y[1]bT/mc+h|y1, . . . , yT] and εh is the

reconciliation error with mean zero and covarianceΣh.

Forecasting with temporal hierarchies Optimal combination forecasts 12

Page 23: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

Forecasting framework

Let h be the required forecast horizon at the annuallevel. For each aggregation level k we generatem/k × h base forecasts and stack them the sameway as the data,

yh = (y[m]h , . . . , y[k3]′

h , y[k2]′

h , y[1]′

h )′.

Reconciliation regression,

yh = Sβ(h) + εh

where β(h) = E[y[1]bT/mc+h|y1, . . . , yT] and εh is the

reconciliation error with mean zero and covarianceΣh.

Forecasting with temporal hierarchies Optimal combination forecasts 12

Page 24: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

Approx. optimal forecasts

Solution 1: OLS

Approximate Σh by σ2I.

Solution 2: WLS (variance scaling)

Let Λ =[diagonal

(Σ1

)]contain the one-step

forecast error variances.

Yh = S(S′Λ−1S)−1S′Λ−1Yh

Easy to estimate, and places more weightwhere we have best forecasts.

Forecasting with temporal hierarchies Optimal combination forecasts 13

yh = Sβ(h) = S(S′Σ−1h S)−1S′Σ−1

h yh

Page 25: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

Approx. optimal forecasts

Solution 1: OLS

Approximate Σh by σ2I.

Solution 2: WLS (variance scaling)

Let Λ =[diagonal

(Σ1

)]contain the one-step

forecast error variances.

Yh = S(S′Λ−1S)−1S′Λ−1Yh

Easy to estimate, and places more weightwhere we have best forecasts.

Forecasting with temporal hierarchies Optimal combination forecasts 13

yh = Sβ(h) = S(S′Σ−1h S)−1S′Σ−1

h yh

Page 26: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

Approx. optimal forecasts

Solution 1: OLS

Approximate Σh by σ2I.

Solution 2: WLS (variance scaling)

Let Λ =[diagonal

(Σ1

)]contain the one-step

forecast error variances.

Yh = S(S′Λ−1S)−1S′Λ−1Yh

Easy to estimate, and places more weightwhere we have best forecasts.

Forecasting with temporal hierarchies Optimal combination forecasts 13

yh = Sβ(h) = S(S′Σ−1h S)−1S′Σ−1

h yh

Page 27: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

Approx. optimal forecasts

Forecasting with temporal hierarchies Optimal combination forecasts 14

Yh = S(S′Σ−1h S)−1S′Σ−1

h Yh

Solution 3: WLS (structural scaling)

Bottom level reconciliation errors haveapproximately the same variances.

Assuming that they are approximatelyuncorrelated then Σh is proportional to thenumber of series contributing to each node.

So set Σh = σ2Λ where

Λ = diag(S× 1)

where 1 = (1,1, . . . ,1)′.

Page 28: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

Outline

1 Introduction

2 Temporal hierarchies

3 Optimal combination forecasts

4 A Monte-Carlo simulation study

5 Conclusion

Forecasting with temporal hierarchies A Monte-Carlo simulation study 15

Page 29: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

Simulation setup

Silvestrini and Veredas (2008, JoES):Survey paper on temporal aggregation.

Theoretical derivation of temporarilyaggregated ARIMA models.

Empirical application Belgian cash deficitseries, 252 monthly observations,ARIMA(0,0,1)(0,1,1)12 with an intercept.

Discussion on estimation efficiency loss.

Two simulation setups.

Forecasting with temporal hierarchies A Monte-Carlo simulation study 16

Page 30: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

Simulation setup 1

Freq ARIMA orders c θ1 θ2 Θ1 σe

Theoretically derived parameters

Annual (0,1,2) 112.3 -0.43 0.01SemAnn (0,0,1)(0,1,1)2 28.1 -0.05 -0.4FourM (0,0,1)(0,1,1)3 12.4 -0.06 -0.4Quart (0,0,1)(0,1,1)4 7.0 -0.10 -0.4BiMonth (0,0,1)(0,1,1)6 3.1 -0.13 -0.4

× 103

Estimated parameters

Monthly (0,0,1)(0,1,1)12 0.78 -0.22 -0.4 4.19× 103 × 10−5

Drawing from εt ∼ N(0, σ2ε ), we generate time series from the monthly DGP

and then aggregate these to the levels above.

Forecasting with temporal hierarchies A Monte-Carlo simulation study 17

Page 31: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

ARIMA(0,0,1)(0,1,1)12 with drift

Time

Ann

ual

5 10 15 20

4.5

5.5

6.5

Time

Qua

rter

ly

5 10 15 20

1.2

1.4

1.6

1.8

Time

Sem

i−A

nnua

l

5 10 15 20

2.4

2.8

3.2

3.6

TimeB

i−M

onth

ly5 10 15 20

0.7

0.9

1.1

Fou

r−M

onth

ly

5 10 15 20

1.6

2.0

2.4

Mon

thly

0 50 100 150 200 250

0.2

0.4

0.6

0.8

Forecasting with temporal hierarchies A Monte-Carlo simulation study 18

Page 32: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

Simulation setup 1

Four scenarios. Base forecasts for each series ateach aggregation level generated from:

1 the theoretically derived ARIMA DGPs at eachlevel (complete certainty);

2 the theoretically derived correct ARIMAspecification but with estimated parameters(parameter uncertainty);

3 an automatically selected ARIMA model (modeluncertainty);

4 an automatically selected ETS model (partialmodel misspecification).

Forecasting with temporal hierarchies A Monte-Carlo simulation study 19

Page 33: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

Simulation setup 1

Forecast comparisons

1 Approx. optimal combination using WLS (Variance).

2 Bottom up.

versus base (unreconciled) forecasts.

Negative (positive) entries represent a percentagedecrease (increase) in RMSE compared to base (unreconciled)forecasts.

Iterations

Sample sizes (annual): 4, 12, 20, 40.

Forecast horizons (annual): 1, 3, 5, 10.

1,000 iterations for each sample size.

Forecasting with temporal hierarchies A Monte-Carlo simulation study 20

Page 34: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

Simulation setup 1

Forecast comparisons

1 Approx. optimal combination using WLS (Variance).

2 Bottom up.

versus base (unreconciled) forecasts.

Negative (positive) entries represent a percentagedecrease (increase) in RMSE compared to base (unreconciled)forecasts.

Iterations

Sample sizes (annual): 4, 12, 20, 40.

Forecast horizons (annual): 1, 3, 5, 10.

1,000 iterations for each sample size.

Forecasting with temporal hierarchies A Monte-Carlo simulation study 20

Page 35: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

Simulation setup 1

Forecast comparisons

1 Approx. optimal combination using WLS (Variance).

2 Bottom up.

versus base (unreconciled) forecasts.

Negative (positive) entries represent a percentagedecrease (increase) in RMSE compared to base (unreconciled)forecasts.

Iterations

Sample sizes (annual): 4, 12, 20, 40.

Forecast horizons (annual): 1, 3, 5, 10.

1,000 iterations for each sample size.

Forecasting with temporal hierarchies A Monte-Carlo simulation study 20

Page 36: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

Simulation setup 1: results

Scenario 1 (complete certainty):Base forecasts from theoretically derived ARIMA DGPs.

Sample 4 12 20 40 4 12 20 40(h) (1) (3) (5) (10) (1) (3) (5) (10)

WLS Bottom-up

Annual -0.3 0.0 0.0 0.0 -0.7 -0.1 0.2 0.1SemiA -0.2 -0.1 0.0 0.0 -0.5 -0.1 0.1 0.0FourM -0.1 0.0 0.0 0.0 -0.2 -0.1 0.1 0.0Quart -0.1 0.0 0.0 0.0 -0.2 0.0 0.0 0.0BiM 0.0 0.0 0.0 0.0 -0.1 0.0 0.0 0.0Monthly 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0

Negative (positive) entries represent a percentage decrease(increase) in RMSE compared to base (initial) forecasts.

Forecasting with temporal hierarchies A Monte-Carlo simulation study 21

Page 37: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

Simulation setup 1: results

Scenario 1 (complete certainty):Base forecasts from theoretically derived ARIMA DGPs.

Sample 4 12 20 40 4 12 20 40(h) (1) (3) (5) (10) (1) (3) (5) (10)

WLS Bottom-up

Annual -0.3 0.0 0.0 0.0 -0.7 -0.1 0.2 0.1SemiA -0.2 -0.1 0.0 0.0 -0.5 -0.1 0.1 0.0FourM -0.1 0.0 0.0 0.0 -0.2 -0.1 0.1 0.0Quart -0.1 0.0 0.0 0.0 -0.2 0.0 0.0 0.0BiM 0.0 0.0 0.0 0.0 -0.1 0.0 0.0 0.0Monthly 0.0 0.0 0.0 0.0 0.0 0.0 0.0 0.0

Negative (positive) entries represent a percentage decrease(increase) in RMSE compared to base (initial) forecasts.

Forecasting with temporal hierarchies A Monte-Carlo simulation study 21

Page 38: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

Simulation setup 1: results

Scenario 2 (parameter uncertainty):Base forecasts from estimated ARIMA DGPs at each level.

Sample 4 12 20 40 4 12 20 40(h) (1) (3) (5) (10) (1) (3) (5) (10)

WLS Bottom-up

Annual -4.3 -7.9 -6.1 -3.3 -5.3 -9.5 -7.1 -3.4SemiA -5.2 -3.5 -1.6 -0.2 -7.6 -4.8 -2.4 -0.2FourM -3.7 -1.4 -0.4 -0.1 -5.5 -2.6 -0.9 -0.2Quart -3.9 -0.6 -0.2 -0.1 -6.0 -1.8 -0.7 -0.2BiM -1.1 0.0 0.1 0.0 -2.8 -0.9 -0.2 -0.1Monthly 1.0 0.4 0.1 0.0 0.0 0.0 0.0 0.0

Negative (positive) entries represent a percentage decrease(increase) in RMSE compared to base (initial) forecasts.

Forecasting with temporal hierarchies A Monte-Carlo simulation study 22

Page 39: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

Simulation setup 1: results

Scenario 3 (model uncertainty):Base forecasts from automatically selected ARIMA models.

Sample 4 12 20 40 4 12 20 40(h) (1) (3) (5) (10) (1) (3) (5) (10)

WLS Bottom-up

Annual -66.2 -5.1 -2.6 -0.4 -64.2 -1.2 5.9 27.9SemiA -50.6 -4.9 -2.6 -1.2 -48.5 -2.8 2.3 13.8FourM -10.1 -6.2 -2.0 -1.2 -7.1 -5.1 1.4 8.7Quart -16.4 -4.1 -1.9 -0.8 -14.0 -3.0 0.4 6.5BiM -7.5 -3.3 -0.7 -0.9 -5.8 -2.4 1.2 3.8Monthly -0.9 -0.5 -0.8 -1.9 0.0 0.0 0.0 0.0

Negative (positive) entries represent a percentage decrease(increase) in RMSE compared to base (initial) forecasts.

Forecasting with temporal hierarchies A Monte-Carlo simulation study 23

Page 40: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

Simulation setup 1: results

Scenario 4 (partial misspecification):Base forecasts from automatically selected ETS models.

Sample 4 12 20 40 4 12 20 40(h) (1) (3) (5) (10) (1) (3) (5) (10)

WLS Bottom-up

Annual -24.7 1.6 0.5 -1.8 -20.9 69.1 101.5 150.4SemiA -42.5 -5.4 -2.7 -1.1 -40.0 35.4 63.8 105.3FourM -9.4 -6.7 -2.7 -4.3 -5.7 23.4 47.8 73.1Quart -1.2 -8.3 -5.5 -5.9 2.3 15.5 33.3 54.9BiM -0.9 -8.3 -9.3 -8.6 1.9 8.2 16.1 32.7Monthly -1.4 -7.3 -11.3 -16.9 0.0 0.0 0.0 0.0

Negative (positive) entries represent a percentage decrease(increase) in RMSE compared to base (initial) forecasts.

Forecasting with temporal hierarchies A Monte-Carlo simulation study 24

Page 41: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

Simulation setup 2

Take one draw from DGP 1 at the monthlylevel and fit an ETS model: ETS(A,Ad,A).

µt = `t−1 + φbt−1 + st−m`t = `t−1 + φbt−1 + αεtbt = φbt−1 + βεtst = st−m + γεtYt+h|t = `t + φhbt + st−m+h+m

where φh = φ+ · · ·+ φh and h+m =

[(h− 1) mod m

]+ 1

α = β = 0.0144, γ = 0.5521, φ = 0.9142.

1 Scenario 1: Forecast with ETS;2 Scenario 2: Forecast with ARIMA;Forecasting with temporal hierarchies A Monte-Carlo simulation study 25

Page 42: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

Simulation setup 2

Take one draw from DGP 1 at the monthlylevel and fit an ETS model: ETS(A,Ad,A).

µt = `t−1 + φbt−1 + st−m`t = `t−1 + φbt−1 + αεtbt = φbt−1 + βεtst = st−m + γεtYt+h|t = `t + φhbt + st−m+h+m

where φh = φ+ · · ·+ φh and h+m =

[(h− 1) mod m

]+ 1

α = β = 0.0144, γ = 0.5521, φ = 0.9142.

1 Scenario 1: Forecast with ETS;2 Scenario 2: Forecast with ARIMA;Forecasting with temporal hierarchies A Monte-Carlo simulation study 25

Page 43: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

ETS(A,Ad,A)Annual

5 10 15 20

5.2

5.6

6.0

6.4

Quarterly

5 10 15 20

1.2

1.4

1.6

Semi-Annual

5 10 15 20

2.6

2.8

3.0

3.2

Bi-Monthly

5 10 15 20

0.80.91.01.11.2

Four-Monthly

5 10 15 20

1.6

1.8

2.0

2.2

Monthly

0 50 100 150 200 250

0.3

0.4

0.5

0.6

Forecasting with temporal hierarchies A Monte-Carlo simulation study 26

Page 44: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

ETS(A,Ad,A)Annual

5 10 15 20

4.2

4.6

5.0

Quarterly

5 10 15 20

1.0

1.2

Semi-Annual

5 10 15 20

2.0

2.2

2.4

2.6

Bi-Monthly

5 10 15 20

0.5

0.7

0.9

Four-Monthly

5 10 15 20

1.2

1.4

1.6

1.8

Monthly

0 50 100 150 200 250

0.2

0.4

0.6

Forecasting with temporal hierarchies A Monte-Carlo simulation study 27

Page 45: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

Simulation setup 2: results

Scenario 1: DGP is ETS(A,Ad,A). Fitting ETS.

Sample 4 12 20 40 4 12 20 40(h) (1) (3) (5) (10) (1) (3) (5) (10)

WLS Bottom-up

Annual -12.3 -5.3 -7.1 -9.8 -7.0 1.2 -6.7 -6.4SemiA -26.9 -3.5 -5.6 -4.2 -23.5 4.2 -5.2 -0.9FourM -5.2 -3.6 -5.4 -1.5 -1.4 4.3 -5.0 1.6Quart -2.3 -4.5 -5.0 -0.9 1.1 3.2 -4.8 2.1BiM -1.4 -4.0 -1.9 0.3 1.1 3.3 -1.8 3.0Monthly -1.4 -4.7 -0.1 -1.9 0 0 0 0

Negative (positive) entries represent a percentage decrease(increase) in RMSE compared to base (initial) forecasts.

Forecasting with temporal hierarchies A Monte-Carlo simulation study 28

Page 46: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

Simulation setup 2: results

Scenario 2: DGP is ETS(A,Ad,A). Fitting ARIMA.

Sample 4 12 20 40 4 12 20 40(h) (1) (3) (5) (10) (1) (3) (5) (10)

WLS Bottom-up

Annual -39.9 -7.6 -9.4 -1.0 -36.4 -2.0 -2.6 5.8SemiA -36.6 -1.3 -2.1 -0.8 -33.6 3.7 4.4 6.1FourM -12.6 -4.0 -3.8 -2.6 -8.8 0.7 2.2 3.9Quart -23.9 -3.9 -4.4 -5.1 -19.8 0.3 1.2 1.3BiM -11.5 -2.9 -3.6 -3.6 -8.2 0.5 1.7 2.6Monthly -2.9 -2.5 -3.8 -5.0 0 0 0 0

Negative (positive) entries represent a percentage decrease(increase) in RMSE compared to base (initial) forecasts.

Forecasting with temporal hierarchies A Monte-Carlo simulation study 29

Page 47: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

Outline

1 Introduction

2 Temporal hierarchies

3 Optimal combination forecasts

4 A Monte-Carlo simulation study

5 Conclusion

Forecasting with temporal hierarchies Conclusion 30

Page 48: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

Conclusion/Implications

1 Significant forecast gains from applyingtemporal hierarchies both in simulations andempirical evaluations.

2 Beside the forecast gains we achieve thealignment of short, medium and long termforecasts.

3 Significant implications from an operational,managerial point of view.

Forecasting with temporal hierarchies Conclusion 31

Page 49: (with Rob J. Hyndman, Nikos Kourentzes and Fotis ...€¦ · Kourentzes, Petropoulos, Trapero (2014), MAPA, IJF. å For a series observed at the highest possible frequency, construct

Interesting questions

Measurement error versus the level ofaggregation.

Aggregation level versus the effect of outliers.

Taking advantage of high frequency beingupdated more regularly.

Thank you!

Forecasting with temporal hierarchies Conclusion 32