clustering functional data with wavelets · advisors: anestis antoniadisa and jean-michel poggib a...

20
ADVISORS: Anestis Antoniadis a and Jean-Michel Poggi b a Joseph Fourier University, Grenoble b Paris-Sud University Clustering functional data with wavelets Jairo Cugliari R39 - OSIRIS - EDF R & D RESP .: XAVIER BROSSAT August 2010 Compstat 2010 | August 2010 | Jairo Cugliari Clustering FD with waveletes

Upload: others

Post on 04-Aug-2020

6 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Clustering functional data with wavelets · ADVISORS: Anestis Antoniadisa and Jean-Michel Poggib a Joseph Fourier University, Grenoble b Paris-Sud University Clustering functional

ADVISORS: Anestis Antoniadisa and Jean-Michel Poggib

aJoseph Fourier University, GrenoblebParis-Sud University

Clustering functional datawith wavelets

Jairo Cugliari

R39 - OSIRIS - EDF R & D

RESP.: XAVIER BROSSAT

August 2010

Compstat 2010 | August 2010 | Jairo Cugliari Clustering FD with waveletes

Page 2: Clustering functional data with wavelets · ADVISORS: Anestis Antoniadisa and Jean-Michel Poggib a Joseph Fourier University, Grenoble b Paris-Sud University Clustering functional

MotivationWavelet based feature extraction

ResultsConclusion

Plan

1 Motivation

2 Wavelet based feature extraction

3 Results

4 Conclusion

Compstat 2010 | August 2010 | Jairo Cugliari Clustering FD with waveletes

Page 3: Clustering functional data with wavelets · ADVISORS: Anestis Antoniadisa and Jean-Michel Poggib a Joseph Fourier University, Grenoble b Paris-Sud University Clustering functional

MotivationWavelet based feature extraction

ResultsConclusion

EDF data

Functional data from a time series

Consider a square integrable continuous time stochastic processX = (X (t), t ∈ R) observed over the interval [0,T ], T > 0 at a relatively highsampling frequency. A commonly used approach is to divide the interval[0,T ] into subintervals [lδ, (l + 1)δ], l = 1, . . . , n with δ = T/n, and toconsider the functional-valued discrete time stochastic processZ = (Zi , i ∈ N), associated to X by

Zi(t) = X (iδ + t) t ∈ [0, δ) (1)

Compstat 2010 | August 2010 | Jairo Cugliari Clustering FD with waveletes

Page 4: Clustering functional data with wavelets · ADVISORS: Anestis Antoniadisa and Jean-Michel Poggib a Joseph Fourier University, Grenoble b Paris-Sud University Clustering functional

MotivationWavelet based feature extraction

ResultsConclusion

EDF data

Functional data from a time series

Consider a square integrable continuous time stochastic processX = (X (t), t ∈ R) observed over the interval [0,T ], T > 0 at a relatively highsampling frequency. A commonly used approach is to divide the interval[0,T ] into subintervals [lδ, (l + 1)δ], l = 1, . . . , n with δ = T/n, and toconsider the functional-valued discrete time stochastic processZ = (Zi , i ∈ N), associated to X by

Zi(t) = X (iδ + t) t ∈ [0, δ) (1)

Compstat 2010 | August 2010 | Jairo Cugliari Clustering FD with waveletes

Page 5: Clustering functional data with wavelets · ADVISORS: Anestis Antoniadisa and Jean-Michel Poggib a Joseph Fourier University, Grenoble b Paris-Sud University Clustering functional

MotivationWavelet based feature extraction

ResultsConclusion

Clustering and FD

I Given a sample of curves, wesearch for homogeneoussubgroups of individuals.

I Clustering is a process forpartitioning a dataset intosub-groups

I The instances within a groupare similar to each other andare very dissimilar to theinstances of other groups.

I In a functional contextclustering helps to identifyrepresentative curve patternsand individuals who are verylikely involved in the same orsimilar processes.

Compstat 2010 | August 2010 | Jairo Cugliari Clustering FD with waveletes

Page 6: Clustering functional data with wavelets · ADVISORS: Anestis Antoniadisa and Jean-Michel Poggib a Joseph Fourier University, Grenoble b Paris-Sud University Clustering functional

MotivationWavelet based feature extraction

ResultsConclusion

Plan

1 Motivation

2 Wavelet based feature extraction

3 Results

4 Conclusion

Compstat 2010 | August 2010 | Jairo Cugliari Clustering FD with waveletes

Page 7: Clustering functional data with wavelets · ADVISORS: Anestis Antoniadisa and Jean-Michel Poggib a Joseph Fourier University, Grenoble b Paris-Sud University Clustering functional

MotivationWavelet based feature extraction

ResultsConclusion

Wavelets

Wavelet transformI domain-transform technique for hierarchical decomposing finite energy

signalsI description in terms of an approximation plus a set of detailsI the broad trend is preserved in the approximation part, while the

localized changes are kept in the detail parts.

For short, a wavelet is asmooth and quicklyvanishing oscillating functionwith good localisationproperties in both frequencyand time.Specially interesting forapproximating time seriescurves that contain localizedstructures !!!

Compstat 2010 | August 2010 | Jairo Cugliari Clustering FD with waveletes

Page 8: Clustering functional data with wavelets · ADVISORS: Anestis Antoniadisa and Jean-Michel Poggib a Joseph Fourier University, Grenoble b Paris-Sud University Clustering functional

MotivationWavelet based feature extraction

ResultsConclusion

Discret Wavelet Transform

We consider an orthonormal basis of waveforms derived from scaling andtranslations of a compactly supported scaling function φ and a compactlysupported mother wavelet ψ. We let

φj,k (t) = 2j/2φ(2j t − k), ψj,k (t) = 2j/2φ(2j t − k).

For any j0 ≥ 0, the collection

{φj0,k , k = 0, 1, . . . , 2j0 − 1;ψj,k , j ≥ j0, k = 0, 1, . . . , 2j − 1}, (2)

is an orthonormal basis of H a real separable Hilbert space.Any z ∈ H can be written as

z(t) =2j0−1∑k=0

cj0,kφj0,k (t) +∞∑

j=j0

2j−1∑k=0

dj,kψj,k (t), (3)

where cj,k and dj,k are the scale and the wavelet coefficients (resp.) of z atthe position k of the scale j defined as

cj,k =< z, φj,k >H dj,k =< z, ψj,k >H .

Compstat 2010 | August 2010 | Jairo Cugliari Clustering FD with waveletes

Page 9: Clustering functional data with wavelets · ADVISORS: Anestis Antoniadisa and Jean-Michel Poggib a Joseph Fourier University, Grenoble b Paris-Sud University Clustering functional

MotivationWavelet based feature extraction

ResultsConclusion

Discret Wavelet Transform

We consider an orthonormal basis of waveforms derived from scaling andtranslations of a compactly supported scaling function φ and a compactlysupported mother wavelet ψ. We let

φj,k (t) = 2j/2φ(2j t − k), ψj,k (t) = 2j/2φ(2j t − k).

For any j0 ≥ 0, the collection

{φj0,k , k = 0, 1, . . . , 2j0 − 1;ψj,k , j ≥ j0, k = 0, 1, . . . , 2j − 1}, (2)

is an orthonormal basis of H a real separable Hilbert space.Any z ∈ H can be written as

z̃J(t) = c0φ0,0(t) +J−1∑j=0

2j−1∑k=0

dj,kψj,k (t). (3)

where cj,k and dj,k are the scale and the wavelet coefficients (resp.) of z atthe position k of the scale j defined as

cj,k =< z, φj,k >H dj,k =< z, ψj,k >H .

Compstat 2010 | August 2010 | Jairo Cugliari Clustering FD with waveletes

Page 10: Clustering functional data with wavelets · ADVISORS: Anestis Antoniadisa and Jean-Michel Poggib a Joseph Fourier University, Grenoble b Paris-Sud University Clustering functional

MotivationWavelet based feature extraction

ResultsConclusion

Energy decomposition of the DWT

Since DWT is based on an L2-orthonormal basis decomposition we haveconservation of the signal’s energy.We can then write for a discretized function z̃ a characterization by the set ofchannel variances estimated at the output of the corresponding filter bank:

Ez ≈ ‖z‖22 = c2

0 +J−1∑j=0

2j−1∑k=0

d2j,k = c2

0 +J−1∑j=0

‖dj‖22. (4)

where Ez = ‖z‖2H.

Compstat 2010 | August 2010 | Jairo Cugliari Clustering FD with waveletes

Page 11: Clustering functional data with wavelets · ADVISORS: Anestis Antoniadisa and Jean-Michel Poggib a Joseph Fourier University, Grenoble b Paris-Sud University Clustering functional

MotivationWavelet based feature extraction

ResultsConclusion

Scale specific AC and RC Contributions

We will use j0 = 0 and we will concentrate on the wavelet coefficients dj,k .We have conservation of the energy

||z(t)||2 = ||c0,0||2 +∑

j

||dj||2

.For each j = 1, . . . , J, we compute the absolute and relative contributionrepresentations (ACR and RCR rp.) by

contj = ||dj||2︸ ︷︷ ︸ACR

relj =||dj||2∑j ||dj||2︸ ︷︷ ︸

RCR

These coefficients resume the relative importance of each scale to the globaldynamic of a trajectory.

Compstat 2010 | August 2010 | Jairo Cugliari Clustering FD with waveletes

Page 12: Clustering functional data with wavelets · ADVISORS: Anestis Antoniadisa and Jean-Michel Poggib a Joseph Fourier University, Grenoble b Paris-Sud University Clustering functional

MotivationWavelet based feature extraction

ResultsConclusion

Plan

1 Motivation

2 Wavelet based feature extraction

3 Results

4 Conclusion

Compstat 2010 | August 2010 | Jairo Cugliari Clustering FD with waveletes

Page 13: Clustering functional data with wavelets · ADVISORS: Anestis Antoniadisa and Jean-Michel Poggib a Joseph Fourier University, Grenoble b Paris-Sud University Clustering functional

MotivationWavelet based feature extraction

ResultsConclusion

Simulated data

We simulate K = 3 clusters of 25 observations sampled by 1024 points each.

a 2-sinus model

b FAR with diagonal covariance operator

c FAR with non diagonal covariance operator

Figure: Mean energy scale’s contribution by model.

Compstat 2010 | August 2010 | Jairo Cugliari Clustering FD with waveletes

Page 14: Clustering functional data with wavelets · ADVISORS: Anestis Antoniadisa and Jean-Michel Poggib a Joseph Fourier University, Grenoble b Paris-Sud University Clustering functional

MotivationWavelet based feature extraction

ResultsConclusion

Schema of procedure

I After approximating functions bydiscretized data, we obtain J handyfeatures.

I We use Steinley & Brusco’s featureselection algorithm

I In order to use k−means we estimatethe number of clusters K by detectingjumps in the distortion energy curvedK (Sugar & James, 2003):

Compstat 2010 | August 2010 | Jairo Cugliari Clustering FD with waveletes

Page 15: Clustering functional data with wavelets · ADVISORS: Anestis Antoniadisa and Jean-Michel Poggib a Joseph Fourier University, Grenoble b Paris-Sud University Clustering functional

MotivationWavelet based feature extraction

ResultsConclusion

Simulated data

Confusion matrix

Model K1 K2 K3

2-sinus 25 – –FAR1 – 20 5FAR2 – 13 12

I Good overall missclaficationrate (18/75)

I Perfect distinction of 2-sinusmodel

I Relatively good performanceon the FAR models

Compstat 2010 | August 2010 | Jairo Cugliari Clustering FD with waveletes

Page 16: Clustering functional data with wavelets · ADVISORS: Anestis Antoniadisa and Jean-Michel Poggib a Joseph Fourier University, Grenoble b Paris-Sud University Clustering functional

MotivationWavelet based feature extraction

ResultsConclusion

EDF application

Data: 365 daily power demand profiles of french national consumption (48points per day)

Some well known facts of electricitydemand:

I 2 well defined seasons withtransitions

I Weekly cycle due to calendar(WE vs working days)

I Daily cycle: day vs nightI Other features that affect

electricity consumtion: bankholidays, special priced days,strikes, financial crisis, storms

Aim: Detect daily profiles of french national electricity load demand.

Compstat 2010 | August 2010 | Jairo Cugliari Clustering FD with waveletes

Page 17: Clustering functional data with wavelets · ADVISORS: Anestis Antoniadisa and Jean-Michel Poggib a Joseph Fourier University, Grenoble b Paris-Sud University Clustering functional
Page 18: Clustering functional data with wavelets · ADVISORS: Anestis Antoniadisa and Jean-Michel Poggib a Joseph Fourier University, Grenoble b Paris-Sud University Clustering functional

MotivationWavelet based feature extraction

ResultsConclusion

Plan

1 Motivation

2 Wavelet based feature extraction

3 Results

4 Conclusion

Compstat 2010 | August 2010 | Jairo Cugliari Clustering FD with waveletes

Page 19: Clustering functional data with wavelets · ADVISORS: Anestis Antoniadisa and Jean-Michel Poggib a Joseph Fourier University, Grenoble b Paris-Sud University Clustering functional

MotivationWavelet based feature extraction

ResultsConclusion

Conclusion

I We have presented a way of efficiently clustering functions usingwavelet-based dissimilarities.

I Wavelets give a well suited plateform because of their capacity ondetecting highly localized events.

I Feature extraction and feature selection give additional explanaitorycapacity to unsupervised learning.

Compstat 2010 | August 2010 | Jairo Cugliari Clustering FD with waveletes

Page 20: Clustering functional data with wavelets · ADVISORS: Anestis Antoniadisa and Jean-Michel Poggib a Joseph Fourier University, Grenoble b Paris-Sud University Clustering functional

MotivationWavelet based feature extraction

ResultsConclusion

Antoniadis, A. - Fan, J.Regularization of wavelet approximations.Journal of the American Statistical Association, 96:455, 2001.

Antoniadis, A. - Paparoditis, E. - Sapatinas, T.A functional wavelet-kernel approach for time series prediction.Royal Statistical Society, 68:837–857, 2008.

Antoniadis, A. - Paparoditis, E. - Theofanis, S.Bandwidth selection for functional time series prediction.

Poggi, J.M.Prévision non paramétrique de la consommation électrique.Rev. Statistiqué Appliquée, XLII(4):93–98, 1994.

Compstat 2010 | August 2010 | Jairo Cugliari Clustering FD with waveletes