time series analysis - otto von guericke university...

63
Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 1 Time Series Analysis

Upload: others

Post on 24-Apr-2020

7 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 1

Time Series Analysis

Page 2: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Time Series

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 2

• Motivation

• Decomposition Models

◦ Additive models, multiplicative models

• Global Approaches

◦ Regression◦ With and without seasonal component

• Local Approaches

◦ Moving Averages Smoothing◦ With and without seasonal component

• Summary

Page 3: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Motivation: Temperatures

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 3

Example: Temperatures data set (fictive)

• The plot shows the average temperature per day for 50 years.

• Is there any trend visible?

• How to extract seasonal effects?

Page 4: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Decomposition Models

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 4

• The time series is given as a sequence of values

y1, . . . , yt, . . . , yn

• We assume that every yt is a composition of (some of) the following components:

◦ gt trend component

◦ st seasonal component

◦ ct cyclical variation

◦ ǫt irregular component (random factors, noise)

• Assume a functional dependency:

yt = f (gt, st, ct, ǫt)

Page 5: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Components of Time Series

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 5

Trend Component

• Reflects long-term developments.

• Often assumed to be a monotone function of time.

• Represents the actual component we are interested in.

Cyclic Component

• Reflects mid-term developments.

• Models economical cycles such as booms and recessions.

• Variable cycle length.

• We do not consider this component here.

Remark: Often, both components are combined.

Page 6: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Components of Time Series

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 6

Seasonal Component

• Reflects short-term developments.

• Constant cycle length (i. e., 12 months)

• Represents changes that (re)occur rather regularly.

Irregular Component

• Represents everything else that cannot be related to the other components.

• Combines irregular changes, random noise and local fluctuations.

• We assume that the values are small and have an average of zero.

Page 7: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Decompositions

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 7

Additive Decomposition

yt = gt + st + ǫt

• Pure trend model: yt = gt + ǫt (stock market, no season)

• Possible extension: yt = gt + st + xtβ + ǫt (calendar effects)

Multiplicative Decomposition

yt = gt · st · ǫt

• Seasonal changes may increase with trend.

• Transform into additive model:

yt = log yt + log st + log ǫt

Page 8: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Time Series Analysis

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 8

Goal: Estimate the components from a given time series, i. e.

gt + st + ǫt ≈ yt

Application: With the estimates, we can compute the

• trend-adjusted series: yt − gt

• season-adjusted series: yt − st

• We only consider additive models here.

⇒ Additional assumptions necessary in order to find waysto infer the desired components.

Page 9: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Overview

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 9

• Global approach: There is a fix functional dependencethroughout the entire time range. (⇒ regression models)

• Local approach: We do not postulate a global model andrather use local estimations to describe the respective components.

• Seasonal effects: We have to decide beforehand whetherto assume a seasonal component or not.

Global Local

without Season Regression Smoothing Averages

with Season Dummy Variables Smoothing Averages

Page 10: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Global Approach (without Season)

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 10

Model: yt = gt + ǫt

Assumptions:

• No seasonal component: st = 0

• Depending on gt, use regression analysis to estimate theparameter(s) to define the trend component.

◦ linear trend: gt = β0 + β1t

◦ quadratic trend: gt = β0 + β1t + β2t2

◦ polynomial trend: gt = β0 + β1t + · · · + βqtq

◦ exponential trend: gt = β0 exp(β1t)

◦ logistic trend: gt =β0

β1+exp(−β2t)

Page 11: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Global Approach (with Season)

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 11

Model: yt = st + ǫt (no trend)

Assumptions:

• No trend component: gt = 0

• Seasonal component does not change from period to period.

• Introduce dummy variables for every time span (here: months) thatserve as indicator functions to determine to whichmonth a specific t belongs:

sm(t) =

1, if t belongs to month m

0, otherwise

• The seasonal component is then set up as st =12∑

m=1

βmsm(t).

• Determine the monthly effects βm with normal least squares method.

Page 12: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Global Approach (with Season)

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 12

Model: yt = gt + st + ǫt

Assumptions:

• Estimate gt while temporarily ignoring st.

• Estimate st from the trend-adjusted yt = yt − gt.

Model: yt = α1t + · · · + αqtq + · · · + β1s1(t) + · · · + β12s12(t) + ǫt

Assumptions:

• Seasonal component does not change from period to period.

• Model the seasonal effects with trigonometric functions:

st = β0 +6∑

m=1

βm cos(

2πm

12t)

+5∑

m=1

γm sin(

2πm

12t)

• Determine α1, . . . , αq, β0, . . . , β6 and γ1, . . . , γ5 with normal least squares method.

Page 13: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Local Approach (without Season)

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 13

General Idea: Smooth the time series.

• Estimate the trend component gt at time t as the average of the values aroundtime t.

For a given time series y1, . . . , yn, the Smoothing Average y⋆t of order ris defined as follows:

y⋆t =

1

2k + 1·

k∑

j=−k

yt+j, if r = 2k + 1

1

2k· (

1

2yt−k +

k−1∑

j=−k+1

yt+j +1

2yt+k), if r = 2k

Page 14: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Local Approach (without Season)

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 14

Model: yt = gt + ǫt

Assumptions:

• In every time frame of width 2k + 1 the time series can be assumed to be linear.

• ǫt averages to zero.

• Then we use the smoothing average to estimate the trend component:

gt = y⋆t

Page 15: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Local Approach (with Season)

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 15

Model: yt = gt + st + ǫt

Assumptions:

• Seasonal component has period length p (repeats after p points):

st = st+p, t = 1, . . . , n− p

• Sum of seasonal values is zero:p∑

j=1

sj = 0

• Trend component is linear in time frames of width p (if p is odd)or p + 1 (if p is even).

• Irregular component averages to zero.

Page 16: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Local Approach (with Season)

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 16

Let k = p−12 (for odd p) or k = p

2 (for even p).

Then:• Estimate the trend component with smoothing average:

gt = y⋆t , k + 1 ≤ t ≤ n− k

• Estimate the seasonal components s1, . . . , sp as follows:

si = si −1

p

p∑

j=1

sj with st1

mi − li + 1

mi∑

j=l

(yi+jp − y⋆i+jp), 1 ≤ i ≤ p

wheremi = max {m ∈ N0 | i +mp ≤ n− k}

andli = min {l ∈ N0 | i + lp ≥ k + 1}

Page 17: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Example (from motivation)

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 17

• We can extract an increase and decrease of 1 degree during 50 years even thoughthe amount of noise is more than twice as large than the actual trend.

Page 18: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Example

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 18

5 years period, trend ±8 degrees, noise amount ±2 degrees

Page 19: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Example

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 19

100 years period, trend ±1 degree, noise amount ±3 degrees

Page 20: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Summary

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 20

• Definition of the problem domain

◦ Consider a time series to be composed of subcomponents.◦ Additive and multiplicative models.

• Global and local approaches

◦ With and without seasonal components.

• Robust to noise

◦ Noise can be higher than the trend component itself.

Page 21: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 21

Frequent Patterns in Temporal Data

Page 22: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Contents

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 22

• Frequent pattern in temporal data

◦ Motivation / Problem◦ Other common methods◦ Algorithms / Example

Page 23: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Quality Surveillance of Vehicles

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 23

Page 24: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Quality Surveillance of Vehicles

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 24

Page 25: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Quality Surveillance of Vehicles

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 25

Page 26: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Pilot Series Vehicles

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 26

Page 27: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Pilot Series Vehicles

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 27

Page 28: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

What is a Temporal Pattern?

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 28

Page 29: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

What is a Temporal Pattern?

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 29

Page 30: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Content

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 30

• Frequent pattern in temporal data

◦ Motivation / Problem◦ Other common methods◦ Algorithms / Example

Page 31: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Agrawal 1995

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 31

• Many customers with a history of item sets (transaction)

• Searches for sequences◦ z.B. {a, b} → {c} → {b, c}

• No rules

• No time window

• Support: Count!◦ Support counter of a sequence is incremented, if it occurs for a customer

Page 32: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Hoppner 2002

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 32

• On a single time line, events have a duration

• Searches for patterns◦ a contains b and meets with c

• Rules are induced from patterns

• Uses a time frame that needs to contain the pattern

• Support: Temporal Support

Page 33: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Related Methods

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 33

Page 34: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Content

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 34

• Frequent patterns in temporal data

◦ Motivation / Problem◦ Other common methods◦ Algorithms / Example

Page 35: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Partial Apriori Criterion

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 35

• All suffixes of a frequent pattern are frequent.

Page 36: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Candidate Generation

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 36

Page 37: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Support Evaluation

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 37

• Exploit the property of normalised patterns by finite automata

Page 38: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Finite automata get stuck

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 38

• A finite automaton gets stuck after accepting the fist ’a’.

• Solution to this problem: Create copies and filter found occurences

Page 39: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Example: Quality surveillance of Vehicles

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 39

• 101 250 vehicles

◦ Workshop stops

◦ Vehicle configuration

◦ 1.4 Mio. temporal intervals

Page 40: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Number of Frequent Patterns

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 40

Page 41: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Runtime

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 41

Page 42: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 42

Efficiently Finding Motifs in Time Series

Page 43: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Content

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 43

• Efficiently Finding Motifs in Time Series

◦ Data Mining in Time Series◦ memory-efficient Representationen◦ Symbolic Aggregat-Approximation (SAX)◦ Finding Motifs in Time Series with SAX◦ Example

Page 44: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Data Mining in Time Series

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 44

• Main Task: Find useful information in time series

• typical problems: Clustering, Classification, Discovery of frequent patterns andrules, visualisisation, anomaly detection

• Problems are reduced to finding repeated, similar subsequences because of theamount of data

• requires: Similarity measure to compare subsequences

• e.g. euclidean distance

d(Q,C) =

n∑

i=1

(qi − ci)2

between 2 standard normal distributed subsequences Q = (q1, . . . , qn)T and C =

(c1, . . . , cn)T

• Problem: many comparisons and memory capacity often too low to load all therequired data

Page 45: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Memory Efficient Representation

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 45

• Problem: Many, slow accesses to data

• Solution: Approximation of time series, which fits into memory and keeps relevant(or interesting) features

• e.g. discrete fourier transformation (DFT), discrete wavelet transformation (DWT),partially linear approximation and adaptive, partially constant approximation(APCA), singular value decomposition (SVD)

• here: symbolic representationen

• Advantage: Algorithms from information retrieval and bioinformatics can be used(Hashing, markovian models, . . .)

Page 46: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Time Series REpresentation

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 46

Representationen

model based

HMM ARMA

not adaptive

Wavelets

Orthonormal

Haar Daubechies

Bi-orthonormal

Coiflets Symlets

random PAA Spectral

DFT DCT Chebyshev

data-driven

prunedphase-

basedGrid

adaptive

Sorted

Coeffi-

cients

partially

polyno-

mial

partially

linear

Interpolation Regression

APCA

SVD Symbolic

NLG Strings

SAXvalue-

based

inclination-

based

Trees

Page 47: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Most Commonly Used Representation

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 47

DFT PLA Haar wavelet APCA

Page 48: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Partially Aggregated Approximation (PAA)

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 48

Reduction from 128 to 8 Data points

Page 49: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Symbolic Aggregate Approximation (SAX)

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 49

• for every sequence of length n a word with length w is defined (over an alpabetA = {α1, . . . , αa} with |A| = a

• simple Algorithm:1. Split up subsequence into w equally-sized intervals

2. PAA: For each interval find a representativ (e.g. mean value)C = (c1, . . . , cn)

T is mapped toC = (c1, . . . , cw) durch

ci =w

n

nwi∑

j=nw(i−1)+1

cj

3. Map mean value ci of C to one of the a letters byai = αj ⇔ βj−1 ≤ ci ≤ βj

• Assumption: Range of values of the PAA sequence is normally distributed andevery occurence of a letter is equally likely

• Mapping ci 7→ b ∈ A by “sites of fracture” β1, . . . , βa−1

Page 50: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

“Sites of Fracture” of a Normal Distribution

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 50

|A| 3 4 5 6 7 8 9 10β1 −.43 −.67 −.84 −.97 −1.07 −1.15 −1.22 −1.28β2 0.43 0 0.25 0.43 0.57 0.67 0.76 0.84β3 0.67 0.25 0 −.18 −.32 −.43 −.52β4 0.84 0.43 0.18 0 −.14 −.25β5 0.97 0.57 0.32 0.14 0β6 1.07 0.67 0.43 0.25β7 1.15 0.76 0.52β8 1.22 0.84β9 1.28

• sites of fracture split normal distribution into equally probable regions

Page 51: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Example: SAX

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 51

• here: n = 128, w = 8, a = 3

• Result: baabccbc

Page 52: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Distance Measure for SAX

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 52

• PAA: lower bound to euclidean distance by

dr(Q, C) =

n

w

w∑

i=1

(qi − ci)2

• SAX:

d∗(Q, C) =

n

w

w∑

i=1

d∗a(qi, ci)2

• Distance measure d∗a should be defined by a lookup table, e.g. for a = 4a b c d

a 0 0 0.67 1.34b 0 0 0 0.67c 0.67 0 0 0d 1.340 0.67 0 0

d∗a(r, c) =

0 falls |r − c| ≤ 1,

βmax(r,c)−1 − βmin(r,c) sonst

Page 53: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Comparison of Distance Measures

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 53

Page 54: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

SAX-Advantage: Lower Bound

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 54

• d∗(Q, C) is lower bound of the euclidean distance d(Q,C) of the original se-quences Q and C

d∗(Q, C) ≤ d(Q,C)

• if Q and C are dissimilar then Q and C are dissimilar as well

• SAX-based algorithms produce identical results compared to algorithms that workwith original data

• “only” similar SAX words should be compared in the original feature space

• thus only few accesses to original data

Page 55: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Finding Motifs in Time Series

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 55

• Motifs: Primitive, frequent (similar) patterns, prototypes

• Challenges:◦ Motifs are unknown beforehand◦ exhaustive search is too expensive with a complexity of O(n2)◦ Outliers influence euclidean distance

Page 56: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Creating the SAX Matrix

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 56

• Find all motifs of a time series oflength m by sliding windows

• Window lengths n leads to (m−n+1) subsequences

• Transform every subsequence into aSAX word of length w

• Store in row matrix (so called SAXmatrix)

• Matrix has w columns and (m−n+1) rows

Page 57: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Random Projection

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 57

• Guess motif positions by so-called random projection

• pair-wise comparison of SAX words

• Collision matrix M with (m− n + 1)2 cells for every comparison

• Implement M efficiently by a hash table

• At first M(i, j) = 0 for 1 ≤ i, j ≤ m− n + 1

• Idea: Compare characters of two words in a SAX matrix with each other

• Better assumption: “don’t care symbols” in sequences with unknown location

• E.G. noisy Motif or compression/expansion of a sequence

Page 58: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Random Projection

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 58

• Thus: SAX matrix is projected onto 1 ≤ k < w randomly chosen columns

• Compare all rows of the projection

• if two projected SAX words in row i and j are identical, increment M(i, j)

• Projection is repeated t times, because some motifs will share an entry in M aftersome iterations

• It is unlikely that many random sequences will collide with an alredy found motif

• user-edefined threshhold s with 1 ≤ s ≤ k for collision entries in M

• All M(i, j) ≥ s are candidates for motifs

• But: the local neighbourhood of a sequence i contains many (so-called trivial)matches

• These are filtered at the end!

Page 59: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Random Projection

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 59

• first two iterations of a random projection

Page 60: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Subdimensional Motifs

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 60

• Random projections for motifs in univariate SAX time series can also be used ina multi-variate way

• Idea: increment the collision matrixM for each attribute j ∈ {1, . . . , p} for eachprojected SAX word

• Problem: relevant dimensions of potentional subdimensional motifs are unknown

• Solution:◦ Estimate a distribution P (dj) over distances between non-trivial matches bydrawing a sample

◦ Determine the distances d∗1, . . . , d∗p for each entry M(i, j) ≥ s

◦ if P (dj ≤ d∗j) < rrelj (user-specified dimension relevance), then every jthattribute is relevant

Page 61: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Example

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 61

• Expert identifies p = 9 of a total of 130 channels as important

• Motif lasts at least n = 400ms

• 10 time series are given to search for subdimensional motifs

Page 62: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Subdimensional Motif in Two Time Series

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 62

attr 3 attr 1

attr 3 attr 1

0100

200300

400

1000 1500 2000

Tim

e [s]

attr_0

0100

200300

400

0 10 30 50 70

Tim

e [s]

attr_1

0100

200300

400

0 200 600 1000

Tim

e [s]

attr_2

0100

200300

400

−16 −14 −12

Tim

e [s]

attr_3

0100

200300

400

−24 −20 −16

Tim

e [s]

attr_4

0100

200300

400

0 10 30 50 70

Tim

e [s]

attr_1

0100

200300

400

−18 −16 −14 −12

Tim

e [s]

attr_3

0100

200300

400

13.7 13.8 13.9 14.0

Tim

e [s]

attr_6

0100

200300

400

13.7 13.8 13.9 14.0 14.1

Tim

e [s]

attr_7

0100

200300

400

27.4 27.6 27.8 28.0

Tim

e [s]

attr_8

Page 63: Time Series Analysis - Otto von Guericke University Magdeburgfuzzy.cs.ovgu.de/studium/ida/txt/ida_timeseries.pdf · Time Series Analysis Prof. R. Kruse, Chr. Braune Intelligent Data

Clustering of Motivs

Prof. R. Kruse, Chr. Braune Intelligent Data Analysis 63

DO

0_00

36.c

sv_4

34

DO

0_00

36.c

sv_2

587

DO

0_00

77.c

sv_8

08

DO

0_00

77.c

sv_2

024

DO

0_00

36.c

sv_3

89

DO

0_00

36.c

sv_3

95

DO

0_00

36.c

sv_2

548

DO

0_00

36.c

sv_2

543

020

040

060

080

0

Cluster Dendrogram

hclust (*, "ward")prox

Hei

ght

• Calculate dissimilarity matrix bypairwise comparison of all found pat-terns in the 10 time series based ond∗

• Matrix is symetric, positive and con-tains only zeros on its principal diag-onal

• Can be used for grouping the oc-curences to find motifs that occur inseveral time series

• Here: hierarchical, agglomerativeclustering of all motifs, that con-tained the attributes attr 1 attr 3