functional svd for big data - purdue universitypanc/research/fsvd/talk/fda_presentation.pdfpan chao...

Post on 07-Jan-2020

2 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

..........

.....

.....................................................................

.....

......

.....

.....

.

.

......Functional SVD for Big Data

Pan Chao

April 23, 2014

Pan Chao Functional SVD for Big Data April 23, 2014 1 / 24

..........

.....

.....................................................................

.....

......

.....

.....

.

Outline

1. One-Way Functional SVD

a) Interpretationb) Robustnessc) CV/GCV

2. Two-Way Problem

3. Big data solution (split-and-recombine)

a) Data splitb) Recombine

4. Simulation

5. Summary and furture work

6. Reference

Pan Chao Functional SVD for Big Data April 23, 2014 2 / 24

..........

.....

.....................................................................

.....

......

.....

.....

.

One-Way Problem

To estimate the functional principle component of the data, we can useSVD method. In order to incorporate the functional nature of the data,regularization is imposed on the estimates.

The first regularized principle component for functional data can beestimated by minimizing

ρ(Y − uvT ) + λ∥u∥2∫

v′′2,

where:

Y : data matrix, Yij = Yi(tj).

u: first left vector.

v: first right vector, vj = v(tj).

Pan Chao Functional SVD for Big Data April 23, 2014 3 / 24

..........

.....

.....................................................................

.....

......

.....

.....

.

Remarks:

1) Smoothness is controlled by the tuning parameter λ.

2) Adding ∥u∥2 to make the problem scale-invariant [Huang et al. 2008].

3) ρ(·) is the loss function which measure the fidelity of the rank-1approximation. If ρ(·) = ∥·∥2, then least square.

4) Due to the theory of smoothing spline [Green & Silverman], theintegral has a matrix expression∫

v′′2 = vTΩvv

where

Ωv = QR−1QT

Q and R are two banded matrices depending on the discretization ofthe data.

Pan Chao Functional SVD for Big Data April 23, 2014 4 / 24

..........

.....

.....................................................................

.....

......

.....

.....

.

Smoothing Spline View

If Y has only one row, denoted by y, then requiring u = 1 results in astandard smoothing spline problem:

ρ(x− v) + λvTΩvv.

Pan Chao Functional SVD for Big Data April 23, 2014 5 / 24

..........

.....

.....................................................................

.....

......

.....

.....

.

PCA View

If ρ(·) = ∥·∥2F , the minimizer of v of the one-way problem without penaltyis

v = argmaxv

vTY TY v

∥v∥2,

which maximizing the variance of the projected data on v. So v is anestimated PC direction.

With one-way penalty imposed, [Huang et al. 2008]

v = argmaxv

vTY TY v

∥v∥2 + λvTΩv

= argmaxv

vTY TY v

vT (I + λΩ)v

Pan Chao Functional SVD for Big Data April 23, 2014 6 / 24

..........

.....

.....................................................................

.....

......

.....

.....

.

Regression View [Zhang et al. 2013]

Let Y to be a column stack of the data matrix Y , and

U =

u 0 · · · 00 u · · · 0...

.... . . 0

0 0 · · · u

If the loss function ρ(·) = ∥·∥2 and u is given, the estimate of v for theone-way problem without penalty is given by a linear regression:

v =(UTU

)−1 UTY

If the one-way penalty is imposed, then we have Ridge-regression typeproblem. The estimate of v is

v =(UTU + λ∥u∥2Ω

)−1UTY

Pan Chao Functional SVD for Big Data April 23, 2014 7 / 24

..........

.....

.....................................................................

.....

......

.....

.....

.

Robust Version [Zhang et al. 2013]

Sometimes there may be outliners in the observed data, then we can use amore robust loss function ρ(·) to replace the usual quadratic loss. Forexample, Huber loss

ρ(x) =

x2

2 , if |x| ≤ θ

θ(|x| − θ

2

), o.w.

−3 −2 −1 0 1 2 3

0.0

1.0

2.0

Pan Chao Functional SVD for Big Data April 23, 2014 8 / 24

..........

.....

.....................................................................

.....

......

.....

.....

.

Then the regression is analogous to a weighted least square problem

v =(UTWU + λ∥u∥2Ω

)−1UTWY ,

where W is constructed from the weight matrix

Wij =ρ′(yij − uivj)

yij − uivj.

Pan Chao Functional SVD for Big Data April 23, 2014 9 / 24

..........

.....

.....................................................................

.....

......

.....

.....

.

Picking λ

The smoothing parameter λ can be chosen by leave-one-columnCross-Validation. [Huang et al . 2008] [Zhang et al. 2013]

No robust: Let Aλ = (I + λΩ)−1, then the CV and GCV scores are:

CV (λ) =1

m

m∑j=1

((I −Aλ)Y

Tuj1− Aλjj

)2

GCV (λ) =1m

∑mj=1∥(I −Aλ)Y

Tu∥2

(1−Tr (Aλ) /m)2

Remark: If CV is defined to be leave-out-one-column, no computationalsimplification.

Pan Chao Functional SVD for Big Data April 23, 2014 10 / 24

..........

.....

.....................................................................

.....

......

.....

.....

.

Robust: Let Aλ = U(UTWU + λ∥u∥2Ω

)−1UTW, then

GCV (λ) =1m∥v − v∗∥2

(1−Tr (Aλ) /m)2,

where

v =(UTWU + λ∥u∥2Ω

)−1UTWY

v∗ =(UTWU

)−1 UTWY

Pan Chao Functional SVD for Big Data April 23, 2014 11 / 24

..........

.....

.....................................................................

.....

......

.....

.....

.

Two-Way Problem

If both directions of data are considered as functions, then a two-wayversion is to minimize

ρ(Y − uvT ) + λv∥u∥2∫

v′′2 + λu∥v∥2∫

u′′2 + λvλu∥u∥∥v∥∫

v′′2∫

u′′2,

where we added the penalty for the second direction and an interaction isintroduced.

ρ(·) can be chosen to be a robust loss.

Pan Chao Functional SVD for Big Data April 23, 2014 12 / 24

..........

.....

.....................................................................

.....

......

.....

.....

.

Estimation [Zhang et al. 2013]

The estimates of v and u can be updated iteratively (IRLS) as:

v =(UTWU + 2Ωv|u

)−1 UTWYΩv|u = uT (I + λuΩu)u (I + λvΩv)− uTuI

u =(VTW∗V + 2Ωu|v

)−1 VTW∗Y∗

Ωu|v = vT (I + λvΩv)v (I + λuΩu)− vTvI

The hat matrices are:

H = U(UTWU + 2Ωv|u

)−1 UTW

H∗ = V(VTW∗V + 2Ωu|v

)−1 VTW∗

Pan Chao Functional SVD for Big Data April 23, 2014 13 / 24

..........

.....

.....................................................................

.....

......

.....

.....

.

GCV

GCV (λv|λu) =1

n

(∥v − v∗∥

1−Tr (H) /n

)2

v∗ =(UTWU

)−1 UTWY

GCV (λu|λv) =1

m

(∥u− u∗∥

1−Tr (H∗) /m

)2

u∗ =(VTW∗V

)−1 VTW∗Y∗

Remark: Leave-out-one-column/row CV criteria.

Pan Chao Functional SVD for Big Data April 23, 2014 14 / 24

..........

.....

.....................................................................

.....

......

.....

.....

.

Big Data and Parallelism for One-Way Problem

The number of rows is large while the number of columns is moderate.Equally-spaced common grids are assumed. The smoothing parameter isforced to be the same for all subsets.

...1 Split data

...2 Estimation for one block

...3 Recombine

...4 Simulation result

Pan Chao Functional SVD for Big Data April 23, 2014 15 / 24

..........

.....

.....................................................................

.....

......

.....

.....

.

Split Data

Y =

y11 y12 · · · y1my21 y22 · · · y2m...

yn11 yn12 · · · yn1my(n1+1)1 y(n1+1)2 · · · y(n1+1)m

...y(n1+n2)1 y(n1+n2)2 · · · y(n1+n2)m

...

y(n−nK)1 y(n−nK)2 · · · y(n−nK)m

...yn1 yn2 · · · ynm

n×m

Remarks:...1 Each subset is a block of the data matrix....2 The size of the subests are the same except the last one. (The totalnumber of rows may not be divisible by an integer)

Pan Chao Functional SVD for Big Data April 23, 2014 16 / 24

..........

.....

.....................................................................

.....

......

.....

.....

.

When penalty is imposed in one direction (say v), a SVD direction can beestimated for each subset and they can be combined to recover the resultwhen the whole matrix is used.

For kth subset,

vk = (UTk WkUk + 2Ωvk|uk

)−1UTk WkYk.

Ωvk|uk= ∥uk∥2λvΩv

uk = (VTk W∗

kVk + 2Ωuk|vk)−1VT

k W∗kY∗

k .

Ωuk|vk= λvv

Tk Ωvvk

Pan Chao Functional SVD for Big Data April 23, 2014 17 / 24

..........

.....

.....................................................................

.....

......

.....

.....

.

Recombining

v(c) =

[(K∑k=1

UTk WkUk

)+ 2

K∑k=1

Ωvk|uk

]−1 K∑k=1

[(UTk WkUk + 2Ωvk|uk

)vk

]

u(c) = (u1, u2, · · · , uK)

Pan Chao Functional SVD for Big Data April 23, 2014 18 / 24

..........

.....

.....................................................................

.....

......

.....

.....

.

Algorithm

initialize u = u(0) using SVD;

split Y and u(0);

iter = 0;tol = 99999;

while localdiff>tol and iter<maxiter doinitialize parallelism;

estimate v(i)k using u

(i)k ;

stop parallelism;

recombine v(i)k ’s to get v(i+1);

initialize parallelism;

estimate u(i+1)k ;

stop parallelism;

recombine u(i+1)k ’s to get u(i+1);

Y = u(i+1)v(i+1)T ;

localdiff = distance(Y − Y );end

Pan Chao Functional SVD for Big Data April 23, 2014 19 / 24

..........

.....

.....................................................................

.....

......

.....

.....

.

Simulation

7 replicates, 1000× 200 data matrix, number of cores 1− 10, smoothingparameters seq(0, 2, by=0.1).

2 4 6 8 10

4050

6070

8090

Average Elapsed Time vs CPUS

CPUs

Avg

. Ela

psed

Tim

e

Pan Chao Functional SVD for Big Data April 23, 2014 20 / 24

..........

.....

.....................................................................

.....

......

.....

.....

.

0 200 400 600 800 1000 1200 1400

−35

−30

−25

−20

−15

−10

Logarithm of Max Pointwise Discrepency (Fixed Data Set and Fixed Smoothing Parameter)

Index

max

poi

ntw

ise

disc

repe

ncy

Pan Chao Functional SVD for Big Data April 23, 2014 21 / 24

..........

.....

.....................................................................

.....

......

.....

.....

.

Summary

1. One-way functional SVD and its interpretation.

2. Estimation and smoothing parameter selection for a one-way problem.

3. Big data problem, i.e. too many curves, and parallelism.

4. Simulation results, efficiency and accuracy.

Pan Chao Functional SVD for Big Data April 23, 2014 22 / 24

..........

.....

.....................................................................

.....

......

.....

.....

.

Future Work

1. Adding GCV for the one-way problem.

2. Parallelizing two-way problems.

Pan Chao Functional SVD for Big Data April 23, 2014 23 / 24

..........

.....

.....................................................................

.....

......

.....

.....

.

References

Xueying Chen and Min-ge Xie.

A split-and-conquer approach for analysis of extraordinarily large data.

pages 1–35, 2012.

Jianhua Z. Huang, Haipeng Shen, and Andreas Buja.

Functional principal components analysis via penalized rank one approximation.

Electronic Journal of Statistics, 2(March):678–695, 2008.

Jianhua Z. Huang, Haipeng Shen, and Andreas Buja.

The Analysis of Two-Way Functional Data Using Two-Way Regularized SingularValue Decompositions.

Journal of the American Statistical Association, 104(488):1609–1620, December2009.

L Zhang, H Shen, and JZ Huang.

Robust regularized singular value decomposition with application to mortality data.

The Annals of Applied Statistics, 2013.

Pan Chao Functional SVD for Big Data April 23, 2014 24 / 24

top related