functional svd for big data - purdue universitypanc/research/fsvd/talk/fda_presentation.pdfpan chao...

24
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Functional SVD for Big Data Pan Chao April 23, 2014 Pan Chao Functional SVD for Big Data April 23, 2014 1 / 24

Upload: others

Post on 07-Jan-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Functional SVD for Big Data - Purdue Universitypanc/research/fsvd/talk/fda_presentation.pdfPan Chao Functional SVD for Big Data April 23, 2014 16 / 24 When penalty is imposed in one

..........

.....

.....................................................................

.....

......

.....

.....

.

.

......Functional SVD for Big Data

Pan Chao

April 23, 2014

Pan Chao Functional SVD for Big Data April 23, 2014 1 / 24

Page 2: Functional SVD for Big Data - Purdue Universitypanc/research/fsvd/talk/fda_presentation.pdfPan Chao Functional SVD for Big Data April 23, 2014 16 / 24 When penalty is imposed in one

..........

.....

.....................................................................

.....

......

.....

.....

.

Outline

1. One-Way Functional SVD

a) Interpretationb) Robustnessc) CV/GCV

2. Two-Way Problem

3. Big data solution (split-and-recombine)

a) Data splitb) Recombine

4. Simulation

5. Summary and furture work

6. Reference

Pan Chao Functional SVD for Big Data April 23, 2014 2 / 24

Page 3: Functional SVD for Big Data - Purdue Universitypanc/research/fsvd/talk/fda_presentation.pdfPan Chao Functional SVD for Big Data April 23, 2014 16 / 24 When penalty is imposed in one

..........

.....

.....................................................................

.....

......

.....

.....

.

One-Way Problem

To estimate the functional principle component of the data, we can useSVD method. In order to incorporate the functional nature of the data,regularization is imposed on the estimates.

The first regularized principle component for functional data can beestimated by minimizing

ρ(Y − uvT ) + λ∥u∥2∫

v′′2,

where:

Y : data matrix, Yij = Yi(tj).

u: first left vector.

v: first right vector, vj = v(tj).

Pan Chao Functional SVD for Big Data April 23, 2014 3 / 24

Page 4: Functional SVD for Big Data - Purdue Universitypanc/research/fsvd/talk/fda_presentation.pdfPan Chao Functional SVD for Big Data April 23, 2014 16 / 24 When penalty is imposed in one

..........

.....

.....................................................................

.....

......

.....

.....

.

Remarks:

1) Smoothness is controlled by the tuning parameter λ.

2) Adding ∥u∥2 to make the problem scale-invariant [Huang et al. 2008].

3) ρ(·) is the loss function which measure the fidelity of the rank-1approximation. If ρ(·) = ∥·∥2, then least square.

4) Due to the theory of smoothing spline [Green & Silverman], theintegral has a matrix expression∫

v′′2 = vTΩvv

where

Ωv = QR−1QT

Q and R are two banded matrices depending on the discretization ofthe data.

Pan Chao Functional SVD for Big Data April 23, 2014 4 / 24

Page 5: Functional SVD for Big Data - Purdue Universitypanc/research/fsvd/talk/fda_presentation.pdfPan Chao Functional SVD for Big Data April 23, 2014 16 / 24 When penalty is imposed in one

..........

.....

.....................................................................

.....

......

.....

.....

.

Smoothing Spline View

If Y has only one row, denoted by y, then requiring u = 1 results in astandard smoothing spline problem:

ρ(x− v) + λvTΩvv.

Pan Chao Functional SVD for Big Data April 23, 2014 5 / 24

Page 6: Functional SVD for Big Data - Purdue Universitypanc/research/fsvd/talk/fda_presentation.pdfPan Chao Functional SVD for Big Data April 23, 2014 16 / 24 When penalty is imposed in one

..........

.....

.....................................................................

.....

......

.....

.....

.

PCA View

If ρ(·) = ∥·∥2F , the minimizer of v of the one-way problem without penaltyis

v = argmaxv

vTY TY v

∥v∥2,

which maximizing the variance of the projected data on v. So v is anestimated PC direction.

With one-way penalty imposed, [Huang et al. 2008]

v = argmaxv

vTY TY v

∥v∥2 + λvTΩv

= argmaxv

vTY TY v

vT (I + λΩ)v

Pan Chao Functional SVD for Big Data April 23, 2014 6 / 24

Page 7: Functional SVD for Big Data - Purdue Universitypanc/research/fsvd/talk/fda_presentation.pdfPan Chao Functional SVD for Big Data April 23, 2014 16 / 24 When penalty is imposed in one

..........

.....

.....................................................................

.....

......

.....

.....

.

Regression View [Zhang et al. 2013]

Let Y to be a column stack of the data matrix Y , and

U =

u 0 · · · 00 u · · · 0...

.... . . 0

0 0 · · · u

If the loss function ρ(·) = ∥·∥2 and u is given, the estimate of v for theone-way problem without penalty is given by a linear regression:

v =(UTU

)−1 UTY

If the one-way penalty is imposed, then we have Ridge-regression typeproblem. The estimate of v is

v =(UTU + λ∥u∥2Ω

)−1UTY

Pan Chao Functional SVD for Big Data April 23, 2014 7 / 24

Page 8: Functional SVD for Big Data - Purdue Universitypanc/research/fsvd/talk/fda_presentation.pdfPan Chao Functional SVD for Big Data April 23, 2014 16 / 24 When penalty is imposed in one

..........

.....

.....................................................................

.....

......

.....

.....

.

Robust Version [Zhang et al. 2013]

Sometimes there may be outliners in the observed data, then we can use amore robust loss function ρ(·) to replace the usual quadratic loss. Forexample, Huber loss

ρ(x) =

x2

2 , if |x| ≤ θ

θ(|x| − θ

2

), o.w.

−3 −2 −1 0 1 2 3

0.0

1.0

2.0

Pan Chao Functional SVD for Big Data April 23, 2014 8 / 24

Page 9: Functional SVD for Big Data - Purdue Universitypanc/research/fsvd/talk/fda_presentation.pdfPan Chao Functional SVD for Big Data April 23, 2014 16 / 24 When penalty is imposed in one

..........

.....

.....................................................................

.....

......

.....

.....

.

Then the regression is analogous to a weighted least square problem

v =(UTWU + λ∥u∥2Ω

)−1UTWY ,

where W is constructed from the weight matrix

Wij =ρ′(yij − uivj)

yij − uivj.

Pan Chao Functional SVD for Big Data April 23, 2014 9 / 24

Page 10: Functional SVD for Big Data - Purdue Universitypanc/research/fsvd/talk/fda_presentation.pdfPan Chao Functional SVD for Big Data April 23, 2014 16 / 24 When penalty is imposed in one

..........

.....

.....................................................................

.....

......

.....

.....

.

Picking λ

The smoothing parameter λ can be chosen by leave-one-columnCross-Validation. [Huang et al . 2008] [Zhang et al. 2013]

No robust: Let Aλ = (I + λΩ)−1, then the CV and GCV scores are:

CV (λ) =1

m

m∑j=1

((I −Aλ)Y

Tuj1− Aλjj

)2

GCV (λ) =1m

∑mj=1∥(I −Aλ)Y

Tu∥2

(1−Tr (Aλ) /m)2

Remark: If CV is defined to be leave-out-one-column, no computationalsimplification.

Pan Chao Functional SVD for Big Data April 23, 2014 10 / 24

Page 11: Functional SVD for Big Data - Purdue Universitypanc/research/fsvd/talk/fda_presentation.pdfPan Chao Functional SVD for Big Data April 23, 2014 16 / 24 When penalty is imposed in one

..........

.....

.....................................................................

.....

......

.....

.....

.

Robust: Let Aλ = U(UTWU + λ∥u∥2Ω

)−1UTW, then

GCV (λ) =1m∥v − v∗∥2

(1−Tr (Aλ) /m)2,

where

v =(UTWU + λ∥u∥2Ω

)−1UTWY

v∗ =(UTWU

)−1 UTWY

Pan Chao Functional SVD for Big Data April 23, 2014 11 / 24

Page 12: Functional SVD for Big Data - Purdue Universitypanc/research/fsvd/talk/fda_presentation.pdfPan Chao Functional SVD for Big Data April 23, 2014 16 / 24 When penalty is imposed in one

..........

.....

.....................................................................

.....

......

.....

.....

.

Two-Way Problem

If both directions of data are considered as functions, then a two-wayversion is to minimize

ρ(Y − uvT ) + λv∥u∥2∫

v′′2 + λu∥v∥2∫

u′′2 + λvλu∥u∥∥v∥∫

v′′2∫

u′′2,

where we added the penalty for the second direction and an interaction isintroduced.

ρ(·) can be chosen to be a robust loss.

Pan Chao Functional SVD for Big Data April 23, 2014 12 / 24

Page 13: Functional SVD for Big Data - Purdue Universitypanc/research/fsvd/talk/fda_presentation.pdfPan Chao Functional SVD for Big Data April 23, 2014 16 / 24 When penalty is imposed in one

..........

.....

.....................................................................

.....

......

.....

.....

.

Estimation [Zhang et al. 2013]

The estimates of v and u can be updated iteratively (IRLS) as:

v =(UTWU + 2Ωv|u

)−1 UTWYΩv|u = uT (I + λuΩu)u (I + λvΩv)− uTuI

u =(VTW∗V + 2Ωu|v

)−1 VTW∗Y∗

Ωu|v = vT (I + λvΩv)v (I + λuΩu)− vTvI

The hat matrices are:

H = U(UTWU + 2Ωv|u

)−1 UTW

H∗ = V(VTW∗V + 2Ωu|v

)−1 VTW∗

Pan Chao Functional SVD for Big Data April 23, 2014 13 / 24

Page 14: Functional SVD for Big Data - Purdue Universitypanc/research/fsvd/talk/fda_presentation.pdfPan Chao Functional SVD for Big Data April 23, 2014 16 / 24 When penalty is imposed in one

..........

.....

.....................................................................

.....

......

.....

.....

.

GCV

GCV (λv|λu) =1

n

(∥v − v∗∥

1−Tr (H) /n

)2

v∗ =(UTWU

)−1 UTWY

GCV (λu|λv) =1

m

(∥u− u∗∥

1−Tr (H∗) /m

)2

u∗ =(VTW∗V

)−1 VTW∗Y∗

Remark: Leave-out-one-column/row CV criteria.

Pan Chao Functional SVD for Big Data April 23, 2014 14 / 24

Page 15: Functional SVD for Big Data - Purdue Universitypanc/research/fsvd/talk/fda_presentation.pdfPan Chao Functional SVD for Big Data April 23, 2014 16 / 24 When penalty is imposed in one

..........

.....

.....................................................................

.....

......

.....

.....

.

Big Data and Parallelism for One-Way Problem

The number of rows is large while the number of columns is moderate.Equally-spaced common grids are assumed. The smoothing parameter isforced to be the same for all subsets.

...1 Split data

...2 Estimation for one block

...3 Recombine

...4 Simulation result

Pan Chao Functional SVD for Big Data April 23, 2014 15 / 24

Page 16: Functional SVD for Big Data - Purdue Universitypanc/research/fsvd/talk/fda_presentation.pdfPan Chao Functional SVD for Big Data April 23, 2014 16 / 24 When penalty is imposed in one

..........

.....

.....................................................................

.....

......

.....

.....

.

Split Data

Y =

y11 y12 · · · y1my21 y22 · · · y2m...

yn11 yn12 · · · yn1my(n1+1)1 y(n1+1)2 · · · y(n1+1)m

...y(n1+n2)1 y(n1+n2)2 · · · y(n1+n2)m

...

y(n−nK)1 y(n−nK)2 · · · y(n−nK)m

...yn1 yn2 · · · ynm

n×m

Remarks:...1 Each subset is a block of the data matrix....2 The size of the subests are the same except the last one. (The totalnumber of rows may not be divisible by an integer)

Pan Chao Functional SVD for Big Data April 23, 2014 16 / 24

Page 17: Functional SVD for Big Data - Purdue Universitypanc/research/fsvd/talk/fda_presentation.pdfPan Chao Functional SVD for Big Data April 23, 2014 16 / 24 When penalty is imposed in one

..........

.....

.....................................................................

.....

......

.....

.....

.

When penalty is imposed in one direction (say v), a SVD direction can beestimated for each subset and they can be combined to recover the resultwhen the whole matrix is used.

For kth subset,

vk = (UTk WkUk + 2Ωvk|uk

)−1UTk WkYk.

Ωvk|uk= ∥uk∥2λvΩv

uk = (VTk W∗

kVk + 2Ωuk|vk)−1VT

k W∗kY∗

k .

Ωuk|vk= λvv

Tk Ωvvk

Pan Chao Functional SVD for Big Data April 23, 2014 17 / 24

Page 18: Functional SVD for Big Data - Purdue Universitypanc/research/fsvd/talk/fda_presentation.pdfPan Chao Functional SVD for Big Data April 23, 2014 16 / 24 When penalty is imposed in one

..........

.....

.....................................................................

.....

......

.....

.....

.

Recombining

v(c) =

[(K∑k=1

UTk WkUk

)+ 2

K∑k=1

Ωvk|uk

]−1 K∑k=1

[(UTk WkUk + 2Ωvk|uk

)vk

]

u(c) = (u1, u2, · · · , uK)

Pan Chao Functional SVD for Big Data April 23, 2014 18 / 24

Page 19: Functional SVD for Big Data - Purdue Universitypanc/research/fsvd/talk/fda_presentation.pdfPan Chao Functional SVD for Big Data April 23, 2014 16 / 24 When penalty is imposed in one

..........

.....

.....................................................................

.....

......

.....

.....

.

Algorithm

initialize u = u(0) using SVD;

split Y and u(0);

iter = 0;tol = 99999;

while localdiff>tol and iter<maxiter doinitialize parallelism;

estimate v(i)k using u

(i)k ;

stop parallelism;

recombine v(i)k ’s to get v(i+1);

initialize parallelism;

estimate u(i+1)k ;

stop parallelism;

recombine u(i+1)k ’s to get u(i+1);

Y = u(i+1)v(i+1)T ;

localdiff = distance(Y − Y );end

Pan Chao Functional SVD for Big Data April 23, 2014 19 / 24

Page 20: Functional SVD for Big Data - Purdue Universitypanc/research/fsvd/talk/fda_presentation.pdfPan Chao Functional SVD for Big Data April 23, 2014 16 / 24 When penalty is imposed in one

..........

.....

.....................................................................

.....

......

.....

.....

.

Simulation

7 replicates, 1000× 200 data matrix, number of cores 1− 10, smoothingparameters seq(0, 2, by=0.1).

2 4 6 8 10

4050

6070

8090

Average Elapsed Time vs CPUS

CPUs

Avg

. Ela

psed

Tim

e

Pan Chao Functional SVD for Big Data April 23, 2014 20 / 24

Page 21: Functional SVD for Big Data - Purdue Universitypanc/research/fsvd/talk/fda_presentation.pdfPan Chao Functional SVD for Big Data April 23, 2014 16 / 24 When penalty is imposed in one

..........

.....

.....................................................................

.....

......

.....

.....

.

0 200 400 600 800 1000 1200 1400

−35

−30

−25

−20

−15

−10

Logarithm of Max Pointwise Discrepency (Fixed Data Set and Fixed Smoothing Parameter)

Index

max

poi

ntw

ise

disc

repe

ncy

Pan Chao Functional SVD for Big Data April 23, 2014 21 / 24

Page 22: Functional SVD for Big Data - Purdue Universitypanc/research/fsvd/talk/fda_presentation.pdfPan Chao Functional SVD for Big Data April 23, 2014 16 / 24 When penalty is imposed in one

..........

.....

.....................................................................

.....

......

.....

.....

.

Summary

1. One-way functional SVD and its interpretation.

2. Estimation and smoothing parameter selection for a one-way problem.

3. Big data problem, i.e. too many curves, and parallelism.

4. Simulation results, efficiency and accuracy.

Pan Chao Functional SVD for Big Data April 23, 2014 22 / 24

Page 23: Functional SVD for Big Data - Purdue Universitypanc/research/fsvd/talk/fda_presentation.pdfPan Chao Functional SVD for Big Data April 23, 2014 16 / 24 When penalty is imposed in one

..........

.....

.....................................................................

.....

......

.....

.....

.

Future Work

1. Adding GCV for the one-way problem.

2. Parallelizing two-way problems.

Pan Chao Functional SVD for Big Data April 23, 2014 23 / 24

Page 24: Functional SVD for Big Data - Purdue Universitypanc/research/fsvd/talk/fda_presentation.pdfPan Chao Functional SVD for Big Data April 23, 2014 16 / 24 When penalty is imposed in one

..........

.....

.....................................................................

.....

......

.....

.....

.

References

Xueying Chen and Min-ge Xie.

A split-and-conquer approach for analysis of extraordinarily large data.

pages 1–35, 2012.

Jianhua Z. Huang, Haipeng Shen, and Andreas Buja.

Functional principal components analysis via penalized rank one approximation.

Electronic Journal of Statistics, 2(March):678–695, 2008.

Jianhua Z. Huang, Haipeng Shen, and Andreas Buja.

The Analysis of Two-Way Functional Data Using Two-Way Regularized SingularValue Decompositions.

Journal of the American Statistical Association, 104(488):1609–1620, December2009.

L Zhang, H Shen, and JZ Huang.

Robust regularized singular value decomposition with application to mortality data.

The Annals of Applied Statistics, 2013.

Pan Chao Functional SVD for Big Data April 23, 2014 24 / 24