nonlinear dimensionality reduction approach (isomap, lle)

25
Computer Vision Lab. SNU Young Ki Baik Nonlinear Dimensionality Reduction Approach (ISOMAP, LLE)

Upload: moira

Post on 11-Feb-2016

96 views

Category:

Documents


4 download

DESCRIPTION

Nonlinear Dimensionality Reduction Approach (ISOMAP, LLE). Young Ki Baik. Computer Vision Lab. SNU. References. ISOMAP A global geometric framework for nonlinear dimensionality reduction J.B.Tenenbaum, V.De Silva, J.C.Langford (science 2000) LLE - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Nonlinear Dimensionality Reduction Approach (ISOMAP, LLE)

Computer Vision Lab. SNUYoung Ki Baik

Nonlinear Dimensionality Reduction Approach

(ISOMAP, LLE)

Page 2: Nonlinear Dimensionality Reduction Approach (ISOMAP, LLE)

References• ISOMAP

• A global geometric framework for nonlinear dimensionality reduction

• J.B.Tenenbaum, V.De Silva, J.C.Langford (science 2000)

• LLE• Nonlinear Dimensionality Reduction by Locally Linear

Embedding• Sam T. Roweis and Lawrence K. Saul (science 2000)

• ISOMAP and LLE

• LLE and Isomap Analysis of Spectra and Colour Images• Dejan Kulpinski (Thesis 1999)

• Out-of-Sample Extensions for LLE, Isomap, MDS, Eignemaps, and Spectral Clustering

• Yoshua Bengio et. Al. (TR2003)

Page 3: Nonlinear Dimensionality Reduction Approach (ISOMAP, LLE)

Contents

• Introduction• PCA and MDS• ISOMAP and LLE• Conclusion

Page 4: Nonlinear Dimensionality Reduction Approach (ISOMAP, LLE)

Dimensionality Reduction

• Problem• Complex stimuli can be represented by points

in a high-dimensional vector space.• They typically have a much more compact

description.

• The goal• The meaningful low-dimensional structures

hidden in their high-dimensional observations in order to compress the signals in size and discover compact representations of their variable.

Page 5: Nonlinear Dimensionality Reduction Approach (ISOMAP, LLE)

Dimensionality Reduction• Simple example

• 3-D data

X

Y

Z

X

Y

Page 6: Nonlinear Dimensionality Reduction Approach (ISOMAP, LLE)

Dimensionality Reduction

• Linear method• PCA (Principle Component Analysis)

• Preserves the variance• MDS (Multi Dimensional Scaling)

• Preserves inter-point distance

• Non-linear method• ISOMAP• LLE• …

Page 7: Nonlinear Dimensionality Reduction Approach (ISOMAP, LLE)

Linear Dimensionality Reduction• PCA

• Find a low-dimensional embedding of the data points that best preserves their variance as measured in the high-dimensional input space.

• Eigenvectors are the principal directions, and eigen- values represent the variance of the data along each principal direction.

0x

1x

0x

1x

11e

22e

is the marginal variance along the principle direction k ke

Page 8: Nonlinear Dimensionality Reduction Approach (ISOMAP, LLE)

Linear Dimensionality Reduction• PCA

• Projecting onto e1 captures the majority of the variance and hence it minimizes the error.

• Choosing subspace dimension M:• Large M means lower expected error in the subspace data approximation

0x

1x

11e

22e

0x

1x

11e

22e

Reduction

Page 9: Nonlinear Dimensionality Reduction Approach (ISOMAP, LLE)

Linear Dimensionality Reduction

• MDS• Find an embedding that preserves the inter-

point distances, equivalent to PCA when the distances are Euclidean.

0x

1x

0x

1x

PCA MDS

Page 10: Nonlinear Dimensionality Reduction Approach (ISOMAP, LLE)

• MDS• distances

• Relation

Linear Dimensionality Reduction

ijd

221 ijdA

matrix centering theis H , HAHB

)()( xxxxb jT

iij T

T XX(HX)(HX)Bthen

654321

3

2

1

T

T

T

xxx

X

08328083280

21A

808000808

B

)( 1NIH

)()( 2ji

Tjiij xxxxd

Page 11: Nonlinear Dimensionality Reduction Approach (ISOMAP, LLE)

• MDS• Providing dimension reduction.• Relating tools

Linear Dimensionality Reduction

PCA MDS

Method 1

Method 2

Method …

Dimension Reduction

Page 12: Nonlinear Dimensionality Reduction Approach (ISOMAP, LLE)

Nonlinear Dimensionality Reduction

• Many data sets contain essential nonlinear structures that invisible to PCA and MDS.

• Resort to some nonlinear dimensionality reduction approaches.

Page 13: Nonlinear Dimensionality Reduction Approach (ISOMAP, LLE)

ISOMAP

• Example of non-linear structure(swiss roll)• Only the geodesic distances reflect the true low-

dimensional geometry of the manifold.

• ISOMAP (Isometric feature Mapping)• Preserves the intrinsic geometry of the data.• Uses the geodesic manifold distances between all pairs.

Page 14: Nonlinear Dimensionality Reduction Approach (ISOMAP, LLE)

ISOMAP (algorithm description)• Step 1

• Determining neighboring points within a fixed radius based on the input space distance

• These neighborhood relation are represented as a weighted graph G over the data points.

• Step 2• Estimating the geodesic distances between all pairs of

points on the manifold by computing their shortest path distances in the graph G.

• Step 3• Constructing an embedding of the data in d-dimensional

Euclidean space Y that best preserves the manifold’s geometry.

jid ,X

jidG ,

Page 15: Nonlinear Dimensionality Reduction Approach (ISOMAP, LLE)

ISOMAP (algorithm description)

jid ,X

ε

K=4

i j

k

jid ,X

kid ,X

• Step 1• Determining neighboring points within a fixed radius based on

the input space distance # ε-radius # K-nearest neighbors

• These neighborhood relations are represented as a weighted graph G over the data points.

Page 16: Nonlinear Dimensionality Reduction Approach (ISOMAP, LLE)

ISOMAP (algorithm description)• Step 2

• Estimating the geodesic distances between all pairs of points on the manifold by computing their shortest path distances in the graph G.

• Can be done using Floyd’s algorithm or Dijkstra’s algorithm

jidG ,

)},(),( ),,(min{),(N1,2,...,k

othewise ),(ji, gneighborin ),(),(

jkdkidjidjidfor

jidjidjid

XXXG

G

XG

i

jk

jkdX , kidX ,

Page 17: Nonlinear Dimensionality Reduction Approach (ISOMAP, LLE)

ISOMAP (algorithm description)• Step 3

• Constructing an embedding of the data in d-dimensional Euclidean space Y that best preserves the manifold’s geometry.

• Minimize the cost function

)()()(

),(),(

),(

12.121

NN

GG

jiY

IDID

andjidjiD

yyjiDwhere

2)()(LYG DDE

Solution: take top d eigenvectors of the

matrix )( GD

Page 18: Nonlinear Dimensionality Reduction Approach (ISOMAP, LLE)

Manifold Recovery Guarantee of ISOMAP

• Isomap is guaranteed asymptotically to recover the true dimensionality and geometric structure of nonlinear manifolds.

• As the sample data points increases, the graph distances provide increasingly better approximations to the intrinsic geodesic distances.

Page 19: Nonlinear Dimensionality Reduction Approach (ISOMAP, LLE)

Experimental Results (ISOMAP) # Face # Hand writing : face pose and illumination : bottom loop and top arch

MDS : open triangles

Isomap : filled circles

Page 20: Nonlinear Dimensionality Reduction Approach (ISOMAP, LLE)

LLE• LLE (Locally Linear Embedding)

• Neighborhood preserving embeddings.• Mapping to global coordinate system of low dimensionality.• Recovering global nonlinear structure from locally linear fits.• Each data point and it’s neighbors is expected to lie on or close

to a locally linear patch.• Each data point is constructed by it’s neighbors:

• Where Wij summarize the contribution of j-th data point to the i-th data reconstruction and is what we will estimated by optimizing the error.

• Reconstructed from only its neighbors.

ijij

jjiji

XXW

XWX

ofneighbor anot is if 0

ˆ

Page 21: Nonlinear Dimensionality Reduction Approach (ISOMAP, LLE)

LLE (algorithm description)• We want to minimize the error function

• With the constraints :

• Solution (using lagrange multipliers):

2

)( i j

jiji XWXW

jij

iXjXij

W

W

1

0 ofneighbor anot is if

jkjk

jkkjk

kkjkj

CXC

XCW

11

1

)(1

)(

Page 22: Nonlinear Dimensionality Reduction Approach (ISOMAP, LLE)

LLE (algorithm description)

• Choose d-dimensional coordinates, Y, to minimize:

• Under :

• Solution : compute bottom d+1 eigenvectors of M. (discard the last one)

2

)( i j

jiji YWYY

IYYYi

T

ii

N1 ,0

Quadratic form:

where:

ij

ij )(M)( jiYYY

)()( WIWIM T

Page 23: Nonlinear Dimensionality Reduction Approach (ISOMAP, LLE)

LLE (algorithm summary)• Step 1

• Compute the neighbors of each data point, Xi

• Step 2• Compute the weight Wij that best

reconstruct each data point Xi from its neighbors, minimizing the cost in eq(1) by constrainted linear fits.

• Step 3• Compute the vectors Yi best

reconstructed by the weights Wij, minimizing the quadratic form in eq(2) by its bottom nonzero eigenvectors.

jidG ,2

)( i j

jiji XWXW

1

22

)( i j

jiji YWYY

Page 24: Nonlinear Dimensionality Reduction Approach (ISOMAP, LLE)

Experimental Results (LLE)• Lips

# PCA # LLE

Page 25: Nonlinear Dimensionality Reduction Approach (ISOMAP, LLE)

Conclusion• ISOMAP

• Use the geodesic manifold distances between all pairs.

• LLE• Recovers global nonlinear structure from locally linear fits.

• ISOMAP vs LLE• Preserving the neighborhoods and their geometric relation.• LLE requires massive input data sets and it must have same

weight dimension.• Merit of Isomap is fast processing time with dijkstra’s

algorithm.• Isomap is more practical than LLE.