the persistent homology of distance functions under random projection

69
The Persistent Homology of Distance Functions under Random Projection Don Sheehy University of Connecticut

Upload: don-sheehy

Post on 18-Jul-2015

61 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: The Persistent Homology of Distance Functions under Random Projection

The Persistent Homology of Distance Functions

under Random Projection

Don Sheehy University of Connecticut

Page 2: The Persistent Homology of Distance Functions under Random Projection

Unions of Balls

Page 3: The Persistent Homology of Distance Functions under Random Projection

Unions of Balls

Finite Point Set

Page 4: The Persistent Homology of Distance Functions under Random Projection

Unions of Balls

Finite Point Set Union of Balls

Page 5: The Persistent Homology of Distance Functions under Random Projection

Unions of Balls

Finite Point Set Union of Balls

Topologically uninteresting Potentially Interesting

Page 6: The Persistent Homology of Distance Functions under Random Projection

Unions of Balls

Finite Point Set Union of Balls

Topologically uninteresting Potentially Interesting

Idea: Fill in the gaps in the ambient space. Examples: Molecules and Manifolds

Page 7: The Persistent Homology of Distance Functions under Random Projection

Unions of balls are sublevels of the distance.

Page 8: The Persistent Homology of Distance Functions under Random Projection

Unions of balls are sublevels of the distance.

P ⇢ RdInput:

Page 9: The Persistent Homology of Distance Functions under Random Projection

Unions of balls are sublevels of the distance.

P

↵ =[

p2P

ball(p,↵) = {x 2 Rd | d(x, P ) ↵}

P ⇢ RdInput:

Page 10: The Persistent Homology of Distance Functions under Random Projection

Unions of balls are sublevels of the distance.

P

↵ =[

p2P

ball(p,↵) = {x 2 Rd | d(x, P ) ↵}

Persistent Homology was invented to track changesin the homology of P↵ as ↵ ranges from 0 to 1.

P ⇢ RdInput:

Page 11: The Persistent Homology of Distance Functions under Random Projection

Unions of balls are sublevels of the distance.

P

↵ =[

p2P

ball(p,↵) = {x 2 Rd | d(x, P ) ↵}

Persistent Homology was invented to track changesin the homology of P↵ as ↵ ranges from 0 to 1.

Pers({P↵})

P ⇢ RdInput:

Page 12: The Persistent Homology of Distance Functions under Random Projection

Unions of balls are sublevels of the distance.

P

↵ =[

p2P

ball(p,↵) = {x 2 Rd | d(x, P ) ↵}

Persistent Homology was invented to track changesin the homology of P↵ as ↵ ranges from 0 to 1.

Pers({P↵})

P ⇢ RdInput:

Page 13: The Persistent Homology of Distance Functions under Random Projection

Filtered Simplicial Complexes

Page 14: The Persistent Homology of Distance Functions under Random Projection

Filtered Simplicial Complexes

For � ✓ P ,

rad(�) = radius of the min. encl. ball of �.diam(�) = max

p,q2Pkp� qk2.

Page 15: The Persistent Homology of Distance Functions under Random Projection

Filtered Simplicial Complexes

For � ✓ P ,

rad(�) = radius of the min. encl. ball of �.diam(�) = max

p,q2Pkp� qk2.

ˇ

Cech Complex: CP (↵) = {� ✓ P | rad(�) 2↵}

Page 16: The Persistent Homology of Distance Functions under Random Projection

Filtered Simplicial Complexes

For � ✓ P ,

rad(�) = radius of the min. encl. ball of �.diam(�) = max

p,q2Pkp� qk2.

ˇ

Cech Complex: CP (↵) = {� ✓ P | rad(�) 2↵}ˇ

Cech Filtration: {CP (↵)}↵�0

Page 17: The Persistent Homology of Distance Functions under Random Projection

Filtered Simplicial Complexes

For � ✓ P ,

rad(�) = radius of the min. encl. ball of �.diam(�) = max

p,q2Pkp� qk2.

ˇ

Cech Complex: CP (↵) = {� ✓ P | rad(�) 2↵}

Rips Complex: RP (↵) = {� ✓ P | diam(�) ↵}

ˇ

Cech Filtration: {CP (↵)}↵�0

Page 18: The Persistent Homology of Distance Functions under Random Projection

Filtered Simplicial Complexes

For � ✓ P ,

rad(�) = radius of the min. encl. ball of �.diam(�) = max

p,q2Pkp� qk2.

ˇ

Cech Complex: CP (↵) = {� ✓ P | rad(�) 2↵}

Rips Complex: RP (↵) = {� ✓ P | diam(�) ↵}

ˇ

Cech Filtration: {CP (↵)}↵�0

Rips Filtration: {RP (↵)}↵�0

Page 19: The Persistent Homology of Distance Functions under Random Projection

Filtered Simplicial Complexes

For � ✓ P ,

rad(�) = radius of the min. encl. ball of �.diam(�) = max

p,q2Pkp� qk2.

ˇ

Cech Complex: CP (↵) = {� ✓ P | rad(�) 2↵}

Rips Complex: RP (↵) = {� ✓ P | diam(�) ↵}

ˇ

Cech Filtration: {CP (↵)}↵�0

Rips Filtration: {RP (↵)}↵�0

CP (↵) ✓ RP (↵) ✓ CP (p2↵)

Page 20: The Persistent Homology of Distance Functions under Random Projection

Filtered Simplicial Complexes

For � ✓ P ,

rad(�) = radius of the min. encl. ball of �.diam(�) = max

p,q2Pkp� qk2.

ˇ

Cech Complex: CP (↵) = {� ✓ P | rad(�) 2↵}

Rips Complex: RP (↵) = {� ✓ P | diam(�) ↵}

ˇ

Cech Filtration: {CP (↵)}↵�0

Rips Filtration: {RP (↵)}↵�0

CP (↵) ✓ RP (↵) ✓ CP (p2↵)

Pers({RP (↵)}) is ap2-approximation to Pers({CP (↵)}).

Page 21: The Persistent Homology of Distance Functions under Random Projection

Representing sublevels of distances

Page 22: The Persistent Homology of Distance Functions under Random Projection

Representing sublevels of distances

ˇ

Cech Complex: size O(nd+1).

↵-complex (a.k.a. Delaunay Filtration): size O(ndd/2e).

Quality Meshes: size 2

(d2)n.(Sparse

ˇ

Cech Complex: 2

(d2)n).*

Page 23: The Persistent Homology of Distance Functions under Random Projection

Representing sublevels of distances

ˇ

Cech Complex: size O(nd+1).

↵-complex (a.k.a. Delaunay Filtration): size O(ndd/2e).

Quality Meshes: size 2

(d2)n.(Sparse

ˇ

Cech Complex: 2

(d2)n).*

Page 24: The Persistent Homology of Distance Functions under Random Projection

Representing sublevels of distances

ˇ

Cech Complex: size O(nd+1).

↵-complex (a.k.a. Delaunay Filtration): size O(ndd/2e).

Quality Meshes: size 2

(d2)n.(Sparse

ˇ

Cech Complex: 2

(d2)n).*

Key Point: Ambient Dimension Matters!

Page 25: The Persistent Homology of Distance Functions under Random Projection

Johnson Lindenstrauss Projection

Page 26: The Persistent Homology of Distance Functions under Random Projection

Johnson Lindenstrauss Projection

Idea: Project to lower dimensions. Preserve pairwise distances.

Page 27: The Persistent Homology of Distance Functions under Random Projection

Johnson Lindenstrauss Projection

Idea: Project to lower dimensions. Preserve pairwise distances.

Let f : RD ! Rdbe a linear map where d = O(log n/"2) such that:

Page 28: The Persistent Homology of Distance Functions under Random Projection

Johnson Lindenstrauss Projection

Idea: Project to lower dimensions. Preserve pairwise distances.

(1� ")ka� bk2 kf(a)� f(b)k2 (1 + ")ka� bk2Squared distances preserved up to multiplicative factor.1

Let f : RD ! Rdbe a linear map where d = O(log n/"2) such that:

Page 29: The Persistent Homology of Distance Functions under Random Projection

Johnson Lindenstrauss Projection

Idea: Project to lower dimensions. Preserve pairwise distances.

(1� ")ka� bk2 kf(a)� f(b)k2 (1 + ")ka� bk2Squared distances preserved up to multiplicative factor.1

|(b� a)>(c� a)� (f(b)� f(a))>(f(b)� f(a))| "kb� akkc� ak.Inner products preserved up to additive factor.2

Let f : RD ! Rdbe a linear map where d = O(log n/"2) such that:

Page 30: The Persistent Homology of Distance Functions under Random Projection

Johnson Lindenstrauss Projection

Idea: Project to lower dimensions. Preserve pairwise distances.

a

b

c f(c)

f(b)

f(a)

(1� ")ka� bk2 kf(a)� f(b)k2 (1 + ")ka� bk2Squared distances preserved up to multiplicative factor.1

|(b� a)>(c� a)� (f(b)� f(a))>(f(b)� f(a))| "kb� akkc� ak.Inner products preserved up to additive factor.2

Let f : RD ! Rdbe a linear map where d = O(log n/"2) such that:

Page 31: The Persistent Homology of Distance Functions under Random Projection

Can we use JL for P.H. of distances?

Page 32: The Persistent Homology of Distance Functions under Random Projection

Can we use JL for P.H. of distances?

Yes, for Rips filtrations, but not a tight approximation.

Page 33: The Persistent Homology of Distance Functions under Random Projection

Can we use JL for P.H. of distances?

Yes, for Rips filtrations, but not a tight approximation.Distance function itself is not preserved.

Page 34: The Persistent Homology of Distance Functions under Random Projection

Can we use JL for P.H. of distances?

Yes, for Rips filtrations, but not a tight approximation.Distance function itself is not preserved.Pairwise distances in sublevels are not preserved.

Page 35: The Persistent Homology of Distance Functions under Random Projection

Can we use JL for P.H. of distances?

Yes, for Rips filtrations, but not a tight approximation.Distance function itself is not preserved.Pairwise distances in sublevels are not preserved.Is topology preserved? Maybe yes, maybe no.

Page 36: The Persistent Homology of Distance Functions under Random Projection

Can we use JL for P.H. of distances?

Yes, for Rips filtrations, but not a tight approximation.Distance function itself is not preserved.Pairwise distances in sublevels are not preserved.Is topology preserved? Maybe yes, maybe no.Is persistent homology preserved? YES.

Page 37: The Persistent Homology of Distance Functions under Random Projection

Cech Filtration, MEBs, and Approximation

Page 38: The Persistent Homology of Distance Functions under Random Projection

Cech Filtration, MEBs, and Approximationˇ

Cech Complex: CP (↵) = {� ✓ P | rad(�) 2↵}ˇ

Cech Filtration: {CP (↵)}↵�0

Page 39: The Persistent Homology of Distance Functions under Random Projection

Cech Filtration, MEBs, and Approximationˇ

Cech Complex: CP (↵) = {� ✓ P | rad(�) 2↵}ˇ

Cech Filtration: {CP (↵)}↵�0

Let P ⇢ RDand let f be any map from RD

to Rd.

Page 40: The Persistent Homology of Distance Functions under Random Projection

Cech Filtration, MEBs, and Approximationˇ

Cech Complex: CP (↵) = {� ✓ P | rad(�) 2↵}ˇ

Cech Filtration: {CP (↵)}↵�0

Let P ⇢ RDand let f be any map from RD

to Rd.

Idea: If f “preserves M.E.B. radii”, then it preserves

the persistent homology of the distance function.

Page 41: The Persistent Homology of Distance Functions under Random Projection

Cech Filtration, MEBs, and Approximationˇ

Cech Complex: CP (↵) = {� ✓ P | rad(�) 2↵}ˇ

Cech Filtration: {CP (↵)}↵�0

Let P ⇢ RDand let f be any map from RD

to Rd.

Idea: If f “preserves M.E.B. radii”, then it preserves

the persistent homology of the distance function.

Page 42: The Persistent Homology of Distance Functions under Random Projection

Cech Filtration, MEBs, and Approximationˇ

Cech Complex: CP (↵) = {� ✓ P | rad(�) 2↵}ˇ

Cech Filtration: {CP (↵)}↵�0

Let P ⇢ RDand let f be any map from RD

to Rd.

Idea: If f “preserves M.E.B. radii”, then it preserves

the persistent homology of the distance function.

For S ✓ P , (1� 4")rad(S)2 rad(f(S))2 (1 + 4")rad(S)2.

Page 43: The Persistent Homology of Distance Functions under Random Projection

Cech Filtration, MEBs, and Approximationˇ

Cech Complex: CP (↵) = {� ✓ P | rad(�) 2↵}ˇ

Cech Filtration: {CP (↵)}↵�0

Let P ⇢ RDand let f be any map from RD

to Rd.

Idea: If f “preserves M.E.B. radii”, then it preserves

the persistent homology of the distance function.

For S ✓ P , (1� 4")rad(S)2 rad(f(S))2 (1 + 4")rad(S)2.

For all ↵ � 0, CP (p1� 4") ✓ Cf(P )(↵) ✓ CP (

p1� 4")

Page 44: The Persistent Homology of Distance Functions under Random Projection

Cech Filtration, MEBs, and Approximationˇ

Cech Complex: CP (↵) = {� ✓ P | rad(�) 2↵}ˇ

Cech Filtration: {CP (↵)}↵�0

Let P ⇢ RDand let f be any map from RD

to Rd.

Idea: If f “preserves M.E.B. radii”, then it preserves

the persistent homology of the distance function.

For S ✓ P , (1� 4")rad(S)2 rad(f(S))2 (1 + 4")rad(S)2.

For all ↵ � 0, CP (p1� 4") ✓ Cf(P )(↵) ✓ CP (

p1� 4")

So, Pers(d(·, f(P ))) is a (1 +O("))-approximation

to Pers(d(·, P )).

Page 45: The Persistent Homology of Distance Functions under Random Projection

MEBs under JL projection

Page 46: The Persistent Homology of Distance Functions under Random Projection

MEBs under JL projection

Let S = {p1, . . . , pr} and let x 2 conv(S).

Page 47: The Persistent Homology of Distance Functions under Random Projection

MEBs under JL projection

x =rX

i=1

�ipi, whererX

i=1

�i = 1.

Let S = {p1, . . . , pr} and let x 2 conv(S).

Page 48: The Persistent Homology of Distance Functions under Random Projection

MEBs under JL projection

x =rX

i=1

�ipi, whererX

i=1

�i = 1.

kx� pk2 =

�����

rX

i=1

�i(pi � p)

�����

2

=rX

i=1

rX

j=1

�i�j(pi � p)>(pj � p).For any p 2 S,

Let S = {p1, . . . , pr} and let x 2 conv(S).

Page 49: The Persistent Homology of Distance Functions under Random Projection

MEBs under JL projection

��kp� xk2 � kf(p)� f(x)k2�� =

rX

i=1

rX

j=1

�i�j

��(pi � p)>(pj � p)� (f(pi)� f(p))>(f(pj)� f(p))��

rX

i=1

rX

j=1

�i�j"kpi � pkkpj � pk

rX

i=1

rX

j=1

�i�j4" rad(S)2

= 4" rad(S)2.

x =rX

i=1

�ipi, whererX

i=1

�i = 1.

kx� pk2 =

�����

rX

i=1

�i(pi � p)

�����

2

=rX

i=1

rX

j=1

�i�j(pi � p)>(pj � p).For any p 2 S,

Let S = {p1, . . . , pr} and let x 2 conv(S).

Page 50: The Persistent Homology of Distance Functions under Random Projection

MEBs under JL projection

Page 51: The Persistent Homology of Distance Functions under Random Projection

MEBs under JL projectionTheorem: Let P be a set of points in RD

and let f : RD ! Rdbe an "-JL

projection for P . For every subset S of P ,

(1� 4")rad(S)2 rad(f(S))2 (1 + 4")rad(S)2.

Page 52: The Persistent Homology of Distance Functions under Random Projection

MEBs under JL projectionTheorem: Let P be a set of points in RD

and let f : RD ! Rdbe an "-JL

projection for P . For every subset S of P ,

(1� 4")rad(S)2 rad(f(S))2 (1 + 4")rad(S)2.

Let x = center(S).

Page 53: The Persistent Homology of Distance Functions under Random Projection

MEBs under JL projectionTheorem: Let P be a set of points in RD

and let f : RD ! Rdbe an "-JL

projection for P . For every subset S of P ,

(1� 4")rad(S)2 rad(f(S))2 (1 + 4")rad(S)2.

Upper Bound:

Let x = center(S).

Page 54: The Persistent Homology of Distance Functions under Random Projection

MEBs under JL projectionTheorem: Let P be a set of points in RD

and let f : RD ! Rdbe an "-JL

projection for P . For every subset S of P ,

(1� 4")rad(S)2 rad(f(S))2 (1 + 4")rad(S)2.

Upper Bound: rad(f(S))

2 max

p2P(kx� pk2 + 4" rad(S)

2)

max

p2P((1 + 4")rad(S)

2)

= (1 + 4")rad(S)

2.

Let x = center(S).

Page 55: The Persistent Homology of Distance Functions under Random Projection

MEBs under JL projectionTheorem: Let P be a set of points in RD

and let f : RD ! Rdbe an "-JL

projection for P . For every subset S of P ,

(1� 4")rad(S)2 rad(f(S))2 (1 + 4")rad(S)2.

Upper Bound: rad(f(S))

2 max

p2P(kx� pk2 + 4" rad(S)

2)

max

p2P((1 + 4")rad(S)

2)

= (1 + 4")rad(S)

2.

Lower Bound:

Let x = center(S).

Page 56: The Persistent Homology of Distance Functions under Random Projection

MEBs under JL projectionTheorem: Let P be a set of points in RD

and let f : RD ! Rdbe an "-JL

projection for P . For every subset S of P ,

(1� 4")rad(S)2 rad(f(S))2 (1 + 4")rad(S)2.

Upper Bound: rad(f(S))

2 max

p2P(kx� pk2 + 4" rad(S)

2)

max

p2P((1 + 4")rad(S)

2)

= (1 + 4")rad(S)

2.

Lower Bound:

Let x = center(S).

Let q 2 S be such that kq � xk = rad(S) andkf(q)� center(f(S))k � kf(q)� f(x)k.

Page 57: The Persistent Homology of Distance Functions under Random Projection

MEBs under JL projectionTheorem: Let P be a set of points in RD

and let f : RD ! Rdbe an "-JL

projection for P . For every subset S of P ,

(1� 4")rad(S)2 rad(f(S))2 (1 + 4")rad(S)2.

Upper Bound: rad(f(S))

2 max

p2P(kx� pk2 + 4" rad(S)

2)

max

p2P((1 + 4")rad(S)

2)

= (1 + 4")rad(S)

2.

Lower Bound:

Let x = center(S).

Let q 2 S be such that kq � xk = rad(S) andkf(q)� center(f(S))k � kf(q)� f(x)k.

Page 58: The Persistent Homology of Distance Functions under Random Projection

MEBs under JL projectionTheorem: Let P be a set of points in RD

and let f : RD ! Rdbe an "-JL

projection for P . For every subset S of P ,

(1� 4")rad(S)2 rad(f(S))2 (1 + 4")rad(S)2.

Upper Bound: rad(f(S))

2 max

p2P(kx� pk2 + 4" rad(S)

2)

max

p2P((1 + 4")rad(S)

2)

= (1 + 4")rad(S)

2.

Lower Bound:

Let x = center(S).

Let q 2 S be such that kq � xk = rad(S) andkf(q)� center(f(S))k � kf(q)� f(x)k.

rad(f(S))2 � kf(q)� center(f(S))k2

� kf(q)� f(x)k2

� kq � xk2 � 4" rad(S)2

= (1� 4")rad(S)2.

Page 59: The Persistent Homology of Distance Functions under Random Projection

Extension to k-NN distances.

Page 60: The Persistent Homology of Distance Functions under Random Projection

Extension to k-NN distances.

Page 61: The Persistent Homology of Distance Functions under Random Projection

Extension to k-NN distances.

d

kP (x) = distance from x to k points of P .

Page 62: The Persistent Homology of Distance Functions under Random Projection

Extension to k-NN distances.

d

kP (x) = distance from x to k points of P .

Corollary: If f is an "-JL projection then for all k,Pers(d

kf(P )) is a 1 +O(") approximation to Pers(d

kP ).

Page 63: The Persistent Homology of Distance Functions under Random Projection

Extension to k-NN distances.

d

kP (x) = distance from x to k points of P .

Corollary: If f is an "-JL projection then for all k,Pers(d

kf(P )) is a 1 +O(") approximation to Pers(d

kP ).

Bonus: Also works for weighted points.

Page 64: The Persistent Homology of Distance Functions under Random Projection

Going forward…

Page 65: The Persistent Homology of Distance Functions under Random Projection

Going forward…

• Eliminate inner product condition.

Page 66: The Persistent Homology of Distance Functions under Random Projection

Going forward…

• Eliminate inner product condition.• Eliminate constant factor (4)

Page 67: The Persistent Homology of Distance Functions under Random Projection

Going forward…

• Eliminate inner product condition.• Eliminate constant factor (4)• Eliminate linearity condition.

Page 68: The Persistent Homology of Distance Functions under Random Projection

Going forward…

• Eliminate inner product condition.• Eliminate constant factor (4)• Eliminate linearity condition.• Extend to distances to measures.

Page 69: The Persistent Homology of Distance Functions under Random Projection

Going forward…

• Eliminate inner product condition.• Eliminate constant factor (4)• Eliminate linearity condition.• Extend to distances to measures.

Thank you.