persistent homology tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · persistent...

139
Persistent Homology Tutorial Xiaojin Zhu Department of Computer Sciences University of Wisconsin-Madison [email protected] 2013 (Zhu, University of Wisconsin-Madison) Persistent homology 1 / 37

Upload: others

Post on 18-Jan-2021

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Persistent Homology Tutorial

Xiaojin Zhu

Department of Computer SciencesUniversity of Wisconsin-Madison

[email protected]

(Zhu, University of Wisconsin-Madison) Persistent homology 1 / 37

Page 2: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Persistent homology

A rapidly growing branch of topology

mathematically defines “holes” in data:

I 0th order holes: clustersI 1st order holes: holesI 2nd order holes: voidsI higher order holes, too

Betti numbers: the number of kth order holes

(Zhu, University of Wisconsin-Madison) Persistent homology 2 / 37

Page 3: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Persistent homology

A rapidly growing branch of topology

mathematically defines “holes” in data:

I 0th order holes: clustersI 1st order holes: holesI 2nd order holes: voidsI higher order holes, too

Betti numbers: the number of kth order holes

(Zhu, University of Wisconsin-Madison) Persistent homology 2 / 37

Page 4: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Persistent homology

A rapidly growing branch of topology

mathematically defines “holes” in data:I 0th order holes: clusters

I 1st order holes: holesI 2nd order holes: voidsI higher order holes, too

Betti numbers: the number of kth order holes

(Zhu, University of Wisconsin-Madison) Persistent homology 2 / 37

Page 5: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Persistent homology

A rapidly growing branch of topology

mathematically defines “holes” in data:I 0th order holes: clustersI 1st order holes: holes

I 2nd order holes: voidsI higher order holes, too

Betti numbers: the number of kth order holes

(Zhu, University of Wisconsin-Madison) Persistent homology 2 / 37

Page 6: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Persistent homology

A rapidly growing branch of topology

mathematically defines “holes” in data:I 0th order holes: clustersI 1st order holes: holesI 2nd order holes: voids

I higher order holes, too

Betti numbers: the number of kth order holes

(Zhu, University of Wisconsin-Madison) Persistent homology 2 / 37

Page 7: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Persistent homology

A rapidly growing branch of topology

mathematically defines “holes” in data:I 0th order holes: clustersI 1st order holes: holesI 2nd order holes: voidsI higher order holes, too

Betti numbers: the number of kth order holes

(Zhu, University of Wisconsin-Madison) Persistent homology 2 / 37

Page 8: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Persistent homology

A rapidly growing branch of topology

mathematically defines “holes” in data:I 0th order holes: clustersI 1st order holes: holesI 2nd order holes: voidsI higher order holes, too

Betti numbers: the number of kth order holes

(Zhu, University of Wisconsin-Madison) Persistent homology 2 / 37

Page 9: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Betti number examples

[Reproduced from Singh et al. J. Vision 2008]

(Zhu, University of Wisconsin-Madison) Persistent homology 3 / 37

Page 10: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Plan of this talk

Persistent homology tutorial

An application in natural language processing

(Zhu, University of Wisconsin-Madison) Persistent homology 4 / 37

Page 11: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Plan of this talk

Persistent homology tutorial

An application in natural language processing

(Zhu, University of Wisconsin-Madison) Persistent homology 4 / 37

Page 12: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Holes and equivalent rubber bands

blue ∼ green, not red

two equivalent classes ⇔ one hole.

(Zhu, University of Wisconsin-Madison) Persistent homology 5 / 37

Page 13: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Holes and equivalent rubber bands

blue ∼ green, not red

two equivalent classes ⇔ one hole.

(Zhu, University of Wisconsin-Madison) Persistent homology 5 / 37

Page 14: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Group Theory

Definition

A group 〈G, ∗〉 is a set G with a binary operation ∗ such that

1 (associative) a ∗ (b ∗ c) = (a ∗ b) ∗ c for all a, b, c ∈ G.

2 (identity) ∃e ∈ G so that e ∗ a = a ∗ e = a for all a ∈ G.

3 (inverse) ∀a ∈ G, ∃a′ ∈ G where a ∗ a′ = a′ ∗ a = e.

Examples: 〈Z,+〉, 〈R,+〉, 〈R+,×〉, 〈R\{0},×〉.Z2

+2 0 1

0 0 11 1 0

All our groups G are abelian: ∀a, b ∈ G, a ∗ b = b ∗ a.

(Zhu, University of Wisconsin-Madison) Persistent homology 6 / 37

Page 15: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Group Theory

Definition

A group 〈G, ∗〉 is a set G with a binary operation ∗ such that

1 (associative) a ∗ (b ∗ c) = (a ∗ b) ∗ c for all a, b, c ∈ G.

2 (identity) ∃e ∈ G so that e ∗ a = a ∗ e = a for all a ∈ G.

3 (inverse) ∀a ∈ G, ∃a′ ∈ G where a ∗ a′ = a′ ∗ a = e.

Examples: 〈Z,+〉, 〈R,+〉, 〈R+,×〉, 〈R\{0},×〉.Z2

+2 0 1

0 0 11 1 0

All our groups G are abelian: ∀a, b ∈ G, a ∗ b = b ∗ a.

(Zhu, University of Wisconsin-Madison) Persistent homology 6 / 37

Page 16: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Group Theory

Definition

A group 〈G, ∗〉 is a set G with a binary operation ∗ such that

1 (associative) a ∗ (b ∗ c) = (a ∗ b) ∗ c for all a, b, c ∈ G.

2 (identity) ∃e ∈ G so that e ∗ a = a ∗ e = a for all a ∈ G.

3 (inverse) ∀a ∈ G, ∃a′ ∈ G where a ∗ a′ = a′ ∗ a = e.

Examples: 〈Z,+〉, 〈R,+〉, 〈R+,×〉, 〈R\{0},×〉.Z2

+2 0 1

0 0 11 1 0

All our groups G are abelian: ∀a, b ∈ G, a ∗ b = b ∗ a.

(Zhu, University of Wisconsin-Madison) Persistent homology 6 / 37

Page 17: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Group Theory

Definition

A group 〈G, ∗〉 is a set G with a binary operation ∗ such that

1 (associative) a ∗ (b ∗ c) = (a ∗ b) ∗ c for all a, b, c ∈ G.

2 (identity) ∃e ∈ G so that e ∗ a = a ∗ e = a for all a ∈ G.

3 (inverse) ∀a ∈ G, ∃a′ ∈ G where a ∗ a′ = a′ ∗ a = e.

Examples: 〈Z,+〉, 〈R,+〉, 〈R+,×〉, 〈R\{0},×〉.Z2

+2 0 1

0 0 11 1 0

All our groups G are abelian: ∀a, b ∈ G, a ∗ b = b ∗ a.

(Zhu, University of Wisconsin-Madison) Persistent homology 6 / 37

Page 18: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Group Theory

Definition

A group 〈G, ∗〉 is a set G with a binary operation ∗ such that

1 (associative) a ∗ (b ∗ c) = (a ∗ b) ∗ c for all a, b, c ∈ G.

2 (identity) ∃e ∈ G so that e ∗ a = a ∗ e = a for all a ∈ G.

3 (inverse) ∀a ∈ G, ∃a′ ∈ G where a ∗ a′ = a′ ∗ a = e.

Examples: 〈Z,+〉, 〈R,+〉, 〈R+,×〉, 〈R\{0},×〉.

Z2

+2 0 1

0 0 11 1 0

All our groups G are abelian: ∀a, b ∈ G, a ∗ b = b ∗ a.

(Zhu, University of Wisconsin-Madison) Persistent homology 6 / 37

Page 19: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Group Theory

Definition

A group 〈G, ∗〉 is a set G with a binary operation ∗ such that

1 (associative) a ∗ (b ∗ c) = (a ∗ b) ∗ c for all a, b, c ∈ G.

2 (identity) ∃e ∈ G so that e ∗ a = a ∗ e = a for all a ∈ G.

3 (inverse) ∀a ∈ G, ∃a′ ∈ G where a ∗ a′ = a′ ∗ a = e.

Examples: 〈Z,+〉, 〈R,+〉, 〈R+,×〉, 〈R\{0},×〉.Z2

+2 0 1

0 0 11 1 0

All our groups G are abelian: ∀a, b ∈ G, a ∗ b = b ∗ a.

(Zhu, University of Wisconsin-Madison) Persistent homology 6 / 37

Page 20: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Group Theory

Definition

A group 〈G, ∗〉 is a set G with a binary operation ∗ such that

1 (associative) a ∗ (b ∗ c) = (a ∗ b) ∗ c for all a, b, c ∈ G.

2 (identity) ∃e ∈ G so that e ∗ a = a ∗ e = a for all a ∈ G.

3 (inverse) ∀a ∈ G, ∃a′ ∈ G where a ∗ a′ = a′ ∗ a = e.

Examples: 〈Z,+〉, 〈R,+〉, 〈R+,×〉, 〈R\{0},×〉.Z2

+2 0 1

0 0 11 1 0

All our groups G are abelian: ∀a, b ∈ G, a ∗ b = b ∗ a.

(Zhu, University of Wisconsin-Madison) Persistent homology 6 / 37

Page 21: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Subgroup

Definition

A subset H ⊆ G of a group 〈G, ∗〉 is a subgroup of G if 〈H, ∗〉 is itself agroup.

{e} is the trivial subgroup of any group G

〈R+,×〉 is a subgroup of 〈R\{0},×〉not 〈R−,×〉

(Zhu, University of Wisconsin-Madison) Persistent homology 7 / 37

Page 22: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Subgroup

Definition

A subset H ⊆ G of a group 〈G, ∗〉 is a subgroup of G if 〈H, ∗〉 is itself agroup.

{e} is the trivial subgroup of any group G

〈R+,×〉 is a subgroup of 〈R\{0},×〉not 〈R−,×〉

(Zhu, University of Wisconsin-Madison) Persistent homology 7 / 37

Page 23: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Subgroup

Definition

A subset H ⊆ G of a group 〈G, ∗〉 is a subgroup of G if 〈H, ∗〉 is itself agroup.

{e} is the trivial subgroup of any group G

〈R+,×〉 is a subgroup of 〈R\{0},×〉

not 〈R−,×〉

(Zhu, University of Wisconsin-Madison) Persistent homology 7 / 37

Page 24: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Subgroup

Definition

A subset H ⊆ G of a group 〈G, ∗〉 is a subgroup of G if 〈H, ∗〉 is itself agroup.

{e} is the trivial subgroup of any group G

〈R+,×〉 is a subgroup of 〈R\{0},×〉not 〈R−,×〉

(Zhu, University of Wisconsin-Madison) Persistent homology 7 / 37

Page 25: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Coset

Definition

Given a subgroup H of an abelian group G, for any a ∈ G, the seta ∗H = {a ∗ h | h ∈ H} is the coset of H represented by a.

H = R+, G = R\{0}3.14× R+ is a coset which is the same as R+

−1× R+ = R− is another coset (not a subgroup)

cosets have equal sizes and partition G.

(Zhu, University of Wisconsin-Madison) Persistent homology 8 / 37

Page 26: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Coset

Definition

Given a subgroup H of an abelian group G, for any a ∈ G, the seta ∗H = {a ∗ h | h ∈ H} is the coset of H represented by a.

H = R+, G = R\{0}

3.14× R+ is a coset which is the same as R+

−1× R+ = R− is another coset (not a subgroup)

cosets have equal sizes and partition G.

(Zhu, University of Wisconsin-Madison) Persistent homology 8 / 37

Page 27: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Coset

Definition

Given a subgroup H of an abelian group G, for any a ∈ G, the seta ∗H = {a ∗ h | h ∈ H} is the coset of H represented by a.

H = R+, G = R\{0}3.14× R+ is a coset which is the same as R+

−1× R+ = R− is another coset (not a subgroup)

cosets have equal sizes and partition G.

(Zhu, University of Wisconsin-Madison) Persistent homology 8 / 37

Page 28: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Coset

Definition

Given a subgroup H of an abelian group G, for any a ∈ G, the seta ∗H = {a ∗ h | h ∈ H} is the coset of H represented by a.

H = R+, G = R\{0}3.14× R+ is a coset which is the same as R+

−1× R+ = R− is another coset (not a subgroup)

cosets have equal sizes and partition G.

(Zhu, University of Wisconsin-Madison) Persistent homology 8 / 37

Page 29: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Coset

Definition

Given a subgroup H of an abelian group G, for any a ∈ G, the seta ∗H = {a ∗ h | h ∈ H} is the coset of H represented by a.

H = R+, G = R\{0}3.14× R+ is a coset which is the same as R+

−1× R+ = R− is another coset (not a subgroup)

cosets have equal sizes and partition G.

(Zhu, University of Wisconsin-Madison) Persistent homology 8 / 37

Page 30: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Homomorphism

Definition

A map φ : G 7→ G′ is a homomorphism if φ(a ∗ b) = φ(a) ? φ(b) for∀a, b ∈ G.

〈R+,×〉 to 〈Z2,+2〉: trivial homomorphism φ(a) = 0,∀a ∈ R+

negation in natural language: GN

∗ t not

t t notnot not t

homomorphism (isomorphism) from GN to Z2: φ(t) = 0, φ(not) = 1.

(Zhu, University of Wisconsin-Madison) Persistent homology 9 / 37

Page 31: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Homomorphism

Definition

A map φ : G 7→ G′ is a homomorphism if φ(a ∗ b) = φ(a) ? φ(b) for∀a, b ∈ G.

〈R+,×〉 to 〈Z2,+2〉: trivial homomorphism φ(a) = 0, ∀a ∈ R+

negation in natural language: GN

∗ t not

t t notnot not t

homomorphism (isomorphism) from GN to Z2: φ(t) = 0, φ(not) = 1.

(Zhu, University of Wisconsin-Madison) Persistent homology 9 / 37

Page 32: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Homomorphism

Definition

A map φ : G 7→ G′ is a homomorphism if φ(a ∗ b) = φ(a) ? φ(b) for∀a, b ∈ G.

〈R+,×〉 to 〈Z2,+2〉: trivial homomorphism φ(a) = 0, ∀a ∈ R+

negation in natural language: GN

∗ t not

t t notnot not t

homomorphism (isomorphism) from GN to Z2: φ(t) = 0, φ(not) = 1.

(Zhu, University of Wisconsin-Madison) Persistent homology 9 / 37

Page 33: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Kernel

Definition

The kernel of a homomorphism φ : G 7→ G′ iskerφ = {a ∈ G | φ(a) = e′}.

In the φ : GN 7→ Z2 example, kerφ = {t}.Another example: φ : 〈R\{0},×〉 7→ GN by φ(a) = t if a > 0 and“not” if a < 0. kerφ = R+

For any homomorphism φ : G 7→ G′, kerφ is a subgroup of G.

Cosets a ∗ kerφ partition GG

G’

e’kerφ

φ

(Zhu, University of Wisconsin-Madison) Persistent homology 10 / 37

Page 34: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Kernel

Definition

The kernel of a homomorphism φ : G 7→ G′ iskerφ = {a ∈ G | φ(a) = e′}.

In the φ : GN 7→ Z2 example, kerφ = {t}.

Another example: φ : 〈R\{0},×〉 7→ GN by φ(a) = t if a > 0 and“not” if a < 0. kerφ = R+

For any homomorphism φ : G 7→ G′, kerφ is a subgroup of G.

Cosets a ∗ kerφ partition GG

G’

e’kerφ

φ

(Zhu, University of Wisconsin-Madison) Persistent homology 10 / 37

Page 35: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Kernel

Definition

The kernel of a homomorphism φ : G 7→ G′ iskerφ = {a ∈ G | φ(a) = e′}.

In the φ : GN 7→ Z2 example, kerφ = {t}.Another example: φ : 〈R\{0},×〉 7→ GN by φ(a) = t if a > 0 and“not” if a < 0. kerφ = R+

For any homomorphism φ : G 7→ G′, kerφ is a subgroup of G.

Cosets a ∗ kerφ partition GG

G’

e’kerφ

φ

(Zhu, University of Wisconsin-Madison) Persistent homology 10 / 37

Page 36: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Kernel

Definition

The kernel of a homomorphism φ : G 7→ G′ iskerφ = {a ∈ G | φ(a) = e′}.

In the φ : GN 7→ Z2 example, kerφ = {t}.Another example: φ : 〈R\{0},×〉 7→ GN by φ(a) = t if a > 0 and“not” if a < 0. kerφ = R+

For any homomorphism φ : G 7→ G′, kerφ is a subgroup of G.

Cosets a ∗ kerφ partition GG

G’

e’kerφ

φ

(Zhu, University of Wisconsin-Madison) Persistent homology 10 / 37

Page 37: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Kernel

Definition

The kernel of a homomorphism φ : G 7→ G′ iskerφ = {a ∈ G | φ(a) = e′}.

In the φ : GN 7→ Z2 example, kerφ = {t}.Another example: φ : 〈R\{0},×〉 7→ GN by φ(a) = t if a > 0 and“not” if a < 0. kerφ = R+

For any homomorphism φ : G 7→ G′, kerφ is a subgroup of G.

Cosets a ∗ kerφ partition GG

G’

e’kerφ

φ

(Zhu, University of Wisconsin-Madison) Persistent homology 10 / 37

Page 38: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Quotient group

Let 〈H, ∗〉 be a subgroup of an abelian group 〈G, ∗〉.

A new operation on the cosets of H:(a ∗H) ? (b ∗H) = (a ∗ b) ∗H,∀a, b ∈ G.

Definition

The cosets {a ∗H | a ∈ G} under the operation ? form a group, called thequotient group G/H.

Example: G = R\{0} and kerφ = R+, two cosets: R+ and R−.

The quotient group (R\{0})/R+ has the two coset elements.

R−?R− = (−1×R+)?(−1×R+) = (−1×−1)×R+ = 1×R+ = R+.

This quotient group (R\{0})/R+ is isomorphic to Z2.

(Zhu, University of Wisconsin-Madison) Persistent homology 11 / 37

Page 39: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Quotient group

Let 〈H, ∗〉 be a subgroup of an abelian group 〈G, ∗〉.A new operation on the cosets of H:(a ∗H) ? (b ∗H) = (a ∗ b) ∗H,∀a, b ∈ G.

Definition

The cosets {a ∗H | a ∈ G} under the operation ? form a group, called thequotient group G/H.

Example: G = R\{0} and kerφ = R+, two cosets: R+ and R−.

The quotient group (R\{0})/R+ has the two coset elements.

R−?R− = (−1×R+)?(−1×R+) = (−1×−1)×R+ = 1×R+ = R+.

This quotient group (R\{0})/R+ is isomorphic to Z2.

(Zhu, University of Wisconsin-Madison) Persistent homology 11 / 37

Page 40: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Quotient group

Let 〈H, ∗〉 be a subgroup of an abelian group 〈G, ∗〉.A new operation on the cosets of H:(a ∗H) ? (b ∗H) = (a ∗ b) ∗H,∀a, b ∈ G.

Definition

The cosets {a ∗H | a ∈ G} under the operation ? form a group, called thequotient group G/H.

Example: G = R\{0} and kerφ = R+, two cosets: R+ and R−.

The quotient group (R\{0})/R+ has the two coset elements.

R−?R− = (−1×R+)?(−1×R+) = (−1×−1)×R+ = 1×R+ = R+.

This quotient group (R\{0})/R+ is isomorphic to Z2.

(Zhu, University of Wisconsin-Madison) Persistent homology 11 / 37

Page 41: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Quotient group

Let 〈H, ∗〉 be a subgroup of an abelian group 〈G, ∗〉.A new operation on the cosets of H:(a ∗H) ? (b ∗H) = (a ∗ b) ∗H,∀a, b ∈ G.

Definition

The cosets {a ∗H | a ∈ G} under the operation ? form a group, called thequotient group G/H.

Example: G = R\{0} and kerφ = R+, two cosets: R+ and R−.

The quotient group (R\{0})/R+ has the two coset elements.

R−?R− = (−1×R+)?(−1×R+) = (−1×−1)×R+ = 1×R+ = R+.

This quotient group (R\{0})/R+ is isomorphic to Z2.

(Zhu, University of Wisconsin-Madison) Persistent homology 11 / 37

Page 42: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Quotient group

Let 〈H, ∗〉 be a subgroup of an abelian group 〈G, ∗〉.A new operation on the cosets of H:(a ∗H) ? (b ∗H) = (a ∗ b) ∗H,∀a, b ∈ G.

Definition

The cosets {a ∗H | a ∈ G} under the operation ? form a group, called thequotient group G/H.

Example: G = R\{0} and kerφ = R+, two cosets: R+ and R−.

The quotient group (R\{0})/R+ has the two coset elements.

R−?R− = (−1×R+)?(−1×R+) = (−1×−1)×R+ = 1×R+ = R+.

This quotient group (R\{0})/R+ is isomorphic to Z2.

(Zhu, University of Wisconsin-Madison) Persistent homology 11 / 37

Page 43: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Quotient group

Let 〈H, ∗〉 be a subgroup of an abelian group 〈G, ∗〉.A new operation on the cosets of H:(a ∗H) ? (b ∗H) = (a ∗ b) ∗H,∀a, b ∈ G.

Definition

The cosets {a ∗H | a ∈ G} under the operation ? form a group, called thequotient group G/H.

Example: G = R\{0} and kerφ = R+, two cosets: R+ and R−.

The quotient group (R\{0})/R+ has the two coset elements.

R−?R− = (−1×R+)?(−1×R+) = (−1×−1)×R+ = 1×R+ = R+.

This quotient group (R\{0})/R+ is isomorphic to Z2.

(Zhu, University of Wisconsin-Madison) Persistent homology 11 / 37

Page 44: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Quotient group

Let 〈H, ∗〉 be a subgroup of an abelian group 〈G, ∗〉.A new operation on the cosets of H:(a ∗H) ? (b ∗H) = (a ∗ b) ∗H,∀a, b ∈ G.

Definition

The cosets {a ∗H | a ∈ G} under the operation ? form a group, called thequotient group G/H.

Example: G = R\{0} and kerφ = R+, two cosets: R+ and R−.

The quotient group (R\{0})/R+ has the two coset elements.

R−?R− = (−1×R+)?(−1×R+) = (−1×−1)×R+ = 1×R+ = R+.

This quotient group (R\{0})/R+ is isomorphic to Z2.

(Zhu, University of Wisconsin-Madison) Persistent homology 11 / 37

Page 45: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Rank

Definition

Let S be a subset of a group G. The subgroup generated by S, 〈S〉, is thesubgroup of all elements of G that can expressed as the finite operation ofelements in S and their inverses.

Z is itself the subgroup generated by {1}Even integers is the subgroup generated by {2}.

Definition

The rank of a group G is rank(G) = min{|S| | S ⊆ G, 〈S〉 = G}.

rank(G) is the size of the smallest subset that generates G.

rank(Z) = 1 since Z = 〈{1}〉.rank(Z× Z) = 2 since Z× Z = 〈{(0, 1), (1, 0)}〉.

(Zhu, University of Wisconsin-Madison) Persistent homology 12 / 37

Page 46: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Rank

Definition

Let S be a subset of a group G. The subgroup generated by S, 〈S〉, is thesubgroup of all elements of G that can expressed as the finite operation ofelements in S and their inverses.

Z is itself the subgroup generated by {1}

Even integers is the subgroup generated by {2}.

Definition

The rank of a group G is rank(G) = min{|S| | S ⊆ G, 〈S〉 = G}.

rank(G) is the size of the smallest subset that generates G.

rank(Z) = 1 since Z = 〈{1}〉.rank(Z× Z) = 2 since Z× Z = 〈{(0, 1), (1, 0)}〉.

(Zhu, University of Wisconsin-Madison) Persistent homology 12 / 37

Page 47: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Rank

Definition

Let S be a subset of a group G. The subgroup generated by S, 〈S〉, is thesubgroup of all elements of G that can expressed as the finite operation ofelements in S and their inverses.

Z is itself the subgroup generated by {1}Even integers is the subgroup generated by {2}.

Definition

The rank of a group G is rank(G) = min{|S| | S ⊆ G, 〈S〉 = G}.

rank(G) is the size of the smallest subset that generates G.

rank(Z) = 1 since Z = 〈{1}〉.rank(Z× Z) = 2 since Z× Z = 〈{(0, 1), (1, 0)}〉.

(Zhu, University of Wisconsin-Madison) Persistent homology 12 / 37

Page 48: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Rank

Definition

Let S be a subset of a group G. The subgroup generated by S, 〈S〉, is thesubgroup of all elements of G that can expressed as the finite operation ofelements in S and their inverses.

Z is itself the subgroup generated by {1}Even integers is the subgroup generated by {2}.

Definition

The rank of a group G is rank(G) = min{|S| | S ⊆ G, 〈S〉 = G}.

rank(G) is the size of the smallest subset that generates G.

rank(Z) = 1 since Z = 〈{1}〉.rank(Z× Z) = 2 since Z× Z = 〈{(0, 1), (1, 0)}〉.

(Zhu, University of Wisconsin-Madison) Persistent homology 12 / 37

Page 49: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Rank

Definition

Let S be a subset of a group G. The subgroup generated by S, 〈S〉, is thesubgroup of all elements of G that can expressed as the finite operation ofelements in S and their inverses.

Z is itself the subgroup generated by {1}Even integers is the subgroup generated by {2}.

Definition

The rank of a group G is rank(G) = min{|S| | S ⊆ G, 〈S〉 = G}.

rank(G) is the size of the smallest subset that generates G.

rank(Z) = 1 since Z = 〈{1}〉.rank(Z× Z) = 2 since Z× Z = 〈{(0, 1), (1, 0)}〉.

(Zhu, University of Wisconsin-Madison) Persistent homology 12 / 37

Page 50: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Rank

Definition

Let S be a subset of a group G. The subgroup generated by S, 〈S〉, is thesubgroup of all elements of G that can expressed as the finite operation ofelements in S and their inverses.

Z is itself the subgroup generated by {1}Even integers is the subgroup generated by {2}.

Definition

The rank of a group G is rank(G) = min{|S| | S ⊆ G, 〈S〉 = G}.

rank(G) is the size of the smallest subset that generates G.

rank(Z) = 1 since Z = 〈{1}〉.

rank(Z× Z) = 2 since Z× Z = 〈{(0, 1), (1, 0)}〉.

(Zhu, University of Wisconsin-Madison) Persistent homology 12 / 37

Page 51: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Rank

Definition

Let S be a subset of a group G. The subgroup generated by S, 〈S〉, is thesubgroup of all elements of G that can expressed as the finite operation ofelements in S and their inverses.

Z is itself the subgroup generated by {1}Even integers is the subgroup generated by {2}.

Definition

The rank of a group G is rank(G) = min{|S| | S ⊆ G, 〈S〉 = G}.

rank(G) is the size of the smallest subset that generates G.

rank(Z) = 1 since Z = 〈{1}〉.rank(Z× Z) = 2 since Z× Z = 〈{(0, 1), (1, 0)}〉.

(Zhu, University of Wisconsin-Madison) Persistent homology 12 / 37

Page 52: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

The group of rubber bands

To count “holes” in homology, consider the group of cycles (therubber bands)

The kernel: “uninteresting rubber bands” that do not surround holes

The quotient group “all rubber bands”/“uninteresting rubber bands”will identify holes.

Computation: need discrete rubber bands ⇒ simplicial complex

(Zhu, University of Wisconsin-Madison) Persistent homology 13 / 37

Page 53: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

The group of rubber bands

To count “holes” in homology, consider the group of cycles (therubber bands)

The kernel: “uninteresting rubber bands” that do not surround holes

The quotient group “all rubber bands”/“uninteresting rubber bands”will identify holes.

Computation: need discrete rubber bands ⇒ simplicial complex

(Zhu, University of Wisconsin-Madison) Persistent homology 13 / 37

Page 54: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

The group of rubber bands

To count “holes” in homology, consider the group of cycles (therubber bands)

The kernel: “uninteresting rubber bands” that do not surround holes

The quotient group “all rubber bands”/“uninteresting rubber bands”will identify holes.

Computation: need discrete rubber bands ⇒ simplicial complex

(Zhu, University of Wisconsin-Madison) Persistent homology 13 / 37

Page 55: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

The group of rubber bands

To count “holes” in homology, consider the group of cycles (therubber bands)

The kernel: “uninteresting rubber bands” that do not surround holes

The quotient group “all rubber bands”/“uninteresting rubber bands”will identify holes.

Computation: need discrete rubber bands ⇒ simplicial complex

(Zhu, University of Wisconsin-Madison) Persistent homology 13 / 37

Page 56: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Simplex

Definition

A p-simplex σ is the convex hull of p+ 1 affinely independent pointsx0, x1, . . . , xp ∈ Rd. We denote σ = conv{x0, . . . , xp}. The dimension ofσ is p.

p = 0, 1, 2, 3

(Zhu, University of Wisconsin-Madison) Persistent homology 14 / 37

Page 57: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Simplex

Definition

A p-simplex σ is the convex hull of p+ 1 affinely independent pointsx0, x1, . . . , xp ∈ Rd. We denote σ = conv{x0, . . . , xp}. The dimension ofσ is p.

p = 0, 1, 2, 3

(Zhu, University of Wisconsin-Madison) Persistent homology 14 / 37

Page 58: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Simplicial complex

Definition

A simplicial complex K is a finite collection of simplices such that σ ∈ Kand τ being a face of σ implies τ ∈ K, and σ, σ′ ∈ K implies σ ∩ σ′ iseither empty or a face of both σ and σ′.

Properly aligned

Simplicial complex = the yellow space in the rubber band picture

(Zhu, University of Wisconsin-Madison) Persistent homology 15 / 37

Page 59: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Simplicial complex

Definition

A simplicial complex K is a finite collection of simplices such that σ ∈ Kand τ being a face of σ implies τ ∈ K, and σ, σ′ ∈ K implies σ ∩ σ′ iseither empty or a face of both σ and σ′.

Properly aligned

Simplicial complex = the yellow space in the rubber band picture

(Zhu, University of Wisconsin-Madison) Persistent homology 15 / 37

Page 60: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Simplicial complex

Definition

A simplicial complex K is a finite collection of simplices such that σ ∈ Kand τ being a face of σ implies τ ∈ K, and σ, σ′ ∈ K implies σ ∩ σ′ iseither empty or a face of both σ and σ′.

Properly aligned

Simplicial complex = the yellow space in the rubber band picture

(Zhu, University of Wisconsin-Madison) Persistent homology 15 / 37

Page 61: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Chain

Definition

A p-chain is a subset of p-simplices in a simplicial complex K.

Example: K=tetrahedron.

A 2-chain is a subset of the four triangles.

24 distinct 2-chains.

26 distinct 1-chains (subsets of edges).

Left: a 2-chain, right: a 1-chain

A p-chain does not have to be connected.

(Zhu, University of Wisconsin-Madison) Persistent homology 16 / 37

Page 62: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Chain

Definition

A p-chain is a subset of p-simplices in a simplicial complex K.

Example: K=tetrahedron.

A 2-chain is a subset of the four triangles.

24 distinct 2-chains.

26 distinct 1-chains (subsets of edges).

Left: a 2-chain, right: a 1-chain

A p-chain does not have to be connected.

(Zhu, University of Wisconsin-Madison) Persistent homology 16 / 37

Page 63: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Chain

Definition

A p-chain is a subset of p-simplices in a simplicial complex K.

Example: K=tetrahedron.

A 2-chain is a subset of the four triangles.

24 distinct 2-chains.

26 distinct 1-chains (subsets of edges).

Left: a 2-chain, right: a 1-chain

A p-chain does not have to be connected.

(Zhu, University of Wisconsin-Madison) Persistent homology 16 / 37

Page 64: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Chain

Definition

A p-chain is a subset of p-simplices in a simplicial complex K.

Example: K=tetrahedron.

A 2-chain is a subset of the four triangles.

24 distinct 2-chains.

26 distinct 1-chains (subsets of edges).

Left: a 2-chain, right: a 1-chain

A p-chain does not have to be connected.

(Zhu, University of Wisconsin-Madison) Persistent homology 16 / 37

Page 65: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Chain

Definition

A p-chain is a subset of p-simplices in a simplicial complex K.

Example: K=tetrahedron.

A 2-chain is a subset of the four triangles.

24 distinct 2-chains.

26 distinct 1-chains (subsets of edges).

Left: a 2-chain, right: a 1-chain

A p-chain does not have to be connected.

(Zhu, University of Wisconsin-Madison) Persistent homology 16 / 37

Page 66: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Chain

Definition

A p-chain is a subset of p-simplices in a simplicial complex K.

Example: K=tetrahedron.

A 2-chain is a subset of the four triangles.

24 distinct 2-chains.

26 distinct 1-chains (subsets of edges).

Left: a 2-chain, right: a 1-chain

A p-chain does not have to be connected.

(Zhu, University of Wisconsin-Madison) Persistent homology 16 / 37

Page 67: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Chain

Definition

A p-chain is a subset of p-simplices in a simplicial complex K.

Example: K=tetrahedron.

A 2-chain is a subset of the four triangles.

24 distinct 2-chains.

26 distinct 1-chains (subsets of edges).

Left: a 2-chain, right: a 1-chain

A p-chain does not have to be connected.

(Zhu, University of Wisconsin-Madison) Persistent homology 16 / 37

Page 68: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Chain group

Definition

The set of p-chains of a simplicial complex K form a p-chain group Cp.

Mod-2 addition

+ =

(Zhu, University of Wisconsin-Madison) Persistent homology 17 / 37

Page 69: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Chain group

Definition

The set of p-chains of a simplicial complex K form a p-chain group Cp.

Mod-2 addition

+ =

(Zhu, University of Wisconsin-Madison) Persistent homology 17 / 37

Page 70: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Boundary

Definition

The boundary of a p-simplex is the set of (p− 1)-simplices faces.

boundary of a tetrahedron = the four triangles faces

boundary of a triangle = the three edges

boundary of an edge = its two vertices

(Zhu, University of Wisconsin-Madison) Persistent homology 18 / 37

Page 71: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Boundary

Definition

The boundary of a p-simplex is the set of (p− 1)-simplices faces.

boundary of a tetrahedron = the four triangles faces

boundary of a triangle = the three edges

boundary of an edge = its two vertices

(Zhu, University of Wisconsin-Madison) Persistent homology 18 / 37

Page 72: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Boundary

Definition

The boundary of a p-simplex is the set of (p− 1)-simplices faces.

boundary of a tetrahedron = the four triangles faces

boundary of a triangle = the three edges

boundary of an edge = its two vertices

(Zhu, University of Wisconsin-Madison) Persistent homology 18 / 37

Page 73: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Boundary

Definition

The boundary of a p-simplex is the set of (p− 1)-simplices faces.

boundary of a tetrahedron = the four triangles faces

boundary of a triangle = the three edges

boundary of an edge = its two vertices

(Zhu, University of Wisconsin-Madison) Persistent homology 18 / 37

Page 74: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Boundary of a p-chain

Definition

The boundary of a p-chain is the Mod-2 sum of the boundaries of itssimplices. Taking the boundary is a group homomorphism ∂p from Cp toCp−1.

Faces shared by an even number of p-simplices in the chain willcancel out:

+ =2

(Zhu, University of Wisconsin-Madison) Persistent homology 19 / 37

Page 75: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Boundary of a p-chain

Definition

The boundary of a p-chain is the Mod-2 sum of the boundaries of itssimplices. Taking the boundary is a group homomorphism ∂p from Cp toCp−1.

Faces shared by an even number of p-simplices in the chain willcancel out:

+ =2

(Zhu, University of Wisconsin-Madison) Persistent homology 19 / 37

Page 76: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Cycles

Definition

A p-cycle c is a p-chain with empty boundary: ∂pc = 0 (the identity inCp−1).

Discrete p-dimensional “rubber bands”

Left: a 1-cycle; Right: not a cycle

Zp= all p-cycles (all rubber bands)

∂pZp = 0: Zp is the kernel ker∂p and a subgroup of Cp.

(Zhu, University of Wisconsin-Madison) Persistent homology 20 / 37

Page 77: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Cycles

Definition

A p-cycle c is a p-chain with empty boundary: ∂pc = 0 (the identity inCp−1).

Discrete p-dimensional “rubber bands”

Left: a 1-cycle; Right: not a cycle

Zp= all p-cycles (all rubber bands)

∂pZp = 0: Zp is the kernel ker∂p and a subgroup of Cp.

(Zhu, University of Wisconsin-Madison) Persistent homology 20 / 37

Page 78: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Cycles

Definition

A p-cycle c is a p-chain with empty boundary: ∂pc = 0 (the identity inCp−1).

Discrete p-dimensional “rubber bands”

Left: a 1-cycle; Right: not a cycle

Zp= all p-cycles (all rubber bands)

∂pZp = 0: Zp is the kernel ker∂p and a subgroup of Cp.

(Zhu, University of Wisconsin-Madison) Persistent homology 20 / 37

Page 79: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Cycles

Definition

A p-cycle c is a p-chain with empty boundary: ∂pc = 0 (the identity inCp−1).

Discrete p-dimensional “rubber bands”

Left: a 1-cycle; Right: not a cycle

Zp= all p-cycles (all rubber bands)

∂pZp = 0: Zp is the kernel ker∂p and a subgroup of Cp.

(Zhu, University of Wisconsin-Madison) Persistent homology 20 / 37

Page 80: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Cycles

Definition

A p-cycle c is a p-chain with empty boundary: ∂pc = 0 (the identity inCp−1).

Discrete p-dimensional “rubber bands”

Left: a 1-cycle; Right: not a cycle

Zp= all p-cycles (all rubber bands)

∂pZp = 0: Zp is the kernel ker∂p and a subgroup of Cp.

(Zhu, University of Wisconsin-Madison) Persistent homology 20 / 37

Page 81: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Boundary-Cycle

The boundary of any (p+ 1)-chain is always a p-cycles

c c c1 2 3

Definition

A p-boundary-cycle is a p-cycle that is also the boundary of some(p+ 1)-chain.

Let Bp = ∂p+1Cp+1, the p-boundary-cycles.

Bp are the uninteresting rubber bands (e.g., B1 = {0, c1})Bp is a subgroup of Zp (all rubber bands).

(Zhu, University of Wisconsin-Madison) Persistent homology 21 / 37

Page 82: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Boundary-Cycle

The boundary of any (p+ 1)-chain is always a p-cycles

c c c1 2 3

Definition

A p-boundary-cycle is a p-cycle that is also the boundary of some(p+ 1)-chain.

Let Bp = ∂p+1Cp+1, the p-boundary-cycles.

Bp are the uninteresting rubber bands (e.g., B1 = {0, c1})Bp is a subgroup of Zp (all rubber bands).

(Zhu, University of Wisconsin-Madison) Persistent homology 21 / 37

Page 83: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Boundary-Cycle

The boundary of any (p+ 1)-chain is always a p-cycles

c c c1 2 3

Definition

A p-boundary-cycle is a p-cycle that is also the boundary of some(p+ 1)-chain.

Let Bp = ∂p+1Cp+1, the p-boundary-cycles.

Bp are the uninteresting rubber bands (e.g., B1 = {0, c1})Bp is a subgroup of Zp (all rubber bands).

(Zhu, University of Wisconsin-Madison) Persistent homology 21 / 37

Page 84: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Boundary-Cycle

The boundary of any (p+ 1)-chain is always a p-cycles

c c c1 2 3

Definition

A p-boundary-cycle is a p-cycle that is also the boundary of some(p+ 1)-chain.

Let Bp = ∂p+1Cp+1, the p-boundary-cycles.

Bp are the uninteresting rubber bands (e.g., B1 = {0, c1})

Bp is a subgroup of Zp (all rubber bands).

(Zhu, University of Wisconsin-Madison) Persistent homology 21 / 37

Page 85: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Boundary-Cycle

The boundary of any (p+ 1)-chain is always a p-cycles

c c c1 2 3

Definition

A p-boundary-cycle is a p-cycle that is also the boundary of some(p+ 1)-chain.

Let Bp = ∂p+1Cp+1, the p-boundary-cycles.

Bp are the uninteresting rubber bands (e.g., B1 = {0, c1})Bp is a subgroup of Zp (all rubber bands).

(Zhu, University of Wisconsin-Madison) Persistent homology 21 / 37

Page 86: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Interesting rubber bands

c c c1 2 3

c2 and c3 in Z1 but not in B1

We can drag rubber band c2 over the yellow triangle to make c3

Formally, c3 = c2 + c1.

c2 and c3 are equivalent in the hole they surround.

The equivalence class: c+Bp

(Zhu, University of Wisconsin-Madison) Persistent homology 22 / 37

Page 87: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Interesting rubber bands

c c c1 2 3

c2 and c3 in Z1 but not in B1

We can drag rubber band c2 over the yellow triangle to make c3

Formally, c3 = c2 + c1.

c2 and c3 are equivalent in the hole they surround.

The equivalence class: c+Bp

(Zhu, University of Wisconsin-Madison) Persistent homology 22 / 37

Page 88: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Interesting rubber bands

c c c1 2 3

c2 and c3 in Z1 but not in B1

We can drag rubber band c2 over the yellow triangle to make c3

Formally, c3 = c2 + c1.

c2 and c3 are equivalent in the hole they surround.

The equivalence class: c+Bp

(Zhu, University of Wisconsin-Madison) Persistent homology 22 / 37

Page 89: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Interesting rubber bands

c c c1 2 3

c2 and c3 in Z1 but not in B1

We can drag rubber band c2 over the yellow triangle to make c3

Formally, c3 = c2 + c1.

c2 and c3 are equivalent in the hole they surround.

The equivalence class: c+Bp

(Zhu, University of Wisconsin-Madison) Persistent homology 22 / 37

Page 90: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Interesting rubber bands

c c c1 2 3

c2 and c3 in Z1 but not in B1

We can drag rubber band c2 over the yellow triangle to make c3

Formally, c3 = c2 + c1.

c2 and c3 are equivalent in the hole they surround.

The equivalence class: c+Bp

(Zhu, University of Wisconsin-Madison) Persistent homology 22 / 37

Page 91: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Homology group

Definition

The p-th homology group is the quotient group Hp = Zp/Bp.

Example:

c c c1 2 3

All the 1-cycles : Z1 = {0, c1, c2, c3}.The uninteresting 1-cycles: B1 = {0, c1}, a subgroup of Z1.

The interesting 1-cycles: c2 +B1 = c3 +B1 = {c2, c3}The homology group H1 = Z1/B1 isomorphic to Z2

(Zhu, University of Wisconsin-Madison) Persistent homology 23 / 37

Page 92: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Homology group

Definition

The p-th homology group is the quotient group Hp = Zp/Bp.

Example:

c c c1 2 3

All the 1-cycles : Z1 = {0, c1, c2, c3}.The uninteresting 1-cycles: B1 = {0, c1}, a subgroup of Z1.

The interesting 1-cycles: c2 +B1 = c3 +B1 = {c2, c3}The homology group H1 = Z1/B1 isomorphic to Z2

(Zhu, University of Wisconsin-Madison) Persistent homology 23 / 37

Page 93: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Homology group

Definition

The p-th homology group is the quotient group Hp = Zp/Bp.

Example:

c c c1 2 3

All the 1-cycles : Z1 = {0, c1, c2, c3}.

The uninteresting 1-cycles: B1 = {0, c1}, a subgroup of Z1.

The interesting 1-cycles: c2 +B1 = c3 +B1 = {c2, c3}The homology group H1 = Z1/B1 isomorphic to Z2

(Zhu, University of Wisconsin-Madison) Persistent homology 23 / 37

Page 94: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Homology group

Definition

The p-th homology group is the quotient group Hp = Zp/Bp.

Example:

c c c1 2 3

All the 1-cycles : Z1 = {0, c1, c2, c3}.The uninteresting 1-cycles: B1 = {0, c1}, a subgroup of Z1.

The interesting 1-cycles: c2 +B1 = c3 +B1 = {c2, c3}The homology group H1 = Z1/B1 isomorphic to Z2

(Zhu, University of Wisconsin-Madison) Persistent homology 23 / 37

Page 95: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Homology group

Definition

The p-th homology group is the quotient group Hp = Zp/Bp.

Example:

c c c1 2 3

All the 1-cycles : Z1 = {0, c1, c2, c3}.The uninteresting 1-cycles: B1 = {0, c1}, a subgroup of Z1.

The interesting 1-cycles: c2 +B1 = c3 +B1 = {c2, c3}

The homology group H1 = Z1/B1 isomorphic to Z2

(Zhu, University of Wisconsin-Madison) Persistent homology 23 / 37

Page 96: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Homology group

Definition

The p-th homology group is the quotient group Hp = Zp/Bp.

Example:

c c c1 2 3

All the 1-cycles : Z1 = {0, c1, c2, c3}.The uninteresting 1-cycles: B1 = {0, c1}, a subgroup of Z1.

The interesting 1-cycles: c2 +B1 = c3 +B1 = {c2, c3}The homology group H1 = Z1/B1 isomorphic to Z2

(Zhu, University of Wisconsin-Madison) Persistent homology 23 / 37

Page 97: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Betti number

Definition

The p-th Betti number is the rank of the homology group: βp = rank(Hp).

In our example, β1 = rank(Z2) = 1 (one 1st-order hole)

βp is the number of independent p-th holes.

A tetrahedron has β0 = 1 (connected), β1 = β2 = 0 (no holes orvoids)

A hollow tetrahedron has β0 = 1, β1 = 0, β2 = 1

Removing the four triangle faces, the edge skeleton has β0 = 1,β1 = 3 (one is the sum of the other three), β2 = 0 (no more void).

Removing the edges, β0 = 4 (4 vertices) and β1 = β2 = 0.

(Zhu, University of Wisconsin-Madison) Persistent homology 24 / 37

Page 98: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Betti number

Definition

The p-th Betti number is the rank of the homology group: βp = rank(Hp).

In our example, β1 = rank(Z2) = 1 (one 1st-order hole)

βp is the number of independent p-th holes.

A tetrahedron has β0 = 1 (connected), β1 = β2 = 0 (no holes orvoids)

A hollow tetrahedron has β0 = 1, β1 = 0, β2 = 1

Removing the four triangle faces, the edge skeleton has β0 = 1,β1 = 3 (one is the sum of the other three), β2 = 0 (no more void).

Removing the edges, β0 = 4 (4 vertices) and β1 = β2 = 0.

(Zhu, University of Wisconsin-Madison) Persistent homology 24 / 37

Page 99: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Betti number

Definition

The p-th Betti number is the rank of the homology group: βp = rank(Hp).

In our example, β1 = rank(Z2) = 1 (one 1st-order hole)

βp is the number of independent p-th holes.

A tetrahedron has β0 = 1 (connected), β1 = β2 = 0 (no holes orvoids)

A hollow tetrahedron has β0 = 1, β1 = 0, β2 = 1

Removing the four triangle faces, the edge skeleton has β0 = 1,β1 = 3 (one is the sum of the other three), β2 = 0 (no more void).

Removing the edges, β0 = 4 (4 vertices) and β1 = β2 = 0.

(Zhu, University of Wisconsin-Madison) Persistent homology 24 / 37

Page 100: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Betti number

Definition

The p-th Betti number is the rank of the homology group: βp = rank(Hp).

In our example, β1 = rank(Z2) = 1 (one 1st-order hole)

βp is the number of independent p-th holes.

A tetrahedron has β0 = 1 (connected), β1 = β2 = 0 (no holes orvoids)

A hollow tetrahedron has β0 = 1, β1 = 0, β2 = 1

Removing the four triangle faces, the edge skeleton has β0 = 1,β1 = 3 (one is the sum of the other three), β2 = 0 (no more void).

Removing the edges, β0 = 4 (4 vertices) and β1 = β2 = 0.

(Zhu, University of Wisconsin-Madison) Persistent homology 24 / 37

Page 101: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Betti number

Definition

The p-th Betti number is the rank of the homology group: βp = rank(Hp).

In our example, β1 = rank(Z2) = 1 (one 1st-order hole)

βp is the number of independent p-th holes.

A tetrahedron has β0 = 1 (connected), β1 = β2 = 0 (no holes orvoids)

A hollow tetrahedron has β0 = 1, β1 = 0, β2 = 1

Removing the four triangle faces, the edge skeleton has β0 = 1,β1 = 3 (one is the sum of the other three), β2 = 0 (no more void).

Removing the edges, β0 = 4 (4 vertices) and β1 = β2 = 0.

(Zhu, University of Wisconsin-Madison) Persistent homology 24 / 37

Page 102: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Betti number

Definition

The p-th Betti number is the rank of the homology group: βp = rank(Hp).

In our example, β1 = rank(Z2) = 1 (one 1st-order hole)

βp is the number of independent p-th holes.

A tetrahedron has β0 = 1 (connected), β1 = β2 = 0 (no holes orvoids)

A hollow tetrahedron has β0 = 1, β1 = 0, β2 = 1

Removing the four triangle faces, the edge skeleton has β0 = 1,β1 = 3 (one is the sum of the other three), β2 = 0 (no more void).

Removing the edges, β0 = 4 (4 vertices) and β1 = β2 = 0.

(Zhu, University of Wisconsin-Madison) Persistent homology 24 / 37

Page 103: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Betti number

Definition

The p-th Betti number is the rank of the homology group: βp = rank(Hp).

In our example, β1 = rank(Z2) = 1 (one 1st-order hole)

βp is the number of independent p-th holes.

A tetrahedron has β0 = 1 (connected), β1 = β2 = 0 (no holes orvoids)

A hollow tetrahedron has β0 = 1, β1 = 0, β2 = 1

Removing the four triangle faces, the edge skeleton has β0 = 1,β1 = 3 (one is the sum of the other three), β2 = 0 (no more void).

Removing the edges, β0 = 4 (4 vertices) and β1 = β2 = 0.

(Zhu, University of Wisconsin-Madison) Persistent homology 24 / 37

Page 104: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

From data to simplicial complex

Given data x1, . . . , xn ∈ Rd.

If any subset of p+ 1 points are within diameter ε, we add ap-simplex generated by those points.

Definition

A Vietoris-Rips complex of diameter ε is the simplicial complexV R(ε) = {σ | diam(σ) ≤ ε}.

Example

VR( 5)VR(1) VR(2)

(Zhu, University of Wisconsin-Madison) Persistent homology 25 / 37

Page 105: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

From data to simplicial complex

Given data x1, . . . , xn ∈ Rd.

If any subset of p+ 1 points are within diameter ε, we add ap-simplex generated by those points.

Definition

A Vietoris-Rips complex of diameter ε is the simplicial complexV R(ε) = {σ | diam(σ) ≤ ε}.

Example

VR( 5)VR(1) VR(2)

(Zhu, University of Wisconsin-Madison) Persistent homology 25 / 37

Page 106: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

From data to simplicial complex

Given data x1, . . . , xn ∈ Rd.

If any subset of p+ 1 points are within diameter ε, we add ap-simplex generated by those points.

Definition

A Vietoris-Rips complex of diameter ε is the simplicial complexV R(ε) = {σ | diam(σ) ≤ ε}.

Example

VR( 5)VR(1) VR(2)

(Zhu, University of Wisconsin-Madison) Persistent homology 25 / 37

Page 107: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

From data to simplicial complex

Given data x1, . . . , xn ∈ Rd.

If any subset of p+ 1 points are within diameter ε, we add ap-simplex generated by those points.

Definition

A Vietoris-Rips complex of diameter ε is the simplicial complexV R(ε) = {σ | diam(σ) ≤ ε}.

Example

VR( 5)VR(1) VR(2)

(Zhu, University of Wisconsin-Madison) Persistent homology 25 / 37

Page 108: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Filtration

Which ε should we pick?

Don’t pick – look at all ε’s

Definition

An increasing sequence of ε produces a filtration, i.e., a sequence ofincreasing simplicial complexes V R(ε1) ⊆ V R(ε2) ⊆ . . ., with the propertythat a simplex enters the sequence no earlier than all its faces.

(Zhu, University of Wisconsin-Madison) Persistent homology 26 / 37

Page 109: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Filtration

Which ε should we pick?

Don’t pick – look at all ε’s

Definition

An increasing sequence of ε produces a filtration, i.e., a sequence ofincreasing simplicial complexes V R(ε1) ⊆ V R(ε2) ⊆ . . ., with the propertythat a simplex enters the sequence no earlier than all its faces.

(Zhu, University of Wisconsin-Madison) Persistent homology 26 / 37

Page 110: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Filtration

Which ε should we pick?

Don’t pick – look at all ε’s

Definition

An increasing sequence of ε produces a filtration, i.e., a sequence ofincreasing simplicial complexes V R(ε1) ⊆ V R(ε2) ⊆ . . ., with the propertythat a simplex enters the sequence no earlier than all its faces.

(Zhu, University of Wisconsin-Madison) Persistent homology 26 / 37

Page 111: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Persistent homology

In a filtration, at what value of ε does a hole appear, and how longdoes it persist till it is filled in?

Barcode

VR( 5)VR(1) VR(2)

0 0.5 1 1.5 2 2.5

barcode (dimension 0)

0 0.5 1 1.5 2 2.5

barcode (dimension 1)

(Zhu, University of Wisconsin-Madison) Persistent homology 27 / 37

Page 112: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Persistent homology

In a filtration, at what value of ε does a hole appear, and how longdoes it persist till it is filled in?

Barcode

VR( 5)VR(1) VR(2)

0 0.5 1 1.5 2 2.5

barcode (dimension 0)

0 0.5 1 1.5 2 2.5

barcode (dimension 1)

(Zhu, University of Wisconsin-Madison) Persistent homology 27 / 37

Page 113: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Applications to natural language processing

Good articles “tie back.”

Introduction

conclusion

next paragraph

...

How can we capture such loopy structure in text documents?

(Zhu, University of Wisconsin-Madison) Persistent homology 28 / 37

Page 114: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Applications to natural language processing

Some documents “straight,” others “twist and turn”

Divide a document into small units x1, . . . , xn (e.g., sentences,paragraphs).

Given distance function D(xi, xj) ≥ 0 (e.g., Euclidean, cosine)

We will focus on the 0-th (clusters) and 1st (holes) order homologyclasses.

(Zhu, University of Wisconsin-Madison) Persistent homology 29 / 37

Page 115: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Applications to natural language processing

Some documents “straight,” others “twist and turn”

Divide a document into small units x1, . . . , xn (e.g., sentences,paragraphs).

Given distance function D(xi, xj) ≥ 0 (e.g., Euclidean, cosine)

We will focus on the 0-th (clusters) and 1st (holes) order homologyclasses.

(Zhu, University of Wisconsin-Madison) Persistent homology 29 / 37

Page 116: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Applications to natural language processing

Some documents “straight,” others “twist and turn”

Divide a document into small units x1, . . . , xn (e.g., sentences,paragraphs).

Given distance function D(xi, xj) ≥ 0 (e.g., Euclidean, cosine)

We will focus on the 0-th (clusters) and 1st (holes) order homologyclasses.

(Zhu, University of Wisconsin-Madison) Persistent homology 29 / 37

Page 117: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Applications to natural language processing

Some documents “straight,” others “twist and turn”

Divide a document into small units x1, . . . , xn (e.g., sentences,paragraphs).

Given distance function D(xi, xj) ≥ 0 (e.g., Euclidean, cosine)

We will focus on the 0-th (clusters) and 1st (holes) order homologyclasses.

(Zhu, University of Wisconsin-Madison) Persistent homology 29 / 37

Page 118: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Example: Itsy bitsy spider

The Itsy Bitsy Spider climbed up the water spoutDown came the rain and washed the spider outOut came the sun and dried up all the rain

And the Itsy Bitsy Spider climbed up the spout again

bag-of-words

vertices

(Zhu, University of Wisconsin-Madison) Persistent homology 30 / 37

Page 119: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Example: Itsy bitsy spider

The Itsy Bitsy Spider climbed up the water spoutDown came the rain and washed the spider outOut came the sun and dried up all the rain

And the Itsy Bitsy Spider climbed up the spout again

bag-of-words

vertices

(Zhu, University of Wisconsin-Madison) Persistent homology 30 / 37

Page 120: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Similarity Filtration (SIF)

Dmax = maxD(xi, xj), ∀i, j = 1 . . . nFOR m = 0, 1, . . .M

Add V R(mMDmax

)to the filtration

ENDCompute persistent homology on the filtration

larger diameter, looser tie-backs

order of x1 . . . xn ignored

(Zhu, University of Wisconsin-Madison) Persistent homology 31 / 37

Page 121: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Similarity Filtration (SIF)

Dmax = maxD(xi, xj), ∀i, j = 1 . . . nFOR m = 0, 1, . . .M

Add V R(mMDmax

)to the filtration

ENDCompute persistent homology on the filtration

larger diameter, looser tie-backs

order of x1 . . . xn ignored

(Zhu, University of Wisconsin-Madison) Persistent homology 31 / 37

Page 122: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Similarity Filtration with Time Skeleton (SIFTS)

D(xi, xi+1) = 0 for i = 1, . . . , n− 1Dmax = maxD(xi, xj), ∀i, j = 1 . . . nFOR m = 0, 1, . . .M

Add V R(mMDmax

)to the filtration

ENDCompute persistent homology on the filtration

time edges allow tie-back in time

(Zhu, University of Wisconsin-Madison) Persistent homology 32 / 37

Page 123: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

SIF vs. SIFTS on Itsy bitsy spider

0 1 2

SIF (dimension 0)

0 1 2

SIF (dimension 1)0 1 2

SIFTS (dimension 0)

0 1 2

SIFTS (dimension 1)

(Zhu, University of Wisconsin-Madison) Persistent homology 33 / 37

Page 124: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

On Nursery Rhymes and Other Stories

0 2 4

SIF (dimension 0)

0 2 4

SIF (dimension 1)0 2 4

SIFTS (dimension 0)

0 2 4

SIFTS (dimension 1)0 2 4

SIF (dimension 0)

0 2 4

SIF (dimension 1)0 2 4

SIFTS (dimension 0)

0 2 4

SIFTS (dimension 1)

Row Row Row Your Boat London Bridge

0 0.5 1 1.5

SIF (dimension 0)

0 0.5 1 1.5

SIF (dimension 1)0 0.5 1 1.5

SIFTS (dimension 0)

0 0.5 1 1.5

SIFTS (dimension 1)0 0.5 1 1.5

SIF (dimension 0)

0 0.5 1 1.5

SIF (dimension 1)0 0.5 1 1.5

SIFTS (dimension 0)

0 0.5 1 1.5

SIFTS (dimension 1)

Little Red-Cap Alice in Wonderland

London Bridge:“My fair Lady” repeats 12 times.

Little Red-Cap: “The better to see you with, my dear” and “Thebetter to eat you with!”

(Zhu, University of Wisconsin-Madison) Persistent homology 34 / 37

Page 125: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

On Nursery Rhymes and Other Stories

0 2 4

SIF (dimension 0)

0 2 4

SIF (dimension 1)0 2 4

SIFTS (dimension 0)

0 2 4

SIFTS (dimension 1)0 2 4

SIF (dimension 0)

0 2 4

SIF (dimension 1)0 2 4

SIFTS (dimension 0)

0 2 4

SIFTS (dimension 1)

Row Row Row Your Boat London Bridge

0 0.5 1 1.5

SIF (dimension 0)

0 0.5 1 1.5

SIF (dimension 1)0 0.5 1 1.5

SIFTS (dimension 0)

0 0.5 1 1.5

SIFTS (dimension 1)0 0.5 1 1.5

SIF (dimension 0)

0 0.5 1 1.5

SIF (dimension 1)0 0.5 1 1.5

SIFTS (dimension 0)

0 0.5 1 1.5

SIFTS (dimension 1)

Little Red-Cap Alice in Wonderland

London Bridge:“My fair Lady” repeats 12 times.

Little Red-Cap: “The better to see you with, my dear” and “Thebetter to eat you with!”

(Zhu, University of Wisconsin-Madison) Persistent homology 34 / 37

Page 126: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

On Child and Adolescent Writing

Older writers have more complex barcodes?

LUCY corpus: children (ages 9–12, 150 essays), undergraduates (48essays)

average article length: child=11.6 sentences, adolescent=25.8

SIFTS barcode summary statistics:

I holes?: what percentage of articles have H1 holesI |H1|: number of holes in the articleI ε∗: the smallest ε when the first hole in H1 forms.

child adolescent adol. trunc.holes? 87% 100%∗ 98%∗

|H1| 3.0 (±0.2) 17.6 (±0.9)∗ 3.9 (±0.2)∗

ε∗ 1.35 (±.02) 1.27 (±.02)∗ 1.38 (±.01)∗: statistically significantly different from “child”

(Zhu, University of Wisconsin-Madison) Persistent homology 35 / 37

Page 127: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

On Child and Adolescent Writing

Older writers have more complex barcodes?

LUCY corpus: children (ages 9–12, 150 essays), undergraduates (48essays)

average article length: child=11.6 sentences, adolescent=25.8

SIFTS barcode summary statistics:

I holes?: what percentage of articles have H1 holesI |H1|: number of holes in the articleI ε∗: the smallest ε when the first hole in H1 forms.

child adolescent adol. trunc.holes? 87% 100%∗ 98%∗

|H1| 3.0 (±0.2) 17.6 (±0.9)∗ 3.9 (±0.2)∗

ε∗ 1.35 (±.02) 1.27 (±.02)∗ 1.38 (±.01)∗: statistically significantly different from “child”

(Zhu, University of Wisconsin-Madison) Persistent homology 35 / 37

Page 128: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

On Child and Adolescent Writing

Older writers have more complex barcodes?

LUCY corpus: children (ages 9–12, 150 essays), undergraduates (48essays)

average article length: child=11.6 sentences, adolescent=25.8

SIFTS barcode summary statistics:

I holes?: what percentage of articles have H1 holesI |H1|: number of holes in the articleI ε∗: the smallest ε when the first hole in H1 forms.

child adolescent adol. trunc.holes? 87% 100%∗ 98%∗

|H1| 3.0 (±0.2) 17.6 (±0.9)∗ 3.9 (±0.2)∗

ε∗ 1.35 (±.02) 1.27 (±.02)∗ 1.38 (±.01)∗: statistically significantly different from “child”

(Zhu, University of Wisconsin-Madison) Persistent homology 35 / 37

Page 129: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

On Child and Adolescent Writing

Older writers have more complex barcodes?

LUCY corpus: children (ages 9–12, 150 essays), undergraduates (48essays)

average article length: child=11.6 sentences, adolescent=25.8

SIFTS barcode summary statistics:

I holes?: what percentage of articles have H1 holesI |H1|: number of holes in the articleI ε∗: the smallest ε when the first hole in H1 forms.

child adolescent adol. trunc.holes? 87% 100%∗ 98%∗

|H1| 3.0 (±0.2) 17.6 (±0.9)∗ 3.9 (±0.2)∗

ε∗ 1.35 (±.02) 1.27 (±.02)∗ 1.38 (±.01)∗: statistically significantly different from “child”

(Zhu, University of Wisconsin-Madison) Persistent homology 35 / 37

Page 130: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

On Child and Adolescent Writing

Older writers have more complex barcodes?

LUCY corpus: children (ages 9–12, 150 essays), undergraduates (48essays)

average article length: child=11.6 sentences, adolescent=25.8

SIFTS barcode summary statistics:I holes?: what percentage of articles have H1 holes

I |H1|: number of holes in the articleI ε∗: the smallest ε when the first hole in H1 forms.

child adolescent adol. trunc.holes? 87% 100%∗ 98%∗

|H1| 3.0 (±0.2) 17.6 (±0.9)∗ 3.9 (±0.2)∗

ε∗ 1.35 (±.02) 1.27 (±.02)∗ 1.38 (±.01)∗: statistically significantly different from “child”

(Zhu, University of Wisconsin-Madison) Persistent homology 35 / 37

Page 131: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

On Child and Adolescent Writing

Older writers have more complex barcodes?

LUCY corpus: children (ages 9–12, 150 essays), undergraduates (48essays)

average article length: child=11.6 sentences, adolescent=25.8

SIFTS barcode summary statistics:I holes?: what percentage of articles have H1 holesI |H1|: number of holes in the article

I ε∗: the smallest ε when the first hole in H1 forms.child adolescent adol. trunc.

holes? 87% 100%∗ 98%∗

|H1| 3.0 (±0.2) 17.6 (±0.9)∗ 3.9 (±0.2)∗

ε∗ 1.35 (±.02) 1.27 (±.02)∗ 1.38 (±.01)∗: statistically significantly different from “child”

(Zhu, University of Wisconsin-Madison) Persistent homology 35 / 37

Page 132: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

On Child and Adolescent Writing

Older writers have more complex barcodes?

LUCY corpus: children (ages 9–12, 150 essays), undergraduates (48essays)

average article length: child=11.6 sentences, adolescent=25.8

SIFTS barcode summary statistics:I holes?: what percentage of articles have H1 holesI |H1|: number of holes in the articleI ε∗: the smallest ε when the first hole in H1 forms.

child adolescent adol. trunc.holes? 87% 100%∗ 98%∗

|H1| 3.0 (±0.2) 17.6 (±0.9)∗ 3.9 (±0.2)∗

ε∗ 1.35 (±.02) 1.27 (±.02)∗ 1.38 (±.01)∗: statistically significantly different from “child”

(Zhu, University of Wisconsin-Madison) Persistent homology 35 / 37

Page 133: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Is Homology Merely Counting Repeats?

On x1 x2 x3 where x1, x2, x3 SIFTS will find two holes:x1 x2, x2 x3

k such repeats of x will generate k − 1 holes. The Betti numberβ1 = k − 1?

No.x x

xx

1 2

34

yz

x1

x13ε

I Left: k − 1 = 3, SIFTS correctly finds β1 = 1I Right: k − 1 = 12, merging x 0 holes, SIFTS correctly finds β1 = 2

(Zhu, University of Wisconsin-Madison) Persistent homology 36 / 37

Page 134: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Is Homology Merely Counting Repeats?

On x1 x2 x3 where x1, x2, x3 SIFTS will find two holes:x1 x2, x2 x3

k such repeats of x will generate k − 1 holes. The Betti numberβ1 = k − 1?

No.x x

xx

1 2

34

yz

x1

x13ε

I Left: k − 1 = 3, SIFTS correctly finds β1 = 1I Right: k − 1 = 12, merging x 0 holes, SIFTS correctly finds β1 = 2

(Zhu, University of Wisconsin-Madison) Persistent homology 36 / 37

Page 135: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Is Homology Merely Counting Repeats?

On x1 x2 x3 where x1, x2, x3 SIFTS will find two holes:x1 x2, x2 x3

k such repeats of x will generate k − 1 holes. The Betti numberβ1 = k − 1?

No.x x

xx

1 2

34

yz

x1

x13ε

I Left: k − 1 = 3, SIFTS correctly finds β1 = 1I Right: k − 1 = 12, merging x 0 holes, SIFTS correctly finds β1 = 2

(Zhu, University of Wisconsin-Madison) Persistent homology 36 / 37

Page 136: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Is Homology Merely Counting Repeats?

On x1 x2 x3 where x1, x2, x3 SIFTS will find two holes:x1 x2, x2 x3

k such repeats of x will generate k − 1 holes. The Betti numberβ1 = k − 1?

No.x x

xx

1 2

34

yz

x1

x13ε

I Left: k − 1 = 3, SIFTS correctly finds β1 = 1

I Right: k − 1 = 12, merging x 0 holes, SIFTS correctly finds β1 = 2

(Zhu, University of Wisconsin-Madison) Persistent homology 36 / 37

Page 137: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Is Homology Merely Counting Repeats?

On x1 x2 x3 where x1, x2, x3 SIFTS will find two holes:x1 x2, x2 x3

k such repeats of x will generate k − 1 holes. The Betti numberβ1 = k − 1?

No.x x

xx

1 2

34

yz

x1

x13ε

I Left: k − 1 = 3, SIFTS correctly finds β1 = 1I Right: k − 1 = 12, merging x 0 holes, SIFTS correctly finds β1 = 2

(Zhu, University of Wisconsin-Madison) Persistent homology 36 / 37

Page 138: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Summary

Persistent homology may offer new representations for machinelearning

How to best use it?

To read more, see the references inXiaojin Zhu. Persistent homology: An introduction and a new textrepresentation for natural language processing. IJCAI, 2013.

(Zhu, University of Wisconsin-Madison) Persistent homology 37 / 37

Page 139: Persistent Homology Tutorialpages.cs.wisc.edu/~jerryzhu/ssl/pub/cvrghomology.pdf · Persistent homology A rapidly growing branch of topology mathematically de nes \holes" in data:

Summary

Persistent homology may offer new representations for machinelearning

How to best use it?

To read more, see the references inXiaojin Zhu. Persistent homology: An introduction and a new textrepresentation for natural language processing. IJCAI, 2013.

(Zhu, University of Wisconsin-Madison) Persistent homology 37 / 37