uniform sampling for matrix approximationcmusco/personal_site/pdfs/... · 2019-04-09 · uniform...

183
Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher Musco, Richard Peng, Aaron Sidford Massachusetts Institute of Technology November 20, 2014

Upload: others

Post on 27-Jun-2020

5 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Uniform Sampling for Matrix Approximation

Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher Musco,Richard Peng, Aaron Sidford

Massachusetts Institute of Technology

November 20, 2014

Page 2: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Overview

Goal

Reduce large matrix A to some smaller matrix A. Use A toapproximate solution to some problem - e.g. regression.

Main Result

Simple and efficient iterative sampling algorithms for matrixapproximation.

Alternatives to Johnson-Lindenstrauss (random projection) typeapproaches

Main technique

Understanding what information is preserved when we sample rows ofmatrix uniformly at random

Page 3: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Overview

Goal

Reduce large matrix A to some smaller matrix A. Use A toapproximate solution to some problem - e.g. regression.

Main Result

Simple and efficient iterative sampling algorithms for matrixapproximation.

Alternatives to Johnson-Lindenstrauss (random projection) typeapproaches

Main technique

Understanding what information is preserved when we sample rows ofmatrix uniformly at random

Page 4: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Overview

Goal

Reduce large matrix A to some smaller matrix A. Use A toapproximate solution to some problem - e.g. regression.

Main Result

Simple and efficient iterative sampling algorithms for matrixapproximation.

Alternatives to Johnson-Lindenstrauss (random projection) typeapproaches

Main technique

Understanding what information is preserved when we sample rows ofmatrix uniformly at random

Page 5: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Outline

1 Spectral Matrix Approximation

2 Leverage Score Sampling

3 Iterative Leverage Score Computation

4 Coherence Reducing Reweighting

Page 6: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Overview

1 Spectral Matrix Approximation

2 Leverage Score Sampling

3 Iterative Leverage Score Computation

4 Coherence Reducing Reweighting

Page 7: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Spectral Matrix Approximation

Goal

(1− ε)‖Ax‖22 ≤ ‖Ax‖22 ≤ (1 + ε)‖Ax‖22

Page 8: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Spectral Matrix Approximation

Goal

(1− ε)‖Ax‖22 ≤ ‖Ax‖22 ≤ (1 + ε)‖Ax‖22

Page 9: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Spectral Matrix Approximation

Goal

(1− ε)‖Ax‖22 ≤ ‖Ax‖22 ≤ (1 + ε)‖Ax‖22

Page 10: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Applications

Approximate linear algebra - e.g. regression

Preconditioning

Spectral Graph Sparsification

Etc...

Page 11: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Applications

Approximate linear algebra - e.g. regression

Preconditioning

Spectral Graph Sparsification

Etc...

Page 12: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Applications

Approximate linear algebra - e.g. regression

Preconditioning

Spectral Graph Sparsification

Etc...

Page 13: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Applications

Approximate linear algebra - e.g. regression

Preconditioning

Spectral Graph Sparsification

Etc...

Page 14: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Applications

Approximate linear algebra - e.g. regression

Preconditioning

Spectral Graph Sparsification

Etc...

Page 15: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Regression

Goal:

minx‖Ax− b‖22

Solve:

Ax = b =⇒ A>(Ax) = A>b

Set x to:

(A>A)−1A>b

Problem:A is very tall. Too slow to compute A>A.

Page 16: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Regression

Goal:

minx‖Ax− b‖22

Solve:

Ax = b =⇒ A>(Ax) = A>b

Set x to:

(A>A)−1A>b

Problem:A is very tall. Too slow to compute A>A.

Page 17: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Regression

Goal:

minx‖Ax− b‖22

Solve:

Ax = b =⇒ A>(Ax) = A>b

Set x to:

(A>A)−1A>b

Problem:A is very tall. Too slow to compute A>A.

Page 18: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Regression

Goal:

minx‖Ax− b‖22

Solve:

Ax = b =⇒ A>(Ax) = A>b

Set x to:

(A>A)−1A>b

Problem:A is very tall. Too slow to compute A>A.

Page 19: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Regression

Goal:

minx‖Ax− b‖22

Solve:

Ax = b =⇒ A>(Ax) = A>b

Set x to:

(A>A)−1A>b

Problem:A is very tall. Too slow to compute A>A.

Page 20: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Regression

Solution:

‖Ax− b‖22 =

∥∥∥∥[ A b] [ x−1

]∥∥∥∥22

≈ε∥∥∥∥[ A b

] [ x−1

]∥∥∥∥22

x = argminx‖Ax− b‖22

Page 21: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Regression

Solution:

‖Ax− b‖22 =

∥∥∥∥[ A b] [ x−1

]∥∥∥∥22

≈ε∥∥∥∥[ A b

] [ x−1

]∥∥∥∥22

x = argminx‖Ax− b‖22

Page 22: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Regression

Solution:

‖Ax− b‖22 =

∥∥∥∥[ A b] [ x−1

]∥∥∥∥22

≈ε∥∥∥∥[ A b

] [ x−1

]∥∥∥∥22

x = argminx‖Ax− b‖22

Page 23: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Regression

Solution:

Or, use preconditioned iterative method:

κ(

(A>A)−1(A>A))

= O(1)

.

Page 24: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Spectral Matrix Approximation

All equivalent:

Norm:‖Ax‖22 = (1± ε)‖Ax‖22

Quadratic Form:

x>(A>A)x = (1± ε)x>(A>A)x

Loewner Ordering:

(1− ε)A>A � A>A � (1 + ε)A>A

Inverse:

1

(1 + ε)(A>A)−1 � (A>A)−1 � 1

(1− ε)(A>A)−1

Page 25: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Spectral Matrix Approximation

All equivalent:

Norm:‖Ax‖22 = (1± ε)‖Ax‖22

Quadratic Form:

x>(A>A)x = (1± ε)x>(A>A)x

Loewner Ordering:

(1− ε)A>A � A>A � (1 + ε)A>A

Inverse:

1

(1 + ε)(A>A)−1 � (A>A)−1 � 1

(1− ε)(A>A)−1

Page 26: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Spectral Matrix Approximation

All equivalent:

Norm:‖Ax‖22 = (1± ε)‖Ax‖22

Quadratic Form:

x>(A>A)x = (1± ε)x>(A>A)x

Loewner Ordering:

(1− ε)A>A � A>A � (1 + ε)A>A

Inverse:

1

(1 + ε)(A>A)−1 � (A>A)−1 � 1

(1− ε)(A>A)−1

Page 27: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Spectral Matrix Approximation

All equivalent:

Norm:‖Ax‖22 = (1± ε)‖Ax‖22

Quadratic Form:

x>(A>A)x = (1± ε)x>(A>A)x

Loewner Ordering:

(1− ε)A>A � A>A � (1 + ε)A>A

Inverse:

1

(1 + ε)(A>A)−1 � (A>A)−1 � 1

(1− ε)(A>A)−1

Page 28: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Spectral Matrix Approximation

All equivalent:

Norm:‖Ax‖22 = (1± ε)‖Ax‖22

Quadratic Form:

x>(A>A)x = (1± ε)x>(A>A)x

Loewner Ordering:

(1− ε)A>A � A>A � (1 + ε)A>A

Inverse:

1

(1 + ε)(A>A)−1 � (A>A)−1 � 1

(1− ε)(A>A)−1

Page 29: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

How to Find a Spectral Approximation?

Just take a matrix square root of A>A!

U>U = A>A =⇒ ‖Ux‖22 = ‖Ax‖22

Cholesky decomposition, SVD, etc. give U ∈ Rd×d

Runs in something like O(nd2) time.

Page 30: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

How to Find a Spectral Approximation?

Just take a matrix square root of A>A!

U>U = A>A =⇒ ‖Ux‖22 = ‖Ax‖22

Cholesky decomposition, SVD, etc. give U ∈ Rd×d

Runs in something like O(nd2) time.

Page 31: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

How to Find a Spectral Approximation?

Just take a matrix square root of A>A!

U>U = A>A =⇒ ‖Ux‖22 = ‖Ax‖22

Cholesky decomposition, SVD, etc. give U ∈ Rd×d

Runs in something like O(nd2) time.

Page 32: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

How to Find a Spectral Approximation?

Just take a matrix square root of A>A!

U>U = A>A =⇒ ‖Ux‖22 = ‖Ax‖22

Cholesky decomposition, SVD, etc. give U ∈ Rd×d

Runs in something like O(nd2) time.

Page 33: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

How to Find one Faster?

Use Subspace Embeddings [Sarlos ‘06], [Nelson, Nguyen ‘13], [Clarkson,Woodruff ‘13], [Mahoney, Meng ‘13]

Left multiply by sparse ‘Johnson-Lindenstrauss matrix’.

Can apply in O(nnz(A)) time.

Reduce to O(d/ε2) dimensions.

Page 34: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

How to Find one Faster?

Use Subspace Embeddings [Sarlos ‘06], [Nelson, Nguyen ‘13], [Clarkson,Woodruff ‘13], [Mahoney, Meng ‘13]

Left multiply by sparse ‘Johnson-Lindenstrauss matrix’.

Can apply in O(nnz(A)) time.

Reduce to O(d/ε2) dimensions.

Page 35: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

How to Find one Faster?

Use Subspace Embeddings [Sarlos ‘06], [Nelson, Nguyen ‘13], [Clarkson,Woodruff ‘13], [Mahoney, Meng ‘13]

Left multiply by sparse ‘Johnson-Lindenstrauss matrix’.

Can apply in O(nnz(A)) time.

Reduce to O(d/ε2) dimensions.

Page 36: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

How to Find one Faster?

Use Subspace Embeddings [Sarlos ‘06], [Nelson, Nguyen ‘13], [Clarkson,Woodruff ‘13], [Mahoney, Meng ‘13]

Left multiply by sparse ‘Johnson-Lindenstrauss matrix’.

Can apply in O(nnz(A)) time.

Reduce to O(d/ε2) dimensions.

Page 37: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

How to Find one Faster?

‘Squishes’ together rows

Page 38: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

What if we want to preserve structure/sparsity?

Use Row Sampling

Page 39: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Overview

1 Spectral Matrix Approximation

2 Leverage Score Sampling

3 Iterative Leverage Score Computation

4 Coherence Reducing Reweighting

Page 40: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Leverage Score Sampling

Sample rows with probability proportional to leverage scores.

Leverage Score

τi (A) = a>i (A>A)−1ai

Page 41: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Leverage Score Sampling

Sample rows with probability proportional to leverage scores.

Leverage Score

τi (A) = a>i (A>A)−1ai

Page 42: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Leverage Score Sampling

Sample each row independently with pi = O(τi (A) log d/ε2).∑i τi (A) = d giving reduction to O(d log d/ε2) rows.

Straight-forward analysis with matrix Chernoff bounds.

Page 43: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Leverage Score Sampling

Sample each row independently with pi = O(τi (A) log d/ε2).∑i τi (A) = d giving reduction to O(d log d/ε2) rows.

Straight-forward analysis with matrix Chernoff bounds.

Page 44: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Leverage Score Sampling

Sample each row independently with pi = O(τi (A) log d/ε2).∑i τi (A) = d giving reduction to O(d log d/ε2) rows.

Straight-forward analysis with matrix Chernoff bounds.

Page 45: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

What is the leverage score?

Statistics: Outlier detection

Spectral Graph Theory: Effective resistance, commute time

Matrix Approximation: Row’s importance in composing the quadraticform of A>A.

Page 46: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

What is the leverage score?

Statistics: Outlier detection

Spectral Graph Theory: Effective resistance, commute time

Matrix Approximation: Row’s importance in composing the quadraticform of A>A.

Page 47: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

What is the leverage score?

Statistics: Outlier detection

Spectral Graph Theory: Effective resistance, commute time

Matrix Approximation: Row’s importance in composing the quadraticform of A>A.

Page 48: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

What is the leverage score?

Statistics: Outlier detection

Spectral Graph Theory: Effective resistance, commute time

Matrix Approximation: Row’s importance in composing the quadraticform of A>A.

Page 49: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

What is the leverage score?

τi (A) = a>i (A>A)−1ai

How to Understand this equation?

Row norms of orthonormal basis for A’s columns

Correct ‘scaling’ to make matrix Chernoff bound work

How easily a row can be reconstructed from other rows.

Page 50: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

What is the leverage score?

τi (A) = a>i (A>A)−1ai

How to Understand this equation?

Row norms of orthonormal basis for A’s columns

Correct ‘scaling’ to make matrix Chernoff bound work

How easily a row can be reconstructed from other rows.

Page 51: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

What is the leverage score?

τi (A) = a>i (A>A)−1ai

How to Understand this equation?

Row norms of orthonormal basis for A’s columns

Correct ‘scaling’ to make matrix Chernoff bound work

How easily a row can be reconstructed from other rows.

Page 52: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

What is the leverage score?

τi (A) = a>i (A>A)−1ai

How to Understand this equation?

Row norms of orthonormal basis for A’s columns

Correct ‘scaling’ to make matrix Chernoff bound work

How easily a row can be reconstructed from other rows.

Page 53: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

What is the leverage score?

How easily a row can be reconstructed from other rows

min ‖x‖22 = τi (A).

ai has component orthogonal to all other rows: x = ei , ‖x‖2 = 1.

There are many rows pointing ‘in the direction of’ ai : x is well spreadand has small norm.

Page 54: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

What is the leverage score?

How easily a row can be reconstructed from other rows

min ‖x‖22 = τi (A).

ai has component orthogonal to all other rows: x = ei , ‖x‖2 = 1.

There are many rows pointing ‘in the direction of’ ai : x is well spreadand has small norm.

Page 55: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

What is the leverage score?

How easily a row can be reconstructed from other rows

min ‖x‖22 = τi (A).

ai has component orthogonal to all other rows: x = ei , ‖x‖2 = 1.

There are many rows pointing ‘in the direction of’ ai : x is well spreadand has small norm.

Page 56: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

What is the leverage score?

How easily a row can be reconstructed from other rows

min ‖x‖22 = τi (A).

ai has component orthogonal to all other rows: x = ei , ‖x‖2 = 1.

There are many rows pointing ‘in the direction of’ ai : x is well spreadand has small norm.

Page 57: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

What is the leverage score?

How easily a row can be reconstructed from other rows

x = [0, 0, . . . , 14 ,14 ,

14 ,

14 , 0, . . .] works.

τi (A) ≤ ‖x‖22 = 14 .

Page 58: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

What is the leverage score?

How easily a row can be reconstructed from other rows

x = [0, 0, . . . , 14 ,14 ,

14 ,

14 , 0, . . .] works.

τi (A) ≤ ‖x‖22 = 14 .

Page 59: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

What is the leverage score?

How easily a row can be reconstructed from other rows

x = [0, 0, . . . , 14 ,14 ,

14 ,

14 , 0, . . .] works.

τi (A) ≤ ‖x‖22 = 14 .

Page 60: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

What is the leverage score?

Mathematically:

min{x|x>A=a>i }

‖x‖22

To minimize, setx> = a>i (A>A)−1A>

So:

‖x‖22 = a>i (A>A)−1ai = τi (A)

Page 61: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

What is the leverage score?

Mathematically:

min{x|x>A=a>i }

‖x‖22

To minimize, setx> = a>i (A>A)−1A>

So:

‖x‖22 = a>i (A>A)−1ai = τi (A)

Page 62: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

What is the leverage score?

Mathematically:

min{x|x>A=a>i }

‖x‖22

To minimize, setx> = a>i (A>A)−1A>

So:

‖x‖22 = a>i (A>A)−1ai = τi (A)

Page 63: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

What is the leverage score?

Mathematically:

min{x|x>A=a>i }

‖x‖22

To minimize, setx> = a>i (A>A)−1A>

So:

‖x‖22 = a>i (A>A)−1ai = τi (A)

Page 64: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

What is the leverage score?

Intepretation tells us:

τi (A) ≤ 1

Adding rows to A can only decrease leverage scores of existing rows

Removing rows can only increase leverage scores

In fact always have:∑n

i=1 τi (A) = d (Foster’s Theorem)

Page 65: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

What is the leverage score?

Intepretation tells us:

τi (A) ≤ 1

Adding rows to A can only decrease leverage scores of existing rows

Removing rows can only increase leverage scores

In fact always have:∑n

i=1 τi (A) = d (Foster’s Theorem)

Page 66: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

What is the leverage score?

Intepretation tells us:

τi (A) ≤ 1

Adding rows to A can only decrease leverage scores of existing rows

Removing rows can only increase leverage scores

In fact always have:∑n

i=1 τi (A) = d (Foster’s Theorem)

Page 67: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

What is the leverage score?

Intepretation tells us:

τi (A) ≤ 1

Adding rows to A can only decrease leverage scores of existing rows

Removing rows can only increase leverage scores

In fact always have:∑n

i=1 τi (A) = d (Foster’s Theorem)

Page 68: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

What is the leverage score?

Intepretation tells us:

τi (A) ≤ 1

Adding rows to A can only decrease leverage scores of existing rows

Removing rows can only increase leverage scores

In fact always have:∑n

i=1 τi (A) = d (Foster’s Theorem)

Page 69: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Computing Leverage Scores

Great! Nicely interpretable scores that allow us to sample down A.

Problem: Computing leverage scores naively requires computing(A>A)−1.

Page 70: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Computing Leverage Scores

Great! Nicely interpretable scores that allow us to sample down A.

Problem: Computing leverage scores naively requires computing(A>A)−1.

Page 71: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Computing Leverage Scores

Traditional Solution

Overestimates are good enough. Just increases number of rows taken.

Given a constant factor spectral approximation A we have:

a>i (A>A)−1ai ≈c a>i (A>A)−1ai

∑τi (A) ≤

∑c · τi (A) = c

∑τi (A) = c · d .

So can still sample O(d log d/ε2) rows using τi (A).

But how to obtain a constant factor spectral approximation?

Unfortunately, to compute even a constant factor spectralapproximation still requires either leverage score estimates, a subspaceembedding or a matrix square root.

Page 72: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Computing Leverage Scores

Traditional Solution

Overestimates are good enough. Just increases number of rows taken.

Given a constant factor spectral approximation A we have:

a>i (A>A)−1ai ≈c a>i (A>A)−1ai

∑τi (A) ≤

∑c · τi (A) = c

∑τi (A) = c · d .

So can still sample O(d log d/ε2) rows using τi (A).

But how to obtain a constant factor spectral approximation?

Unfortunately, to compute even a constant factor spectralapproximation still requires either leverage score estimates, a subspaceembedding or a matrix square root.

Page 73: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Computing Leverage Scores

Traditional Solution

Overestimates are good enough. Just increases number of rows taken.

Given a constant factor spectral approximation A we have:

a>i (A>A)−1ai ≈c a>i (A>A)−1ai

∑τi (A) ≤

∑c · τi (A) = c

∑τi (A) = c · d .

So can still sample O(d log d/ε2) rows using τi (A).

But how to obtain a constant factor spectral approximation?

Unfortunately, to compute even a constant factor spectralapproximation still requires either leverage score estimates, a subspaceembedding or a matrix square root.

Page 74: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Computing Leverage Scores

Traditional Solution

Overestimates are good enough. Just increases number of rows taken.

Given a constant factor spectral approximation A we have:

a>i (A>A)−1ai ≈c a>i (A>A)−1ai

∑τi (A) ≤

∑c · τi (A) = c

∑τi (A) = c · d .

So can still sample O(d log d/ε2) rows using τi (A).

But how to obtain a constant factor spectral approximation?

Unfortunately, to compute even a constant factor spectralapproximation still requires either leverage score estimates, a subspaceembedding or a matrix square root.

Page 75: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Computing Leverage Scores

Traditional Solution

Overestimates are good enough. Just increases number of rows taken.

Given a constant factor spectral approximation A we have:

a>i (A>A)−1ai ≈c a>i (A>A)−1ai

∑τi (A) ≤

∑c · τi (A) = c

∑τi (A) = c · d .

So can still sample O(d log d/ε2) rows using τi (A).

But how to obtain a constant factor spectral approximation?

Unfortunately, to compute even a constant factor spectralapproximation still requires either leverage score estimates, a subspaceembedding or a matrix square root.

Page 76: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Computing Leverage Scores

Traditional Solution

Overestimates are good enough. Just increases number of rows taken.

Given a constant factor spectral approximation A we have:

a>i (A>A)−1ai ≈c a>i (A>A)−1ai

∑τi (A) ≤

∑c · τi (A) = c

∑τi (A) = c · d .

So can still sample O(d log d/ε2) rows using τi (A).

But how to obtain a constant factor spectral approximation?

Unfortunately, to compute even a constant factor spectralapproximation still requires either leverage score estimates, a subspaceembedding or a matrix square root.

Page 77: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Computing Leverage Scores

Traditional Solution

Overestimates are good enough. Just increases number of rows taken.

Given a constant factor spectral approximation A we have:

a>i (A>A)−1ai ≈c a>i (A>A)−1ai

∑τi (A) ≤

∑c · τi (A) = c

∑τi (A) = c · d .

So can still sample O(d log d/ε2) rows using τi (A).

But how to obtain a constant factor spectral approximation?

Unfortunately, to compute even a constant factor spectralapproximation still requires either leverage score estimates, a subspaceembedding or a matrix square root.

Page 78: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Computing Leverage Scores

A spectral approximation A gives the for each guarantee:

τi (A) ≤ c · τi (A)

This is stronger than we need though! We just need a bound on thesum: ∑

i

τi (A) ≤ c · d

Page 79: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Computing Leverage Scores

A spectral approximation A gives the for each guarantee:

τi (A) ≤ c · τi (A)

This is stronger than we need though! We just need a bound on thesum: ∑

i

τi (A) ≤ c · d

Page 80: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Overview

1 Spectral Matrix Approximation

2 Leverage Score Sampling

3 Iterative Leverage Score Computation

4 Coherence Reducing Reweighting

Page 81: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Iterative Leverage Score Computation

Review of what our goal is:

Want to find A such that ‖Ax‖22 ≈ε ‖Ax‖22.

Want to avoid JL Projection - just use random row sampling

Need a way to efficiently compute approximations τi (A) such that∑τi (A) = O(d)

Efficient: O(nnz(A) + R(d , d)) where R(d , d) is the cost of solving ad × d regression problem.

Page 82: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Iterative Leverage Score Computation

Review of what our goal is:

Want to find A such that ‖Ax‖22 ≈ε ‖Ax‖22.

Want to avoid JL Projection - just use random row sampling

Need a way to efficiently compute approximations τi (A) such that∑τi (A) = O(d)

Efficient: O(nnz(A) + R(d , d)) where R(d , d) is the cost of solving ad × d regression problem.

Page 83: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Iterative Leverage Score Computation

Review of what our goal is:

Want to find A such that ‖Ax‖22 ≈ε ‖Ax‖22.

Want to avoid JL Projection - just use random row sampling

Need a way to efficiently compute approximations τi (A) such that∑τi (A) = O(d)

Efficient: O(nnz(A) + R(d , d)) where R(d , d) is the cost of solving ad × d regression problem.

Page 84: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Iterative Leverage Score Computation

Review of what our goal is:

Want to find A such that ‖Ax‖22 ≈ε ‖Ax‖22.

Want to avoid JL Projection - just use random row sampling

Need a way to efficiently compute approximations τi (A) such that∑τi (A) = O(d)

Efficient: O(nnz(A) + R(d , d)) where R(d , d) is the cost of solving ad × d regression problem.

Page 85: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Iterative Leverage Score Computation

Review of what our goal is:

Want to find A such that ‖Ax‖22 ≈ε ‖Ax‖22.

Want to avoid JL Projection - just use random row sampling

Need a way to efficiently compute approximations τi (A) such that∑τi (A) = O(d)

Efficient: O(nnz(A) + R(d , d)) where R(d , d) is the cost of solving ad × d regression problem.

Page 86: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Uniform Row Sampling

In practice, data is typically incoherent - no row has a high leveragescore. [Kumar, Mohri, Talwalkar ‘12], [Avron, Maymounkov, Toledo‘10].

∀i τi (A) ≤ O(d/n)

.

In this case, we can uniformly sample O(d log d) rows to obtain aspectral approximation.∑

i

τi (A) = O

(d

n

)· n = O(d)

Page 87: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Uniform Row Sampling

In practice, data is typically incoherent - no row has a high leveragescore. [Kumar, Mohri, Talwalkar ‘12], [Avron, Maymounkov, Toledo‘10].

∀i τi (A) ≤ O(d/n)

.In this case, we can uniformly sample O(d log d) rows to obtain aspectral approximation.

∑i

τi (A) = O

(d

n

)· n = O(d)

Page 88: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Uniform Row Sampling

In practice, data is typically incoherent - no row has a high leveragescore. [Kumar, Mohri, Talwalkar ‘12], [Avron, Maymounkov, Toledo‘10].

∀i τi (A) ≤ O(d/n)

.In this case, we can uniformly sample O(d log d) rows to obtain aspectral approximation.∑

i

τi (A) = O

(d

n

)· n = O(d)

Page 89: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Uniform Row Sampling

In practice, data is typically incoherent - no row has a high leveragescore. [Kumar, Mohri, Talwalkar ‘12], [Avron, Maymounkov, Toledo‘10].

∀i τi (A) ≤ O(d/n)

.In this case, we can uniformly sample O(d log d) rows to obtain aspectral approximation.∑

i

τi (A) = O

(d

n

)· n = O(d)

Page 90: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Uniform Row Sampling

No guarantees on what uniform sampling does on general matrices.

For x =

10...0

, ‖Ax‖22 = 1, but with good probability, ‖Ax‖22 = 0.

Can we still use uniform sampling in some way?

Page 91: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Uniform Row Sampling

No guarantees on what uniform sampling does on general matrices.

For x =

10...0

, ‖Ax‖22 = 1, but with good probability, ‖Ax‖22 = 0.

Can we still use uniform sampling in some way?

Page 92: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Uniform Row Sampling

No guarantees on what uniform sampling does on general matrices.

For x =

10...0

, ‖Ax‖22 = 1, but with good probability, ‖Ax‖22 = 0.

Can we still use uniform sampling in some way?

Page 93: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Uniform Row Sampling

One simple idea for an algorithm:

1 Uniformly sample O(d log d) rows of A to obtain Au

2 Compute (A>u Au)−1, and use it to estimate leverage scores of A

3 Sample rows of A using these estimated leverage scores to obtain Awhich spectrally approximates A.

We want to bound how large A must be.

Page 94: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Uniform Row Sampling

One simple idea for an algorithm:

1 Uniformly sample O(d log d) rows of A to obtain Au

2 Compute (A>u Au)−1, and use it to estimate leverage scores of A

3 Sample rows of A using these estimated leverage scores to obtain Awhich spectrally approximates A.

We want to bound how large A must be.

Page 95: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Uniform Row Sampling

One simple idea for an algorithm:1 Uniformly sample O(d log d) rows of A to obtain Au

2 Compute (A>u Au)−1, and use it to estimate leverage scores of A

3 Sample rows of A using these estimated leverage scores to obtain Awhich spectrally approximates A.

We want to bound how large A must be.

Page 96: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Uniform Row Sampling

One simple idea for an algorithm:1 Uniformly sample O(d log d) rows of A to obtain Au

2 Compute (A>u Au)−1, and use it to estimate leverage scores of A3 Sample rows of A using these estimated leverage scores to obtain A

which spectrally approximates A.

We want to bound how large A must be.

Page 97: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Uniform Row Sampling

One simple idea for an algorithm:1 Uniformly sample O(d log d) rows of A to obtain Au

2 Compute (A>u Au)−1, and use it to estimate leverage scores of A3 Sample rows of A using these estimated leverage scores to obtain A

which spectrally approximates A.

We want to bound how large A must be.

Page 98: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Uniform Row Sampling

TheoremLet Au be obtained from uniformly sampling m rows of A. Let Au∪i be Au withai appended if not already included. τi (A) = a>i (A>u∪iAu∪i )

−1ai .

τi (A) ≥ τi (A)

E∑i

τi (A) ≤ nd

m

Note:Sherman-Morrison gives equation to compute τi (A) from a>i (A>u Au)−1ai

τi (A) =

a>i (A>u Au)−1ai if ai ∈ Au

11+ 1

a>i

(A>u Au )−1ai

o.w.

Page 99: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Uniform Row Sampling

TheoremLet Au be obtained from uniformly sampling m rows of A. Let Au∪i be Au withai appended if not already included. τi (A) = a>i (A>u∪iAu∪i )

−1ai .

τi (A) ≥ τi (A)

E∑i

τi (A) ≤ nd

m

Note:Sherman-Morrison gives equation to compute τi (A) from a>i (A>u Au)−1ai

τi (A) =

a>i (A>u Au)−1ai if ai ∈ Au

11+ 1

a>i

(A>u Au )−1ai

o.w.

Page 100: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Uniform Row Sampling

TheoremLet Au be obtained from uniformly sampling m rows of A. Let Au∪i be Au withai appended if not already included. τi (A) = a>i (A>u∪iAu∪i )

−1ai .

τi (A) ≥ τi (A)

E∑i

τi (A) ≤ nd

m

Note:Sherman-Morrison gives equation to compute τi (A) from a>i (A>u Au)−1ai

τi (A) =

a>i (A>u Au)−1ai if ai ∈ Au

11+ 1

a>i

(A>u Au )−1ai

o.w.

Page 101: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Uniform Row Sampling

What does this bound give us?

E∑i

τi (A) ≤ nd

m

Set m = n2 . Then: E

∑i τi (A) ≤ 2d .

Reminiscent of the MST algorithm from [Karger, Klein, Tarjan ’95].

Page 102: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Uniform Row Sampling

What does this bound give us?

E∑i

τi (A) ≤ nd

m

Set m = n2 . Then: E

∑i τi (A) ≤ 2d .

Reminiscent of the MST algorithm from [Karger, Klein, Tarjan ’95].

Page 103: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Uniform Row Sampling

What does this bound give us?

E∑i

τi (A) ≤ nd

m

Set m = n2 . Then: E

∑i τi (A) ≤ 2d .

Reminiscent of the MST algorithm from [Karger, Klein, Tarjan ’95].

Page 104: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Uniform Row Sampling

What does this bound give us?

E∑i

τi (A) ≤ nd

m

Set m = n2 . Then: E

∑i τi (A) ≤ 2d .

Reminiscent of the MST algorithm from [Karger, Klein, Tarjan ’95].

Page 105: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Uniform Row Sampling

What does this bound give us?

E∑i

τi (A) ≤ nd

m

Set m = n2 . Then: E

∑i τi (A) ≤ 2d .

Reminiscent of the MST algorithm from [Karger, Klein, Tarjan ’95].

Page 106: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Uniform Row Sampling

What does this bound give us?

E∑i

τi (A) ≤ nd

m

Set m = n2 . Then: E

∑i τi (A) ≤ 2d .

Reminiscent of the MST algorithm from [Karger, Klein, Tarjan ’95].

Page 107: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Iterative Leverage Score Computation

Immediately yields a recursive algorithm for obtaining A.

1 Recursively obtain constant factor spectral approximation to A2.

2 Use this to approximate τi (A).

3 Use these approximate scores to obtain A. (With any (1± ε)multiplicative error you want).

Page 108: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Iterative Leverage Score Computation

Immediately yields a recursive algorithm for obtaining A.

1 Recursively obtain constant factor spectral approximation to A2.

2 Use this to approximate τi (A).

3 Use these approximate scores to obtain A. (With any (1± ε)multiplicative error you want).

Page 109: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Iterative Leverage Score Computation

Immediately yields a recursive algorithm for obtaining A.

1 Recursively obtain constant factor spectral approximation to A2.

2 Use this to approximate τi (A).

3 Use these approximate scores to obtain A. (With any (1± ε)multiplicative error you want).

Page 110: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Iterative Leverage Score Computation

Immediately yields a recursive algorithm for obtaining A.

1 Recursively obtain constant factor spectral approximation to A2.

2 Use this to approximate τi (A).

3 Use these approximate scores to obtain A. (With any (1± ε)multiplicative error you want).

Page 111: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Iterative Leverage Score Computation

Immediately yields a recursive algorithm for obtaining A.

1 Recursively obtain constant factor spectral approximation to A2.

2 Use this to approximate τi (A).

3 Use these approximate scores to obtain A. (With any (1± ε)multiplicative error you want).

Page 112: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Iterative Leverage Score Computation

Immediately yields a recursive algorithm for obtaining A.

1 Recursively obtain constant factor spectral approximation to A2.

2 Use this to approximate τi (A).

3 Use these approximate scores to obtain A. (With any (1± ε)multiplicative error you want).

Page 113: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Iterative Leverage Score Computation

This algorithm, along with a few other variations on it are the mainalgorithmic results of our paper.

Obtain spectral approximation to A with arbitrary error.

Avoid JL projecting and densifying A.

O(nnz(A) + R(d , d)) time.

Page 114: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Iterative Leverage Score Computation

This algorithm, along with a few other variations on it are the mainalgorithmic results of our paper.

Obtain spectral approximation to A with arbitrary error.

Avoid JL projecting and densifying A.

O(nnz(A) + R(d , d)) time.

Page 115: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Iterative Leverage Score Computation

This algorithm, along with a few other variations on it are the mainalgorithmic results of our paper.

Obtain spectral approximation to A with arbitrary error.

Avoid JL projecting and densifying A.

O(nnz(A) + R(d , d)) time.

Page 116: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Iterative Leverage Score Computation

This algorithm, along with a few other variations on it are the mainalgorithmic results of our paper.

Obtain spectral approximation to A with arbitrary error.

Avoid JL projecting and densifying A.

O(nnz(A) + R(d , d)) time.

Page 117: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Proof of Main Theorem

TheoremLet Au be obtained from uniformly sampling m rows of A. Let Au∪i be Au withai appended if not already included. τi (A) = a>i (A>u∪iAu∪i )

−1ai .

τi (A) ≥ τi (A) (1)

E∑i

τi (A) ≤ nd

m(2)

(1) follows from the fact that removing rows of A can only increaseleverage scores.

Page 118: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Proof of Main Theorem

TheoremLet Au be obtained from uniformly sampling m rows of A. Let Au∪i be Au withai appended if not already included. τi (A) = a>i (A>u∪iAu∪i )

−1ai .

τi (A) ≥ τi (A) (1)

E∑i

τi (A) ≤ nd

m(2)

(1) follows from the fact that removing rows of A can only increaseleverage scores.

Page 119: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Proof of Main Theorem

Eu

∑i

τi (A) ≤ nd

m

Consider choosing a uniform random row aj .

Eu

∑i

τi (A) = n · Eu,jτj(A)

.

Page 120: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Proof of Main Theorem

Eu

∑i

τi (A) ≤ nd

m

Consider choosing a uniform random row aj .

Eu

∑i

τi (A) = n · Eu,jτj(A)

.

Page 121: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Proof of Main Theorem

Further, expectation of this process is exactly the same as expectedleverage score if we sample m rows, and then compute the leveragescore of a random one.

Page 122: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Proof of Main Theorem

Further, expectation of this process is exactly the same as expectedleverage score if we sample m rows, and then compute the leveragescore of a random one.

Page 123: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Proof of Main Theorem

Further, expectation of this process is exactly the same as expectedleverage score if we sample m rows, and then compute the leveragescore of a random one.

Page 124: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Proof of Main Theorem

Further, expectation of this process is exactly the same as expectedleverage score if we sample m rows, and then compute the leveragescore of a random one.

Page 125: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Proof of Main Theorem

Further, expectation of this process is exactly the same as expectedleverage score if we sample m rows, and then compute the leveragescore of a random one.

Page 126: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Proof of Main Theorem

Further, expectation of this process is exactly the same as expectedleverage score if we sample m rows, and then compute the leveragescore of a random one.

Page 127: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Proof of Main Theorem

Further, expectation of this process is exactly the same as expectedleverage score if we sample m rows, and then compute the leveragescore of a random one.

Page 128: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Proof of Main Theorem

Further, expectation of this process is exactly the same as expectedleverage score if we sample m rows, and then compute the leveragescore of a random one.

Page 129: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Proof of Main Theorem

Further, expectation of this process is exactly the same as expectedleverage score if we sample m rows, and then compute the leveragescore of a random one.

Page 130: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Proof of Main Theorem

Remember,∑

i τi (Au) = d .

m rows, so expected score of randomly chosen row is dm .

Ej τj(A) = dm

Eu∑

i τi (A) = ndm .

Page 131: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Proof of Main Theorem

Remember,∑

i τi (Au) = d .

m rows, so expected score of randomly chosen row is dm .

Ej τj(A) = dm

Eu∑

i τi (A) = ndm .

Page 132: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Proof of Main Theorem

Remember,∑

i τi (Au) = d .

m rows, so expected score of randomly chosen row is dm .

Ej τj(A) = dm

Eu∑

i τi (A) = ndm .

Page 133: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Proof of Main Theorem

Remember,∑

i τi (Au) = d .

m rows, so expected score of randomly chosen row is dm .

Ej τj(A) = dm

Eu∑

i τi (A) = ndm .

Page 134: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Proof of Main Theorem

Remember,∑

i τi (Au) = d .

m rows, so expected score of randomly chosen row is dm .

Ej τj(A) = dm

Eu∑

i τi (A) = ndm .

Page 135: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Review

What did this theorem just tell us?

E∑

i τi (A) = E∑

i a>i (A>u∪iAu∪i )−1ai is bounded. And this is all we

need!

Recall that uniform sampling from A does not give us a spectralapproximation.

We cannot bound a>i (A>u Au)−1ai .

Page 136: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Review

What did this theorem just tell us?

E∑

i τi (A) = E∑

i a>i (A>u∪iAu∪i )−1ai is bounded. And this is all we

need!

Recall that uniform sampling from A does not give us a spectralapproximation.

We cannot bound a>i (A>u Au)−1ai .

Page 137: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Review

What did this theorem just tell us?

E∑

i τi (A) = E∑

i a>i (A>u∪iAu∪i )−1ai is bounded. And this is all we

need!

Recall that uniform sampling from A does not give us a spectralapproximation.

We cannot bound a>i (A>u Au)−1ai .

Page 138: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Overview

1 Spectral Matrix Approximation

2 Leverage Score Sampling

3 Iterative Leverage Score Computation

4 Coherence Reducing Reweighting

Page 139: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Coherence Reducing Reweighting

Reminder: If our data is incoherent, then all leverage scores are smallO(d/n) and we can uniformly sample rows and obtain a small spectralapproximation.

We show: Even if our matrix is not incoherent, it is ‘close’ to someincoherent matrix.

Page 140: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Coherence Reducing Reweighting

Reminder: If our data is incoherent, then all leverage scores are smallO(d/n) and we can uniformly sample rows and obtain a small spectralapproximation.We show: Even if our matrix is not incoherent, it is ‘close’ to someincoherent matrix.

Page 141: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Coherence Reducing Reweighting

Reminder: If our data is incoherent, then all leverage scores are smallO(d/n) and we can uniformly sample rows and obtain a small spectralapproximation.We show: Even if our matrix is not incoherent, it is ‘close’ to someincoherent matrix.

Page 142: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Coherence Reducing Reweighting

What do we mean by close?

We can reweight d/α rows of A to obtain A′ with τi (A′) ≤ α for all i .

Can reweight n/2 rows so that τi is bounded by 2d/n

Page 143: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Coherence Reducing Reweighting

What do we mean by close?

We can reweight d/α rows of A to obtain A′ with τi (A′) ≤ α for all i .

Can reweight n/2 rows so that τi is bounded by 2d/n

Page 144: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Coherence Reducing Reweighting

What do we mean by close?

We can reweight d/α rows of A to obtain A′ with τi (A′) ≤ α for all i .

Can reweight n/2 rows so that τi is bounded by 2d/n

Page 145: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Coherence Reducing Reweighting

What do we mean by close?

We can reweight d/α rows of A to obtain A′ with τi (A′) ≤ α for all i .

Can reweight n/2 rows so that τi is bounded by 2d/n

Page 146: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Coherence Reducing Reweighting

What does this result give us (without going into details)?

Uniform sample m rows of A and setτi (A) = max{a>i (A>u Au)−1ai , 1}.This sampling does a good job of approximating A′.

Gives good estimates for leverage scores of the that are notreweighted.

Few rows are reweighted, so overall we are able to significantly reducematrix size.

Don’t need to actually compute W. Just existence is enough.

Page 147: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Coherence Reducing Reweighting

What does this result give us (without going into details)?

Uniform sample m rows of A and setτi (A) = max{a>i (A>u Au)−1ai , 1}.

This sampling does a good job of approximating A′.

Gives good estimates for leverage scores of the that are notreweighted.

Few rows are reweighted, so overall we are able to significantly reducematrix size.

Don’t need to actually compute W. Just existence is enough.

Page 148: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Coherence Reducing Reweighting

What does this result give us (without going into details)?

Uniform sample m rows of A and setτi (A) = max{a>i (A>u Au)−1ai , 1}.This sampling does a good job of approximating A′.

Gives good estimates for leverage scores of the that are notreweighted.

Few rows are reweighted, so overall we are able to significantly reducematrix size.

Don’t need to actually compute W. Just existence is enough.

Page 149: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Coherence Reducing Reweighting

What does this result give us (without going into details)?

Uniform sample m rows of A and setτi (A) = max{a>i (A>u Au)−1ai , 1}.This sampling does a good job of approximating A′.

Gives good estimates for leverage scores of the that are notreweighted.

Few rows are reweighted, so overall we are able to significantly reducematrix size.

Don’t need to actually compute W. Just existence is enough.

Page 150: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Coherence Reducing Reweighting

What does this result give us (without going into details)?

Uniform sample m rows of A and setτi (A) = max{a>i (A>u Au)−1ai , 1}.This sampling does a good job of approximating A′.

Gives good estimates for leverage scores of the that are notreweighted.

Few rows are reweighted, so overall we are able to significantly reducematrix size.

Don’t need to actually compute W. Just existence is enough.

Page 151: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Coherence Reducing Reweighting

What does this result give us (without going into details)?

Uniform sample m rows of A and setτi (A) = max{a>i (A>u Au)−1ai , 1}.This sampling does a good job of approximating A′.

Gives good estimates for leverage scores of the that are notreweighted.

Few rows are reweighted, so overall we are able to significantly reducematrix size.

Don’t need to actually compute W. Just existence is enough.

Page 152: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Coherence Reducing Reweighting

How to prove existence of reweighting?

∑i τi (A) = d so at most d/α rows with τi (A) ≥ α.

Can we just delete them?

Page 153: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Coherence Reducing Reweighting

How to prove existence of reweighting?∑i τi (A) = d so at most d/α rows with τi (A) ≥ α.

Can we just delete them?

Page 154: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Coherence Reducing Reweighting

How to prove existence of reweighting?∑i τi (A) = d so at most d/α rows with τi (A) ≥ α.

Can we just delete them?

Page 155: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Coherence Reducing Reweighting

Not quite. Deleting some rows may cause leverage scores of other rows toincrease.

Page 156: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Coherence Reducing Reweighting

Not quite. Deleting some rows may cause leverage scores of other rows toincrease.

Page 157: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Coherence Reducing Reweighting

Alternative idea

1 Cycle through rows

2 Each time you see a row with τi (A) ≥ α cut its weight so thatτi (A) = α

3 Repeat

Page 158: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Coherence Reducing Reweighting

Alternative idea

1 Cycle through rows

2 Each time you see a row with τi (A) ≥ α cut its weight so thatτi (A) = α

3 Repeat

Page 159: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Coherence Reducing Reweighting

Alternative idea

1 Cycle through rows

2 Each time you see a row with τi (A) ≥ α cut its weight so thatτi (A) = α

3 Repeat

Page 160: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Coherence Reducing Reweighting

Alternative idea

1 Cycle through rows

2 Each time you see a row with τi (A) ≥ α cut its weight so thatτi (A) = α

3 Repeat

Page 161: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Coherence Reducing Reweighting

Alternative idea

1 Cycle through rows

2 Each time you see a row with τi (A) ≥ α cut its weight so thatτi (A) = α

3 Repeat

Page 162: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Coherence Reducing Reweighting

Alternative idea

1 Cycle through rows

2 Each time you see a row with τi (A) ≥ α cut its weight so thatτi (A) = α

3 Repeat

Page 163: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Coherence Reducing Reweighting

Alternative idea

1 Cycle through rows

2 Each time you see a row with τi (A) ≥ α cut its weight so thatτi (A) = α

3 Repeat

Page 164: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Key observation:

Decreasing the weight of a row will only increase the leverage score ofother rows

At any step, every row that has been reweighted has leverage score≥ α so at most d/α reweighted rows.

Page 165: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Key observation:

Decreasing the weight of a row will only increase the leverage score ofother rows

At any step, every row that has been reweighted has leverage score≥ α so at most d/α reweighted rows.

Page 166: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Key observation:

Decreasing the weight of a row will only increase the leverage score ofother rows

At any step, every row that has been reweighted has leverage score≥ α so at most d/α reweighted rows.

Page 167: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Key observation:

Decreasing the weight of a row will only increase the leverage score ofother rows

At any step, every row that has been reweighted has leverage score≥ α so at most d/α reweighted rows.

Page 168: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Key observation:

Decreasing the weight of a row will only increase the leverage score ofother rows

At any step, every row that has been reweighted has leverage score≥ α so at most d/α reweighted rows.

Page 169: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Key observation:

Decreasing the weight of a row will only increase the leverage score ofother rows

At any step, every row that has been reweighted has leverage score≥ α so at most d/α reweighted rows.

Page 170: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Key observation:

Decreasing the weight of a row will only increase the leverage score ofother rows

At any step, every row that has been reweighted has leverage score≥ α so at most d/α reweighted rows.

Page 171: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Key observation:

Decreasing the weight of a row will only increase the leverage score ofother rows

At any step, every row that has been reweighted has leverage score≥ α so at most d/α reweighted rows.

Page 172: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Coherence Reducing Reweighting

But does this reweighting process converge?

Page 173: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Coherence Reducing Reweighting

But does this reweighting process converge?

Page 174: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Coherence Reducing Reweighting

But does this reweighting process converge?

Page 175: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Coherence Reducing Reweighting

But does this reweighting process converge?

Page 176: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Coherence Reducing Reweighting

But does this reweighting process converge?

Rows that keep violating constraint will have weight cut to ≈ 0.

Can remove them without significantly effecting leverage scores ofother rows.

So overall, reweighting d/α rows is enough to cut all leverage scoresbelow α.

Page 177: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Coherence Reducing Reweighting

But does this reweighting process converge?

Rows that keep violating constraint will have weight cut to ≈ 0.

Can remove them without significantly effecting leverage scores ofother rows.

So overall, reweighting d/α rows is enough to cut all leverage scoresbelow α.

Page 178: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Coherence Reducing Reweighting

But does this reweighting process converge?

Rows that keep violating constraint will have weight cut to ≈ 0.

Can remove them without significantly effecting leverage scores ofother rows.

So overall, reweighting d/α rows is enough to cut all leverage scoresbelow α.

Page 179: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Coherence Reducing Reweighting

But does this reweighting process converge?

Rows that keep violating constraint will have weight cut to ≈ 0.

Can remove them without significantly effecting leverage scores ofother rows.

So overall, reweighting d/α rows is enough to cut all leverage scoresbelow α.

Page 180: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Coherence Reducing Reweighting

But does this reweighting process converge?

Rows that keep violating constraint will have weight cut to ≈ 0.

Can remove them without significantly effecting leverage scores ofother rows.

So overall, reweighting d/α rows is enough to cut all leverage scoresbelow α.

Page 181: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Conclusion

Take away:

Very simple analysis shows how leverage scores can be approximatedwith uniform sampling.

Simple iterative spectral approximation algorithms matching state ofthe art runtimes follow.

Open Questions:

Analogous algorithms for low rank approximation?

For other types of regression?

Generally, our result shows that it is possible to go beyond relativecondition number (spectral approximation) invariants for iterativealgorithms. Can this type of analysis be useful elsewhere?

Thanks! Questions?

Page 182: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Conclusion

Take away:

Very simple analysis shows how leverage scores can be approximatedwith uniform sampling.

Simple iterative spectral approximation algorithms matching state ofthe art runtimes follow.

Open Questions:

Analogous algorithms for low rank approximation?

For other types of regression?

Generally, our result shows that it is possible to go beyond relativecondition number (spectral approximation) invariants for iterativealgorithms. Can this type of analysis be useful elsewhere?

Thanks! Questions?

Page 183: Uniform Sampling for Matrix Approximationcmusco/personal_site/pdfs/... · 2019-04-09 · Uniform Sampling for Matrix Approximation Michael Cohen, Yin Tat Lee, Cameron Musco, Christopher

Conclusion

Take away:

Very simple analysis shows how leverage scores can be approximatedwith uniform sampling.

Simple iterative spectral approximation algorithms matching state ofthe art runtimes follow.

Open Questions:

Analogous algorithms for low rank approximation?

For other types of regression?

Generally, our result shows that it is possible to go beyond relativecondition number (spectral approximation) invariants for iterativealgorithms. Can this type of analysis be useful elsewhere?

Thanks! Questions?