mean-field theory and its applications in computer vision3 1

40
Mean-Field Theory and Its Applications In Computer Vision3 1

Upload: aaron-bentley

Post on 28-Mar-2015

221 views

Category:

Documents


4 download

TRANSCRIPT

Page 1: Mean-Field Theory and Its Applications In Computer Vision3 1

Mean-Field Theory and Its Applications In Computer Vision3

1

Page 2: Mean-Field Theory and Its Applications In Computer Vision3 1

Gaussian Pairwise Potential

2

Spatial

Expensive message passing can be performed by cross-bilateral filtering

Range

Page 3: Mean-Field Theory and Its Applications In Computer Vision3 1

Cross bilateral filter

3

qq

pp

output input

pp

reproducedfrom [Durand 02]

S

IIIGGW

IBFq

qqpp

p qp ||||||1

][rs

output input

Page 4: Mean-Field Theory and Its Applications In Computer Vision3 1

Efficient Cross-Bilateral Filtering

• Based on permutohedral lattice (PLBF)2

• Embed the points on the permutohedral lattice• Apply Gaussian Blurring

4

Page 5: Mean-Field Theory and Its Applications In Computer Vision3 1

Efficient Cross-Bilateral Filtering

• Based on permutohedral lattice (PLBF)2

• Embed the points on the permutohedral lattice• Apply Gaussian Blurring

5

• Based on the domain-transform (DTBF)3

• Project the point to lower dimension• Perform filtering in the transformed domain

Page 6: Mean-Field Theory and Its Applications In Computer Vision3 1

Efficient Cross-Bilateral Filtering

• Based on permutohedral lattice (PLBF)2

• Embed the points on the permutohedral lattice• Apply Gaussian Blurring

6

• Based on the domain-transform (DTBF)3

• Project the point to lower dimension• Perform filtering in the transformed domain

• Filtering in frequency domain• Apply fast fourier transform• convolution in (s) domain=multiplication in (f) domain

Page 7: Mean-Field Theory and Its Applications In Computer Vision3 1

Barycentric Interpolation

7

Page 8: Mean-Field Theory and Its Applications In Computer Vision3 1

Efficient Cross-Bilateral Filtering

8

Page 9: Mean-Field Theory and Its Applications In Computer Vision3 1

Permutohedral Lattice based filtering

• For each pixel (x, y)

9

• Downsample all the points (dependent on standard deviations)

rss

),(,,),,(

YXIYX

zyx

Page 10: Mean-Field Theory and Its Applications In Computer Vision3 1

Embed to the permutohedral lattice

• Embed each downsampled points to the lattice

10

Page 11: Mean-Field Theory and Its Applications In Computer Vision3 1

Embed to the permutohedral lattice

• Embed each downsampled points to the lattice

11

Page 12: Mean-Field Theory and Its Applications In Computer Vision3 1

Embed to the permutohedral lattice

• Embed each downsampled points to the lattice

12

Page 13: Mean-Field Theory and Its Applications In Computer Vision3 1

Embed to the permutohedral lattice

• Embed each downsampled points to the lattice

13

Page 14: Mean-Field Theory and Its Applications In Computer Vision3 1

Gaussian blurring

• Apply Gaussian blurring along axes

14

Page 15: Mean-Field Theory and Its Applications In Computer Vision3 1

Gaussian blurring

• Apply Gaussian blurring along axes

15

Page 16: Mean-Field Theory and Its Applications In Computer Vision3 1

Gaussian blurring

• Apply Gaussian blurring along axes

16

Page 17: Mean-Field Theory and Its Applications In Computer Vision3 1

Splatting

• Upsample the points

17

Page 18: Mean-Field Theory and Its Applications In Computer Vision3 1

Splatting

• Upsample the points

18

Page 19: Mean-Field Theory and Its Applications In Computer Vision3 1

PLBF

• Final upsampled points

19

Page 20: Mean-Field Theory and Its Applications In Computer Vision3 1

Domain Transform Filtering

20

• Project points in low-dimension preserving the distance in the high dimension

• Projecting to the original space

• Filtering performed in low-dimension space

Page 21: Mean-Field Theory and Its Applications In Computer Vision3 1

Distance in high-dimension space

21

Page 22: Mean-Field Theory and Its Applications In Computer Vision3 1

Filtering in high-dimension space

22

Spatial

Range

Inefficient

Page 23: Mean-Field Theory and Its Applications In Computer Vision3 1

Projection in low-dimension space

23

• Project to low-dimension • Maintain geodesic distance high-dimension space

Page 24: Mean-Field Theory and Its Applications In Computer Vision3 1

Projection in low-dimension space

24

• Project to low-dimension • Maintain geodesic distance high-dimension space

Page 25: Mean-Field Theory and Its Applications In Computer Vision3 1

Projection in low-dimension space

25

• Project to low-dimension • Maintain geodesic distance high-dimension space

Page 26: Mean-Field Theory and Its Applications In Computer Vision3 1

Gaussian blurring in low-dimension

26

• Apply Gaussian blurring in low-dimension space

Page 27: Mean-Field Theory and Its Applications In Computer Vision3 1

Project

27

• Project the blurred values in the original space

Page 28: Mean-Field Theory and Its Applications In Computer Vision3 1

Project

28

• Project the blurred values in the original space

Page 29: Mean-Field Theory and Its Applications In Computer Vision3 1

PLBF Vs DTBF

29

• Filter parameter:• PLBF runtime is inversely proportional to the kernel size defined over space and range

• Use PLBF with the relatively large (~10) range • Use DTBF with relatively smaller (~1-2) range

• Processing Time:• Both linear in the number of pixels

Page 30: Mean-Field Theory and Its Applications In Computer Vision3 1

Filtering in frequency domain

30

Page 31: Mean-Field Theory and Its Applications In Computer Vision3 1

Convergence

31

• Iteration vs. KL-divergence value• In theory: (since parallel update) convergence is not guaranteed• In practice: converges observe a convergence

Page 32: Mean-Field Theory and Its Applications In Computer Vision3 1

MSRC-21 dataset

32

• 591 colour images, 320x213 size, 21 object classes

Page 33: Mean-Field Theory and Its Applications In Computer Vision3 1

MSRC-21 dataset

33

• 591 colour images, 320x213 size, 21 object classes

Runtime Standard ground truth Accurate ground truth

Global Average Global Average

Unary Classifiers

84.0 76.6 83.2±1.5 80.6±2.3

Grid CRF 1 sec 84.6 77.2 84.8±1.5 82.4±1.8

Robust Pn 30 sec 84.9 77.5 86.5±1.0 83.1±1.5

Dense CRF 0.2 sec 86.0 78.3 88.2±0.7 84.7±0.7

Page 34: Mean-Field Theory and Its Applications In Computer Vision3 1

PascalVOC-10 dataset

34

• 591 colour images, 320x213 size, 21 object classes

Page 35: Mean-Field Theory and Its Applications In Computer Vision3 1

PascalVOC-10 dataset

35

• 591 colour images, 320x213 size, 21 object classes

Runtime Overall Av. Recall Av. I/U

Dense CRF 0.67 sec 71.63 34.53 28.4

Page 36: Mean-Field Theory and Its Applications In Computer Vision3 1

Long-range connections

36

• Accuracy on increasing the spatial and range standard deviations• On MSRC-21 spatial – 61 pixels, range – 11

Page 37: Mean-Field Theory and Its Applications In Computer Vision3 1

Long-range connections

37

• On increasing the spatial and range standard deviations• On MSRC-21 spatial – 61 pixels, range – 11

Page 38: Mean-Field Theory and Its Applications In Computer Vision3 1

Long-range connections

38

• Sometimes propagates misleading information

Page 39: Mean-Field Theory and Its Applications In Computer Vision3 1

Mean-field Vs. Graph-cuts

39

• Measure I/U score on PascalVOC-10 segmentation • Increase standard deviation for mean-field• Increase window size for graph-cuts method

• Both achieve almost similar accuracy

Page 40: Mean-Field Theory and Its Applications In Computer Vision3 1

Mean-field Vs. Graph-cuts

40

• Measure I/U score on PascalVOC-10 segmentation • Increase standard deviation for mean-field• Increase window size for graph-cuts method

•Time complexity very high, making infeasible to work with large neighbourhood system