proximal deep structured models - github pages · performing inference in mrfs with continuous...

26
Proximal Deep Structured Models Shenlong Wang, Sanja Fidler, & Raquel Urtasun University of Toronto NIPS 2016 Presenter: Jack Lanchantin Shenlong Wang, Sanja Fidler, & Raquel Urtasun Proximal Deep Structured Models NIPS 2016 Presenter: Jack Lanchantin 1 / 26

Upload: others

Post on 25-Jun-2020

6 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Proximal Deep Structured Models - GitHub Pages · Performing inference in MRFs with continuous variables involves solving a challenging numerical optimization problem If certain conditions

Proximal Deep Structured Models

Shenlong Wang, Sanja Fidler, & Raquel Urtasun

University of Toronto

NIPS 2016Presenter: Jack Lanchantin

Shenlong Wang, Sanja Fidler, & Raquel Urtasun Proximal Deep Structured ModelsNIPS 2016 Presenter: Jack Lanchantin 1

/ 26

Page 2: Proximal Deep Structured Models - GitHub Pages · Performing inference in MRFs with continuous variables involves solving a challenging numerical optimization problem If certain conditions

Outline

1 Intro

2 Deep Structured Prediction

3 Proximal MethodsProximal MethodsProximal Deep Structured ModelsSolving Proximal Deep Structured Models

4 Experiments and ResultsImage Denoising/Depth RefinementOptical Flow

Shenlong Wang, Sanja Fidler, & Raquel Urtasun Proximal Deep Structured ModelsNIPS 2016 Presenter: Jack Lanchantin 2

/ 26

Page 3: Proximal Deep Structured Models - GitHub Pages · Performing inference in MRFs with continuous variables involves solving a challenging numerical optimization problem If certain conditions

Structured Prediction

Many problems in real-world applications involve predicting acollection of random variables that are statistically related

Graphical models have been widely exploited to encode theseinteractions, but they are shallow and only a log linear combination ofhand-crafted– features is learned

Shenlong Wang, Sanja Fidler, & Raquel Urtasun Proximal Deep Structured ModelsNIPS 2016 Presenter: Jack Lanchantin 3

/ 26

Page 4: Proximal Deep Structured Models - GitHub Pages · Performing inference in MRFs with continuous variables involves solving a challenging numerical optimization problem If certain conditions

Structured Prediction

Deep structured models attempt to learn complex features by takinginto account the dependencies between the output variables. Avariety of methods have been developed in the context of predictingdiscrete outputs

However, little to no attention has been given to deep structuredmodels with continuous valued output variables.

One of the main reasons is that inference is much less well studied, andvery few solutions exist

Shenlong Wang, Sanja Fidler, & Raquel Urtasun Proximal Deep Structured ModelsNIPS 2016 Presenter: Jack Lanchantin 4

/ 26

Page 5: Proximal Deep Structured Models - GitHub Pages · Performing inference in MRFs with continuous variables involves solving a challenging numerical optimization problem If certain conditions

Continuous-Valued Structured Prediction

Given input x ∈ X , let y = (y1, ..., yn) be the set of random variableswe want to predict. The output space is a product space of all theelements y ∈ Y =

∏Ni=1 Yi , Yi ⊂ R

E (x , y ;w) is an energy function which encodes the problem:

E (x , y ;w) =∑i

fi (yi , x ;wu) +∑α

fα(yα, x ,wα) (1)

where fi (yi : x ,wu) : Yi ×X → R is a function that depends on asingle variable (i.e. unary term) and fα(yi ) : Yα ×X → R depends ona subset of variables yα = (yi )i∈α defined on a domain Yα ⊂ Y

Shenlong Wang, Sanja Fidler, & Raquel Urtasun Proximal Deep Structured ModelsNIPS 2016 Presenter: Jack Lanchantin 5

/ 26

Page 6: Proximal Deep Structured Models - GitHub Pages · Performing inference in MRFs with continuous variables involves solving a challenging numerical optimization problem If certain conditions

Continuous-Valued Structured Prediction

Shenlong Wang, Sanja Fidler, & Raquel Urtasun Proximal Deep Structured ModelsNIPS 2016 Presenter: Jack Lanchantin 6

/ 26

Page 7: Proximal Deep Structured Models - GitHub Pages · Performing inference in MRFs with continuous variables involves solving a challenging numerical optimization problem If certain conditions

Continuous-Valued Structured Prediction

Given an input x , inference aims at finding the best configuration byminimizing the energy function:

y∗ = argminy∈Y∑i

fα(yi ; x ,wu) +∑α

fα(yα, x ,wα) (2)

Finding the best scoring configuration y∗ is equivalent to maximizingthe posteriori distribution:

p(y |x ;w) =1

Z (x ;w)exp(−E (x , y |w)) (3)

Shenlong Wang, Sanja Fidler, & Raquel Urtasun Proximal Deep Structured ModelsNIPS 2016 Presenter: Jack Lanchantin 7

/ 26

Page 8: Proximal Deep Structured Models - GitHub Pages · Performing inference in MRFs with continuous variables involves solving a challenging numerical optimization problem If certain conditions

Inference in Deep Structured Prediction

Performing inference in MRFs with continuous variables involvessolving a challenging numerical optimization problem

If certain conditions are satisfied, inference is often tackled by a groupof algorithms called proximal methods

In this paper, they use proximal methods and show that it results in aparticular type of recurrent net

Shenlong Wang, Sanja Fidler, & Raquel Urtasun Proximal Deep Structured ModelsNIPS 2016 Presenter: Jack Lanchantin 8

/ 26

Page 9: Proximal Deep Structured Models - GitHub Pages · Performing inference in MRFs with continuous variables involves solving a challenging numerical optimization problem If certain conditions

Outline

1 Intro

2 Deep Structured Prediction

3 Proximal MethodsProximal MethodsProximal Deep Structured ModelsSolving Proximal Deep Structured Models

4 Experiments and ResultsImage Denoising/Depth RefinementOptical Flow

Shenlong Wang, Sanja Fidler, & Raquel Urtasun Proximal Deep Structured ModelsNIPS 2016 Presenter: Jack Lanchantin 9

/ 26

Page 10: Proximal Deep Structured Models - GitHub Pages · Performing inference in MRFs with continuous variables involves solving a challenging numerical optimization problem If certain conditions

Proximal Methods

The proximal operator proxf (x0): R→ R of a function is defined as:

proxf (x0) = argminy (y − x0)2 + f (y) (4)

This involves solving a convex optimization problem, but usually there is aclosed-form solution.

Shenlong Wang, Sanja Fidler, & Raquel Urtasun Proximal Deep Structured ModelsNIPS 2016 Presenter: Jack Lanchantin 10

/ 26

Page 11: Proximal Deep Structured Models - GitHub Pages · Performing inference in MRFs with continuous variables involves solving a challenging numerical optimization problem If certain conditions

Outline

1 Intro

2 Deep Structured Prediction

3 Proximal MethodsProximal MethodsProximal Deep Structured ModelsSolving Proximal Deep Structured Models

4 Experiments and ResultsImage Denoising/Depth RefinementOptical Flow

Shenlong Wang, Sanja Fidler, & Raquel Urtasun Proximal Deep Structured ModelsNIPS 2016 Presenter: Jack Lanchantin 11

/ 26

Page 12: Proximal Deep Structured Models - GitHub Pages · Performing inference in MRFs with continuous variables involves solving a challenging numerical optimization problem If certain conditions

Proximal Deep Structured Models

In order to apply proximal algorithms to tackle the inference problemdefined in Eq. (2), we require the energy functions fi and fα to satisfy thefollowing conditions:

1 There exist functions hi and gi s.t. fi (yi , x ;w) = gi (yi , hi (x ,w)),where gi is a distance function

2 There exists a closed-form proximal operator for gi (yi , hi (x ,w)) wrt yi3 There exist functions hα and gα s.t. fα(yα, x ;w) can be re-written as

fα(yα, x ;w) = hα(x ;w)gα(wTα yα)

4 There exists a proximal operator for gα()

Shenlong Wang, Sanja Fidler, & Raquel Urtasun Proximal Deep Structured ModelsNIPS 2016 Presenter: Jack Lanchantin 12

/ 26

Page 13: Proximal Deep Structured Models - GitHub Pages · Performing inference in MRFs with continuous variables involves solving a challenging numerical optimization problem If certain conditions

Proximal Deep Structured Models

If our potential functions satisfy the conditions above, we can rewrite ourobjective function as follows

E (x , y ;w) =∑i

gi (yi , hi (x ;w)) +∑α

hα(x ;w)gα(wTα yα) (5)

Shenlong Wang, Sanja Fidler, & Raquel Urtasun Proximal Deep Structured ModelsNIPS 2016 Presenter: Jack Lanchantin 13

/ 26

Page 14: Proximal Deep Structured Models - GitHub Pages · Performing inference in MRFs with continuous variables involves solving a challenging numerical optimization problem If certain conditions

Primal Dual Solvers

The general idea of primal dual solvers is to introduce auxiliary variables zto decompose the high order terms. We can then minimize z and yalternately through computing their proximal operator:

(6)

miny ∈Ymaxz ∈Z∑i

gi (yi , hi (x ;w))

−∑α

hα(x ,w)g∗α(wTα yα) +

∑α

hα(x ,w)〈wTα yα, zα〉

where g∗α is the convex conjugate of g∗

Shenlong Wang, Sanja Fidler, & Raquel Urtasun Proximal Deep Structured ModelsNIPS 2016 Presenter: Jack Lanchantin 14

/ 26

Page 15: Proximal Deep Structured Models - GitHub Pages · Performing inference in MRFs with continuous variables involves solving a challenging numerical optimization problem If certain conditions

Outline

1 Intro

2 Deep Structured Prediction

3 Proximal MethodsProximal MethodsProximal Deep Structured ModelsSolving Proximal Deep Structured Models

4 Experiments and ResultsImage Denoising/Depth RefinementOptical Flow

Shenlong Wang, Sanja Fidler, & Raquel Urtasun Proximal Deep Structured ModelsNIPS 2016 Presenter: Jack Lanchantin 15

/ 26

Page 16: Proximal Deep Structured Models - GitHub Pages · Performing inference in MRFs with continuous variables involves solving a challenging numerical optimization problem If certain conditions

Solving Deep Structured Models

The primal-dual method solves the problem in Eq.(6) by iterating thefollowing steps: (i) fix y and minimize the energy wrt z; (ii) fix z andminimize the energy wrt y; (iii) conduct a Nesterov extrapolation gradientstep:

where y (t) is the solution at the t-th iteration, z(t) is an auxiliary variableand h(x ,wu) is the deep unary network

Shenlong Wang, Sanja Fidler, & Raquel Urtasun Proximal Deep Structured ModelsNIPS 2016 Presenter: Jack Lanchantin 16

/ 26

Page 17: Proximal Deep Structured Models - GitHub Pages · Performing inference in MRFs with continuous variables involves solving a challenging numerical optimization problem If certain conditions

Solving Deep Structured Models

Shenlong Wang, Sanja Fidler, & Raquel Urtasun Proximal Deep Structured ModelsNIPS 2016 Presenter: Jack Lanchantin 17

/ 26

Page 18: Proximal Deep Structured Models - GitHub Pages · Performing inference in MRFs with continuous variables involves solving a challenging numerical optimization problem If certain conditions

Learning

Given training pairs composed of inputs {xn}Nn=1 and their correspondingoutput {ygtn }Nn=1, learning aims at finding parameters which minimizes aregularized loss function:

w∗ = argminw∑n

`(y∗n , ygtn ) + γ||w ||2 (7)

Where `() is the loss, y∗ is the minimizer of RNN, and γ is a scalar.

Shenlong Wang, Sanja Fidler, & Raquel Urtasun Proximal Deep Structured ModelsNIPS 2016 Presenter: Jack Lanchantin 18

/ 26

Page 19: Proximal Deep Structured Models - GitHub Pages · Performing inference in MRFs with continuous variables involves solving a challenging numerical optimization problem If certain conditions

Algorithm for Learning

Shenlong Wang, Sanja Fidler, & Raquel Urtasun Proximal Deep Structured ModelsNIPS 2016 Presenter: Jack Lanchantin 19

/ 26

Page 20: Proximal Deep Structured Models - GitHub Pages · Performing inference in MRFs with continuous variables involves solving a challenging numerical optimization problem If certain conditions

Outline

1 Intro

2 Deep Structured Prediction

3 Proximal MethodsProximal MethodsProximal Deep Structured ModelsSolving Proximal Deep Structured Models

4 Experiments and ResultsImage Denoising/Depth RefinementOptical Flow

Shenlong Wang, Sanja Fidler, & Raquel Urtasun Proximal Deep Structured ModelsNIPS 2016 Presenter: Jack Lanchantin 20

/ 26

Page 21: Proximal Deep Structured Models - GitHub Pages · Performing inference in MRFs with continuous variables involves solving a challenging numerical optimization problem If certain conditions

Image Denoising

Corrupt each image with Gaussian noise and use the following energyfunction to denoise:

y∗ = argminy∈Y∑i

||yi − xi ||22+∑α

λ||wTho,αyα||1 (8)

where prox`2(y , λ) = x+λy1+λ and prox∗ρ (z) = min(|z |, 1) · sign(z)

Shenlong Wang, Sanja Fidler, & Raquel Urtasun Proximal Deep Structured ModelsNIPS 2016 Presenter: Jack Lanchantin 21

/ 26

Page 22: Proximal Deep Structured Models - GitHub Pages · Performing inference in MRFs with continuous variables involves solving a challenging numerical optimization problem If certain conditions

Image Denoising

Shenlong Wang, Sanja Fidler, & Raquel Urtasun Proximal Deep Structured ModelsNIPS 2016 Presenter: Jack Lanchantin 22

/ 26

Page 23: Proximal Deep Structured Models - GitHub Pages · Performing inference in MRFs with continuous variables involves solving a challenging numerical optimization problem If certain conditions

Depth Refinement

Shenlong Wang, Sanja Fidler, & Raquel Urtasun Proximal Deep Structured ModelsNIPS 2016 Presenter: Jack Lanchantin 23

/ 26

Page 24: Proximal Deep Structured Models - GitHub Pages · Performing inference in MRFs with continuous variables involves solving a challenging numerical optimization problem If certain conditions

Outline

1 Intro

2 Deep Structured Prediction

3 Proximal MethodsProximal MethodsProximal Deep Structured ModelsSolving Proximal Deep Structured Models

4 Experiments and ResultsImage Denoising/Depth RefinementOptical Flow

Shenlong Wang, Sanja Fidler, & Raquel Urtasun Proximal Deep Structured ModelsNIPS 2016 Presenter: Jack Lanchantin 24

/ 26

Page 25: Proximal Deep Structured Models - GitHub Pages · Performing inference in MRFs with continuous variables involves solving a challenging numerical optimization problem If certain conditions

Optical Flow

Predict the motion between two image frames for each pixel

y∗ = argminy∈Y∑i

||yi − fi (xl , x r ,wu)||1+

∑α

λ||wTho,αyα||1 (9)

Shenlong Wang, Sanja Fidler, & Raquel Urtasun Proximal Deep Structured ModelsNIPS 2016 Presenter: Jack Lanchantin 25

/ 26

Page 26: Proximal Deep Structured Models - GitHub Pages · Performing inference in MRFs with continuous variables involves solving a challenging numerical optimization problem If certain conditions

Optical Flow

Shenlong Wang, Sanja Fidler, & Raquel Urtasun Proximal Deep Structured ModelsNIPS 2016 Presenter: Jack Lanchantin 26

/ 26