“what you saw is not what you get” domain adaptation for deep learning ·  ·...

51
“What you saw is not what you get” Domain adaptation for deep learning Kate Saenko

Upload: buidieu

Post on 06-May-2018

221 views

Category:

Documents


3 download

TRANSCRIPT

Page 1: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

“What you saw is not what you get”Domain adaptation for deep learning

Kate Saenko

Page 2: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

Successes of Deep Learning in AI

Google's DeepMind Masters Atari Games

Face Recognition

TranslateA Learning Advance in Artificial Intelligence Rivals Human Abilities

FLOWER FIELDSKYCLOUDS

2

Deep Learning for self-driving cars

Page 3: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

So is AI solved?

pedestrian detection FAIL

https://www.youtube.com/watch?v=w2pwxv8rFkU

3

Page 4: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

Major limitation of deep learningNot data efficient: Learning requires millions of labeled examples,

models do not generalize well to new domains; not like humans!

Newdomain

?

Page 5: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

“What you saw is not what you get”

“Dataset Bias”“Domain Shift”

“Domain Adaptation”“Domain Transfer”

What your net is trained on What it’s asked to label

Page 6: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

Example: scene segmentation

Train on Cityscapes, Test on Cityscapes

road

road

people

building

building

sky

FCNs in the Wild: Pixel-level Adversarial and Constraint-based Adaptation, Judy Hoffman, Dequan Wang, Fisher Yu, Trevor Darrell, Arxiv 2016

Page 7: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

Domain shift: Cityscapes to SF

Train on Cityscapes, Test on San Francisco Dashcam

car

car

tree

tree

sign

road

FCNs in the Wild: Pixel-level Adversarial and Constraint-based Adaptation, Judy Hoffman, Dequan Wang, Fisher Yu, Trevor Darrell, Arxiv 2016

Page 8: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

No tunnels in CityScapes?...

treebuilding

FCNs in the Wild: Pixel-level Adversarial and Constraint-based Adaptation, Judy Hoffman, Dequan Wang, Fisher Yu, Trevor Darrell, Arxiv 2016

Page 9: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

Applications to different types of domain shiftFrom dataset to dataset

From simulated to real control

From RGB to depth

From CAD models to real images

Page 10: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

Today Show that deep models can be adapted without labels Propose two deep adaptation methods:

adversarial alignment correlation alignment

Show applications

Page 11: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

Background: Domain Adaptation from source to target distribution

backpack chair bike

Adapt

Source Domainlots of labeled data

bike??

Target Domainunlabeled or limited labels

?

Page 12: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

Background: unsupervised domain adaptation

• NO labels in target domain• Roughly, three categories of methods

B. Femando, A. Habrard, M. Sebban, and T. Tuytelaars. Unsupervised visual domain adaptation using subspace alignment. In ICCV, 2013.B. Gong, Y. Shi, F. Sha, and K. Grauman. Geodesic flow kernel for unsupervised domain adaptation. In CVPR, 2012Y. Ganin and V. Lempitsky. Unsupervised domain adaptation by back propagation. In ICML 2015

Subspace alignment Deep alignment

Sample re-weighting

13

Sample re-weighting Subspace matching Deep methods

Page 13: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

fc8conv1 conv5 fc

6fc7 classification

loss

How to adapt a deep network?

backpack chair bike

Source Data

Page 14: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

backpack chair bike

fc8conv1 conv5 fc

6fc7

• Applying source classifier to target domain can yield inferior performance…

classificationloss

How to adapt a deep network?

Source Data

Target Databackpack

?

Page 15: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

Source Data

backpack chair

Target Databackpack

?

fc8conv1 conv5 fc

6fc7

labeled target data

fc8conv1 conv5 fc

6fc7

classificationlosssh

ared

shar

ed

shar

ed

shar

ed

shar

ed

• Fine tune? …..Zero or few labels in target domain

• Siamese network?…..No paired / aligned instance examples!

How to adapt a deep network?

source data

same

bike

IDEA: align feature distributions

Page 16: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

Deep distribution alignment

17Y. Ganin and V. Lempitsky ICML 2015

M. Long, et al. ICML 2015Maximum Mean Discrepancy

E. Tzeng et al. ICCV 2015Domain Confusion Reverse Gradient

Sun and Saenko, AAAI 2016CORrelation ALignment

• by minimizing distance between distributions, e.g.

• …or by adversarial domain alignment, e.g.

Page 17: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

Adversarial Domain Adaptation

18

Trevor Darrell

UC Berkeley

JudyHoffman

UC Berkeley

Eric Tzeng

UC Berkeley

Page 18: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

Adversarial networks

Page 19: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

Adversarial networksP

Q

P

Q

Page 20: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

Source Data + Labels

backpack chair bike

Unlabeled Target Data

?

Cla

ssifi

er

classificationloss

Adversarial domain adaptation

conv1 conv5 fc6

fc7

conv1 conv5 fc6

fc7

Page 21: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

Source Data + Labels

backpack chair bike

Unlabeled Target Data

?

Encoder

Cla

ssifi

er

Encoder classificationloss

Adversarial domain adaptation

can be shared

Page 22: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

Source Data + Labels

backpack chair bike

Unlabeled Target Data

?

Encoder

Cla

ssifi

er

Encoder classificationloss

Adversarial domain adaptation

Discriminator Adversarial loss

can be shared

Page 23: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

Source Data + Labels

backpack chair bike

Unlabeled Target Data

?

Encoder

Cla

ssifi

er

Encoder classificationloss

Adversarial domain adaptation

Discriminator Adversarial loss

can be shared

Page 24: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

Source Data + Labels

backpack chair bike

Unlabeled Target Data

?

Encoder

Cla

ssifi

er

Encoder classificationloss

Design choices in adversarial adaptation

Discriminator Adversarial loss

Which loss?Shared or not?Generative ordiscriminative?

“confusion”

Ming-Yu Liu and Oncel Tuzel. Coupled generative adversarial networks, NIPS 2016

Page 25: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

Source Data

backpack chair bike

fc8conv1 conv5 fc

6fc7

labeled target data

fc8conv1 conv5

source data

fcD

fc6

fc7

classificationloss

domain confusion

loss

domain classifier

loss

shar

ed

shar

ed

shar

ed

shar

ed

shar

ed

Domain Label Cross-entropy with uniform distribution

Adversarial Training of domain label predictor and domain confusion loss:

Deep domain confusion

Unlabeled Target Data

?

[Tzeng ICCV15 ]

Page 26: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

Deep domain confusionTrain a network to minimize classification loss AND confuse two domains

[Tzeng ICCV15 ]

source inputs

targetinputs

networkparameters(fixed)

domainclassifier(learn) domain classifier loss

domain classifier prediction

network parameters (learn)

domain confusion loss

= 𝑝𝑝(𝑦𝑦𝐷𝐷 = 1|𝑥𝑥)

(cross-entropy with uniform distribution)

domainclassifier (fixed)

27

iterate

Page 27: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

Applications to different types of domain shiftFrom dataset to dataset

From simulated to real control

From RGB to depth

From CAD models to real images

Page 28: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

ImageNet adapted to Caltech

400 120 200

Number of labeled target examples

Mul

ticla

ss A

ccur

acy

[Tzeng ICCV15 ]

Page 29: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

Before domain confusion

After domain confusion

Results on Cityscapes to SF adaptation

FCNs in the Wild: Pixel-level Adversarial and Constraint-based Adaptation, Judy Hoffman, Dequan Wang, Fisher Yu, Trevor Darrell, Arxiv 2016

Page 30: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

Adversarial Loss Functions

Minimax loss

Confusion loss

GAN loss [Goodfellow 2014]

[Tzeng 2015]

[Ganin 2015]

“stronger gradients”

Page 31: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

Source Data + Labels

backpack chair bike

Unlabeled Target Data

?

Encoder

Cla

ssifi

er

Encoder classificationloss

Adversarial Discriminative Domain Adaptation (ADDA)

Discriminator Adversarial loss

Which loss?Shared or not?Generative ordiscriminative?

GAN

(in submission)

Page 32: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

ADDA: Adaptation on digits (in submission)

Page 33: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

Applications to different types of domain shiftFrom dataset to dataset

From simulated to real control

From RGB to depth

From CAD models to real images

Page 34: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

ADDA: Adaptation on RGB-D (in submission)

Train on RGB

Test on depth

Page 35: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

ADDA: Adaptation on RGB-D (in submission)

Page 36: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

Applications to different types of domain shiftFrom dataset to dataset

From simulated to real control

From RGB to depth

From CAD models to real images

Page 37: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

Adapting Deep VisuomotorRepresentations with Weak Pairwise

ConstraintsEric Tzeng1, Coline Devin1, Judy Hoffman1, Chelsea Finn1,

Pieter Abbeel1, Sergey Levine1, Kate Saenko2, Trevor Darrell11 University of California, Berkeley

2 Boston University

Page 38: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

From simulation to real world control [Tzeng, Devin, et al 16]

Page 39: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

Weak pairwise constraints [Tzeng, Devin, et al 16]

Page 40: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

Robotic task: place rope on scale [Tzeng, Devin, et al 16]

Page 41: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

Applications to different types of domain shiftFrom dataset to dataset

From simulated to real control

From RGB to depth

From CAD models to real images

Page 42: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

Domain Adaptation via Correlation Alignment

49

Baochen Sun

Microsoft

XingchaoPeng

Boston University

Page 43: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

Deep CORAL: Correlation Alignment for Deep Domain Adaptation[Sun 2016]

Page 44: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

Deep CORAL: Correlation Alignment for Deep Domain Adaptation[Sun 2016]

Page 45: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

Generative CORAL Network (in submission)

Page 46: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

Synthetic to real adaptation for object recognition

Train on synthetic

Test on real

(in submission)

Page 47: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

Synthetic to Real Adaptation with Deep Generative Correlation Alignment Networks (in submission)

Page 48: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

Synthetic to Real Adaptation with Deep Generative Correlation Alignment Networks (in submission)

Page 49: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

Synthetic to Real Adaptation with Deep Generative Correlation Alignment Networks (in submission)

Page 50: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

Summary Deep models can be adapted to new domains without labels Proposed two deep feature alignment methods:

adversarial alignment correlation alignment

Many potential applications

Page 51: “What you saw is not what you get” Domain adaptation for deep learning ·  · 2017-01-13“What you saw is not what you get” Domain adaptation for deep learning. Kate Saenko

Thank youReferences• Eric Tzeng, Judy Hoffman, Trevor Darrell, Kate Saenko, Simultaneous Deep Transfer Across Domains and Tasks,

ICCV 2015• Eric Tzeng, Coline Devin, Judy Hoffman, Chelsea Finn, Pieter Abbeel, Sergey Levine, Kate Saenko, Trevor Darrell,

Adapting Deep Visuomotor Representations with Weak Pairwise Constraints, WAFR 2016• Baochen Sun, Jiashi Feng, Kate Saenko, Return of Frustratingly Easy Domain Adaptation, AAAI 2016• Baochen Sun, Kate Saenko, Deep CORAL: Correlation Alignment for Deep Domain Adaptation, TASK-CV Workshop

at ICCV 2016• Adversarial Discriminative Domain Adaptation, in submission• Synthetic to Real Adaptation with Deep Generative Correlation Alignment Networks, in submission