unsupervised learning jointly with image clusteringjyang375/jianwei_yang... · unsupervised...

92
Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra https://filebox.ece.vt.edu/~jw2yang/ 1

Upload: others

Post on 05-Mar-2020

8 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Unsupervised Learning Jointly With Image Clustering

Virginia Tech

Jianwei Yang Devi Parikh Dhruv Batra

https://filebox.ece.vt.edu/~jw2yang/ 1

Page 2: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

2

Page 3: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Huge amount of images!!!

3

Page 4: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Huge amount of images!!!

Learning without annotation efforts

4

Page 5: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Huge amount of images!!!

Learning without annotation efforts

What we need to learn?

5

Page 6: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

An open problem

Huge amount of images!!!

Learning without annotation efforts

What we need to learn?

6

Page 7: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

An open problem

A hot problem

Huge amount of images!!!

Learning without annotation efforts

What we need to learn?

7

Page 8: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Various methodologies

An open problem

A hot problem

Huge amount of images!!!

Learning without annotation efforts

What we need to learn?

8

Page 9: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Learning distribution (structure)

Jain, Anil K., M. Narasimha Murty, and Patrick J. Flynn. "Data clustering: a review." ACM computing surveys (CSUR) 31.3 (1999): 264-323.

Clustering

9

Page 10: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Learning distribution (structure)

Jain, Anil K., M. Narasimha Murty, and Patrick J. Flynn. "Data clustering: a review." ACM computing surveys (CSUR) 31.3 (1999): 264-323.

Clustering

K-means (Image Credit: Jesse Johnson)

10

Page 11: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Learning distribution (structure)

Jain, Anil K., M. Narasimha Murty, and Patrick J. Flynn. "Data clustering: a review." ACM computing surveys (CSUR) 31.3 (1999): 264-323.

Clustering

K-means (Image Credit: Jesse Johnson)

Hierarchical Clustering

11

Page 12: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Learning distribution (structure)

Jain, Anil K., M. Narasimha Murty, and Patrick J. Flynn. "Data clustering: a review." ACM computing surveys (CSUR) 31.3 (1999): 264-323.

Clustering

K-means (Image Credit: Jesse Johnson)Spectral Clustering

Manor et al, NIPS’04Hierarchical Clustering

12

Page 13: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Learning distribution (structure)

Jain, Anil K., M. Narasimha Murty, and Patrick J. Flynn. "Data clustering: a review." ACM computing surveys (CSUR) 31.3 (1999): 264-323.

Clustering

K-means (Image Credit: Jesse Johnson)Spectral Clustering

Manor et al, NIPS’04Hierarchical Clustering Graph Cut

Shi et al, TPAMI’00

13

Page 14: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Learning distribution (structure)

Jain, Anil K., M. Narasimha Murty, and Patrick J. Flynn. "Data clustering: a review." ACM computing surveys (CSUR) 31.3 (1999): 264-323.

Clustering

K-means (Image Credit: Jesse Johnson)

DBSCAN, Ester et al, KDD’96 (Image Credit: Jesse Johnson)

Spectral Clustering Manor et al, NIPS’04

Hierarchical Clustering Graph CutShi et al, TPAMI’00

14

Page 15: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Learning distribution (structure)

Jain, Anil K., M. Narasimha Murty, and Patrick J. Flynn. "Data clustering: a review." ACM computing surveys (CSUR) 31.3 (1999): 264-323.

Clustering

K-means (Image Credit: Jesse Johnson)

DBSCAN, Ester et al, KDD’96 (Image Credit: Jesse Johnson)

Spectral Clustering Manor et al, NIPS’04

Hierarchical Clustering Graph CutShi et al, TPAMI’00

EM Algorithm, Dempster et al, JRSS’77

15

Page 16: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Learning distribution (structure)

Jain, Anil K., M. Narasimha Murty, and Patrick J. Flynn. "Data clustering: a review." ACM computing surveys (CSUR) 31.3 (1999): 264-323.

Clustering

K-means (Image Credit: Jesse Johnson)

DBSCAN, Ester et al, KDD’96 (Image Credit: Jesse Johnson)

Spectral Clustering Manor et al, NIPS’04

Hierarchical Clustering Graph CutShi et al, TPAMI’00

EM Algorithm, Dempster et al, JRSS’77NMF, Xu et al, SIGIR‘03 (Image Credit: Conrad Lee)

16

Page 17: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Learning distribution (structure)

Sub-space Analysis

PCA (Image Credit: Jesse Johnson)ICA (Image Credit: Shylaja et al)

tSNE, Maaten et al, JMLR’08

Subspace Clustering, Vidal et al.

Sparse coding, Olshausen et al. Vision Research’9717

Page 18: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Learning representation (feature)

Yoshua Bengio, Aaron Courville, and Pierre Vincent. "Representation learning: A review and new perspectives." IEEE Transactions on Pattern Analysis and Machine Intelligence. 35.8 (2013): 1798-1828.

Autoencoder, Hinton et al, Science’06 (Image Credit: Jesse Johnson)

DBN, Hinton et al, Science’06 DBM, Salakhutdinov et al, AISTATS’09

Bengio et al, TPAMI’13

18

Page 19: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Learning representation (feature)

VAE, Kingma et al, arXiv’13(Image Credit: Fast Forward Labs)

GAN, Goodfellow et al, NIPS’14DCGAN, Radford et al, arXiv’15

(Image Credit: Mike Swarbrick Jones)

19

Page 20: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Most Recent CV Works

Spatial context, Doersch et al, ICCV’15Temporal context, Wang et al, ICCV’15

Solving Jigsaw, Noroozi et al, ECCV’16Context Encoder, Deepak et al, CVPR’16

Ego-motion, Jayaraman et al, ICCV’15

20

Page 21: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Most Recent CV Works

Visual concept clustering, Huang et al, CVPR’16

Graph constraint, Li et al, ECCV’16

TAGnet, Wang et al, SDM’16

Deep Embedding, Xie et al, ICML’1621

Page 22: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Our Work

Joint Unsupervised Learning (JULE) of Deep Representations and Image Clusters

22

Page 23: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Outline

• Intuition

• Approach

• Experiments

• Extensions

23

Page 24: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Intuition

Meaningful clusters can provide supervisory signals to learn image representations

24

Page 25: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Intuition

Meaningful clusters can provide supervisory signals to learn image representations

Good representations help to get meaningful clusters

25

Page 26: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Intuition

Cluster images first, and then learn representations

26

Page 27: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Intuition

Cluster images first, and then learn representations

Learn representations first, and then cluster images

27

Page 28: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Intuition

Cluster images and learn representations progressively

Cluster images first, and then learn representations

Learn representations first, and then cluster images

28

Page 29: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

IntuitionGood clusterGood representationsGood clusters

Good representations

Poor clusters

Poor representations 29

Page 30: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

IntuitionGood clusterGood representationsGood clusters

Good representations

Poor clusters

Poor representations 30

Page 31: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

IntuitionGood clusterGood representations

Good representations

Good clusters

Poor clusters

Poor representations 31

Page 32: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

IntuitionGood clusterGood representations

Good representations

Good clusters

Poor clusters

Poor representations 32

Page 33: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Approach

• Framework

• Objective

• Algorithm & Implementation

33

Page 34: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Approach: Framework

Agglomerative Clustering

Convolutional Neural Network

Representation Learning

Agglomerative Clustering

arg min ( | , )y

L y I

arg min ( | , )L y I

,

arg min ( , | )y

L y I

34

Page 35: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Approach: Framework

Convolutional Neural Network Agglomerative Clustering

arg min ( | , )L y I

arg min ( | , )y

L y I

35

Page 36: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Approach: Recurrent Framework

36

Page 37: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Approach: Recurrent Framework

37

Page 38: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Approach: Recurrent Framework

38

Page 39: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Approach: Recurrent Framework

39

Page 40: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Approach: Recurrent Framework

40

Page 41: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Approach: Recurrent Framework

41

Page 42: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Approach: Recurrent Framework

Backward at each time-step is time-consuming and prone to over-fitting!

42

Page 43: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Approach: Recurrent Framework

How about updating once for multiple time-steps?

Backward at each time-step is time-consuming and prone to over-fitting!

43

Page 44: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Approach: Recurrent Framework

Partially Unrolling: divide all T time-steps into P periods

In each period, we merge clusters for multiple times and update CNN parameters at the end of period

44

Page 45: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Approach: Recurrent Framework

Partially Unrolling: divide all T time-steps into P periods

In each period, we merge clusters for multiple times and update CNN parameters at the end of period

45

Page 46: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Approach: Recurrent Framework

In each period, we merge clusters for multiple times and update CNN parameters at the end of period

Partially Unrolling: divide all T time-steps into P periods

P is determined by a hyper-parameter will be introduced later 46

Page 47: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Approach: Objective Function

Overall loss:

,

arg min ( , | )y

L y I

arg min ( | , )y

L y I arg min ( | , )L y I

47

Page 48: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Approach: Objective Function

Loss at time-step t:

Conventional Agg. Clustering Strategy

Proposed Agg. Clustering Strategy

48

Page 49: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Approach: Objective Function

Loss at time-step t:

Conventional Agg. Clustering Strategy

Proposed Agg. Clustering Strategy

Affinity measure

49

Page 50: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Approach: Objective Function

Loss at time-step t:

Conventional Agg. Clustering Strategy

Proposed Agg. Clustering Strategy

i-th cluster

50

Page 51: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Approach: Objective Function

Loss at time-step t:

Conventional Agg. Clustering Strategy

Proposed Agg. Clustering Strategy

K_c nearest neighbor clusters of i-th cluster

51

Page 52: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Approach: Objective Function

Loss at time-step t:

Conventional Agg. Clustering Strategy

Proposed Agg. Clustering Strategy

Affinity between i-thcluster and its NN

52

Page 53: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Approach: Objective Function

Loss at time-step t:

Conventional Agg. Clustering Strategy

Proposed Agg. Clustering Strategy

Affinity between i-thcluster and its NN

Differences between two cluster affinities 53

Page 54: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Approach: Objective Function

Loss at time-step t:

Conventional Agg. Clustering Strategy

Proposed Agg. Clustering Strategy

Affinity between i-thcluster and its NN

Differences between two cluster affinities

Merge these two clusters

54

Page 55: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Approach: Objective Function

Loss at time-step t:

Conventional Agg. Clustering Strategy

Proposed Agg. Clustering Strategy

Affinity between i-thcluster and its NN

Differences between two cluster affinities

Merge these two clusters

55

Page 56: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Approach: Objective Function

Loss in forward pass in period p (merge clusters):

Loss in forward pass in period p (merge clusters):

56

Page 57: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Approach: Objective Function

Loss in forward pass in period p (merge clusters):

Loss in forward pass in period p (merge clusters):

57

Page 58: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Approach: Objective Function

Loss in forward pass in period p (merge clusters):

Loss in forward pass in period p (merge clusters):

CNN parameters are fixed

58

Page 59: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Approach: Objective Function

Loss in forward pass in period p (merge clusters):

Loss in forward pass in period p (merge clusters):

CNN parameters are fixed

Cluster labels are fixed

59

Page 60: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Approach: Objective Function

Forward Pass:

Simple Greedy Algorithm

Merge two clusters which minimize the loss at each time step

60

Page 61: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Approach: Objective Function

Forward Pass:

Simple Greedy Algorithm

Merge two clusters which minimize the loss at each time step

61

Page 62: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Approach: Objective Function

Forward Pass:

Simple Greedy Algorithm

Merge two clusters which minimize the loss at each time step

62

Page 63: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Approach: Objective Function

Forward Pass:

Simple Greedy Algorithm

Merge two clusters which minimize the loss at each time step

63

Page 64: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Approach: Objective

Backward Pass:

64

Page 65: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Approach: Objective

Backward Pass:

Consider all previous periods

65

Page 66: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Approach: Objective

Backward Pass:

Cluster based loss is not proper for batch optimization!!!

Consider all previous periods

66

Page 67: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Approach: Objective

Backward Pass:

Cluster based loss is not proper for batch optimization!!!

Consider all previous periods

Approximation:

67

Page 68: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Approach: Objective

Backward Pass:

Convert to sample-based loss:

Consider all previous periods

Intra-sample affinity Inter-sample affinity

Recall cluster-based loss:

68

Page 69: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Approach: Objective

Backward Pass:

Convert to sample-based loss:

Consider all previous periods

Intra-sample affinity Inter-sample affinity

Recall cluster-based loss:

Weighted triplet loss

69

Page 70: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Approach: Algorithm & Implementation

70

Page 71: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Approach: Algorithm & Implementation

Raw image data

71

Page 72: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Approach: Algorithm & Implementation

Raw image data

Assume it is known

72

Page 73: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Approach: Algorithm & Implementation

Raw image data

Assume it is known

Randomly initialize CNN parameters4 samples in each cluster in average

73

Page 74: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Approach: Algorithm & Implementation

Raw image data

Assume it is known

Randomly initialize CNN parameters4 samples in each cluster in average

Train CNN for about 20 epochs

74

Page 75: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Approach: Algorithm & Implementation

Raw image data

Assume it is known

Randomly initialize CNN parameters4 samples in each cluster in average

Train CNN for about 20 epochs

We can go back and retrain the model, but it improve slightly

75

Page 76: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Experiments

• Datasets

• Network Architecture

• Image Clustering

• Representation Learning

76

Page 77: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Experiments: Datasets

MNIST (70000, 10, 28x28) USPS (11000, 10, 16x16) COIL20 (1440, 20, 128x128) COIL100 (7200, 100, 128x128)

UMist (575, 20, 112x92) FRGC (2462, 20, 32x32) CMU-PIE (2856, 68, 32x32) Youtube Face (1000, 41, 55x55)77

Page 78: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Experiments: SettingsTwo important parameters

Set the layer numbers so that theOutput feature map is about 10x10

78

Page 79: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Experiments: Clustering : Performance

+6.43% on NMI to best performance of existing approaches averaged over all datasets

79

Page 80: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Experiments: Clustering : Performance

+12.76% on AC to best performance of existing approaches averaged over all datasets

80

Page 81: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Experiments: Clustering : Performance

Average +21.5% on NMI

81

Page 82: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Experiments: Clustering : Performance

Average +25.7% on NMI

82

Page 83: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Experiments: Clustering : Performance

Our clustering performance vs. that of existing clustering approaches using raw image data.

Clustering performance using our representation fed to existing clustering algorithms.

Page 84: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Experiments: Clustering : Visualization

COIL-20

COIL-100

84

Page 85: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Experiments: Clustering : Visualization

USPS

MNIST-test

85

Page 86: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Experiments: Clustering : Ablation study

86

Page 87: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Experiments: Clustering : Verification

87

Page 88: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Experiments: Clustering : Time Cost

88

Page 89: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Experiments: Representation Learning

Testing generalization of our learnt (unsupervised) representation to LFW face verification.

Evaluation on CIFAR-10 classification

Representation transfer

Representation learning

89

Page 90: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Extensions: Data Visualization

90

Page 91: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Conclusion

• A new unsupervised learning method jointly with image clustering, cast the problem into a recurrent optimization problem;

• In the recurrent framework, clustering is conducted during forward pass, and representation learning is conducted during backward pass;

• A unified loss function in the forward pass and backward pass;

• Performance outperforms the state-of-the-art over a number of datasets;

• It can also learn plausible representations for image recognition.

91

Page 92: Unsupervised Learning Jointly With Image Clusteringjyang375/Jianwei_Yang... · Unsupervised Learning Jointly With Image Clustering Virginia Tech Jianwei Yang Devi Parikh Dhruv Batra

Thanks!

https://github.com/jwyang/joint-unsupervised-learning 92