outline · outline 1. problem 2. teori ... “rich feature hierarchies for accurate object...

40
Muhammad Ghifary | Transfer Learning Weta Digital Ltd. Outline 1. Problem 2. Teori Terminologi Definisi & klasifikasi transfer learning 3. Review algoritma 4. Scatter Component Analysis (SCA) - [TPAMI 2017] Algoritma transfer learning untuk object recognition (Kernel) Principal Component Analysis Hasil eksperimen 5. Kesimpulan 1

Upload: duongque

Post on 07-Sep-2018

214 views

Category:

Documents


0 download

TRANSCRIPT

Muhammad Ghifary | Transfer LearningWeta Digital Ltd.

Outline1. Problem2. Teori

○ Terminologi○ Definisi & klasifikasi transfer learning

3. Review algoritma4. Scatter Component Analysis (SCA) - [TPAMI 2017]

○ Algoritma transfer learning untuk object recognition○ (Kernel) Principal Component Analysis○ Hasil eksperimen

5. Kesimpulan

1

1. Problem

2

Muhammad Ghifary | Transfer LearningWeta Digital Ltd.

1. Sketch-based face recognition

B. F. Klare, A. K. Jain. “Heterogenous Face Recognition using Kernel Prototype Similarities”, TPAMI 2013

Bagaimana komputer mencocokan gambar wajah asli dengan sketsa?

3

Sketch

Original

Muhammad Ghifary | Transfer LearningWeta Digital Ltd.

1. Dataset biasA.Torralba & A. Efros. “Unbiased look at dataset bias”, CVPR 2011

SUN09 LabelMe PASCAL ImageNet Caltech101 MSRC Self Mean others

Percent drop

SUN09 69.8 50.7 42.2 42.6 54.7 69.4 69.8 51.9 26%

LabelMe 61.8 67.6 40.8 38.5 53.4 67.0 67.6 52.3 23%

PASCAL 55.8 55.2 62.1 56.8 54.2 74.8 62.1 59.4 4%

ImageNet 43.9 31.8 46.9 60.7 59.3 67.8 60.7 49.9 18%

Caltech101 20.2 18.8 11.0 31.4 100 29.3 100 22.2 78%

MSRC 28.6 17.1 32.3 21.5 67.7 74.3 74.3 33.4 55%

Mean others 42.0 34.7 34.6 38.2 57.9 61.7 72.4 44.8 48%

4

Muhammad Ghifary | Transfer LearningWeta Digital Ltd.

1. Facial performance capture

source target

5

Muhammad Ghifary | Transfer LearningWeta Digital Ltd.

1. Problem

● Data training tidak sepenuhnya mewakili realita/target ! -- domain mismatch● Tidak mudah untuk mengoleksi dan melakukan anotasi data training● Target domain tidak sepenuhnya diketahui

6

Gambar diambil dari Y. Ganin & V. Lempitsky, “Unsupervised Domain Adaptation by Backpropagation”, ICML 2015

2. Teori

7

Muhammad Ghifary | Transfer LearningWeta Digital Ltd.

2. Definisi transfer learning

8

“Kemampuan sistem untuk mentransfer pengetahuan/skill yang telah didapatkan dari domain / task tertentu (source), untuk diaplikasikan di domain / task yang lain (target)”

● Psikologi/Neurobiologi:

Indonesian → Malay, Japanese → Korean, Karate → Tinju, Piano → Gitar, C++ → Java, MATLAB → Python (numpy), Math →{Computer Science, Physics, Economics, …}

● Machine learning○ mengidentifikasi kesamaan pola/atribut/fitur antar domain

● Domain:

● Task:

● Training data

2. Definisi transfer learning

9

● Source domain & task:

● Target domain & task:

● Supervised learning:

● Transfer learning:

Muhammad Ghifary | Transfer LearningWeta Digital Ltd.

2. Klasifikasi transfer learning

1. Inductive transfer ( )○ Multi-task learning, self-taught learning, one-shot learning, zero-shot learning

2. Transductive transfer learning ( )○ Domain adaptation○ Domain generalization○ Covariate shift○ Sample selection bias

10

S. J. Pan & Q. Yang. “A Survey on Transfer Learning”, TKDE 2009

3. Review Algoritma

11

3. DASVM (1)

12

SVM recap● Data training:

● Classifier:

● Formulasi:

L. Bruzzone, M. Marconcini, “Domain adaptation problems: a DASVM classification technique and a cricular validation strategy.”, TPAMI 2010

Sumber: https://epat2014.sciencesconf.org/conference/epat2014/pages/slides_DA_epat_17.pdf

Muhammad Ghifary | Transfer LearningWeta Digital Ltd.

3. DASVM (2) [Bruzzone & Marconcini, TPAMI 2010]

13

Sumber: https://epat2014.sciencesconf.org/conference/epat2014/pages/slides_DA_epat_17.pdf

Muhammad Ghifary | Transfer LearningWeta Digital Ltd.

3. DASVM (3) [Bruzzone & Marconcini, TPAMI 2010]

14

Sumber: https://epat2014.sciencesconf.org/conference/epat2014/pages/slides_DA_epat_17.pdf

Muhammad Ghifary | Transfer LearningWeta Digital Ltd.

3. DASVM (4) [Bruzzone & Marconcini, TPAMI 2010]

15

Sumber: https://epat2014.sciencesconf.org/conference/epat2014/pages/slides_DA_epat_17.pdf

Muhammad Ghifary | Transfer LearningWeta Digital Ltd.

3. DASVM (5) [Bruzzone & Marconcini, TPAMI 2010]

16

Sumber: https://epat2014.sciencesconf.org/conference/epat2014/pages/slides_DA_epat_17.pdf

Muhammad Ghifary | Transfer LearningWeta Digital Ltd.

3. DASVM (6) [Bruzzone & Marconcini, TPAMI 2010]

17

Sumber: https://epat2014.sciencesconf.org/conference/epat2014/pages/slides_DA_epat_17.pdf

Muhammad Ghifary | Transfer LearningWeta Digital Ltd.

3. Feature Augmentation

Daume-III, “Frustratingly easy domain adaptation”, ACL 2007

Li et al, “Learning with augmented features for supervised and semi-supervised heterogeneous domain adaptation”, TPAMI 2014

18

Muhammad Ghifary | Transfer LearningWeta Digital Ltd.

3. Transfer Component Analysis

Memanfaatkan fungsi jarak antar distribusi probabilitas

● Maximum Mean Discrepancy:

● Formulasi

19

Sumber: http://informationtransfereconomics.blogspot.co.nz/2015/05/the-economic-allocation-problem.html

Muhammad Ghifary | Transfer LearningWeta Digital Ltd.

3. Geodesic Interpolation

Gopalan et al., “Domain adaptation for object recognition: an unsupervised approach”, ICCV 2011

20

Gong et al., “Geodesic flow kernel for unsupervised domain adaptation”, CVPR 2012

Muhammad Ghifary | Transfer LearningWeta Digital Ltd.

3. Deep Learning

Donahue et al., “DeCAF: a deep convolutional activation feature for generic visual recognition”, ICML 2014Girshick et al., “Rich feature hierarchies for accurate object detection and semantic segmentation”, CVPR 2014Yosinski et al., “How transferable are features in deep neural networks”, NIPS 2014Long et al., “Learning transferable features with deep adaptation networks”, ICML 2015Ganin & Lempitsky, “Unsupervised domain adaptation through backpropagation”, ICML 2015Ghifary et al., “Deep reconstruction-classification networks for unsupervised domain adaptation”, ECCV 2016

21

[Girshick et al. CVPR2014]

4. Scatter Component Analysis(SCA)

22

M. Ghifary, D. Balduzzi, W. B. Kleijn, M. Zhang, “Scatter Component Analysis: A Unified Framework for Domain Adaptation and Domain Generalization”, TPAMI 2017

Preprint: https://arxiv.org/abs/1510.04373

Muhammad Ghifary | Transfer LearningWeta Digital Ltd.

4. SCA

23

* persebaran = variance = scatter

Algoritma representation/feature learning yang bertujuan:1. Memaksimalkan persebaran data keseluruhan2. Meminimalkan persebaran data pada kelas yang sama3. Memaksimalkan persebaran data yang memiliki kelas berbeda4. Meminimalkan persebaran domain

Discriminative

Muhammad Ghifary | Transfer LearningWeta Digital Ltd.

4. SCA

SCA menggabungkan unsur yang ada pada Principal Component Analysis (PCA)*, Linear Discriminant Analysis (LDA), Maximum Mean Discrepancy (MMD), dan kernel trick.

24

Muhammad Ghifary | Transfer LearningWeta Digital Ltd.

4. Principal Component Analysis (1)● Mereduksi dimensi data menjadi beberapa komponen● Komponen-komponen tersebut mempertahankan

informasi yang dimiliki data

25

Sumber: http://www.nlpca.org/fig_pca_principal_component_analysis.png

Muhammad Ghifary | Transfer LearningWeta Digital Ltd.

4. PCA (2)

PCA berbentuk model linear

Goal:

26

Muhammad Ghifary | Transfer LearningWeta Digital Ltd.

4. PCA (3)

Tulis

sehingga

PCA berbasis linear projection → matrix W ortogonal ( ) →

Asumsi , persebaran / variance dari Z dapat dinyatakan dengan

27

Muhammad Ghifary | Transfer LearningWeta Digital Ltd.

4. PCA (4)

Optimisasi: hitung matriks W

Ekspansi Var(Z)

Bentuk Lagrangian

dimana variabel-variabel Lagrange

28

Muhammad Ghifary | Transfer LearningWeta Digital Ltd.

4. PCA (5)

Hitung

Eigen-decomposition:

Eigen-decomposition dari covariance matrix input/data menghasilkan basis yang memaksimalkan !

Komputasi representasi/component scores:

29

4. PCA (6)

(Trained) loadings/basis:

Diberikan sebuah data

30

Training● Input

○ Pastikan memiliki rata-rata 0

● Bentuk matrix input ● Hitung covariance matrix

● Hitung eigen-decomposision:

Test:● Input● Ekstrak representasi/fitur/components:

Muhammad Ghifary | Transfer LearningWeta Digital Ltd.

4. PCA (7)

Python (numpy)

31

Scikit-learn: http://scikit-learn.org/stable/

Muhammad Ghifary | Transfer LearningWeta Digital Ltd.

4. Kernel PCA

B. Scholkopf, A. Smola, “Nonlinear Component Analysis as a Kernel Eigenvalue”, Neural Computation 1998

32

Muhammad Ghifary | Transfer LearningWeta Digital Ltd.

4. PCA vs Kernel PCA

Sumber: http://2.bp.blogspot.com/_slrAR0IXTL0/TF-OZaNbRCI/AAAAAAAAAUo/SdYS3hXd4MI/s1600/figure.png

33

Muhammad Ghifary | Transfer LearningWeta Digital Ltd.

4. Scatter Component Analysis

34

35

Muhammad Ghifary | Transfer LearningWeta Digital Ltd. 36

4. SCA: Experiments

Muhammad Ghifary | Transfer LearningWeta Digital Ltd.

4. SCA: Experiments

37

Muhammad Ghifary | Transfer LearningWeta Digital Ltd.

4. SCA: ExperimentsOffice dataset [Saenko et al., ECCV2010]

38

Muhammad Ghifary | Transfer LearningWeta Digital Ltd.

4. SCA: ExperimentsOffice dataset [Saenko et al., ECCV2010]

39

Muhammad Ghifary | Transfer LearningWeta Digital Ltd.

5. Kesimpulan1. Transfer learning bertujuan untuk mengatasi problem ketidaksesuaian

antara data dan target/realita (domain mismatch)○ Pengoleksian data sulit dilakukan○ Target tidak diketahui secara pasti

2. Klasifikasi transfer learning○ Inductive transfer: multi-task learning, self-taught learning, one-shot learning, zero-shot

learing○ Transductive transfer: domain adaptation, domain generalization, covariate shift, sample

selection bias

3. Berbagai algoritma TL untuk aplikasi computer vision telah tersedia, namun masih open problem

40