machine learning: foundations course tau – 2012a prof. yishay mansour

16
MACHINE LEARNING: FOUNDATIONS COURSE TAU – 2012A PROF. YISHAY MANSOUR o TextonBoost : Joint Appearance, Shape and Context Modeling for Multi-Class Object Recognition and Segmentation J. Shotton*, J. Winn†, C. Rother†, and A. Criminisi† o * University of Cambridge o † Microsoft Research Ltd, Cambridge, UK Yaniv Bar March 2013

Upload: mio

Post on 15-Jan-2016

52 views

Category:

Documents


0 download

DESCRIPTION

Machine Learning: Foundations Course TAU – 2012A Prof. Yishay Mansour. TextonBoost : Joint Appearance, Shape and Context Modeling for Multi-Class Object Recognition and Segmentation J. Shotton * , J. Winn † , C. Rother † , and A. Criminisi † * University of Cambridge - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Machine Learning: Foundations Course TAU – 2012A Prof.  Yishay Mansour

MACHINE LEARNING: FOUNDATIONS COURSETAU – 2012APROF. YISHAY MANSOUR

o TextonBoost :Joint Appearance, Shape and Context Modeling for Multi-Class Object Recognition and Segmentation

J. Shotton*, J. Winn†, C. Rother†, and A. Criminisi†

o * University of Cambridgeo † Microsoft Research Ltd, Cambridge, UK

Yaniv BarMarch 2013

Page 2: Machine Learning: Foundations Course TAU – 2012A Prof.  Yishay Mansour

GOAL

o Simultaneous recognition and segmentation:Efficiently detect a large number of object classes and give a pixel-perfect segmentation of an image into these classes.

Page 3: Machine Learning: Foundations Course TAU – 2012A Prof.  Yishay Mansour
Page 4: Machine Learning: Foundations Course TAU – 2012A Prof.  Yishay Mansour

DATA AND CLASSES Original Paper: 3 DBs.

Main DB: MSRC 21. MSRC 21-Class Object

Recognition Database 591 hand-labelled images

Original main DB was updated to MSRC 23.

MSRC 23-Class Object Recognition Database 592 hand-labelled images

Page 5: Machine Learning: Foundations Course TAU – 2012A Prof.  Yishay Mansour

HIGH LEVEL APPROACH

High-level description of approach:Learn classifier based on relative texture locations for each class. Classification is then refined.

Given an image, for each pixel:

- Texture-Layout features are calculated

- A boosting classifier gives the probability of the pixel belonging to each class

- The discriminative model combines the boosting output with low-level color, location, and edge information; image receives final label.

Page 6: Machine Learning: Foundations Course TAU – 2012A Prof.  Yishay Mansour

Most important part of the model is the Shape/Context Potential – it is significant for object recognition and very rough segmentation results.

Other potential such as Edge and Color refine the segmentation results.

(a) Original image, (b) Shape, (c) (b)+edge, (d) (c)+color

TEXTURE LAYOUT FEATURES

Page 7: Machine Learning: Foundations Course TAU – 2012A Prof.  Yishay Mansour

For modeling object shape, appearance and context we use a New texton-based features.This feature (texton) compact and efficient characterisation of local texture.

Page 8: Machine Learning: Foundations Course TAU – 2012A Prof.  Yishay Mansour

o The task is to recognize surfaces made from different materials on the basis of their texture appearance.o Different materials show different texture appearance.Moreover, texture appearance of the same material changes dramatically due to different viewpoint/lighting settings (specularities, shadows, and occlusions).

WHAT ARE TEXTONS

Page 9: Machine Learning: Foundations Course TAU – 2012A Prof.  Yishay Mansour

Computing texton maps:

Texton mapColours Texton Indices

Input image

Clustering

Filter BankConvolve 17-D filter bank (composed of gaussians, dogs, logs) with all training images

Responses are clustered with K-means

Each pixel is assigned a texton number

CALCULATING TEXTURE-LAYOUT FEATURES

Page 10: Machine Learning: Foundations Course TAU – 2012A Prof.  Yishay Mansour

Capturing appearance:

Page 11: Machine Learning: Foundations Course TAU – 2012A Prof.  Yishay Mansour

How Texture-Layout features jointly model texture and layout:

Page 12: Machine Learning: Foundations Course TAU – 2012A Prof.  Yishay Mansour

LEARNING

Learning is done with Joint Boost algorithm – A version of Multi class gentle boost algorithm.

I’ve used both AdaBoost.M1 and AdaBoost.Mh (multiclass reduction to binary which is due to the fact that AdaBoosting is only for binary classification).

Page 13: Machine Learning: Foundations Course TAU – 2012A Prof.  Yishay Mansour

THE GOOD AND BAD

The Good: Provides reasonable recognition + segmentation for many classes. Also, combines several good ideas. Most of previous works didn’t tackle the problem as a whole – rather, problems were treated separately.

The Bad: Does not beat past work (in terms of quantitative recognition results) and a bit hacky.

Page 14: Machine Learning: Foundations Course TAU – 2012A Prof.  Yishay Mansour

CODE-SEQUENCE OF EXECUTION

1. imagesTextonization.m(extract efficient images characterization)

2. calcModelFeatures.m(calculate the appearance (shape) potential

context)

3. trainModel.m(build a classification model)

4. testModel.m(test the classification model with test data)

Page 15: Machine Learning: Foundations Course TAU – 2012A Prof.  Yishay Mansour

RESULTS

Page 16: Machine Learning: Foundations Course TAU – 2012A Prof.  Yishay Mansour