complex feature recognition: a bayesian approach for learning to recognize objects by paul a. viola...

37
Complex Feature Recognition: A Bayesian Approach for Learning to Recognize Objects by Paul A. Viola Presented By: Emrah Ceyhan Divin Proothi Sherwin Shaidee Kanwalbir Sekhon Gauri Tembe

Upload: doreen-hopkins

Post on 17-Dec-2015

217 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Complex Feature Recognition: A Bayesian Approach for Learning to Recognize Objects by Paul A. Viola Presented By: Emrah Ceyhan Divin Proothi Sherwin Shaidee

Complex Feature Recognition: ABayesian Approach for Learning to

Recognize Objectsby

Paul A. Viola

Presented By:

Emrah CeyhanDivin Proothi

Sherwin ShaideeKanwalbir Sekhon

Gauri Tembe

Page 2: Complex Feature Recognition: A Bayesian Approach for Learning to Recognize Objects by Paul A. Viola Presented By: Emrah Ceyhan Divin Proothi Sherwin Shaidee

Abstract

• The overall approach:– applicable to a wide range of object types– makes constructing object models easy– capable of identifying either the class or the

identity of an object– computationally efficient

Page 3: Complex Feature Recognition: A Bayesian Approach for Learning to Recognize Objects by Paul A. Viola Presented By: Emrah Ceyhan Divin Proothi Sherwin Shaidee

Introduction

• The essential problem of object recognition is this:– given an image, what known object is most likely to have generated it?

• Among the confounding influences are pose, lighting, clutter and occlusion.

• A typical example of such a feature is an intensity edge.• Three main motivations for using simple features.

– it is assumed that simple features are detectable under a wide variety of pose and lighting changes.

– the resulting image representation is compact and discrete, consisting of a list of features and their positions.

– the position of these features in a novel image of an object can be predicted from knowledge of their positions in other images

Page 4: Complex Feature Recognition: A Bayesian Approach for Learning to Recognize Objects by Paul A. Viola Presented By: Emrah Ceyhan Divin Proothi Sherwin Shaidee

Contd..

• A novel approach to image representation that does not use a single predefined feature.

• Use a large set of complex features that are learned from experience with model objects.

• The response of a single complex feature contains much more class information than does a single edge.

• Reduces the number of possible correspondences between the model and the image.

Page 5: Complex Feature Recognition: A Bayesian Approach for Learning to Recognize Objects by Paul A. Viola Presented By: Emrah Ceyhan Divin Proothi Sherwin Shaidee

A Generative Process for Images

• A generative process is much like a computer graphics rendering system.

• Our generative process is really somewhere between the direct and feature based approaches.

• Like feature based approaches, it uses features to represent images. – But, rather than extracting and localizing a single type of simple feature,

a more complex yet still local set of features is defined.

• Like direct techniques, it makes detailed predictions about the intensity of pixels in the image

Page 6: Complex Feature Recognition: A Bayesian Approach for Learning to Recognize Objects by Paul A. Viola Presented By: Emrah Ceyhan Divin Proothi Sherwin Shaidee

What is CFR?• Every image is a collection of distinct complex features

• Complex features are chosen so that they are distinct and stable.

• A distinct feature is one that appears no more than a few times in any image

• Stability has two related meanings:– the position of a stable feature changes slowly as the pose of an object changes

slowly; – a stable feature is present in a range of views of an object about some canonical

view.

Page 7: Complex Feature Recognition: A Bayesian Approach for Learning to Recognize Objects by Paul A. Viola Presented By: Emrah Ceyhan Divin Proothi Sherwin Shaidee

Idea behind CFR

• A picture of a person can be a complex feature but it is unstable.

Page 8: Complex Feature Recognition: A Bayesian Approach for Learning to Recognize Objects by Paul A. Viola Presented By: Emrah Ceyhan Divin Proothi Sherwin Shaidee

Idea behind CFR contd….

• Local pictures of the object serve as a better complex feature.

Page 9: Complex Feature Recognition: A Bayesian Approach for Learning to Recognize Objects by Paul A. Viola Presented By: Emrah Ceyhan Divin Proothi Sherwin Shaidee

Idea behind CFR contd….

Page 10: Complex Feature Recognition: A Bayesian Approach for Learning to Recognize Objects by Paul A. Viola Presented By: Emrah Ceyhan Divin Proothi Sherwin Shaidee

Idea behind CFR contd….

Page 11: Complex Feature Recognition: A Bayesian Approach for Learning to Recognize Objects by Paul A. Viola Presented By: Emrah Ceyhan Divin Proothi Sherwin Shaidee

Distinct and Stable Complex Features

Page 12: Complex Feature Recognition: A Bayesian Approach for Learning to Recognize Objects by Paul A. Viola Presented By: Emrah Ceyhan Divin Proothi Sherwin Shaidee

Oriented Energy

• Complex features in CFR are not matched directly with the image pixels, rather, we use intermediate representation called oriented energy.

• Oriented energy representation is a set of images showing different orientations.

• The value of a particular pixel in the vertical energy image is related to the likelihood that there is a vertical edge near that pixel in the original image.

Page 13: Complex Feature Recognition: A Bayesian Approach for Learning to Recognize Objects by Paul A. Viola Presented By: Emrah Ceyhan Divin Proothi Sherwin Shaidee

Oriented Energy Contd….

Page 14: Complex Feature Recognition: A Bayesian Approach for Learning to Recognize Objects by Paul A. Viola Presented By: Emrah Ceyhan Divin Proothi Sherwin Shaidee

Characteristics of CFR

• CFR uses variety of objects and poses rather than using a single feature.

• Each feature is detectable from a set of poses.

• Relative positions of the features can be used as additional information for recognition.

Page 15: Complex Feature Recognition: A Bayesian Approach for Learning to Recognize Objects by Paul A. Viola Presented By: Emrah Ceyhan Divin Proothi Sherwin Shaidee

The Theory of Complex Features

•An image is a vector of pixel values which have a bounded range of R.

Page 16: Complex Feature Recognition: A Bayesian Approach for Learning to Recognize Objects by Paul A. Viola Presented By: Emrah Ceyhan Divin Proothi Sherwin Shaidee

Let S() be a sub-window function on images such that S(I,Li) is a sub-window of Ithat lies at position Li.

Conditional Probability of a particular image sub-window:

The Theory of Complex Features

Page 17: Complex Feature Recognition: A Bayesian Approach for Learning to Recognize Objects by Paul A. Viola Presented By: Emrah Ceyhan Divin Proothi Sherwin Shaidee

The Theory of Complex FeaturesProbability density of an image given M=(d,l) is:

Probability of a model given an image:

Page 18: Complex Feature Recognition: A Bayesian Approach for Learning to Recognize Objects by Paul A. Viola Presented By: Emrah Ceyhan Divin Proothi Sherwin Shaidee

The Theory of Complex Features

Computing object models:

Picking a single value for di in this case is misleading. The real situation is that di is about equally likely to be 1 or 0. Worse it confuses the two very distinct types of models: P(Di = 1 | I) >> P(Di = 0 | I) and P(Di = 1 | I) =P(Di = 0 | I)+ε. In experiments this type of maximum a posteriori model does not work well.

Page 19: Complex Feature Recognition: A Bayesian Approach for Learning to Recognize Objects by Paul A. Viola Presented By: Emrah Ceyhan Divin Proothi Sherwin Shaidee

The Theory of Complex Features

An alternative type of object model retains explicit information about P(Di | I):

The probability of an image given such a model is now really a mixturedistribution:

recognition algorithm for a probabilistic model.

Page 20: Complex Feature Recognition: A Bayesian Approach for Learning to Recognize Objects by Paul A. Viola Presented By: Emrah Ceyhan Divin Proothi Sherwin Shaidee

The Theory of Complex Features

• Recognition Algorithm CFR-MEM, because it explicitly memorizes the distribution of features in each of the model images.

• Recognition Algorithm CFR-DISC

Page 21: Complex Feature Recognition: A Bayesian Approach for Learning to Recognize Objects by Paul A. Viola Presented By: Emrah Ceyhan Divin Proothi Sherwin Shaidee

Learning Features

• For each of a set of training images there should be at least one likely CFR model

• To model an image a set of features are required to fit a particular training image well.

• Good Features– Good features are those that can be used to form

likely models for an entire set of training images.

Page 22: Complex Feature Recognition: A Bayesian Approach for Learning to Recognize Objects by Paul A. Viola Presented By: Emrah Ceyhan Divin Proothi Sherwin Shaidee

Technique for finding Good features

• This technique is based on the principle of maximum likelihood.

• Given– A sequence of images {I(t)}, t: index (time)

• If the probabilities of the these images are independent then the maximum likelihood estimate for fi is found by maximizing the likelihood l

Page 23: Complex Feature Recognition: A Bayesian Approach for Learning to Recognize Objects by Paul A. Viola Presented By: Emrah Ceyhan Divin Proothi Sherwin Shaidee

Contd….

• Since di(t) and li(t) are unknown, we can either integrate them out or choose the best:

Page 24: Complex Feature Recognition: A Bayesian Approach for Learning to Recognize Objects by Paul A. Viola Presented By: Emrah Ceyhan Divin Proothi Sherwin Shaidee

Gradient based Maximization

• Since computing the maximum of l can be quite difficult, gradient based maximization is used.

• Starting with an initial estimate for fi we compute the gradient of l with respect to fi, and take a step in that direction.

Page 25: Complex Feature Recognition: A Bayesian Approach for Learning to Recognize Objects by Paul A. Viola Presented By: Emrah Ceyhan Divin Proothi Sherwin Shaidee

Algorithm

• Algorithm:– For each I(t) find the li(t) that maximizes:– This is implemented much like a convolution where

the point of largest response is chosen.– Extract S(I(t), li(t)) for each time step.– Compute the gradient of l with respect to fi.

Or

Page 26: Complex Feature Recognition: A Bayesian Approach for Learning to Recognize Objects by Paul A. Viola Presented By: Emrah Ceyhan Divin Proothi Sherwin Shaidee

Contd…

• Take a small step in the direction of the gradient – Where

• Repeat until fi stabilizes.

Page 27: Complex Feature Recognition: A Bayesian Approach for Learning to Recognize Objects by Paul A. Viola Presented By: Emrah Ceyhan Divin Proothi Sherwin Shaidee

An Effective Representation

• It should be insensitive to foreseeable variations observed in images

• It should retain all of the necessary information required for recognition

• As the illumination & pose changes, the image pixels of an object will vary rapidly

• To insure good generalization pixelated representations used should be insensitive to these changes

Page 28: Complex Feature Recognition: A Bayesian Approach for Learning to Recognize Objects by Paul A. Viola Presented By: Emrah Ceyhan Divin Proothi Sherwin Shaidee

An Effective Representation

• Sensitivity to pose is directly related to the spatial smoothness

• If the pixelated images are very smooth, pixel values will change slowly as pose is varied

• It should enforce pixel smoothness without removing the information that is critical for discriminating features

Page 29: Complex Feature Recognition: A Bayesian Approach for Learning to Recognize Objects by Paul A. Viola Presented By: Emrah Ceyhan Divin Proothi Sherwin Shaidee

An Effective Representation

• Should smooth attenuating high-frequencies and reducing information

• Should preserve information about higher frequencies to preserve selectivity

• Oriented energy separates the smoothness of the representation from the frequency sensitivity of the representation

Page 30: Complex Feature Recognition: A Bayesian Approach for Learning to Recognize Objects by Paul A. Viola Presented By: Emrah Ceyhan Divin Proothi Sherwin Shaidee

Oriented Energy

• It allows for a selective description of the face, without being overly constraining about the location of important properties

• Noses are strongly vertical pixels surrounded by the strongly horizontal pixels of the eyebrows

• Another major aspect of image variation is illumination

• Value of a pixel can change significantly with changes in lighting

Page 31: Complex Feature Recognition: A Bayesian Approach for Learning to Recognize Objects by Paul A. Viola Presented By: Emrah Ceyhan Divin Proothi Sherwin Shaidee

Insights About Object Recognition

• Oriented energy is an effective means of representing images

• Features can be learned that are stable

• Images are well represented with complex features

Page 32: Complex Feature Recognition: A Bayesian Approach for Learning to Recognize Objects by Paul A. Viola Presented By: Emrah Ceyhan Divin Proothi Sherwin Shaidee

Experiments - Handwritten Digits

• Oriented energy is a more effective representation than the pixels of an image

• Classify each novel digit to the class of the closest training digit

• Training set had 75 examples of each digit

Page 33: Complex Feature Recognition: A Bayesian Approach for Learning to Recognize Objects by Paul A. Viola Presented By: Emrah Ceyhan Divin Proothi Sherwin Shaidee

Experiments - Handwritten Digits

• Using pixels of the images performance was 81%

• Using oriented energy representation performance was 94%.

Page 34: Complex Feature Recognition: A Bayesian Approach for Learning to Recognize Objects by Paul A. Viola Presented By: Emrah Ceyhan Divin Proothi Sherwin Shaidee

Experiments - Object Dataset

• Tested CFR-MEM and CFR-DISC and used 20 features

Page 35: Complex Feature Recognition: A Bayesian Approach for Learning to Recognize Objects by Paul A. Viola Presented By: Emrah Ceyhan Divin Proothi Sherwin Shaidee

Experiments - Face Dataset

• Tested CFR-MEM and CFR-DISC and used 20 features

Page 36: Complex Feature Recognition: A Bayesian Approach for Learning to Recognize Objects by Paul A. Viola Presented By: Emrah Ceyhan Divin Proothi Sherwin Shaidee

Results

• In general CFR is very easy to use• For most part CFR runs without any

intervention• The features are learned, the models are

created and images are recognized without supervision

• Once trained, CFR takes no more than a couple of seconds to recognize each image

Page 37: Complex Feature Recognition: A Bayesian Approach for Learning to Recognize Objects by Paul A. Viola Presented By: Emrah Ceyhan Divin Proothi Sherwin Shaidee

Thank You